Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
12
4
2
Jay Gala
jaygala24
Follow
amr4444444444's profile picture
himanshubeniwal's profile picture
kurianbenoy's profile picture
4 followers
·
4 following
https://jaygala24.github.io/
jaygala24
jaygala24
AI & ML interests
Machine Learning, Natural Language Processing, Language and Vision Intersection, Fairness and Biases
Recent Activity
updated
a collection
about 5 hours ago
RL post-training
updated
a model
about 6 hours ago
jaygala24/Qwen3-4B-DAPO-math-reasoning
published
a model
about 6 hours ago
jaygala24/Qwen3-4B-DAPO-math-reasoning
View all activity
Organizations
jaygala24
's models
25
Sort: Recently updated
jaygala24/Qwen3-4B-DAPO-math-reasoning
Text Generation
•
4B
•
Updated
about 4 hours ago
jaygala24/Qwen3-4B-RLOO-math-reasoning
Text Generation
•
4B
•
Updated
3 days ago
•
174
jaygala24/Qwen3-1.7B-RLOO-math-reasoning
Text Generation
•
2B
•
Updated
4 days ago
•
706
jaygala24/Qwen2.5-3B-RLOO-math-reasoning
Text Generation
•
3B
•
Updated
4 days ago
•
659
jaygala24/Qwen2.5-1.5B-RLOO-math-reasoning
Text Generation
•
2B
•
Updated
4 days ago
•
626
jaygala24/Qwen2.5-0.5B-RLOO-math-reasoning
Text Generation
•
0.5B
•
Updated
4 days ago
•
587
jaygala24/Qwen3-1.7B-DAPO-math-reasoning
Text Generation
•
2B
•
Updated
4 days ago
•
563
jaygala24/Qwen2.5-3B-DAPO-math-reasoning
Text Generation
•
3B
•
Updated
4 days ago
•
569
jaygala24/Qwen2.5-1.5B-DAPO-math-reasoning
Text Generation
•
2B
•
Updated
4 days ago
•
561
jaygala24/Qwen2.5-0.5B-DAPO-math-reasoning
Text Generation
•
0.5B
•
Updated
4 days ago
•
577
jaygala24/Qwen3-4B-ReMax-math-reasoning
Text Generation
•
4B
•
Updated
10 days ago
•
805
jaygala24/Qwen3-4B-GRPO-math-reasoning
Text Generation
•
4B
•
Updated
10 days ago
•
862
jaygala24/Qwen3-4B-GRPO-KL-math-reasoning
Text Generation
•
4B
•
Updated
10 days ago
•
1.04k
jaygala24/Qwen3-1.7B-ReMax-math-reasoning
Text Generation
•
2B
•
Updated
10 days ago
•
865
jaygala24/Qwen3-1.7B-GRPO-math-reasoning
Text Generation
•
2B
•
Updated
10 days ago
•
821
jaygala24/Qwen3-1.7B-GRPO-KL-math-reasoning
Text Generation
•
2B
•
Updated
10 days ago
•
812
jaygala24/Qwen2.5-3B-ReMax-math-reasoning
Text Generation
•
3B
•
Updated
10 days ago
•
444
jaygala24/Qwen2.5-3B-GRPO-math-reasoning
Text Generation
•
3B
•
Updated
10 days ago
•
803
jaygala24/Qwen2.5-3B-GRPO-KL-math-reasoning
Text Generation
•
3B
•
Updated
10 days ago
•
778
jaygala24/Qwen2.5-1.5B-ReMax-math-reasoning
Text Generation
•
2B
•
Updated
10 days ago
•
438
jaygala24/Qwen2.5-1.5B-GRPO-math-reasoning
Text Generation
•
2B
•
Updated
10 days ago
•
567
jaygala24/Qwen2.5-1.5B-GRPO-KL-math-reasoning
Text Generation
•
2B
•
Updated
10 days ago
•
524
jaygala24/Qwen2.5-0.5B-ReMax-math-reasoning
Text Generation
•
0.5B
•
Updated
10 days ago
•
443
jaygala24/Qwen2.5-0.5B-GRPO-math-reasoning
Text Generation
•
0.5B
•
Updated
10 days ago
•
569
jaygala24/Qwen2.5-0.5B-GRPO-KL-math-reasoning
Text Generation
•
0.5B
•
Updated
10 days ago
•
536