Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jiwon Jeon's picture
5

Jiwon Jeon

jwjeonn

AI & ML interests

None yet

Recent Activity

upvoted a paper about 14 hours ago
Rebellious Student: Reversing Teacher Signals for Reasoning Exploration with Self-Distilled RLVR
updated a model 5 days ago
jwjeonn/GRPO-Qwen-Qwen3-4B-dapo_math
published a model 5 days ago
jwjeonn/GRPO-Qwen-Qwen3-4B-dapo_math
View all activity

Organizations

None yet

models 19

jwjeonn/GRPO-Qwen-Qwen3-4B-dapo_math

Updated 5 days ago

jwjeonn/SDPO-Qwen-Qwen3-4B-deepscaler_math-reprompt-tur0.0

Updated 6 days ago

jwjeonn/SDPO-Qwen-Qwen3-4B-dapo_math-reprompt-tur0.0

Updated 6 days ago

jwjeonn/SDPO20-kl-sdpobottom20-Qwen3-4B-dapo_math-origPrompt

Updated 6 days ago

jwjeonn/iterWsdpo_review-grpo20-maxbuff2000-sdpoepoch2-Qwen3-4B-dapo_math-reprompt-tur0.0_persist

Updated 13 days ago

jwjeonn/IterWSDPO-grpo20-sdpo20-Qwen3-4B-dapo_math-org-prompt-tur0.0

Updated 13 days ago

jwjeonn/GRPO-Qwen-Qwen3-4B-dapo_math-reprompt

Updated 13 days ago

jwjeonn/GRPO-Qwen-Qwen3-4B-Instruct-2507-dapo_math-reprompt

Updated 13 days ago

jwjeonn/IterSDPO-grpo30-sdpo10-Qwen3-4B-deepscaler_math

Updated 13 days ago

jwjeonn/SDPO-Qwen-Qwen3-4B-Instruct-2507-dapo_math-reprompt-tur0.0

Updated 13 days ago
View 19 models

datasets 1

jwjeonn/divtraj-data

Updated Mar 26 • 7.04k
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs