Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Peng Wang's picture
In a Training Loop 🔄
1 55 143

Peng Wang

stillarrow
ParamhansTheLebowski's profile picture weizhepei's profile picture 21world's profile picture
·
https://peter-peng-w.github.io/

AI & ML interests

None yet

Recent Activity

updated a model 41 minutes ago
stillarrow/qwen2.5-coder-1.5b-instruct__grpo_no_std_code_hidden_only_shortcut_guard
published a model 41 minutes ago
stillarrow/qwen2.5-coder-1.5b-instruct__grpo_no_std_code_hidden_only_shortcut_guard
updated a model about 4 hours ago
stillarrow/qwen2.5-math-7b__math_subject_proportional_cluster-246fecfa-et_mix_lambda_no_drift_off_ratio_100
View all activity

Organizations

None yet

models 5

stillarrow/qwen2.5-coder-1.5b-instruct__grpo_no_std_code_hidden_only_shortcut_guard

Updated 41 minutes ago

stillarrow/qwen2.5-math-7b__math_subject_proportional_cluster-246fecfa-et_mix_lambda_no_drift_off_ratio_100

Updated about 4 hours ago

stillarrow/qwen2.5-math-7b__skill_accuracy_binning_max_entrop-0939fc56-policy_lambda_no_drift_off_ratio_100

Updated about 4 hours ago

stillarrow/qwen2.5-math-7b__skill_accuracy_binning_max_entrop-6bc47709-et_mix_lambda_no_drift_off_ratio_100

Updated 1 day ago • 21

stillarrow/qwen2.5-math-7b__skill_accuracy_binning_max_entrop-aabaf976-policy_lambda_no_drift_off_ratio_100

Updated 1 day ago • 18

datasets 1

stillarrow/MATH

Viewer • Updated Sep 25, 2025 • 26.5k • 37
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs