Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
2
16
2
Tongyao
PRO
tyzhu
Follow
seasaltflavour's profile picture
1 follower
·
1 following
tongyao-zhu
AI & ML interests
Natural Language Processing
Recent Activity
published
a model
about 1 month ago
tyzhu/checkpoints-llama32-1b-mathpros1sfeos_diff_intra-2k
updated
a model
about 1 month ago
tyzhu/checkpoints-llama32-1b-mathpros1sfeos_blk64-2k
updated
a model
about 1 month ago
tyzhu/checkpoints-llama32-1b-mathpros10sfeos_diff_intra_shift_uniform-4k
View all activity
Organizations
None yet
tyzhu
's models
37
Sort: Recently updated
tyzhu/checkpoints-llama32-1b-mathpros1sfeos_diff_intra-2k
Updated
Apr 2
tyzhu/checkpoints-llama32-1b-mathpros1sfeos_blk64-2k
Updated
Apr 2
tyzhu/checkpoints-llama32-1b-mathpros10sfeos_diff_intra_shift_uniform-4k
Updated
Apr 2
tyzhu/checkpoints-llama32-1b-mathpros10sfeos_intra-4k
Updated
Apr 1
tyzhu/checkpoints-llama32-1b-mathpros10sfeos_intra-2k_4node
Updated
Apr 1
tyzhu/checkpoints-llama32-1b-mathpros10sfeos_diff_intra_shift-2k_2node
Updated
Apr 1
tyzhu/checkpoints-llama32-1b-mathpros10sfeos_blk256-2k-correct-but-missing-second-last
Updated
Apr 1
tyzhu/checkpoints-llama32-1b-mathpros10sfeos_blk256-2k
Updated
Apr 1
tyzhu/checkpoints-llama32-3b-mathprosfeos_intra_zero1_tp2-2k
Updated
Apr 1
tyzhu/checkpoints
Updated
Apr 1
tyzhu/nanotron
Updated
Mar 31
tyzhu/blk2k_models_mar29
Updated
Mar 29
tyzhu/epochs-10-bs-2048-len-1024-qwen3-dllm-repro
Updated
Mar 28
tyzhu/mdlm_v3
Updated
Mar 27
tyzhu/llama32-3b-nt
Updated
Mar 19
tyzhu/checkpoints-llama32-1b-mathpros1sfeos_diff_intra-2k_4node_22000_hf
1B
•
Updated
Feb 10
•
1
tyzhu/checkpoints-llama32-1b-mathpros10sfeos_diff_intra-2k_4node_22000_hf
1B
•
Updated
Feb 10
•
1
tyzhu/fsp_tiny_LLaMA_1b_code_4k_step50000
Updated
Jan 29
tyzhu/fep_tiny_LLaMA_1b_code_4k_step50000
Updated
Jan 28
tyzhu/tiny_LLaMA_1b_code_4k_step50000
Updated
Jan 27
tyzhu/llama32-1b-nt
Updated
Jan 18
tyzhu/opencoder484
Text Generation
•
Updated
Dec 26, 2025
•
7
tyzhu/opencoder-1.5b-pystack80-opcanneal20-50ksteps
Updated
Nov 29, 2025
tyzhu/sokoban-1.5b-coord-baseline-rl1000
Updated
Nov 20, 2025
tyzhu/opencoder-1.5b-oppt80-opcanneal20-25ksteps-4nodes-4k
Updated
Nov 18, 2025
tyzhu/olmo-1b-finecode-5ksteps
Updated
Nov 9, 2025
tyzhu/webinsv1clear-grpo-qwen3-4b
Updated
Nov 7, 2025
tyzhu/SPA-frozenlake-qwen2.5-1.5b-instruct
2B
•
Updated
Oct 29, 2025
•
1
tyzhu/SPA-sudoku-qwen2.5-1.5b-instruct
2B
•
Updated
Oct 29, 2025
•
2
tyzhu/SPA-sokoban-qwen2.5-1.5b-instruct
0.4B
•
Updated
Oct 29, 2025
•
3
Previous
1
2
Next