Large Language Models Explore by Latent Distilling Paper ⢠2604.24927 ⢠Published 13 days ago ⢠72 ⢠7
SWE-chat: Coding Agent Interactions From Real Users in the Wild Paper ⢠2604.20779 ⢠Published 18 days ago ⢠14 ⢠5
OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation Paper ⢠2604.18486 ⢠Published 20 days ago ⢠90 ⢠4
REAM: Merging Improves Pruning of Experts in LLMs Paper ⢠2604.04356 ⢠Published Apr 6 ⢠9 ⢠4
Embarrassingly Simple Self-Distillation Improves Code Generation Paper ⢠2604.01193 ⢠Published Apr 1 ⢠47 ⢠7
Embarrassingly Simple Self-Distillation Improves Code Generation Paper ⢠2604.01193 ⢠Published Apr 1 ⢠47 ⢠7
Omnilingual MT: Machine Translation for 1,600 Languages Paper ⢠2603.16309 ⢠Published Mar 17 ⢠22 ⢠5
TERMINATOR: Learning Optimal Exit Points for Early Stopping in Chain-of-Thought Reasoning Paper ⢠2603.12529 ⢠Published Mar 13 ⢠19 ⢠3
Accent Vector: Controllable Accent Manipulation for Multilingual TTS Without Accented Data Paper ⢠2603.07534 ⢠Published Mar 8 ⢠5 ⢠3
Yor-Sarc: A gold-standard dataset for sarcasm detection in a low-resource African language Paper ⢠2602.18964 ⢠Published Feb 21 ⢠1 ⢠4
Late-to-Early Training: LET LLMs Learn Earlier, So Faster and Better Paper ⢠2602.05393 ⢠Published Feb 5 ⢠8 ⢠3
Chronicals: A High-Performance Framework for LLM Fine-Tuning with 3.51x Speedup over Unsloth Paper ⢠2601.02609 ⢠Published Jan 6 ⢠2 ⢠2
EPAS: Efficient Training with Progressive Activation Sharing Paper ⢠2601.19089 ⢠Published Jan 27 ⢠1 ⢠1
Chronicals: A High-Performance Framework for LLM Fine-Tuning with 3.51x Speedup over Unsloth Paper ⢠2601.02609 ⢠Published Jan 6 ⢠2 ⢠2