david-thrower/HelixLM-tiny-10k-samples-s1-8942pt-s2-700it-20260428 Text Generation • 19.8M • Updated 12 days ago • 318
david-thrower/HelixLM-tiny-10k-samples-s1-8942pt-s2-700it-20260428 Text Generation • 19.8M • Updated 12 days ago • 318
david-thrower/HelixLM-tiny-10k-samples-s1-8942pt-s2-700it-20260427 Text Generation • 19.8M • Updated 13 days ago • 13
david-thrower/HelixLM-tiny-10k-samples-s1-8942pt-s2-700it-20260427 Text Generation • 19.8M • Updated 13 days ago • 13
SmolLM3 pretraining datasets Collection datasets used in SmolLM3 pretraining • 15 items • Updated Aug 12, 2025 • 48