Updated • 13k
• 196
Viewer
• Updated • 170M • 21.2k
• 94
Viewer
• Updated • 621M • 28.4k
• 88
Locutusque/UltraTextbooks
Viewer
• Updated • 5.52M • 2.41k
• 199
PrimeIntellect/StackV1-popular
Viewer
• Updated • 93M • 1.63k
• 2
Viewer
• Updated • 11.7M • 242
• 6
EleutherAI/the_pile_deduplicated
Viewer
• Updated • 134M • 22.2k
• 112
HIT-TMG/KaLM-embedding-pretrain-data
Viewer
• Updated • 23.7M • 1.25k
• 21
suriyagunasekar/stackoverflow-with-meta-data
Viewer
• Updated • 19.9M • 4.82k
• 12
Viewer
• Updated • 13.6M • 98
• 5
Viewer
• Updated • 3.71M • 1.34M
• 684
Viewer
• Updated • 474M • 182
• 4
EleutherAI/deep-ignorance-annealing-mix
Viewer
• Updated • 89M • 1.18k
• 2
Viewer
• Updated • 10.2M • 483
• 5
Viewer
• Updated • 1.76M • 34.7k
• 406
Viewer
• Updated • 167M • 5.69k
• 70
Locutusque/deeplm-training-data
Viewer
• Updated • 2.17M • 214
• 3
nvidia/Llama-Nemotron-Post-Training-Dataset
Viewer
• Updated • 3.91M • 4.68k
• 662
Updated • 1.2M
• 256
EssentialAI/essential-web-v1.0
Preview
• Updated • 47.8k
• 224
Viewer
• Updated • 15.2B • 144k
• 78