admarcosai 's Collections Models
updated
openchat/openchat-3.5-1210
Text Generation
• 7B • Updated • 1.63k
• • 278
MoE-Mamba: Efficient Selective State Space Models with Mixture of
Experts
Paper
• 2401.04081
• Published • 74
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open
Language Models
Paper
• 2402.03300
• Published • 145
0.4B • Updated • 404k
• 238
Paper
• 2412.15115
• Published • 379
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for
Fast, Memory Efficient, and Long Context Finetuning and Inference
Paper
• 2412.13663
• Published • 163
Paper
• 2412.08905
• Published • 123