HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents Paper • 2604.07430 • Published Apr 8 • 187
Running Agents 145 Qwen3.5 Omni Offline Demo 🌍 145 Chat with a multimodal AI using text, audio, images, or video
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated Apr 6 • 241k • 2.84k
TADA Collection TADA: A Generative Framework for Speech Modeling via Text-Acoustic Dual Alignment | https://huggingface.co/papers/2602.23068 • 7 items • Updated Mar 24 • 71
view article Article Introducing Daggr: Chain apps programmatically, inspect visually +3 merve, ysharma, abidlabs, hysts, pcuenq • Jan 29 • 107
GutenOCR: A Grounded Vision-Language Front-End for Documents Paper • 2601.14490 • Published Jan 20 • 37