Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation Paper • 2605.03849 • Published 4 days ago • 116
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents Paper • 2605.05185 • Published 3 days ago • 90
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published 5 days ago • 263
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents Paper • 2604.26752 • Published 10 days ago • 100
Video Analysis and Generation via a Semantic Progress Function Paper • 2604.22554 • Published 15 days ago • 63
Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond Paper • 2604.22748 • Published 15 days ago • 224
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published 17 days ago • 239
Tstars-Tryon 1.0: Robust and Realistic Virtual Try-On for Diverse Fashion Items Paper • 2604.19748 • Published 18 days ago • 249
OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation Paper • 2604.18486 • Published 19 days ago • 90
KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance Paper • 2604.12627 • Published 25 days ago • 100
Running on Zero MCP Featured 256 Qwen Image Edit 2511 Fast 🏆 256 Fast 4 step inference of Qwen Image Edit 2511