GridProbe: Posterior-Probing for Adaptive Test-Time Compute in Long-Video VLMs Paper • 2605.10762 • Published 6 days ago • 2
ReflectDrive-2: Reinforcement-Learning-Aligned Self-Editing for Discrete Diffusion Driving Paper • 2605.04647 • Published 11 days ago • 9
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published 25 days ago • 240
HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents Paper • 2604.07430 • Published Apr 8 • 187
VideoZeroBench: Probing the Limits of Video MLLMs with Spatio-Temporal Evidence Verification Paper • 2604.01569 • Published Apr 2 • 13
AIBench: Evaluating Visual-Logical Consistency in Academic Illustration Generation Paper • 2603.28068 • Published Mar 31 • 13
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published Mar 30 • 342
daaxila/twitter-wrmmm520-2026.03.20-2034970421603471732-HK5qx-wAICzbLdjh-part1 Viewer • Updated Apr 3 • 1 • 20 • 1