GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents Paper • 2604.26752 • Published 7 days ago • 97
Web2BigTable: A Bi-Level Multi-Agent LLM System for Internet-Scale Information Search and Extraction Paper • 2604.27221 • Published 7 days ago • 31
How Far Are We from Genuinely Useful Deep Research Agents? Paper • 2512.01948 • Published Dec 1, 2025 • 58
KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance Paper • 2604.12627 • Published 22 days ago • 100
FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios Paper • 2604.07413 • Published 28 days ago • 95
GEMS: Agent-Native Multimodal Generation with Memory and Skills Paper • 2603.28088 • Published Mar 30 • 85
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper • 2603.21986 • Published Mar 23 • 125
CoachMe: Decoding Sport Elements with a Reference-Based Coaching Instruction Generation Model Paper • 2509.11698 • Published Sep 15, 2025 • 1
Game Plan: What AI can do for Football, and What Football can do for AI Paper • 2011.09192 • Published Nov 18, 2020 • 1
Large Scale Generative AI Text Applied to Sports and Music Paper • 2402.15514 • Published Jan 31, 2024 • 3
BoxMind: Closed-loop AI strategy optimization for elite boxing validated in the 2024 Olympics Paper • 2601.11492 • Published Jan 16 • 1
Stepping VLMs onto the Court: Benchmarking Spatial Intelligence in Sports Paper • 2603.09896 • Published Mar 10 • 28
EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings Paper • 2603.13594 • Published Mar 13 • 148
MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data Paper • 2603.09206 • Published Mar 10 • 53