[ICLR 2026] Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models
Zengbin Wang
MuMing0102
·
AI & ML interests
Agentic AI, Multimodal LLM, Computer Vision
Recent Activity
upvoted a paper 3 days ago
Learning Agentic Policy from Action Guidance authored a paper 10 days ago
Visually-Guided Policy Optimization for Multimodal Reasoning