Boxi Yu's picture

Open to Collab

4 5

Boxi Yu

Bertsekas

·

https://boxiyu.github.io/

AI & ML interests

Coding Agent, Automated Operator

Recent Activity

upvoted a paper 3 days ago

FrontierSmith: Synthesizing Open-Ended Coding Problems at Scale

upvoted a paper about 1 month ago

Combee: Scaling Prompt Learning for Self-Improving Language Model Agents

liked a model 2 months ago

Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

View all activity

Organizations

authored 2 papers 11 months ago

How Should I Build A Benchmark? Revisiting Code-Related Benchmarks For LLMs

Paper • 2501.10711 • Published Jan 18, 2025 • 1

UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench

Paper • 2506.09289 • Published Jun 10, 2025 • 2