·
AI & ML interests
None yet
Organizations
view article AutoBench Goes Scientific: Rigorous Validation for a Dynamic, Open-Source LLM Benchmark
PeterKruger
• • 4
view article AutoBench Third Run: Revolutionizing LLM Evaluation with Record-Breaking Scale, Accuracy, and a New Home at autobench.org
PeterKruger
• • 6
upvoted a paper about 1 year ago