Neer Vana's picture

3 2

Neer Vana

Neervana

·

AI & ML interests

None yet

Organizations

upvoted an article 7 months ago

Article

AutoBench Goes Scientific: Rigorous Validation for a Dynamic, Open-Source LLM Benchmark

PeterKruger

•

Oct 29, 2025

• 4

upvoted an article 9 months ago

Article

AutoBench Third Run: Revolutionizing LLM Evaluation with Record-Breaking Scale, Accuracy, and a New Home at autobench.org

PeterKruger

•

Aug 20, 2025

• 6

upvoted a paper about 1 year ago

Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers

Paper • 2504.20752 • Published Apr 29, 2025 • 96