Investigating the Robustness of LLMs on Math Word Problems Paper • 2406.15444 • Published May 30, 2024
Expert Upcycling: Shifting the Compute-Efficient Frontier of Mixture-of-Experts Paper • 2604.19835 • Published 8 days ago • 18
A Unified Evaluation Framework for Novelty Detection and Accommodation in NLP with an Instantiation in Authorship Attribution Paper • 2305.05079 • Published May 8, 2023
Expert Upcycling: Shifting the Compute-Efficient Frontier of Mixture-of-Experts Paper • 2604.19835 • Published 8 days ago • 18
Expert Upcycling: Shifting the Compute-Efficient Frontier of Mixture-of-Experts Paper • 2604.19835 • Published 8 days ago • 18
Polymath: A Challenging Multi-modal Mathematical Reasoning Benchmark Paper • 2410.14702 • Published Oct 6, 2024 • 1
Polymath: A Challenging Multi-modal Mathematical Reasoning Benchmark Paper • 2410.14702 • Published Oct 6, 2024 • 1
Running Featured 1.33k FineWeb: decanting the web for the finest text data at scale 🍷 1.33k Explore and download the FineWeb web‑text dataset
LongBoX: Evaluating Transformers on Long-Sequence Clinical Tasks Paper • 2311.09564 • Published Nov 16, 2023
InstructABSA: Instruction Learning for Aspect Based Sentiment Analysis Paper • 2302.08624 • Published Feb 16, 2023 • 3