BenchFlow

company

https://benchflow.ai

AI & ML interests

None defined yet.

Recent Activity

xdotli updated a dataset 9 days ago

benchflow/skillsbench-trajectories-apr2026

xdotli published a dataset 9 days ago

benchflow/skillsbench-trajectories-apr2026

xdotli updated a dataset 11 days ago

benchflow/skillsbench-data

View all activity

Papers

ClawsBench: Evaluating Capability and Safety of LLM Productivity Agents in Simulated Workspaces

SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks

View all Papers

Collections 1

models 0

None public yet

datasets 6

benchflow/skillsbench-trajectories-apr2026

Updated 9 days ago • 54

benchflow/skillsbench-data

Viewer • Updated 11 days ago • 94.3k • 61

benchflow/ClawsBench

Viewer • Updated 23 days ago • 7.83k • 473 • 1

benchflow/artifacts

Preview • Updated Jan 22 • 19

benchflow/skills_parquet

Viewer • Updated Jan 16 • 35.5k • 11 • 1

benchflow/skills

Updated Jan 14 • 46