arxiv:2412.04537
rokosbasilisk
rb
AI & ML interests
Large multi-modal world models
Recent Activity
updated a dataset about 2 months ago
antieval/frontier_sweep_evals published a dataset about 2 months ago
antieval/frontier_sweep_evals updated a dataset about 2 months ago
antieval/swebench-trajectories