Kimi-K2.6 / .eval_results /terminal_bench_2.yaml
bigeagle's picture
Add evaluation results for HLE, GPQA, AIME, HMMT, SWE-Bench, and Terminal-Bench (#4)
d9cb81b
raw
history blame contribute delete
224 Bytes
- dataset:
id: harborframework/terminal-bench-2.0
task_id: terminalbench_2
value: 66.7
date: '2026-04-20'
source:
url: https://huggingface.co/moonshotai/Kimi-K2.6
name: Model Card
user: SaylorTwift