Text Generation
Transformers
Safetensors
qwen3_5
image-text-to-text
darwin
darwin-v7
evolutionary-merge
Merge
mergekit
reasoning
advanced-reasoning
chain-of-thought
thinking
qwen3.6
qwen
claude-opus
distillation
gpqa
benchmark
open-source
apache-2.0
hybrid-vigor
proto-agi
vidraft
Eval Results
conversational
Eval Results (legacy)
File size: 244 Bytes
b0fe3a0 | 1 2 3 4 5 6 7 8 9 10 | - dataset:
id: Idavidrein/gpqa
task_id: diamond
value: 88.89
date: "2026-04-25"
source:
url: https://huggingface.co/FINAL-Bench/Darwin-28B-Opus
name: Darwin-28B-Opus Benchmark (3-stage Adaptive Evaluation)
user: vidraft
|