FINAL_Bench

Team

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

SeaWolf-AI updated a dataset 3 days ago

FINAL-Bench/service-urls

SeaWolf-AI updated a Space 3 days ago

FINAL-Bench/model-galaxy

SeaWolf-AI new activity 3 days ago

FINAL-Bench/Darwin-28B-KR:New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

View all activity

Papers

Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning

View all Papers

Articles

Training-Free Reasoning at 88.89% on GPQA Diamond: How Darwin Family Hit Frontier Scores Without a Single Gradient Step

17 days ago

• 18

Darwin-TTS: We Gave a TTS Model 3% of an LLM's Brain — It Started Showing Emotion

Apr 15

• 13

🏟️ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do

Mar 10

• 38

MARL: Runtime Middleware That Reduces LLM Hallucination Without Fine-Tuning

Mar 9

• 16

Structural Problems in AI Benchmarking and the Case for a Unified Evaluation Framework

Mar 8

• 12

Do Bubbles Form When Tens of Thousands of AIs Simulate Capitalism?

Feb 24

• 17

FINAL Bench: The Real Bottleneck to AGI Is Self-Correction

Feb 21

• 20

View all articles

SeaWolf-AI

updated a dataset 3 days ago

FINAL-Bench/service-urls

Viewer • Updated 3 days ago • 1 • 7.93k • 1

SeaWolf-AI

updated a Space 3 days ago

Model Galaxy

🌌

Darwin family + 2026 trending models on the HF galaxy

SeaWolf-AI

in FINAL-Bench/Darwin-28B-KR 3 days ago

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

#1 opened 3 days ago by

SeaWolf-AI

in FINAL-Bench/Darwin-2B-Opus-LoRA 3 days ago

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

#1 opened 3 days ago by

SeaWolf-AI

in FINAL-Bench/Darwin-28B-KR-Legal 3 days ago

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

#2 opened 3 days ago by

SeaWolf-AI

in FINAL-Bench/Darwin-9B-MFP4 3 days ago

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

#1 opened 3 days ago by

SeaWolf-AI

in FINAL-Bench/Darwin-28B-Coder 3 days ago

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

#2 opened 3 days ago by

SeaWolf-AI

in FINAL-Bench/Darwin-35B-A3B-Opus-Q8-GGUF 3 days ago

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

#2 opened 3 days ago by

SeaWolf-AI

in FINAL-Bench/Darwin-2B-Opus 3 days ago

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

#2 opened 3 days ago by

SeaWolf-AI

in FINAL-Bench/Darwin-4B-Opus 3 days ago

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

#1 opened 3 days ago by

SeaWolf-AI

in FINAL-Bench/Darwin-28B-REASON 3 days ago

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

#2 opened 3 days ago by

SeaWolf-AI

in FINAL-Bench/Darwin-9B-Opus 3 days ago

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

#1 opened 3 days ago by

SeaWolf-AI

in FINAL-Bench/Darwin-28B-Opus 3 days ago

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

#4 opened 3 days ago by

SeaWolf-AI

in FINAL-Bench/Darwin-TTS-1.7B-Cross 3 days ago

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

#2 opened 3 days ago by

SeaWolf-AI

in FINAL-Bench/Darwin-27B-Opus 3 days ago

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

#3 opened 3 days ago by

SeaWolf-AI

in FINAL-Bench/Darwin-4B-Genesis 3 days ago

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

#5 opened 3 days ago by

SeaWolf-AI

in FINAL-Bench/Darwin-4B-David 3 days ago

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

#2 opened 3 days ago by

SeaWolf-AI

in FINAL-Bench/Darwin-9B-NEG 3 days ago

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

#2 opened 3 days ago by

SeaWolf-AI

in FINAL-Bench/Darwin-31B-Opus 3 days ago

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

#3 opened 3 days ago by

SeaWolf-AI

in FINAL-Bench/Darwin-36B-Opus 3 days ago

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

#11 opened 3 days ago by

SeaWolf-AI

AI & ML interests

Recent Activity

Papers

Articles

Training-Free Reasoning at 88.89% on GPQA Diamond: How Darwin Family Hit Frontier Scores Without a Single Gradient Step

Darwin-TTS: We Gave a TTS Model 3% of an LLM's Brain — It Started Showing Emotion

"Darwin-27B-Opus: Surpassing the Foundation Model Without Training"

Darwin V6: Diagnostic-Guided Evolutionary Model Merging

"The Child That Surpassed Both Parents Through MRI-Guided Evolutionary Merge"

Introducing WM Bench: A Benchmark for Cognitive Intelligence in World Models

🏟️ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do

MARL: Runtime Middleware That Reduces LLM Hallucination Without Fine-Tuning

Structural Problems in AI Benchmarking and the Case for a Unified Evaluation Framework

Do Bubbles Form When Tens of Thousands of AIs Simulate Capitalism?

FINAL Bench: The Real Bottleneck to AGI Is Self-Correction

Team members 1

FINAL-Bench's activity

Model Galaxy

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

New Release: Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀