reasoning-degeneration-dev/ttt-discover-circle_packing_26-qwen3-8b-v2 Viewer • Updated Apr 4 • 24 • 22
reasoning-degeneration-dev/ttt-discover-circle_packing_26-qwen3-8b-v2 Viewer • Updated Apr 4 • 24 • 22
reasoning-degeneration-dev/ttt-discover-circle_packing_26-qwen3-8b-v1 Viewer • Updated Apr 4 • 20 • 28 • 1
reasoning-degeneration-dev/ttt-discover-circle_packing_26-qwen3-8b-v1 Viewer • Updated Apr 4 • 20 • 28 • 1
Algorithmic SFT vs Distillation Collection 10 LoRA adapters + 6 datasets. Algo template SFT vs QwQ distillation on Qwen2.5-1.5B-Instruct across 4 reasoning domains. • 16 items • Updated Apr 1