kumapo/qwen3-0.6b-sft-lora-rank256-1phase-assistant-only-loss Text Generation • 0.6B • Updated about 2 hours ago
kumapo/qwen3-0.6b-sft-lora-rank128-1phase-assistant-only-loss Text Generation • 0.6B • Updated about 2 hours ago
kumapo/qwen3-0.6b-sft-lora-rank512-1phase-assistant-only-loss Text Generation • 0.6B • Updated about 4 hours ago
kumapo/qwen3-0.6b-sft-lora-rank2048-1phase-assistant-only-loss Text Generation • 0.6B • Updated 1 day ago
kumapo/qwen3-0.6b-sft-lora-rank2048-2phase-assisistant-only-loss Text Generation • 0.6B • Updated 1 day ago • 2
kumapo/qwen3_235b_a22b_thinking_textbookreasoning_ugphysics_aops_mini 235B • Updated Sep 26, 2025 • 2
kumapo/qwen3_235b_a22b_thinking_tbrminifiltered01_ugpfiltered01_olymomniliveumathreasoning 235B • Updated Sep 26, 2025 • 2
kumapo/qwen3_235b_a22b_thinking_tbrminifiltered01nomath_olymomniliveumathreasoning 235B • Updated Sep 25, 2025 • 1
kumapo/qwen3_235b_a22b_thinking_tbrnomathphysics_olymomniiisliveumathreasoning_ugphysics 235B • Updated Sep 25, 2025 • 1
kumapo/qwen3_235b_a22b_thinking_tbrbalancedminifiltered01_ugpfiltered01_olymomniliveumathreasoning 235B • Updated Sep 25, 2025 • 1
kumapo/qwen3_235b_a22b_thinking_tbrnomath_olymomni2liveumathreasoning 235B • Updated Sep 25, 2025 • 1
kumapo/llm-jp-3-1.8b-jaster-dev-2634-ichikara-003-001-1 Text Generation • 2B • Updated Aug 10, 2025 • 5
kumapo/llm-jp-3-1.8b-ichikara-003-001-1-jaster-dev-first100 Text Generation • 2B • Updated Aug 10, 2025 • 4