inference-optimization/Qwen3-30B-A3B-Instruct-2507_7.0_bits_mode_heuristic 27B • Updated 1 day ago • 15
inference-optimization/Qwen3-30B-A3B-Instruct-2507_6.5_bits_mode_heuristic 25B • Updated 1 day ago • 15
inference-optimization/Qwen3-30B-A3B-Instruct-2507_6.0_bits_mode_heuristic 23B • Updated 1 day ago • 15
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.5_bits_mode_heuristic 22B • Updated 2 days ago • 14
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.5_bits_mode_hybrid 22B • Updated 2 days ago • 12
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.0_bits_mode_heuristic 20B • Updated 2 days ago • 14
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.0_bits_mode_hybrid 20B • Updated 2 days ago • 19
inference-optimization/Qwen3-30B-from-Qwen3-235B_resps-speculators.eagle3-ckpt3 0.5B • Updated 5 days ago • 23
inference-optimization/gpt-oss-120b-from-qwen235b-ckpt3-speculator.eagle3 0.9B • Updated 5 days ago • 39
inference-optimization/gpt-oss-120b-from-qwen235b-ckpt1-speculator.eagle3 0.9B • Updated 5 days ago • 24
inference-optimization/gpt-oss-120b-from-qwen235b-ckpt0-speculator.eagle3 0.9B • Updated 5 days ago • 23
inference-optimization/Qwen3-30B-from-Qwen3-235B_resps-speculators.eagle3-ckpt2 0.5B • Updated 5 days ago • 22