Block Diffusion for Flash Speculative Decoding
AI & ML interests
Efficient AI
Recent Activity
Papers
DFlash: Block Diffusion for Flash Speculative Decoding
ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference
Pairwise Rotation Quantization for Efficient Reasoning LLM Inference
-
ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference
Paper • 2511.10645 • Published • 14 -
z-lab/Qwen3.6-27B-PARO
Image-Text-to-Text • 6B • Updated • 3.48k • 27 -
z-lab/Qwen3.6-35B-A3B-PARO
Image-Text-to-Text • 6B • Updated • 5.05k • 6 -
z-lab/gemma-4-31B-it-PARO
Image-Text-to-Text • 6B • Updated • 1.07k • 22
Block Diffusion for Flash Speculative Decoding
Pairwise Rotation Quantization for Efficient Reasoning LLM Inference
-
ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference
Paper • 2511.10645 • Published • 14 -
z-lab/Qwen3.6-27B-PARO
Image-Text-to-Text • 6B • Updated • 3.48k • 27 -
z-lab/Qwen3.6-35B-A3B-PARO
Image-Text-to-Text • 6B • Updated • 5.05k • 6 -
z-lab/gemma-4-31B-it-PARO
Image-Text-to-Text • 6B • Updated • 1.07k • 22
models 48
z-lab/gemma4-12B-it-DFlash
0.7B • Updated • 355 • 7
z-lab/MiniMax-M2.7-DFlash
Text Generation • 0.6B • Updated • 539 • 25
z-lab/MiniMax-M2.5-DFlash
Text Generation • 2B • Updated • 179 • 8
z-lab/Qwen3.6-35B-A3B-DFlash
Text Generation • 0.4B • Updated • 144k • 255
z-lab/Qwen3.5-9B-DFlash
Text Generation • 1B • Updated • 14.5k • 36
z-lab/Qwen3.5-4B-DFlash
Text Generation • 0.6B • Updated • 15.3k • 29
z-lab/Qwen3.5-35B-A3B-DFlash
Text Generation • 0.4B • Updated • 6.42k • 39
z-lab/Qwen3.5-27B-DFlash
Text Generation • 2B • Updated • 8.27k • 110
z-lab/Qwen3.5-122B-A10B-DFlash
Text Generation • 0.8B • Updated • 10.4k • 18
z-lab/Qwen3.5-397B-A17B-DFlash
Text Generation • 1B • Updated • 4.81k • 7
datasets 8
z-lab/glm52-cc
Viewer • Updated • 2.26k • 14
z-lab/long-code-test
Viewer • Updated • 1k • 57 • 2
z-lab/kimi-k26-regen
Viewer • Updated • 1M • 5 • 3
z-lab/humaneval-long
Viewer • Updated • 1k • 29
z-lab/gsm8k-filtered
Viewer • Updated • 1.31k • 15
z-lab/mt-bench-filtered
Viewer • Updated • 79 • 3
z-lab/mbpp-sanitized-filtered
Viewer • Updated • 256 • 7
z-lab/humaneval-filtered
Viewer • Updated • 137 • 11