inference-optimization/ctest-Qwen3.5-9B-sliding-window-all-speculator.dflash 2B • Updated 5 days ago • 39
inference-optimization/ctest-Qwen3.5-9B-sliding-window-all-speculator.dflash 2B • Updated 5 days ago • 39
inference-optimization/ctest-Qwen3.5-9B-sliding-window-speculator.dflash 2B • Updated 6 days ago • 56
inference-optimization/ctest-Qwen3.5-9B-sliding-window-speculator.dflash 2B • Updated 6 days ago • 56
inference-optimization/ctest-Qwen3.6-27B-speculator-dataset Viewer • Updated 12 days ago • 5.61k • 28
inference-optimization/ctest-Qwen3.6-27B-speculator-dataset Viewer • Updated 12 days ago • 5.61k • 28
inference-optimization/Qwen3-235B-A22B-Thinking-2507-quantized.w4a16 Text Generation • 32B • Updated 20 days ago • 193
inference-optimization/Qwen3-235B-A22B-Thinking-2507-quantized.w8a8 Text Generation • 235B • Updated 20 days ago • 184
inference-optimization/Qwen3-235B-A22B-Instruct-2507-quantized.w4a16 Text Generation • 32B • Updated 20 days ago • 226
RedHatAI/Qwen3-235B-A22B-Instruct-2507-quantized.w8a8 Text Generation • 235B • Updated 20 days ago • 193
inference-optimization/ctest-subset-Qwen3.5-397B-A17B-FP8-dynamic-speculator-dataset Viewer • Updated 21 days ago • 10k • 73
inference-optimization/ctest-subset-Qwen3.5-397B-A17B-FP8-dynamic-speculator-dataset Viewer • Updated 21 days ago • 10k • 73
inference-optimization/final-ctest-Qwen3-8B-speculator-dataset Viewer • Updated 27 days ago • 10k • 61