unsloth
/

Kimi-K2.5-GGUF

Model card Files Files and versions

danielhanchen commited on 8 days ago

Commit

955c264

·

verified ·

1 Parent(s): 44c5713

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -29,9 +29,9 @@ library_name: transformers
 <h2 style="margin-top: 0rem;">Kimi K2.5 Usage Guidelines</h2>
 </div>
-- No vision support at the moment.
 - It is recommended to have at least 240GB unified memory or RAM/VRAM to run the small quants. With 16GB VRAM and 256 RAM, expect 5+ tokens/sec.
 - For best results, use any 2-bit XL quant or above (requires >380GB unified memory /combined RAM + VRAM).
 - To run the model in **full precision**, you can use the 4-bit or 5-bit quants. You can use any higher just to be safe.
 - For complete detailed instructions (sampling parameters etc.), see our guide: [docs.unsloth.ai/models/kimi-k2.5](https://docs.unsloth.ai/models/kimi-k2.5)

 <h2 style="margin-top: 0rem;">Kimi K2.5 Usage Guidelines</h2>
 </div>
 - It is recommended to have at least 240GB unified memory or RAM/VRAM to run the small quants. With 16GB VRAM and 256 RAM, expect 5+ tokens/sec.
 - For best results, use any 2-bit XL quant or above (requires >380GB unified memory /combined RAM + VRAM).
+- No vision support at the moment.
 - To run the model in **full precision**, you can use the 4-bit or 5-bit quants. You can use any higher just to be safe.
 - For complete detailed instructions (sampling parameters etc.), see our guide: [docs.unsloth.ai/models/kimi-k2.5](https://docs.unsloth.ai/models/kimi-k2.5)