Update README.md
Browse files
README.md
CHANGED
|
@@ -29,9 +29,9 @@ library_name: transformers
|
|
| 29 |
<h2 style="margin-top: 0rem;">Kimi K2.5 Usage Guidelines</h2>
|
| 30 |
</div>
|
| 31 |
|
| 32 |
-
- No vision support at the moment.
|
| 33 |
- It is recommended to have at least 240GB unified memory or RAM/VRAM to run the small quants. With 16GB VRAM and 256 RAM, expect 5+ tokens/sec.
|
| 34 |
- For best results, use any 2-bit XL quant or above (requires >380GB unified memory /combined RAM + VRAM).
|
|
|
|
| 35 |
- To run the model in **full precision**, you can use the 4-bit or 5-bit quants. You can use any higher just to be safe.
|
| 36 |
- For complete detailed instructions (sampling parameters etc.), see our guide: [docs.unsloth.ai/models/kimi-k2.5](https://docs.unsloth.ai/models/kimi-k2.5)
|
| 37 |
|
|
|
|
| 29 |
<h2 style="margin-top: 0rem;">Kimi K2.5 Usage Guidelines</h2>
|
| 30 |
</div>
|
| 31 |
|
|
|
|
| 32 |
- It is recommended to have at least 240GB unified memory or RAM/VRAM to run the small quants. With 16GB VRAM and 256 RAM, expect 5+ tokens/sec.
|
| 33 |
- For best results, use any 2-bit XL quant or above (requires >380GB unified memory /combined RAM + VRAM).
|
| 34 |
+
- No vision support at the moment.
|
| 35 |
- To run the model in **full precision**, you can use the 4-bit or 5-bit quants. You can use any higher just to be safe.
|
| 36 |
- For complete detailed instructions (sampling parameters etc.), see our guide: [docs.unsloth.ai/models/kimi-k2.5](https://docs.unsloth.ai/models/kimi-k2.5)
|
| 37 |
|