Quantized Models (GGUF, IQ, Imatrix)
Collection
Various GGUF quantizations of small models. Models with a "checkmark" are personal favorites. An "orange arrow" means it's being uploaded. • 97 items • Updated • 71
Discussions
- General discussion and feedback.
Feedback is always welcome for potential issues with quants and as a way to help the author improve on the next iteration, your comments are appreciated!
SillyTavern
The complete AIO recommended preset:
[SillyTavern Presets]
3-bit
4-bit
5-bit
6-bit
8-bit
16-bit