Collection of Quantized Models for MoE
Krishna Teja Chitty-Venkata
AI & ML interests
LLM Optimization, Neural Architecture Search, Quantization, Pruning
Recent Activity
updated a model 1 day ago
RedHatAI/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 updated a model 7 days ago
inference-optimization/Qwen3.6-35B-A3B-7.0-bits-mode-noise updated a model 7 days ago
inference-optimization/Qwen3.6-35B-A3B-7.0-bits-mode-hybrid