zentorch TorchAO Quantized Models - PyTorch 2.10 Collection TorchAO quantized models for AMD EPYC CPU inference. The inference stack includes vLLM (0.15.0 to 0.18.0), PyTorch 2.10, and zentorch 5.2.1. • 4 items • Updated 6 days ago
zentorch TorchAO Quantized Models - PyTorch 2.10 Collection TorchAO quantized models for AMD EPYC CPU inference. The inference stack includes vLLM (0.15.0 to 0.18.0), PyTorch 2.10, and zentorch 5.2.1. • 4 items • Updated 6 days ago
Mutual Adversarial Training: Learning together is better than going alone Paper • 2112.05005 • Published Dec 9, 2021 • 1
A2Q: Accumulator-Aware Quantization with Guaranteed Overflow Avoidance Paper • 2308.13504 • Published Aug 25, 2023