1 14

Max Kleinegger

mkleinegger

AI & ML interests

None yet

Recent Activity

updated a model 4 days ago

mkleinegger/testing

published a model 4 days ago

mkleinegger/testing

upvoted a collection 7 days ago

GSQ

View all activity

Organizations

updated a model 4 days ago

mkleinegger/testing

8B • Updated 4 days ago • 14

published a model 4 days ago

mkleinegger/testing

8B • Updated 4 days ago • 14

upvoted a collection 7 days ago

GSQ

Collection

GSQ: Highly-Accurate Low-Precision Scalar Quantization for LLMs via Gumbel-Softmax Sampling, https://huggingface.co/papers/2604.18556 • 9 items • Updated 25 days ago • 9

updated a model 18 days ago

mkleinegger/llama3.1-8b-2bit

8B • Updated 18 days ago • 21

published a model 18 days ago

mkleinegger/llama3.1-8b-2bit

8B • Updated 18 days ago • 21

updated a model about 1 month ago

daslab-testing/Apertus-8B-Instruct-2509-vLLM-FP8

8B • Updated May 18 • 4

upvoted 2 papers about 1 month ago

GSQ: Highly-Accurate Low-Precision Scalar Quantization for LLMs via Gumbel-Softmax Sampling

Paper • 2604.18556 • Published Apr 20 • 8

MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning

Paper • 2605.07850 • Published May 8 • 18

published a model about 1 month ago

daslab-testing/Apertus-8B-Instruct-2509-vLLM-FP8

8B • Updated May 18 • 4

updated a model about 1 month ago

daslab-testing/Apertus-v1.1-4B-Instruct-vLLM-FP8

4B • Updated May 18 • 4

published a model about 1 month ago

daslab-testing/Apertus-v1.1-4B-Instruct-vLLM-FP8

4B • Updated May 18 • 4

updated a model about 2 months ago

mkleinegger/test2

31B • Updated Apr 22 • 2

published a model about 2 months ago

mkleinegger/test2

31B • Updated Apr 22 • 2

authored a paper 4 months ago

MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization

Paper • 2602.03537 • Published Feb 3 • 5

upvoted 4 papers 4 months ago

QuEST: Stable Training of LLMs with 1-Bit Weights and Activations

Paper • 2502.05003 • Published Feb 7, 2025 • 44

Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization

Paper • 2509.23202 • Published Sep 27, 2025 • 30

CAGE: Curvature-Aware Gradient Estimation For Accurate Quantization-Aware Training

Paper • 2510.18784 • Published Oct 21, 2025 • 2

WUSH: Near-Optimal Adaptive Transforms for LLM Quantization

Paper • 2512.00956 • Published Nov 30, 2025 • 23

commented a paper 4 months ago

MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization

Paper • 2602.03537 • Published Feb 3 • 5 •

upvoted a paper 4 months ago

MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization

Paper • 2602.03537 • Published Feb 3 • 5

Max Kleinegger

AI & ML interests

Recent Activity

Organizations

mkleinegger's activity