andy s's picture

andy s

andysalerno

·

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

cyankiwi/Qwen3.6-35B-A3B-AWQ-4bit

liked a model 3 days ago

QuantTrio/Qwen3.6-35B-A3B-AWQ

liked a model 4 days ago

Qwen/Qwen3.6-35B-A3B-FP8

View all activity

Organizations

upvoted a paper 3 months ago

WUSH: Near-Optimal Adaptive Transforms for LLM Quantization

Paper • 2512.00956 • Published Nov 30, 2025 • 23

upvoted a paper 4 months ago

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2512.20848 • Published Dec 23, 2025 • 42

upvoted a paper 11 months ago

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 121

upvoted an article 12 months ago

Article

Comparing sub 50GB Llama 4 Scout quants (KLD/Top P)

Apr 9, 2025

•

45

upvoted a collection over 1 year ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5, 2025 • 305

upvoted a collection almost 2 years ago

📀 Dataset comparison models

1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated Jun 12, 2024 • 42

upvoted 2 papers about 2 years ago

EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs

Paper • 2403.02775 • Published Mar 5, 2024 • 13

Nemotron-4 15B Technical Report

Paper • 2402.16819 • Published Feb 26, 2024 • 46

upvoted a collection about 2 years ago

OpenMath

A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" • 15 items • Updated about 2 hours ago • 46

upvoted a paper over 2 years ago

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Paper • 2401.01335 • Published Jan 2, 2024 • 69

upvoted a collection over 2 years ago

Tulu V2 Suite

The set of models associated with the paper "Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2" • 19 items • Updated Dec 23, 2025 • 45