GSQ: Highly-Accurate Low-Precision Scalar Quantization for LLMs via Gumbel-Softmax Sampling Paper • 2604.18556 • Published Apr 20 • 7
Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration Paper • 2605.05566 • Published 16 days ago • 37
Echoes as Anchors: Probabilistic Costs and Attention Refocusing in LLM Reasoning Paper • 2602.06600 • Published Feb 6 • 3
PowerInfer-2: Fast Large Language Model Inference on a Smartphone Paper • 2406.06282 • Published Jun 10, 2024 • 40
ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning Paper • 2605.00380 • Published 22 days ago • 7
EMO: Pretraining Mixture of Experts for Emergent Modularity Paper • 2605.06663 • Published 16 days ago • 12
Evaluating the Progression of Large Language Model Capabilities for Small-Molecule Drug Design Paper • 2604.16279 • Published Apr 17 • 1
lablab-ai-amd-developer-hackathon/CyberSecQwen-4B Text Generation • 4B • Updated 15 days ago • 736 • 11
view article Article CyberSecQwen-4B: Why Defensive Cyber Needs Small, Specialized, Locally-Runnable Models lablab-ai-amd-developer-hackathon • 14 days ago • 8
view article Article EMO: Pretraining mixture of experts for emergent modularity allenai • 15 days ago • 38
ÜberWeb: Insights from Multilingual Curation for a 20-Trillion-Token Dataset Paper • 2602.15210 • Published Feb 25 • 1
Kakugo: Distillation of Low-Resource Languages into Small Language Models Paper • 2601.14051 • Published Jan 20 • 1