OpenGVLab/InternVL3_5-GPT-OSS-20B-A4B-Preview Image-Text-to-Text • 0.4B • Updated Aug 29, 2025 • 36.5k • 83
Running on CPU Upgrade Featured 3.01k The Smol Training Playbook 📚 3.01k The secrets to building world-class LLMs
pplx-embed Collection Diffusion-LM for Dense and Contextual Retrieval • 7 items • Updated 13 days ago • 26
MoEs Collection This collection holds the list of all the first MoE releases from research labs. • 14 items • Updated 22 days ago • 4
Running 3.71k The Ultra-Scale Playbook 🌌 3.71k The ultimate guide to training LLM on large GPU Clusters