view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency not-lain • Jan 30, 2025 • 325
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling lightonai • Feb 12 • 56
view article Article We Got Claude to Fine-Tune an Open Source LLM burtenshaw, evalstate • Dec 4, 2025 • 624
view article Article Building the Hugging Face MCP Server +2 evalstate, julien-c, coyotte508, abidlabs • Jul 10, 2025 • 67
view article Article NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks nvidia • Aug 11, 2025 • 76
view article Article Tiny Agents in Python: a MCP-powered agent in ~70 lines of code +2 celinah, julien-c, Wauplin, evalstate • May 23, 2025 • 172
AdaptThink: Reasoning Models Can Learn When to Think Paper • 2505.13417 • Published May 19, 2025 • 83
view article Article Tiny Agents: an MCP-powered agent in 50 lines of code julien-c • Apr 25, 2025 • 308
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality +2 saurabhdash, olivernan, ArashAhmadian, johndang-cohere • Mar 4, 2025 • 78