10 16

Artemiy

3fgson

lskog7

AI & ML interests

Recent Activity

liked a model 9 days ago

ai-sage/GigaChat3.1-702B-A36B

upvoted a paper 26 days ago

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

liked a model 3 months ago

apple/FastVLM-7B

View all activity

Organizations

None yet

liked a model 9 days ago

ai-sage/GigaChat3.1-702B-A36B

Text Generation • 715B • Updated 8 days ago • 2.02k • 24

liked a model 3 months ago

apple/FastVLM-7B

Text Generation • 8B • Updated Sep 3, 2025 • 1.82k • 270

liked 3 models 5 months ago

liked a dataset 5 months ago

HuggingFaceTB/training-guide-nanotron-configs

Viewer • Updated Dec 22, 2025 • 2 • 129 • 9

liked a Space 5 months ago

The Smol Training Playbook

📚

3.1k

The secrets to building world-class LLMs

liked 2 models 5 months ago

zai-org/GLM-4.5

Text Generation • 358B • Updated Aug 11, 2025 • 67.5k • • 1.4k

MiniMaxAI/MiniMax-M2

Text Generation • 229B • Updated Dec 23, 2025 • 56.9k • • 1.49k

liked a Space 5 months ago

FineVision: Open Data is All You Need

📝

221

A new open-source dataset for training VLMs

liked 3 models 7 months ago

bird-of-paradise/deepseek-mla

Text Generation • Updated Feb 27, 2025 • 19

ai-sage/GigaChat-20B-A3B-instruct

Text Generation • 21B • Updated Jun 25, 2025 • 426 • 50

ai-forever/FRIDA

Feature Extraction • 0.8B • Updated May 26, 2025 • 551k • • 135

liked a Space 7 months ago

The Ultra-Scale Playbook

🌌

3.77k

The ultimate guide to training LLM on large GPU Clusters

liked a dataset 8 months ago

nebius/SWE-rebench

Viewer • Updated Dec 23, 2025 • 27.9k • 61.4k • 59

liked a Space about 1 year ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.33k

Read a detailed overview of the FineWeb web‑scale text dataset