1 1 16

Wen Z

scifish

AI & ML interests

None yet

Recent Activity

upvoted a paper 21 days ago

Scaling Embeddings Outperforms Scaling Experts in Language Models

liked a model 5 months ago

meituan-longcat/LongCat-Flash-Chat

liked a Space 8 months ago

nanotron/predict_memory

View all activity

Organizations

None yet

upvoted a paper 21 days ago

Scaling Embeddings Outperforms Scaling Experts in Language Models

Paper • 2601.21204 • Published 22 days ago • 99

liked a model 5 months ago

meituan-longcat/LongCat-Flash-Chat

Text Generation • 562B • Updated Sep 24, 2025 • 25.2k • 525

liked a Space 8 months ago

Predict Memory

🧮

106

Estimate GPU memory usage for transformer training

liked a Space 12 months ago

The Ultra-Scale Playbook

🌌

3.7k

The ultimate guide to training LLM on large GPU Clusters

liked a Space almost 2 years ago

Gradio Leaderboard Component

💻

Display and filter leaderboard data

liked a Space about 2 years ago

AnimeGANv2

⚡

1.35k

Generate anime-style portrait from your photo

liked a dataset over 2 years ago

bigcode/commitpack

Updated Feb 4, 2025 • 15.8k • 77

liked a model over 2 years ago

zai-org/chatglm2-6b

Updated Aug 4, 2024 • 424k • 2.06k

liked a dataset over 2 years ago

Salesforce/wikitext

Viewer • Updated Jan 4, 2024 • 3.71M • 851k • 634

liked 4 models over 2 years ago

liked 4 models almost 3 years ago

naonovn/Lora

Updated Jun 13, 2023 • 110

samle/sd-webui-models

Updated Nov 21, 2023 • 245

jomcs/NeverEnding_Dream-Feb19-2023

Updated Jun 27, 2023 • 207

joaoalvarenga/bloom-8bit

Text Generation • Updated Jul 14, 2022 • 31 • 75