j2's picture

j2

ej2

·

AI & ML interests

None yet

Recent Activity

upvoted a collection 22 days ago

liked a dataset 2 months ago

updated a model 4 months ago

ej2/Holmes_moe_history

View all activity

Organizations

None yet

upvoted a collection 22 days ago

DeepSeek-V4

4 items • Updated 22 days ago • 640

upvoted 3 articles 5 months ago

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

MiniMax-AI

•

Oct 30, 2025

• 43

Article

What makes good reasoning data

MiniMax-AI

•

Oct 30, 2025

• 44

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

MiniMax-AI

•

Oct 30, 2025

• 80

upvoted a collection 5 months ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.78k

upvoted a collection 8 months ago

RWKV World v3 Corpus

RWKV World v3.0 Dataset for training RWKV-7 Goose World v3 models • 64 items • Updated Mar 9, 2025 • 3

upvoted a paper 12 months ago

3D Gaussian Splatting for Real-Time Radiance Field Rendering

Paper • 2308.04079 • Published Aug 8, 2023 • 202

upvoted an article 12 months ago

Article

Introduction to 3D Gaussian Splatting

dylanebert

•

Sep 18, 2023

• 137

upvoted 2 papers about 1 year ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published Jan 30, 2025 • 61

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 161

upvoted an article about 1 year ago

Article

Fine-tuning Llama 2 70B using PyTorch FSDP

+2

smangrul, sgugger, lewtun, philschmid

•

Sep 13, 2023

• 32

upvoted a collection over 1 year ago

"Physics of Language Models" series

7 items • Updated Dec 22, 2025 • 53