arxiv:2602.05711
Loser Cheems
JingzeShi
AI & ML interests
I like training small languge models.
Recent Activity
liked a model 23 days ago
BAAI/OpenSeek-Mid-v1 updated a model about 1 month ago
JingzeShi/flash-sparse-attention published a model about 1 month ago
JingzeShi/flash-sparse-attention