39 36 21

Denis Kuznedelev

SpiridonSunRotator

https://github.com/Godofnothing

Godofnothing

AI & ML interests

Model compression, computer vision, NLP

Recent Activity

liked a model 23 days ago

AlexWortega/ml-intern-v4-100m-tinystories-20260512-1721

upvoted a paper 25 days ago

MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning

upvoted a paper 2 months ago

Reasoning Shift: How Context Silently Shortens LLM Reasoning

View all activity

Organizations

liked a model 23 days ago

AlexWortega/ml-intern-v4-100m-tinystories-20260512-1721

Text Generation • 0.1B • Updated 25 days ago • 3.43k • 3

upvoted a paper 25 days ago

MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning

Paper • 2605.07850 • Published 30 days ago • 18

upvoted a paper 2 months ago

Reasoning Shift: How Context Silently Shortens LLM Reasoning

Paper • 2604.01161 • Published Apr 1 • 32

upvoted an article 3 months ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

•

Jan 27

• 76

upvoted a paper 3 months ago

LK Losses: Direct Acceptance Rate Optimization for Speculative Decoding

Paper • 2602.23881 • Published Feb 27 • 18

liked a dataset 3 months ago

ma-xu/fine-t2i

Viewer • Updated Feb 20 • 727k • 38.6k • 108

upvoted a paper 3 months ago

MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization

Paper • 2602.03537 • Published Feb 3 • 5

liked a model 4 months ago

black-forest-labs/FLUX.2-klein-4B

Image-to-Image • Updated Feb 24 • 380k • • 705

New activity in Skywork/unipic_nano_2images 4 months ago

Fix of cat command

#2 opened 4 months ago by

SpiridonSunRotator

upvoted 2 papers 4 months ago

DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers

Paper • 2602.02016 • Published Feb 2 • 13

Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation

Paper • 2601.22813 • Published Jan 30 • 62

liked a model 4 months ago

tencent/HunyuanImage-3.0-Instruct-Distil

Image-to-Image • 83B • Updated Feb 3 • 2.67k • 59

New activity in tencent/HunyuanImage-3.0-Instruct-Distil 4 months ago

OOM on 4 GPU

#3 opened 4 months ago by

SpiridonSunRotator

New activity in tencent/HunyuanImage-3.0-Instruct 4 months ago

cuBLAS error on image generation

#6 opened 4 months ago by

SpiridonSunRotator

New activity in tencent/HunyuanImage-3.0-Instruct-Distil 4 months ago

Issues with loading the model

#2 opened 4 months ago by

SpiridonSunRotator

updated a model 5 months ago

yresearch/Alice-AI-ART-dev

Text-to-Image • Updated Dec 30, 2025 • 1

published a model 5 months ago

yresearch/Alice-AI-ART-dev

Text-to-Image • Updated Dec 30, 2025 • 1

upvoted a paper 6 months ago

WUSH: Near-Optimal Adaptive Transforms for LLM Quantization

Paper • 2512.00956 • Published Nov 30, 2025 • 23

updated a model 7 months ago

ISTA-DASLab/Kimi-K2-Thinking-GPTQ-2b-32g-experts

1.1T • Updated Nov 20, 2025 • 2

upvoted a paper 7 months ago

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12, 2025 • 129

Denis Kuznedelev

AI & ML interests

Recent Activity

Organizations

SpiridonSunRotator's activity

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Fix of cat command

OOM on 4 GPU

cuBLAS error on image generation

Issues with loading the model