13 9 132

Pritish Mishra

pritish

AI & ML interests

Machine Learning, Computer Vision, NLP, ODML, ML Ops

Recent Activity

liked a model 26 days ago

Qwen/Qwen3.5-397B-A17B

upvoted an article 27 days ago

KV Caching Explained: Optimizing Transformer Inference Efficiency

liked a model about 2 months ago

arcee-ai/Trinity-Large-Preview

View all activity

Organizations

None yet

upvoted an article 27 days ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

244

upvoted an article about 2 months ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

Dec 1, 2025

•

306

upvoted a collection 3 months ago

NVIDIA Nemotron v3

Collection

Open, Production-ready Enterprise Models • 12 items • Updated 3 days ago • 195

upvoted an article 6 months ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

Sep 4, 2025

•

273

upvoted an article 8 months ago

Article

Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨

Jul 25, 2025

•

upvoted an article 10 months ago

Article

You could have designed state of the art positional encoding

Nov 25, 2024

•

454

upvoted an article 12 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12, 2025

•

490

upvoted a paper about 1 year ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published Dec 12, 2024 • 97

upvoted a paper over 1 year ago

Unveiling Encoder-Free Vision-Language Models

Paper • 2406.11832 • Published Jun 17, 2024 • 54

Pritish Mishra

AI & ML interests

Recent Activity

Organizations

pritish's activity

KV Caching Explained: Optimizing Transformer Inference Efficiency

Transformers v5: Simple model definitions powering the AI ecosystem

Welcome EmbeddingGemma, Google's new efficient embedding model

Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨

You could have designed state of the art positional encoding

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM