1 6

Sharath Turuvekere Sreenivas

sharathts

AI & ML interests

Learning algorithms, LLM efficiency: Knowledege distillation and compression.

Recent Activity

upvoted an article 6 days ago

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

published an article 6 days ago

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

upvoted a paper 4 months ago

Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs

View all activity

Organizations

upvoted an article 6 days ago

Article

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

6 days ago

•

published an article 6 days ago

Article

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

6 days ago

•

upvoted a paper 4 months ago

Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs

Paper • 2511.16664 • Published Nov 20, 2025 • 29

upvoted a paper 7 months ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published Aug 20, 2025 • 46

published an article 7 months ago

Article

Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B

Aug 18, 2025

•

updated 3 models 7 months ago

published 2 models 7 months ago

nvidia/NVIDIA-Nemotron-Nano-12B-v2-Base

Text Generation • Updated Nov 4, 2025 • 2.19k • 89

nvidia/NVIDIA-Nemotron-Nano-9B-v2-Base

Text Generation • 9B • Updated Nov 4, 2025 • 172k • 43

upvoted a collection 11 months ago

Nemotron-H

Collection

Mamba-Transformer hybrid models • 10 items • Updated about 4 hours ago • 32

authored 2 papers 11 months ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21, 2024 • 58

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Paper • 2504.03624 • Published Apr 4, 2025 • 17

New activity in nvidia/Llama-3.1-Minitron-4B-Width-Base over 1 year ago

Teacher correction training hyperparameters

#13 opened over 1 year ago by

hjlee1371

upvoted a paper over 1 year ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21, 2024 • 58

authored a paper over 1 year ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19, 2024 • 39

upvoted a paper over 1 year ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19, 2024 • 39

Sharath Turuvekere Sreenivas

AI & ML interests

Recent Activity

Organizations

sharathts's activity

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B

Teacher correction training hyperparameters