2 23 77

Masoud Hashemi

masoudhashemi

AI & ML interests

None yet

Recent Activity

liked a Space 21 days ago

aminediroHF/trainer-generator-bf16-mismatch

liked a Space 27 days ago

AdithyaSK/rl-environments-guide

upvoted an article about 2 months ago

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

View all activity

Organizations

liked a Space 21 days ago

Defeating the trainer-generator precision mismatch in TRL

🎯

Download research PDF (Pro access required)

liked a Space 27 days ago

The ultimate guide to RL environments: building and scaling them in the LLM era

📝

175

Building and scaling RL environments for LLM training

upvoted an article about 2 months ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 159

upvoted 2 papers 2 months ago

Terminal Agents Suffice for Enterprise Automation

Paper • 2604.00073 • Published Mar 31 • 96

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published Mar 25 • 98

upvoted an article 2 months ago

Article

A New Framework for Evaluating Voice Agents (EVA)

ServiceNow-AI

•

Mar 24

• 95

upvoted a paper 3 months ago

EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings

Paper • 2603.13594 • Published Mar 13 • 149

liked a model 5 months ago

LLM360/K2-V2

Updated Jan 26 • 123 • 32

liked a Space 5 months ago

AI Deadlines

⚡

761

Track upcoming AI conference deadlines in one place

liked a dataset 6 months ago

nvidia/Nemotron-Agentic-v1

Preview • Updated Dec 15, 2025 • 3.7k • 166

liked a Space 6 months ago

Apriel Chat

💬

ServiceNow-AI model chat

published an article 6 months ago

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

ServiceNow-AI

•

Dec 9, 2025

• 84

liked a model 6 months ago

ServiceNow-AI/Apriel-1.6-15b-Thinker

Image-Text-to-Text • 15B • Updated Dec 22, 2025 • 547 • 300

liked a dataset 6 months ago

open-thoughts/OpenThoughts-Agent-v1-SFT

Viewer • Updated Jan 27 • 15.2k • 2.64k • 93

upvoted an article 6 months ago

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

ServiceNow-AI

•

Dec 9, 2025

• 84

updated a model 6 months ago

ServiceNow-AI/Apriel-1.6-15b-Thinker

Image-Text-to-Text • 15B • Updated Dec 22, 2025 • 547 • 300

upvoted a paper 8 months ago

Apriel-Nemotron-15B-Thinker

Paper • 2508.10948 • Published Aug 13, 2025 • 6

upvoted an article 8 months ago

Article

Smol2Operator: Post-Training GUI Agents for Computer Use

A-Mahla, merve, sergiopaniego, reach-vb, lewtun

•

Sep 23, 2025

• 138

upvoted a collection 8 months ago

Apriel-1.5-15B-Thinker

Collection

3 items • Updated Oct 2, 2025 • 76

liked a Space 8 months ago

DNR-Bench

⚡

DNR-Bench leaderboard for RLM's

Masoud Hashemi

AI & ML interests

Recent Activity

Organizations

masoudhashemi's activity

Defeating the trainer-generator precision mismatch in TRL

The ultimate guide to RL environments: building and scaling them in the LLM era

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

A New Framework for Evaluating Voice Agents (EVA)

AI Deadlines

Apriel Chat

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

Smol2Operator: Post-Training GUI Agents for Computer Use

DNR-Bench