4 384

M Saad Salman

MSS444

MSS444

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 hour ago

FireRed-OCR Technical Report

upvoted a paper about 1 hour ago

Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data

upvoted a paper about 1 hour ago

Efficient RLVR Training via Weighted Mutual Information Data Selection

View all activity

Organizations

None yet

upvoted 11 papers about 1 hour ago

Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration?

Paper • 2603.03202 • Published 1 day ago • 5

Surgical Post-Training: Cutting Errors, Keeping Knowledge

Paper • 2603.01683 • Published 3 days ago • 9

How Controllable Are Large Language Models? A Unified Evaluation across Behavioral Granularities

Paper • 2603.02578 • Published 2 days ago • 21

Qwen3-Coder-Next Technical Report

Paper • 2603.00729 • Published 4 days ago • 23

BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?

Paper • 2603.03194 • Published 1 day ago • 48

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published 1 day ago • 53

upvoted 3 papers 2 days ago

Agentic Code Reasoning

Paper • 2603.01896 • Published 3 days ago • 9

Learn Hard Problems During RL with Reference Guided Fine-tuning

Paper • 2603.01223 • Published 3 days ago • 12

Tool Verification for Test-Time Reinforcement Learning

Paper • 2603.02203 • Published 2 days ago • 4

upvoted 2 papers 3 days ago

Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization

Paper • 2602.23008 • Published 7 days ago • 34

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Paper • 2602.24286 • Published 5 days ago • 70

upvoted 4 papers 10 days ago

Discovering Multiagent Learning Algorithms with Large Language Models

Paper • 2602.16928 • Published 14 days ago • 16

"What Are You Doing?": Effects of Intermediate Feedback from Agentic LLM In-Car Assistants During Multi-Step Processing

Paper • 2602.15569 • Published 16 days ago • 13

Arcee Trinity Large Technical Report

Paper • 2602.17004 • Published 14 days ago • 17

Unified Latents (UL): How to train your latents

Paper • 2602.17270 • Published 14 days ago • 57

M Saad Salman

AI & ML interests

Recent Activity

Organizations

MSS444's activity