RL - a mapuna Collection

Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

mapuna 's Collections

RL

updated 21 days ago

Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities

Paper • 2507.13158 • Published Jul 17, 2025 • 24
DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning

Paper • 2602.11089 • Published Feb 11 • 18
Experiential Reinforcement Learning

Paper • 2602.13949 • Published Feb 15 • 75
Self-Distilled Agentic Reinforcement Learning

Paper • 2605.15155 • Published 26 days ago • 111

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs