Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
10
34
84
Sukesh Perla
hitchhiker3010
Follow
Gargaz's profile picture
Mi6paulino's profile picture
Prettykittycat35's profile picture
10 followers
ยท
34 following
hitchhiker3010
hitchhiker3010
sukesh-perla
AI & ML interests
None yet
Recent Activity
upvoted
a
collection
4 days ago
Environment Hub
reacted
to
sergiopaniego
's
post
with ๐ฅ
4 days ago
New TRL + OpenEnv example! ๐ฅ Fine tune an LLM for playing Sudoku using an RL env via OpenEnv Includes a script that runs on 1 or multiple GPUs with vLLM, plus a Colab-ready notebook. Enjoy! Notebook: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/openenv_sudoku_grpo.ipynb Script: https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/sudoku.py
upvoted
an
article
6 days ago
Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective
View all activity
Organizations
hitchhiker3010
's Spaces
2
Sort:ย Recently updated
Sleeping
Token Visualizer
๐
Visualize tokens from text using a tokenizer
Runtime error
Quickdraw
๐