Submitted by
Wei Xiong
AI & ML interests
Workflow of Reinforcement Learning from Human Feedback (RLHF). Blog: https://rlhflow.github.io/
Recent Activity
View all activity
Workflow of Reinforcement Learning from Human Feedback (RLHF). Blog: https://rlhflow.github.io/