Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Xiaoyang Cao's picture
5

Xiaoyang Cao

Sean13
·
https://xiaoyangcao1113.github.io/
  • XiaoyangCao1113
  • xiaoyangcao

AI & ML interests

RLFH, Deep Reinfrocement Learning

Recent Activity

updated a model 10 days ago
Sean13/repo-best-llama-re-dpo
published a model 10 days ago
Sean13/repo-best-llama-re-dpo
updated a model 10 days ago
Sean13/repo-best-llama-dpo
View all activity

Organizations

None yet

Sean13 's models 66

Sean13/mistral-7b-instruct-v0.2-rsimpo-full

Text Generation • 7B • Updated Sep 6, 2025 • 4

Sean13/mistral-7b-instruct-v0.2-ipo-full

Text Generation • 7B • Updated Aug 19, 2025 • 1

Sean13/mistral-7b-instruct-v0.2-slic_hf-full

Text Generation • 7B • Updated Aug 11, 2025

Sean13/mistral-7b-instruct-v0.2-rslic_hf-full

Updated Aug 8, 2025

Sean13/mistral-7b-instruct-v0.2-ripo-full

Text Generation • 7B • Updated Aug 3, 2025 • 4

Sean13/mistral-7b-instruct-v0.2-emdpo-full

7B • Updated Jul 24, 2025
  • Previous
  • 1
  • 2
  • 3
  • Next
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs