jaycehaha
jaycehaha
ยท
AI & ML interests
AGI, reinforcement learning, foundation model, causal inference
Recent Activity
upvoted a paper 7 days ago
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models liked a model about 1 month ago
cyankiwi/Qwen3-Omni-30B-A3B-Instruct-AWQ-4bit liked a dataset 7 months ago
qihoo360/Light-R1-SFTDataOrganizations
None yet