AI & ML interests
None yet
Organizations
None yet
yujunzhou/SFT_Advanced_Risk_Self_Grading_Qwen3-4B
Text Generation
•
4B
•
Updated
•
5
yujunzhou/SFT_Advanced_Risk_Self_Grading_llama
Text Generation
•
8B
•
Updated
•
5
yujunzhou/SFT_Advanced_Risk_Self_Grading_Qwen3-4B-Base
Text Generation
•
4B
•
Updated
•
3
yujunzhou/SFT_Advanced_Risk_Reward_Tampering_Qwen3-4B
Text Generation
•
4B
•
Updated
•
11
yujunzhou/Advanced_Risk_Self_Grading_llama
8B
•
Updated
yujunzhou/SFT_Advanced_Risk_Reward_Tampering_Qwen3-4B-Base
Text Generation
•
4B
•
Updated
•
6
yujunzhou/SFT_Advanced_Risk_Reward_Tampering_llama
Text Generation
•
8B
•
Updated
•
1
yujunzhou/SFT_Advanced_Risk_Situation_Aware_Qwen3-4B-Base
Text Generation
•
4B
•
Updated
•
5
yujunzhou/SFT_Advanced_Risk_Situation_Aware_Qwen3-4B
Text Generation
•
4B
•
Updated
•
1
yujunzhou/SFT_Advanced_Risk_Situation_Aware_llama
Text Generation
•
8B
•
Updated
•
7
yujunzhou/SFT_Advanced_Risk_Summarization_Qwen3-4B-Base
Text Generation
•
4B
•
Updated
•
2
yujunzhou/SFT_Advanced_Risk_Summarization_Qwen3-4B
Text Generation
•
4B
•
Updated
•
4
yujunzhou/SFT_Advanced_Risk_Summarization_llama
Text Generation
•
8B
•
Updated
•
2
yujunzhou/Advanced_Risk_Situation_Aware_llama
yujunzhou/Advanced_Risk_Situation_Aware_Qwen3-4B-Base
4B
•
Updated
•
9
yujunzhou/MATH-TTT-OctoThinker-8B-Hybrid-Base-TTRL
8B
•
Updated
•
1
yujunzhou/Math-Train-Self-Consistency-Qwen3-4B-Base
4B
•
Updated
yujunzhou/MATH-TTT-OctoThinker-8B-Hybrid-Base-Semantic-ClipHigh-Ent0.001
8B
•
Updated
•
2
yujunzhou/Math-Train-EM-RL-Token-Qwen3-4B-Base
4B
•
Updated
yujunzhou/Math-Train-EM-RL-Sequence-Qwen3-4B-Base
4B
•
Updated
yujunzhou/Advanced_Risk_Advanced_Risk_Self_Grading_llama_situation_aware
8B
•
Updated
yujunzhou/Advanced_Risk_Advanced_Risk_Self_Grading_Qwen3-4B_situation_aware
4B
•
Updated
yujunzhou/Advanced_Risk_Advanced_Risk_Self_Grading_Qwen3-4B-Base_situation_aware
4B
•
Updated
yujunzhou/Advanced_Risk_Advanced_Risk_Summarization_llama_situation_aware
8B
•
Updated
yujunzhou/MATH-TTT-OctoThinker-8B-Hybrid-Base-TTRL-MATH_TRAIN
Updated
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_llama_situation_aware
8B
•
Updated
yujunzhou/Advanced_Risk_Advanced_Risk_Summarization_Qwen3-4B_situation_aware
4B
•
Updated
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_Qwen3-4B_situation_aware
4B
•
Updated
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_Qwen3-4B-Base_situation_aware
4B
•
Updated
yujunzhou/Advanced_Risk_Advanced_Risk_Summarization_Qwen3-4B-Base_situation_aware
4B
•
Updated