tencent/Hunyuan-MT-7B-fp8
Translation • 8B • Updated • 1.72k • 33
None defined yet.
HRBench: Benchmarking and Understanding Thinking-Mode Switch Strategies in Hybrid-Reasoning LLMs
Efficient Agentic Reinforcement Learning with On-Policy Intrinsic Knowledge Boundary Enhancement