rlvr-weak-supervision Collection Models from "When Can LLMs Learn to Reason with Weak Supervision?" — Llama-3.2-3B with continual pre-training and Thinking SFT. • 3 items • Updated 27 days ago • 2
rlvr-weak-supervision Collection Models from "When Can LLMs Learn to Reason with Weak Supervision?" — Llama-3.2-3B with continual pre-training and Thinking SFT. • 3 items • Updated 27 days ago • 2
CoDaS: AI Co-Data-Scientist for Biomarker Discovery via Wearable Sensors Paper • 2604.14615 • Published Apr 16 • 8