lablab-ai-amd-developer-hackathon/Qwen-security-builder-14b Text Generation • 15B • Updated 22 days ago • 313 • • 1
IDEALLab/Qwen2.5-Coder-14B-Instruct-GRPO-SDS-Hero-seed101 Reinforcement Learning • 15B • Updated 19 days ago • 15
IDEALLab/Qwen2.5-Coder-14B-Instruct-GRPO-SDS-Hero-seed202 Reinforcement Learning • 15B • Updated 19 days ago • 13
IDEALLab/Qwen2.5-Coder-14B-Instruct-GRPO-SDS-Hero-seed303 Reinforcement Learning • 15B • Updated 19 days ago • 11
IDEALLab/Qwen2.5-Coder-14B-Instruct-GRPO-SDS-Ablation-Oracle-seed101 Reinforcement Learning • 15B • Updated 19 days ago • 14
IDEALLab/Qwen2.5-Coder-14B-Instruct-GRPO-SDS-Ablation-Oracle-seed202 Reinforcement Learning • 15B • Updated 19 days ago • 14
IDEALLab/Qwen2.5-Coder-14B-Instruct-GRPO-SDS-Ablation-Oracle-seed303 Reinforcement Learning • 15B • Updated 19 days ago • 14
IDEALLab/Qwen2.5-Coder-14B-Instruct-GRPO-SDS-Ablation-Diversity-seed101 Reinforcement Learning • 15B • Updated 19 days ago • 12