Edit Models filters
Apps
Inference Providers
Active filters: multi-agent
36n9/Vehuiah-Draco-20260425_054202
36n9/Vehuiah-Draco-20260425_054238
36n9/Vehuiah-Draco-20260425_054312
36n9/Vehuiah-Draco-20260425_054347
36n9/Vehuiah-Draco-20260425_054423
36n9/Vehuiah-Draco-20260425_054459
pragunk/PropagationShield
Text Generation • 8B • Updated • 462
Bharath-1608/negotiation-agent-grpo
Updated
ujjwalpardeshi/chakravyuh-analyzer-lora-v2
Text Generation • Updated • 37
Steven668866/qwen3-8b-grpo-teaching-phase1
Updated • 26
Timusgeorge/SynthAudit-Qwen2.5-3B-GRPO
Text Generation • Updated • 65
Bharavi/rpoe-x-qwen-0.5b-grpo
Reinforcement Learning • 0.5B • Updated • 24
M134pra/neon-syndicate-qwen25-sft
Text Generation • 0.5B • Updated • 487
srikrish2004/sentinel-qwen3-4b-grpo
Text Generation • Updated • 69
IshikaMahadar/hiring-fleet-grpo-adapter
Text Generation • Updated • 26
garvitsachdeva/spindleflow-rl
Reinforcement Learning • Updated • 162
Prathamesh0292/market-rl-stage1
Reinforcement Learning • Updated
helloAK96/chaosops-grpo-lora
Text Generation • Updated • 80
kartikraut09/ecocloud-grpo-qwen
Text Generation • 0.5B • Updated • 332
helloAK96/chaosops-grpo-lora-p2
Text Generation • Updated • 75
OnurDemircioglu/OmniGPT-355M-Instruct
0.4B • Updated • 137 • 1
132ragini/triage-wars-llm
Reinforcement Learning • Updated
helloAK96/chaosops-grpo-lora-p3a
Text Generation • Updated • 84
RavichandraNayakar/openenv-grpo-merged
Reinforcement Learning • 8B • Updated • 85
balarajr/triage-hospital-agent
Text Generation • 4B • Updated • 181
nothr/boardroom-grpo-lora-L2-best
Text Generation • Updated • 85
coliseum034/coliseum-defender-grpo-live
Reinforcement Learning • Updated • 44
hirann/immunoorg2-grpo-0.5b
Updated
balarajr/triage-qwen2.5-7b-grpo
Text Generation • 4B • Updated • 257