Fine Tuning Datasets ethanolivertroy/nist-cybersecurity-training Viewer • Updated Oct 22, 2025 • 531k • 1.33k • 52 darkknight25/Vulnerable_Programming_Dataset Updated May 24, 2025 • 75 • 1 WNT3D/Ultimate-Offensive-Red-Team Viewer • Updated Aug 23, 2025 • 25.6k • 506 • 146 yevzh1/Ultimate-Offensive-Red-Team Viewer • Updated Jan 4 • 25.6k • 69
Bench Datasets Idavidrein/gpqa Benchmark • Updated Mar 5 • 1.25k • 123k • 455 openai/gsm8k Benchmark • Updated Mar 23 • 17.6k • 919k • 1.37k princeton-nlp/SWE-bench_Verified Viewer • Updated Feb 18, 2025 • 500 • 940k • 353 ScaleAI/SWE-bench_Pro Benchmark • Updated Feb 23 • 731 • 68.2k • 123
Gym ServiceNow-AI/EnterpriseOps-Gym Viewer • Updated Apr 30 • 2.56k • 6.71k • 89 allenai/MolmoWeb-HumanSkills Viewer • Updated Apr 13 • 116k • 1.64k • 14 allenai/MolmoWeb-SyntheticSkills Viewer • Updated Apr 13 • 5.55k • 312 • 7 allenai/MolmoWeb-SyntheticTrajs Viewer • Updated Apr 10 • 108k • 1.42k • 10
Fine Tuning Datasets ethanolivertroy/nist-cybersecurity-training Viewer • Updated Oct 22, 2025 • 531k • 1.33k • 52 darkknight25/Vulnerable_Programming_Dataset Updated May 24, 2025 • 75 • 1 WNT3D/Ultimate-Offensive-Red-Team Viewer • Updated Aug 23, 2025 • 25.6k • 506 • 146 yevzh1/Ultimate-Offensive-Red-Team Viewer • Updated Jan 4 • 25.6k • 69
Gym ServiceNow-AI/EnterpriseOps-Gym Viewer • Updated Apr 30 • 2.56k • 6.71k • 89 allenai/MolmoWeb-HumanSkills Viewer • Updated Apr 13 • 116k • 1.64k • 14 allenai/MolmoWeb-SyntheticSkills Viewer • Updated Apr 13 • 5.55k • 312 • 7 allenai/MolmoWeb-SyntheticTrajs Viewer • Updated Apr 10 • 108k • 1.42k • 10
Bench Datasets Idavidrein/gpqa Benchmark • Updated Mar 5 • 1.25k • 123k • 455 openai/gsm8k Benchmark • Updated Mar 23 • 17.6k • 919k • 1.37k princeton-nlp/SWE-bench_Verified Viewer • Updated Feb 18, 2025 • 500 • 940k • 353 ScaleAI/SWE-bench_Pro Benchmark • Updated Feb 23 • 731 • 68.2k • 123