SWE-bench

community

https://swe-bench.com

SWE-bench

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

john-b-yang authored a paper about 2 months ago

SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?

john-b-yang authored a paper about 2 months ago

OpenThoughts: Data Recipes for Reasoning Models

john-b-yang authored a paper about 2 months ago

LongCodeBench: Evaluating Coding LLMs at 1M Context Windows

View all activity

Papers

SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?

SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

View all Papers

Organization Card

Community About org cards

SWE-bench

We are a team of researchers across Stanford University and Princeton University working on LMs and AI systems for software engineering.

In this organization, you will find the assets for several projects in the SWE-* research ecosystem, notably:

SWE-bench (Lite, Verified, Multimodal, Multilingual)
SWE-smith
SWE-agent

Collections 3

View 3 collections

models 3

datasets 17

SWE-bench/SWE-prime

Viewer • Updated Apr 13 • 1.36k • 82

SWE-bench/SWE-smith-cpp

Viewer • Updated Mar 9 • 5.12k • 327

SWE-bench/SWE-smith-ts

Viewer • Updated Feb 28 • 5.03k • 327

SWE-bench/SWE-bench_Verified

Benchmark • Updated Feb 27 • 500 • 68.3k • 99

SWE-bench/SWE-smith-java

Viewer • Updated Feb 12 • 7.47k • 411

SWE-bench/SWE-smith-rs

Viewer • Updated Feb 6 • 5.31k • 317 • 2

SWE-bench/SWE-smith-js

Viewer • Updated Jan 20 • 6.07k • 392

SWE-bench/SWE-smith-py

Viewer • Updated Dec 18, 2025 • 50.9k • 1.38k • 5

SWE-bench/SWE-smith-go

Viewer • Updated Dec 18, 2025 • 8.21k • 474

SWE-bench/SWE-smith-php

Viewer • Updated Dec 18, 2025 • 1 • 60

View 17 datasets

SWE-bench

AI & ML interests

Recent Activity

Papers

SWE-bench

Collections 3

SWE-bench/SWE-bench_Verified

SWE-bench/SWE-bench_Multilingual

SWE-bench/SWE-bench_Multimodal

SWE-bench/SWE-bench_Lite

SWE-bench/SWE-smith-py

SWE-bench/SWE-smith-go

SWE-bench/SWE-smith-rs

SWE-bench/SWE-smith-cpp

SWE-bench/SWE-bench_Verified

SWE-bench/SWE-bench_Multilingual

SWE-bench/SWE-bench_Multimodal

SWE-bench/SWE-bench_Lite

SWE-bench/SWE-smith-py

SWE-bench/SWE-smith-go

SWE-bench/SWE-smith-rs

SWE-bench/SWE-smith-cpp

models 3

SWE-bench/SWE-agent-LM-7B

SWE-bench/SWE-Rater-32B

SWE-bench/SWE-agent-LM-32B

datasets 17

SWE-bench/SWE-prime

SWE-bench/SWE-smith-cpp

SWE-bench/SWE-smith-ts

SWE-bench/SWE-bench_Verified

SWE-bench/SWE-smith-java

SWE-bench/SWE-smith-rs

SWE-bench/SWE-smith-js

SWE-bench/SWE-smith-py

SWE-bench/SWE-smith-go

SWE-bench/SWE-smith-php

AI & ML interests

Recent Activity

Papers

Team members 7

SWE-bench

Collections 3

models 3 Sort: Recently updated

datasets 17 Sort: Recently updated

models 3

datasets 17