SWE-bench
SWE-bench
Organization for maintaining the SWE-bench/agent projects
Pinned Loading
Repositories
Showing 6 of 6 repositories
- experiments Public
Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.
SWE-bench/experiments’s past year of commit activity