SWE-bench-Live Leaderboard

Evaluating your AI system on latest software engineering tasks.

Loading... methods evaluated on Lite set
# Model % Resolved % Applied % Loc Success Date

Loading leaderboard data...

Select a time range and method to analyze solved/unsolved instances

Jan 2024 Apr 2025

Submit your results

We coordinate results submission via Pull Requests, see SWE-bench-Live/submissions for instructions.

Acknowledgement

SWE-bench-Live is built upon the foundation of SWE-bench. We extend our gratitude to the original SWE-bench team for their pioneering work in software engineering evaluation benchmarks.

Citation

If you use SWE-bench-Live in your research, please cite:

@article{zhang2025swebenchgoeslive,
  title={SWE-bench Goes Live!},
  author={Linghao Zhang and Shilin He and Chaoyun Zhang and Yu Kang and Bowen Li and Chengxing Xie and Junhao Wang and Maoquan Wang and Yufan Huang and Shengyu Fu and Elsie Nallipogu and Qingwei Lin and Yingnong Dang and Saravan Rajmohan and Dongmei Zhang},
  journal={arXiv preprint arXiv:2505.23419},
  year={2025}
}