SWE Paper List
Paper List:
- SWE-bench Goes Live! arXiv:2505.23419
- SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks arXiv:2506.10954
- SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale arXiv:2602.23866
- SWE-Universe: Scale Real-World Verifiable Environments to Millions arXiv:2602.02361
- DockSmith: Scaling Reliable Coding Environments via an Agentic Docker Builder arXiv:2602.00592
- Multi-Docker-Eval: A ‘Shovel of the Gold Rush’ Benchmark on Automatic Environment Building for Software Engineering arXiv:2512.06915
- Scaling Agentic Verifier for Competitive Coding arXiv:2602.04254
- SWE-MiniSandbox: Container-Free Reinforcement Learning for Building Software Engineering Agents arXiv:2602.04254
- Immersion in the GitHub Universe: Scaling Coding Agents to Mastery arXiv:2602.09892
SWE-bench Goes Live!
SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks
SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale
SWE-Universe: Scale Real-World Verifiable Environments to Millions
DockSmith: Scaling Reliable Coding Environments via an Agentic Docker Builder
Multi-Docker-Eval: A ‘Shovel of the Gold Rush’ Benchmark on Automatic Environment Building for Software Engineering
Scaling Agentic Verifier for Competitive Coding
SWE-MiniSandbox: Container-Free Reinforcement Learning for Building Software Engineering Agents
Immersion in the GitHub Universe: Scaling Coding Agents to Mastery
SWE Paper List