Terminal Bench measures real-world engineering capability, not cherry-picked demos.
We build agents that ship
Focused on reliability, not demos. Open research, pragmatic engineering, and a ruthless bar for real-world performance.
Open by default
We publish research and interfaces so teams can build on top with confidence.
Terminal-native
Lives where engineers work. Understands large repos and executes end-to-end tasks.
Measured results
Benchmarked on real engineering suites—not cherry-picked demos.
our mission
OpenBlock decentralizes the means of production.
We believe the means of software production should be accessible to everyone, not just a few large labs. Our mission is to provide agents that empower individuals and teams to create and innovate in software development. By sharing knowledge and resources, we aim to make software production a collaborative effort open to all.