Software Engineer, Productivity - Inference Runtime

Openai

San FranciscoFull-timeYesterday

Looking for more like this? See all Software Engineer jobs.

About the role

ABOUT THE TEAM We’re hiring a Developer Productivity engineer to support OpenAI’s Inference Runtime teams. These teams own the systems responsible for serving models reliably, efficiently, and safely across Codex, ChatGPT, API, and internal research workloads. We’re hiring a Developer Productivity Engineer to help scale the engineering systems, safeguards, and developer workflows that enable our teams to move quickly without compromising reliability or performance. This role sits at the intersection of developer experience, CI/CD infrastructure, release engineering, production readiness, and inference systems reliability. You’ll work on the tooling and operational foundations that support model launches, inference optimizations, cloud provider integrations, and large-scale deployments across a rapidly evolving inference stack. ABOUT THE ROLE We’re looking for an autonomous, high-ownership engineer who cares deeply about making other engineers faster, safer, and more confident. A

Software Engineer, Productivity - Inference Runtime

About the role

More at Openai