Software Engineer, Productivity - Inference Runtime
Openai
San FranciscoFull-timeYesterday
Looking for more like this? See all Software Engineer jobs.
About the role
ABOUT THE TEAM
We’re hiring a Developer Productivity engineer to support OpenAI’s Inference Runtime teams. These teams own the systems responsible for serving models reliably, efficiently, and safely across Codex, ChatGPT, API, and internal research workloads. We’re hiring a Developer Productivity Engineer to help scale the engineering systems, safeguards, and developer workflows that enable our teams to move quickly without compromising reliability or performance.
This role sits at the intersection of developer experience, CI/CD infrastructure, release engineering, production readiness, and inference systems reliability. You’ll work on the tooling and operational foundations that support model launches, inference optimizations, cloud provider integrations, and large-scale deployments across a rapidly evolving inference stack.
ABOUT THE ROLE
We’re looking for an autonomous, high-ownership engineer who cares deeply about making other engineers faster, safer, and more confident.
A
More at Openai
- Audio Software Engineer, Consumer DevicesSan Francisco
- Procurement Enablement LeadSan Francisco
- Head of GTM Business Operations & Strategy, Safety & SecuritySan Francisco
- People Data ScientistSan Francisco
- Design Verification, Forward Deployed EngineeringSan Francisco
- Senior Leasing & Strategy Manager, Real Estate & WorkplaceSan Francisco