Engineering Manager, Runtime Fabric
Baseten
San FranciscoFull-time6d ago
About the role
ABOUT BASETEN
Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E https://www.baseten.co/blog/announcing-baseten-s-300m-series-e/, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products.
THE ROLE
Container runtimes were designed for general-purpose software workloads. AI inference is not a general-purpose workload.
Running large models at production scale exposes cracks in every layer of the container stack: runtimes unaware of GPU memory constraints, images that take minutes to pull when a model needs to scale to thousands of replicas, a
More at Baseten
- Software Engineer - CapacitySan Francisco
- Product Manager, Developer ExperienceSan Francisco
- Technical Program Manager, InfrastructureSan Francisco
- Strategic Finance Associate / Sr. AssociateSan Francisco
- Assistant General Counsel, Infrastructure & ComputeSan Francisco
- Head of Legal OperationsSan Francisco