Skip to content

Engineering Manager (AI Inference)

Perplexity

San FranciscoFull-time18d ago

About the role

ABOUT THE ROLE We are looking for an Inference Engineering Manager to lead our AI Inference team. This is a unique opportunity to build and scale the infrastructure that powers Perplexity's products and APIs, serving millions of users with state-of-the-art AI capabilities. You will own the technical direction and execution of our inference systems while building and leading a world-class team of inference engineers. Our current stack includes Python, PyTorch, Rust, C++, and Kubernetes. You will help architect and scale the large-scale deployment of machine learning models behind Perplexity's Comet, Sonar, Search, Deep Research products. WHY PERPLEXITY? - Build SOTA systems that are the fastest in the industry with cutting-edge technology - High-impact work on a smaller team with significant ownership and autonomy - Opportunity to build 0-to-1 infrastructure from scratch rather than maintaining legacy systems - Work on the full spectrum: reducing cost, scaling traffic, and p

More at Perplexity