Machine Learning, Platform Engineer
Togetherai
San Francisco 30d ago
Looking for more like this? See all DevOps Engineer jobs.
About the role
<h3><strong>About the Role</strong></h3>
<p>Our team focuses on enabling custom models and dedicated inference on Together. We are responsible for building a container platform, optimizing autoscaling, minimizing cold starts, achieving the best end-to-end model performance, and providing a best-in-class developer experience with great tooling. We often focus on video or audio generation across the stack: CUDA kernels, pytorch optimization, inference engines, container orchestration, queueing theory, etc. An ideal candidate will be great at profiling/optimization but know the word kubernetes, or be intimately familiar with multi-cluster scheduling and have some sense of ML bottlenecks.</p>
<h3>Responsibilities</h3>
<ul>
<li>New hires may work on multi-cluster orchestration, portfolio optimization, predictive autoscaling, control panes, model bring-up, model optimization, APIs for managing deployments, inference worker SDKs, and
More at Togetherai
- Software Engineer - Storage & Observability (Early Career)San Francisco · $165k – $200k
- Software Engineer - Storage & Observability (Early Career)San Francisco
- Finance Analytics EngineerSan Francisco · $200k – $240k
- Finance Analytics EngineerSan Francisco
- Payroll ManagerSan Francisco · $150k – $170k
- Product Marketing DirectorSan Francisco