AI Infrastructure Engineer
Togetherai
San Francisco$190k – $270k6d ago
Looking for more like this? See all DevOps Engineer jobs.
About the role
About the Role
As an AI Infrastructure Engineer at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a software engineer that applies sound engineering principles, operational discipline, and mature automation to our operating environments and codebase.
You specialize in systems (operating systems, storage subsystems, networking), while implementing best practices for availability, reliability and scalability, with varied interests in algorithms and distributed systems.
Responsibilities
• Participate in on-call rotation (Pagerduty) to respond to production incidents
• Build and run our infrastructure with Ansible, Terraform, and Kubernetes to enable scaling to a massive number of concurrent users
• Build monitoring systems
More at Togetherai
- Technical Account Manager (TAM), AI FactorySan Francisco
- Forward Deployed Engineer (GPU Clusters)San Francisco · $270k – $300k
- Director, Support EngineeringSan Francisco · $290k – $310000k
- Software Engineer - Storage & Observability (Early Career)San Francisco · $165k – $200k
- Software Engineer - Storage & Observability (Early Career)San Francisco
- Finance Analytics EngineerSan Francisco