Research Engineer, Safeguards Labs
Anthropic
San Francisco, CA | New York City, NYToday
About the role
<div class="content-intro"><h2><strong>About Anthropic</strong></h2>
<p>Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.</p></div><h2><strong>About the Team</strong></h2>
<p>Safeguards Labs is a new team operating at the intersection of research and engineering, chartered to investigate novel safety methods that protect Claude and the people who use it. We prototype new approaches to safe models, usage safeguards, and production safety — pressure-testing ideas through offline analysis and subsets of traffic before they graduate into production systems run by our partner Safeguards teams. Our work overlaps closely with account abuse
More at Anthropic
- Partner Business Systems & AI Operations LeadSan Francisco, CA
- Trade Compliance CounselSan Francisco, CA | New York City, NY | Washington, DC
- Director, Technical Accounting – M&A and InvestmentsSan Francisco, CA
- Product Counsel, Claude PlatformSan Francisco, CA | New York City, NY
- Sales Strategy & Operational Excellence LeadSan Francisco, CA | New York City, NY
- Solutions Architect, Applied AI (Commercial)San Francisco, CA | New York City, NY