Researcher, Agentic Post-Training
Openai
San FranciscoFull-time5d ago
About the role
Team Description
OpenAI is looking for exceptional researchers to join the Post-Training Frontiers team, which is responsible for post-training the agentic models we ship across Codex, the API, ChatGPT Thinking, and ChatGPT Pro. The Post-Training Frontiers team sets up the pipeline for deciding which integrations can go into the post-training run, develops its own horizontal improvements to the model, and trains the final model.
The role requires working on the most impactful horizontal improvements for the next model, which could include factuality, instruction following, function calling, multi-agent collaboration, calibrated reasoning effort, tool use, or improving taste in our models. You might build or improve our grading stack, improve our user-data flywheel, or automate our processes to make large post-training runs faster, more reliable, and easier for researchers to use.
This is a team for people who want their work to land directly in models used by hundreds of millions of
More at Openai
- Head of Systems OperationsSan Francisco
- Researcher, Alignment ScienceSan Francisco
- Human Rights & Responsible Deployment Lead (National Security)Washington, DC
- Visual Storytelling & Innovation Lead, Office of the CROSan Francisco
- Enterprise Field MarketerSan Francisco
- GTM Strategic Planning AnalystSan Francisco