Skip to content

Researcher, Agentic Post-Training

Openai

San FranciscoFull-time5d ago

About the role

Team Description OpenAI is looking for exceptional researchers to join the Post-Training Frontiers team, which is responsible for post-training the agentic models we ship across Codex, the API, ChatGPT Thinking, and ChatGPT Pro. The Post-Training Frontiers team sets up the pipeline for deciding which integrations can go into the post-training run, develops its own horizontal improvements to the model, and trains the final model. The role requires working on the most impactful horizontal improvements for the next model, which could include factuality, instruction following, function calling, multi-agent collaboration, calibrated reasoning effort, tool use, or improving taste in our models. You might build or improve our grading stack, improve our user-data flywheel, or automate our processes to make large post-training runs faster, more reliable, and easier for researchers to use. This is a team for people who want their work to land directly in models used by hundreds of millions of

More at Openai