A

Software Engineer, RL Data

Anthropic
1 day ago
Full-time
Remote
Worldwide
Remote Engineering

About Anthropic

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

About the role

Anthropic's RL Data team builds the systems that produce high-quality reinforcement learning data for Claude: data collection pipelines, human feedback tooling, the execution environments RL tasks run in, and the quality assurance that keeps training data trustworthy at scale. Our goal is to make Claude genuinely great at complex, real-world work β€” and to point those capabilities at the things that matter most, including AI safety research and beneficial deployments of AI. (To be upfront: this is dual-use work β€” it advances general capabilities too, though we aim to differentially advance the beneficial ones.)