ML Engineer

ResearchSan FranciscoFull-time

Develop and refine the reward models and evaluation frameworks that determine how effectively our environments train AI capabilities. Partner with frontier labs to design benchmarks that meaningfully measure model progress.

PyTorchEvaluationLLMsResearch

ML Engineer

Apply