All positions

ML Engineer

ResearchSan FranciscoFull-time

Develop and refine the reward models and evaluation frameworks that determine how effectively our environments train AI capabilities. Partner with frontier labs to design benchmarks that meaningfully measure model progress.

PyTorchEvaluationLLMsResearch

Apply

Send us your details and we'll get back to you if there's a fit.

Fields marked with * are required.