Develop and refine the reward models and evaluation frameworks that determine how effectively our environments train AI capabilities. Partner with frontier labs to design benchmarks that meaningfully measure model progress.
Send us your details and we'll get back to you if there's a fit.
Fields marked with * are required.