Lead the development of the infrastructure that could define how society governs increasingly powerful AI systems.
As AI systems become increasingly capable, one question is becoming unavoidable: how can powerful systems be independently scrutinized when the organizations involved cannot fully share what they know?
AI developers are protecting frontier models worth billions of dollars. Auditors, safety researchers, and regulators often rely on sensitive evaluation methods, proprietary benchmarks, and restricted datasets that cannot be publicly disclosed. Yet meaningful oversight depends on both sides being able to collaborate.
At OpenMined, we're building the privacy-preserving infrastructure that makes trustworthy oversight possible without requiring trust. Using privacy-enhancing technologies, secure computation, and distributed systems, we're creating the rails that allow AI developers, auditors, safety institutes, researchers, and regulators to run evaluations without exposing models, datasets, prompts, or proprietary methodologies.
Independent AI evaluation is still an emerging field, and many of the standards, systems, and operational models that will govern how frontier AI is assessed have yet to be established. As the TPM leading this effort, you'll help define them. Working with leading AI labs, AI safety institutes, researchers, and public institutions, you'll shape product strategy, drive ecosystem adoption, and help turn privacy-preserving evaluation from an emerging concept into critical infrastructure.
The challenges you'll tackle span distributed systems, cryptography, privacy-preserving computation, AI evaluation, and institutional trust. Success means expanding adoption among AI labs and evaluators, enabling increasingly sophisticated evaluation workflows, and delivering strategic integrations that demonstrate how independent oversight can work in practice.
If successful, the systems you help build could become part of the foundation that allows increasingly powerful AI models to be evaluated safely, independently, and credibly without requiring organizations to expose their most valuable assets. In a future where AI governance depends on trustworthy mechanisms for independent evaluation, your work will make those mechanisms possible.
Responsibilities
Requirements
Nice-to-Haves