Simplify Logo

Full-Time

Head of Evaluations

Posted on 7/25/2024

DeepMind

DeepMind

1,001-5,000 employees

Develops artificial general intelligence systems

Consulting
Hardware
Enterprise Software
AI & Machine Learning
Healthcare

Senior, Expert

London, UK

Category
AI Research
AI & Machine Learning
Required Skills
Data Science
Requirements
  • Demonstrated prior experience designing and implementing audits or evaluations of cutting edge AI systems.
  • Deep understanding of AI technologies and machine learning principles.
  • Expertise in data science, statistics, algorithmic auditing, or other relevant fields.
  • Demonstrable expertise in identifying solutions to scale evaluations work, driving efficiencies to manage new modalities scaled through to established models
  • Demonstrated experience in managing a high performing interdisciplinary team in a fast paced environment.
  • Demonstrated ability to lead cross-functional teams, foster collaboration, and influence outcomes.
  • Experience working with ethics and safety topics associated with AI development in a technology company such as child safety, privacy, representational harms and discrimination, misinformation, or other areas of content or model risks.
  • Proven ability to engage with and influence a range of internal stakeholders from researchers and engineers through to senior leadership and external partners, from academia through to suppliers.
  • Excellent communication skills, both written and verbal, with the ability to effectively communicate complex technical concepts to a wide range of audiences.
Responsibilities
  • Lead and manage a team of specialists, fostering a collaborative and high-performing environment, providing mentorship and guidance to team members.
  • Build and manage a roadmap of evaluations development that examine responsibility and safety questions for Google DeepMind’s most groundbreaking models across internal assurance evaluations, external evaluations and policy-aligned evaluations for model teams.
  • Proactively engage with industry-wide developments in this space to promote best practices in Google DeepMind.
  • Identify areas where new testing approaches are required, working with Google DeepMind research and engineering teams to:
  • Drive the development of new evaluations for upcoming model releases;
  • Oversee automated evaluations that support model development evaluations and assurance; and
  • Engage external organisations to provide insights on the responsibility and safety aspects of Google DeepMind models.
  • Work closely with other areas of ReDI to develop a deep understanding of key policy areas which guide and inform the evaluations being run.
  • Continuously identify ways to scale evaluations work and drive efficiencies, including identifying right sized approaches to manage evaluations for new modalities and capabilities through to more established models.
  • Coordinate across relevant teams across the organization, such as those working on autonomy and cybersecurity, to create a comprehensive overview of the safety profile of Google DeepMind models.
  • Proactively improve the responsibility and safety of Google DeepMind’s models by sharing and communicating insights from evaluations with model development teams, senior stakeholders and decision makers and broader Google teams.
  • Work closely with Google DeepMind and Google teams to ensure external commitments are upheld.

This company leads in the field of artificial general intelligence (AGI), with notable applications across healthcare, energy management, and biotechnology. Their work in early diagnostic tools for eye diseases, optimizing energy usage in major data centers, and groundbreaking contributions to protein structure prediction underlines their commitment to harnessing AI for diverse practical applications. The company's dedication to pushing the boundaries of AI technology not only propels the industry forward but also creates a dynamic and impactful working environment for its employees.

Company Stage

M&A

Total Funding

$503.3M

Headquarters

London, United Kingdom

Founded

2010

Simplify Jobs

Simplify's Take

What believers are saying

  • DeepMind's advancements in AI-driven drug discovery, such as collaborations with Lilly and Novartis, promise significant contributions to healthcare and pharmaceuticals.
  • The introduction of AlphaCode 2 and other AI models showcases DeepMind's continuous innovation and leadership in competitive programming and AI research.
  • DeepMind's AI tools for music creation and weather forecasting demonstrate the company's versatility and potential to revolutionize multiple industries.

What critics are saying

  • The backlash against Google's Gemini AI model could impact DeepMind's reputation and trustworthiness in the AI community.
  • The competitive landscape in AI is intense, with rapid advancements from other tech giants potentially overshadowing DeepMind's innovations.

What makes DeepMind unique

  • DeepMind's pioneering work in AI, such as the development of AlphaFold and AlphaCode, sets it apart as a leader in both scientific research and practical AI applications.
  • The company's integration with Google's vast resources and data infrastructure provides a significant competitive advantage over standalone AI firms.
  • DeepMind's focus on ethical AI and its collaborations with industry leaders like Lilly and Novartis highlight its commitment to impactful and responsible AI innovation.
INACTIVE