Full-Time

Research Scientist

Reinforcement Learning

DeepMind

DeepMind

1-10 employees

AI research organization pursuing safe AGI

No salary listed

London, UK

In Person

Category
AI & Machine Learning (1)
Required Skills
Data Structures & Algorithms
Reinforcement Learning
Requirements
  • A passion for reinforcement learning.
  • A research track record in reinforcement learning, including peer-reviewed publications.
  • Strong implementation ability and comfort working in research codebases.
  • Evidence of owning experiments end-to-end, including analysis and interpretation.
  • Strong communication skills and a bias toward clarity and honesty regarding results.
  • High agency and drive: You push projects forward, prioritize effectively, and take initiative.
Responsibilities
  • Initiating or pursuing novel research directions, by proposing and testing research hypotheses.
  • Implementing algorithm ideas and run end-to-end experiments, including setup, execution, analysis, and iteration.
  • Sharing your skills and knowledge with other researchers.
  • Building or improving infrastructure for research at scale.
  • Designing evaluations and ablations that answer real questions and change minds.
  • Analyzing results carefully, including debugging and failure analysis.
  • Communicating clearly through plots, writeups, and paper-ready narratives and figures.
  • Contributing to a culture of first-principles thinking, high standards, and direct, constructive feedback.
Desired Qualifications
  • PhD in machine learning preferred, or equivalent practical experience.
  • Experience with reinforcement learning for sequence models, post-training, preference-based learning, or agentic systems.
  • Experience with modern research stacks (e.g., JAX/Flax or PyTorch) and scaling experiments.
  • Strong experimental taste: Good judgment regarding baselines, ablations, and what is worth testing.
  • Comfort with scaling, evaluation methodologies, and diagnosing complex failure modes.
  • A focus on craft: you care about doing excellent work while maintaining a high velocity.

DeepMind conducts research in artificial intelligence to advance general problem-solving capabilities with safety and ethics guiding every project. It develops AI systems and learning algorithms that can tackle complex tasks, such as diagnosing eye diseases, reducing data center energy use, and predicting 3D protein shapes. These products work by learning from large datasets and optimizing models to perform specialized tasks, often through deep neural networks and reinforcement learning, then deploying the resulting systems to assist scientists, doctors, and engineers. What sets DeepMind apart is its emphasis on rigorous fundamental research published in top journals, a strong safety/ethical framework, and collaboration on real-world scientific challenges, rather than just commercial applications. The company’s long-term goal is to solve intelligence and build more general, capable problem-solving systems that can help society address major scientific and humanitarian problems.

Company Size

1-10

Company Stage

Acquired

Total Funding

$533M

Headquarters

Thessaloníki, Greece

Founded

2010

Simplify Jobs

Simplify's Take

What believers are saying

  • GraphCast excels in extreme weather prediction, partnering with disaster agencies.
  • AlphaFold collaborates with DNDi on Chagas and Leishmaniasis therapeutics.
  • Open GraphCast deployment since September 2024 enables enterprise licensing.

What critics are saying

  • Top AI talent exits DeepMind due to Google merger bureaucracy.
  • Isomorphic Labs secures Eli Lilly and Novartis drug discovery deals.
  • NVIDIA's PhysicsNeMo reimplements GraphCast, commoditizing the technology.

What makes DeepMind unique

  • AlphaFold predicts protein structures, catalyzing biology progress since 2021.
  • GraphCast delivers 90% accurate 10-day weather forecasts in under one minute.
  • AlphaGo defeated Go champion Lee Sedol in 2016 using reinforcement learning.

Help us improve and share your feedback! Did you find this helpful?

Your Connections

People at DeepMind who can refer or advise you

Benefits

Performance Bonus

Company News

Epium
Jul 24th, 2025
Google DeepMind unveils Aeneas, a tool to decode ancient Latin inscriptions

Google DeepMind has introduced Aeneas, a novel Artificial Intelligence software designed to assist historians in interpreting ancient Latin inscriptions.

Firmsuggest
Jul 22nd, 2025
OpenAI vs Google DeepMind: Who Really Won the 2025 AI Math Battle?

In one of the boldest displays of AI reasoning ever recorded, both OpenAI and Google DeepMind have hit gold - literally - at the 2025 International Mathematical Olympiad (IMO).

Scale by Tech
Jul 13th, 2025
DeepMind Launches GenAI Processors: Python Library Accelerates Real-Time Multimodal AI Workflows

DeepMind launches GenAI Processors: Python library accelerates real-time multimodal AI workflows.

Imobisoft
Jul 8th, 2025
This Week in AI: 12 Breakthroughs You Shouldn't Miss (July 2025)

DeepMind unveils AlphaGenome: Mapping DNA's "Dark Matter"

Robotics and Automation News
Jun 26th, 2025
Google DeepMind launches new vision language action model to 'put AI directly into local robotic devices'

Google DeepMind has introduced an efficient, on-device robotics model designed for general-purpose dexterity and rapid task adaptation.