Full-Time

Senior Data Engineer

Posted on 7/25/2025

Prima Mente

Prima Mente

11-50 employees

AI-driven neuroscience research for brain health

No salary listed

London, UK + 1 more

More locations: San Francisco, CA, USA

Hybrid

Preference for in-person work; remote is possible for interview stages.

Category
Data & Analytics (1)
Required Skills
Kubernetes
Microsoft Azure
Airflow
Apache Flink
Apache Beam
Apache Spark
Apache Kafka
AWS
Terraform
Data Analysis
Google Cloud Platform
Requirements
  • 4+ years of experience building data infrastructure or data platforms with demonstrated ability to solve complex distributed systems problems independently
  • Experience building infrastructure for large-scale data processing pipelines (both batch and streaming) using tools like Spark, Kafka, Apache Flink, Apache Beam, and with proprietary solutions like Nebius
  • Experience designing and implementing large-scale data storage systems (feature stores, timeseries DBs) for ML use cases, with strong familiarity with relational databases, data warehouses, object storage, and expertise in DB schema design
  • Experience with ML infrastructure and have worked at companies that use ML for core business functions
  • Experience building data pipelines for external data sources that are observable, debuggable, and verifiably correct, having dealt with challenges like data versioning, point-in-time correctness, and evolving schemas
  • Strong distributed systems and infrastructure skills - comfortable scaling and debugging Kubernetes services, writing Terraform, and working with orchestration tools like Flyte, Airflow, or Temporal
  • Experience with cloud platforms (AWS, GCP, Azure) and container technologies
  • Strong software engineering skills with ability to write easy-to-extend and well-tested code
  • Excellent communication skills and experience collaborating within multidisciplinary teams
  • Comfortable with ambiguity and a fast-moving environment, with a bias for action
  • Learn and pick up new skills quickly
Responsibilities
  • Owning and scaling our data infrastructure by several orders of magnitude to handle > 100 petabyte-scale multi-omic datasets, including data pipelines, distributed data processing, and storage systems
  • Building a unified feature store for all our ML models and biological data analysis workflows
  • Efficiently storing and loading petabytes of data for ML bio data
  • Processing and storing predictions and evaluation metrics for large-scale biological forecasting and analysis models
  • Implementing data versioning and point-in-time correctness systems for evolving biological datasets
  • Building observable, debuggable data pipelines that handle the complexity of multi-omic data sources
Desired Qualifications
  • Familiarity with bioinformatics or biological data handling (this will be supported by our in-house bioinformatics team)
  • Knowledge of data governance, compliance, and security standards relevant to healthcare or biotech

Prima Mente combines AI with neuroscience to study the human brain. It generates its own biological datasets and builds advanced AI models to translate discoveries into clinical and research applications, focusing on understanding brain biology and developing treatments for neurological conditions. The company uses a multidisciplinary team and a flat, collaborative culture to tackle complex problems with multi-omics and multi-modal models, sharing findings through influential publications. Its goal is to protect against neurological diseases and improve cognitive health at scale, aligning with global health priorities (SDG 3). Compared with others, Prima Mente differentiates itself through in-house data generation, a strong emphasis on neurobiology and AI integration, and a collaborative, fast-paced environment aimed at delivering real patient impact.

Company Size

11-50

Company Stage

N/A

Total Funding

N/A

Headquarters

London, United Kingdom

Founded

2022

Simplify Jobs

Simplify's Take

What believers are saying

  • Pleiades achieves 0.89 accuracy detecting Alzheimer's from blood.
  • $1M Alzheimer's AI Prize win boosts credibility and partnerships.
  • 1,000-patient study launches next month with NHS SANDBOX pilot.

What critics are saying

  • Biomni-AD's superior AI erodes PARTHENON's lead within 6-12 months.
  • NVIDIA's GPU shift obsoletes Pleiades, halting pilots in 12-24 months.
  • 48-person team fails scaling proteomics, delaying 2026 generalizability.

What makes Prima Mente unique

  • Prima Mente integrates AI with neuroscience via Pleiades epigenome model.
  • PARTHENON virtual lab compresses weeks of experiments into minutes.
  • Multi-omics wet lab generates high-throughput data for Alzheimer's biomarkers.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Professional Development Budget

Growth Opportunities

Company News

PR Newswire
Mar 23rd, 2026
Alzheimer's research AI prize doubles to $2M as two teams win $1M each

The Alzheimer's Disease Data Initiative has doubled its prize competition, awarding $1 million each to two winners — Biomni-AD and Prima Mente — in the Alzheimer's Insights AI Prize. The competition, backed by Bill Gates, sought agentic AI solutions to accelerate Alzheimer's and dementia research. Launched in August 2025, the prize drew over 180 submissions. Biomni-AD developed an AI "co-scientist" that performs time-consuming research tasks in minutes with high accuracy. Prima Mente created PARTHENON, a virtual laboratory platform that compresses weeks of experimental work into minutes. Both solutions will be made freely available to researchers worldwide through the AD Data Initiative's platform. With Alzheimer's projected to affect 152 million people by 2050, the competition reflects urgent demand for innovative approaches to tackle fragmented research data.

INACTIVE