Full-Time

Senior Healthcare Data Engineer

National Center for Advancing Translation Sciences

Posted on 8/30/2025

Axle

Axle

201-500 employees

Translational research informatics and data science

Compensation Overview

$130k - $169k/yr

North Bethesda, MD, USA

In Person

Category
Data & Analytics (1)
Required Skills
Python
Git
Apache Spark
SQL
Docker
Hadoop
Requirements
  • Bachelor's or Master's degree in Computer Science, Data Engineering, Bioinformatics, or a related field, with 8+ years of hands-on experience in data engineering (or 5+ years with a Master's).
  • Expert-level proficiency in Python and SQL, with a proven track record of building and maintaining complex, large-scale data pipelines and ETL processes.
  • Significant experience with healthcare data is essential. You must have deep, practical knowledge of common data models (CDMs), particularly OMOP and/or FHIR, and experience with clinical terminologies (e.g., ICD, SNOMED, RxNorm).
  • Strong experience with big data technologies (e.g., Apache Spark, Hadoop) and containerization using Docker for creating reproducible and scalable workflows.
  • Proficiency with version control (Git) and CI/CD practices for data infrastructure.
  • An architectural mindset with the ability to design for scalability, reliability, and security.
Responsibilities
  • Architect and Modernize National-Scale Data Pipelines: Design, develop, and optimize robust, disease-agnostic data acquisition and ingestion pipelines built to handle the complexity and scale of N3C.
  • Master Data Integration and Harmonization: Tackle the complex challenge of harmonizing heterogeneous clinical data from countless sources. You will maintain and enhance the OMOP harmonization pipeline, improve interoperability between common data models (e.g., OMOP, PCORNet, FHIR), and ensure consistency for critical data like medications and lab values.
  • Build the Future with Dynamic Workspaces: Be a key technical player in developing the infrastructure for N3C's new Dynamic Workspaces. You will help build the systems that provision secure, project-specific analytical environments, giving researchers access to the specific data they need while providing institutions granular control.
  • Champion Data Quality and Governance: Develop and implement sophisticated data quality frameworks, creating dashboards and feedback loops to ensure our data partners and researchers have transparent insight into data completeness, consistency, and quality.
  • Innovate with Advanced Technologies: Integrate critical new data sources, including national mortality data and CMS. You will link datasets and help build the processes for integrating novel data types like geospatial and environmental data.
  • Collaborate and Lead: Work alongside a world-class team of scientists, project managers, and engineers to translate scientific needs into technical solutions. You will provide technical leadership and mentorship, driving best practices in an agile, mission-focused environment.
Desired Qualifications
  • Experience designing and deploying data solutions on cloud platforms (AWS, GCP, Azure).
  • Proficiency with modern workflow management systems (e.g., Nextflow, Snakemake, Airflow).
  • Experience with privacy-preserving record linkage (PPRL) techniques and the challenges of working with de-identified patient data.
  • Familiarity with federated data systems and architectures.
  • Experience working in a regulated data environment (e.g., FISMA, HIPAA).

Axle Informatics provides specialized informatics solutions for translational research, health informatics, and data science to biomedical research centers and healthcare organizations. Its offerings are customized software and data management platforms that help researchers collect, integrate, analyze, and visualize large research datasets, with end-to-end tools that automate data aggregation and deliver analytics, dashboards, and decision-support features. The company differentiates itself with an integrated, scientifically informed approach that bridges data science with application development, specifically focused on translational research and clinical data work rather than generic software. Axle’s goal is to advance public health by moving biomedical discoveries from the lab to bedside, improving healthcare outcomes.

Company Size

201-500

Company Stage

N/A

Total Funding

N/A

Headquarters

Rockville, Maryland

Founded

2002

Simplify Jobs

Simplify's Take

What believers are saying

  • Axle won $21M NIH NCATS STPS contract for scientific support in 2026.
  • Axle secured 3-year NIAID VRC/NIC task beating one bidder recently.
  • Axle’s ML pipeline boosted NIH funding by 3% averting 20% cuts.

What critics are saying

  • NIH FY2026 cuts slash 15% translational research funding in 3-6 months.
  • Tempus AI captures 25% NIH bioinformatics contracts in 12-18 months.
  • DNAnexus federated platform halts 40% Axle deployments in 6-9 months.

What makes Axle unique

  • Axle deploys fine-tuned LLMs for biomedical NLP and sentiment analysis.
  • Axle provides Healthcare Authentication Solution reducing fraud in privacy-first care.
  • Reid Simon leads AI, Cloud, and Cybersecurity expansions at NIH institutes.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Health Insurance

Dental Insurance

Vision Insurance

Paid Vacation

Paid Holidays

401(k) Company Match

Educational Benefits for Career Growth

Employee Referral Bonus

Flexible Spending Accounts

Company News

W.W. Grainger, Inc.
Jul 25th, 2023
“Octo awarded $64.7M IT Infrastructure Call Order to support NCI’s Cancer research”

Octo, in partnership with Unissant, Axle Informatics, and TRex, has been awarded an IT Infrastructure and Operations Call Order to support the NCI’s OCIO.

GlobeNewswire
Jan 12th, 2023
Digital Pathology Market Worth $1.86 Billion by 2030 -

For instance, in May 2020, Indica Labs (U.S.), a provider of digital pathology solutions, collaborated with information technology consulting companies, Octo (U.S.) and Axle Informatics (U.S.) and the National Institutes of Health (NIH) (U.S.), to develop an online collection of high-resolution histopathology images of tissues from COVID-19 patients using Indica’s HALO Link platform.

INACTIVE