(ID: 2024-6798)
Axle is a bioscience and information technology company that offers advancements in
translational research, biomedical informatics, and data
science applications to research centers and healthcare
organizations nationally and abroad. With experts in
biomedical science, software engineering, and program
management, we focus on developing and applying research
tools and techniques to empower decision-making and
accelerate research discoveries. We work with some of the
top research organizations and facilities in the country
including multiple institutes at the National Institutes of
Health (NIH).
Axle is seeking a Data Engineer - Tool Abstraction to join our vibrant team at the National Institutes of Health (NIH) supporting the National Center for Advancing Translational Sciences (NCATS) located in Rockville, MD.
Axle Informatics is a bioinformatics and information technology company that offers innovative computer services, informatics, and enterprise solutions to research centers and healthcare organizations worldwide. Our mission is to empower decision-making and accelerate discovery in translational research. We are looking for a Data Engineer to support clinical and research data projects through pipeline development, data ingestion, harmonization, and containerization.
Key Responsibilities:
Design, build, and maintain scalable and efficient data pipelines for clinical and research datasets.
Ensure the automation of data extraction, transformation, and loading (ETL) processes to support downstream analysis.
Collaborate with data science teams to ensure data is harmonized and consistent across different systems and workflows.
Ingest and harmonize large-scale clinical and research datasets from multiple sources.
Implement best practices for data standardization and cleaning to support various data users, including researchers, clinicians, and analysts.
Work closely with clinical data teams to ensure that datasets comply with research and healthcare standards.
Develop and optimize workflows using languages such as Snakemake, Nextflow, or a similar workflow language, to automate data processing.
Support continuous integration and delivery practices within data workflows.
Utilize Docker for containerization of workflows and data pipelines to ensure reproducibility and scalability.
Collaborate with multidisciplinary teams of data scientists, bioinformaticians, and software engineers to ensure data processes align with project goals.
Document pipeline processes and workflows to ensure transparency and reproducibility of data workflows.
Qualifications:
Master’s degree in Computer Science, Data Engineering, Bioinformatics, or related fields.
Proven experience building data pipelines and automating ETL processes.
Strong experience working with clinical data and familiarity with healthcare data standards and Common Data Models (e.g. CDISC, OMOP).
Hands-on experience with at least one workflow management system (e.g., Snakemake, Nextflow, or a similar tool).
Experience with Docker and containerization practices for deploying reproducible workflows.
Excellent scripting and programming skills in languages such as SQL, Python or Bash..
Nice to Have:
Experience with cloud-based services (e.g., AWS, GCP, or Azure) for data processing and pipeline scaling.
Familiarity with big data frameworks (e.g., Apache Spark or Hadoop).
Knowledge of version control systems, such as Git, for workflow management and collaboration.
Disclaimer:The above description is meant to illustrate the general nature of work and
level of effort being performed by individuals assigned to this position or job description. This is not
restricted as a complete list of all skills, responsibilities, duties, and/or assignments required. Individuals
may be required to perform duties outside of their position, job description or responsibilities as needed.
The diversity of Axle’s employees is a tremendous asset. We are firmly committed to providing equal opportunity
in all aspects of employment and will not tolerate any illegal discrimination or harassment based on age, race,
gender, religion, national origin, disability, marital status, covered veteran status, sexual orientation,
status with respect to public assistance, and other characteristics protected under state, federal, or local law
and to deter those who aid, abet, or induce discrimination or coerce others to discriminate.
Accessibility: If you need an accommodation as part of the employment process please contact: [email protected]
This role has a market-competitive salary with an anticipated base compensation range listed below. Actual salaries will vary depending on a candidate’s experience, qualifications, skills, and location.