Data Engineer/Senior Data Engineer
Bioinformatics
Updated on 2/7/2024
Flagship Pioneering

501-1,000 employees

Originates biotech ventures for health and sustainability
Company Overview
Flagship Pioneering stands out as a leader in the biotechnology industry, having originated and nurtured over 100 scientific ventures, including high-profile companies like Moderna and Indigo Agriculture. The company's culture is rooted in developing transformative products for human health and sustainability, as evidenced by their creation of platform companies like Generate Biomedicines and Tessera Therapeutics. Their competitive edge lies in their ability to explore new frontiers in genetics, as demonstrated by their recent unveiling of Quotient Therapeutics, a venture aimed at creating transformative medicines.
Venture Capital

Company Stage

N/A

Total Funding

$6.4B

Founded

2000

Headquarters

Cambridge, Massachusetts

Growth & Insights
Headcount

6 month growth

6%

1 year growth

16%

2 year growth

56%
Locations
Cambridge, MA, USA
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
Python
Airflow
R
SQL
Tableau
AWS
Data Analysis
CategoriesNew
Data & Analytics
Biology & Biotech
Requirements
  • B.S. or M.S. in computer science or related field and 3+ years of industry experience
  • Extensive experience with database technologies, architecture, and management.
  • Programming experience in scripting languages such as Python, R, SQL, and version control tools.
  • Understanding of ideating and creating UI design deliverables.
  • Working knowledge of AWS services, including EC2, S3, RDS, FSx Lustre, Athena, Glue, Lambda, Batch.
  • Application lifecycle knowledge, including best practices such as code review, unit/integration testing, and documentation.
  • Ability to design and implement backend models using logic and API endpoints for complex scientific workflows/entities (i.e.. Benchling).
  • Adaptive, creative, and a quick learner, efficient in multi-tasking and troubleshooting.
  • Demonstrated ability to successfully work in cross-functional and third-party teams with an emphasis on teamwork, collaboration, and communication.
  • Experience with workflow orchestration frameworks such as Nextflow, Snakemake, Airflow or AWS Step Functions.
Responsibilities
  • Design, create, and manage database systems that allow biologists to investigate data independently.
  • Automate workflows for data retrieval or storage in electronic lab notebooks (ELNs),such as Benchling.
  • Collaborate with a team of scientists to help automate data analysis pipelines and build data infrastructure.
  • Define, contribute to, and proactively communicate data engineering standards and practices establishing repeatable templates and frameworks and efficient usage of cloud services and tools.
  • Communicate insights and progress to the scientific team and executive management.
Desired Qualifications
  • NGS pipeline development experience with virology, pathology, molecular biology, or similar datasets.
  • Ability designing and implementing a variety of software platforms through the API framework.
  • Familiarity with continuous integration/continuous deployment (CI/CD) pipelines.
  • Experience with data visualization tools, such as Tableau, Spotfire, or Retool.