Facebook pixel

ML Scientist for Molecular Omics
Confirmed live in the last 24 hours
San Bruno, CA, USA
Experience Level
Desired Skills
Data Analysis
Google Cloud Platform
  • Ph.D. in computational biology, genetics, biomedical informatics, biostatistics, bioinformatics, computer science, machine learning or a related discipline, or equivalent practical experience (e.g., a Masters degree plus 2 years in relevant industry experience)
  • Experience using and developing cutting-edge methods for analyzing NGS sequencing and/or proteomics data sets
  • Strong fundamentals in applied multivariate statistics
  • Expertise in machine learning (including deep-learning); familiarity with machine learning application on molecular omics data
  • Strong programming skills in Python, or strong programming skills in R and experience in Python
  • Interest in uncovering novel disease biology
  • Ability to communicate effectively and collaborate with people of diverse backgrounds and job functions in a fast-paced startup environment
  • Passion for making a difference in the world
Desired Qualifications
  • Familiarity with common deep learning toolkits such as tensorflow, pytorch, keras
  • Experience with modeling sequencing artifacts (e.g. GC content, fragment length bias, overdispersion, etc.) and interpretation of QC measurements to guide assay development
  • Expertise with NGS data processing tools (samtools, GATK, IGV, etc)
  • Experience working with diverse functional genomic assays (RNA/DNase/ATAC/ChIP-seq, etc); exposure to CRISPR-based experiments a plus
  • Some understanding of human physiology or disease biology (especially cancer, metabolism, or neurodegeneration)
  • Publication record of high-quality work in biomedical, machine learning, or statistics venues
  • Proficiency in modern software development tools, such as: Linux environment (including shell/Bash scripting), version control practices and tools (e.g., Git), or modern workflow management frameworks (Snakemake, Cromwell/CWL/WDL, NextFlow, etc)
  • Familiarity with cloud computing services (e.g., AWS or GCP) and workflow management tools or batch scheduling systems (e.g. SLURM)
  • Proficiency in C++ or other compiled, statically-typed languages
  • Experience with database languages (e.g., SQL)

51-200 employees

Data-driven drug discovery & development
Company Overview
Insitro’s mission is to use machine learning and data at scale to transform the way that drugs are discovered and developed for patients. The company is developing predictive machine learning models to discover underlying biologic state based on human cohort data and in-house generated cellular data at scale to advance novel targets and patient biomarkers, design therapeutics, and inform clinical strategy.
  • Excellent medical, dental, and vision coverage
  • Excellent mental health and well-being support
  • Open vacation policy
  • Access to free onsite baristas & cafe with daily lunch and breakfast
  • Access to free onsite fitness center
  • Commuter benefits
  • Paid parental leave
  • Competitive pay and 401(k) matching
  • Flexible work schedule (on site and remote)