Internship

Data Science & Machine Learning Intern

Confirmed live in the last 24 hours

Insitro

Insitro

201-500 employees

Machine learning for drug discovery efficiency

Compensation Overview

$35 - $65/hr

San Bruno, CA, USA

Interns will have access to onsite facilities and a commuter bus service from various locations around the Bay Area.

Category
Applied Machine Learning
Data Science
AI & Machine Learning
Data & Analytics
Required Skills
LLM
Python
Data Science
Machine Learning
Requirements
  • Working towards a BS, MS, or Ph.D. in engineering, computational biology, systems biology, computer science, mathematics, statistics, life science, chemistry, physics, or a related field.
  • Proficiency in one or more general-purpose programming languages. We primarily use Python.
  • Interest in using and developing brand new statistical and machine learning methods inspired by real problems.
  • Curiosity about human physiology or disease biology.
  • Committed to writing high-quality, well-commented code and documentation.
  • Ability to communicate effectively and collaborate with people of diverse backgrounds and job functions.
  • Passion for making a difference in the world.
Responsibilities
  • Leverage publicly available single cell transcriptomics resources to extract insights about disease mechanisms relevant to the therapeutic areas.
  • Develop, productionize, and deploy cutting-edge ML approaches to integrate large-scale multi-modal phenotypic datasets.
  • Develop workflows to enable post-GWAS (Genome-Wide Association Scan) analysis of results, e.g., fine-mapping.
  • Enable higher throughput annotation and exploration of candidate genes from our discovery efforts.
  • Design statistical methods to improve rare variant burden tests, and methods to improve power in longitudinal phenotypes.
  • Develop ML models for imputing disease-relevant phenotypes from high-content clinical imaging datasets, e.g., MRI, PET-CT.
  • Develop ML methods for disentangling and genetically interpreting axes of variation in complex phenotypes.
  • Use LLMs to extract disease-relevant information from medical records.
  • Build rich embedding models using DNA-Encoded Library (DEL) data on the scale of billions of compounds, and use these representations for downstream drug discovery tasks such as hit-discovery.
  • Explore generative models of small molecules in various data modalities such as 2D and 3D representations for hit-to-lead drug discovery efforts.
  • Develop new geometric deep learning methods to better characterize nuanced molecular properties and relationships.
  • Identify and prototype novel microscopy-driven phenotyping workflows, including hardware acquisition, post-processing, and featurization.
  • Develop robust software tooling to support the deployment of new and existing methods for general use by insitro scientists.
  • Optimize existing microscopy acquisition methods in both hardware and software, using ML feature outputs to benchmark improvements.
Desired Qualifications
  • First-hand experience with biological data, preferably using computational approaches.
  • Passion for learning how to work with diverse functional genomic assays (RNA/DNase/ATAC/ChIP-seq, etc).
  • Interest in learning how to analyze single-cell RNA-seq data.
  • Solid understanding of computational chemistry, including virtual screening (classic QSAR modeling, structure based drug-discovery), library design, etc.
  • Demonstrated ability to use and develop cutting edge statistical and machine learning methods inspired by real problems.
  • Experience with machine and deep Learning frameworks (e.g., scikit-learn, PyTorch, etc.).
  • Demonstrated ability to write high-quality, production-ready code (readable, well-tested, with well-designed APIs).
  • Experience in Linux environment, database languages (e.g., SQL, No-SQL) and version control practices and tools such as Git.
  • Publications of high-quality work in relevant computational biology, bioinformatics, systems biology, life sciences, or biomedical venues, including journals and conferences.
  • Passionate about solving problems, asking questions and learning independently.
  • Familiarity with the SciPy/PyData ecosystem (numpy, pandas, scipy, dask etc.).
  • Familiarity with cloud computing services (AWS or GCP).
  • Familiarity with statistical analysis software, e.g., R.

Insitro specializes in drug discovery and development by using machine learning and biological tools to predict successful medicine pathways. Their predictive models aim to reduce the costly failures common in pharmaceutical research and development. The team, consisting of scientists and engineers, collaborates to generate data that enhances drug development efficiency. Insitro serves pharmaceutical companies and healthcare providers, likely generating revenue through selling insights or partnerships.

Company Size

201-500

Company Stage

Series C

Total Funding

$643M

Headquarters

San Francisco, California

Founded

2018

Simplify Jobs

Simplify's Take

What believers are saying

  • Insitro received $25 million from Bristol Myers Squibb for ALS target discovery.
  • The company raised over $600 million in venture capital funding.
  • Insitro appointed AI expert Emily Fox as Senior VP of AI/ML.

What critics are saying

  • Competition from well-funded companies like Xaira Therapeutics challenges Insitro's market position.
  • Ethical concerns in AI drug development may affect Insitro's public perception.
  • Data privacy issues could lead to regulatory challenges for Insitro.

What makes Insitro unique

  • Insitro integrates machine learning with high-throughput biology for drug discovery.
  • The company builds predictive models to accelerate target selection and drug development.
  • Insitro's team combines expertise in science, engineering, and drug discovery.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Excellent medical, dental, and vision coverage

Excellent mental health and well-being support

Open vacation policy

Access to free onsite baristas & cafe with daily lunch and breakfast

Access to free onsite fitness center

Commuter benefits

Paid parental leave

Competitive pay and 401(k) matching

Flexible work schedule (on site and remote)

Growth & Insights and Company News

Headcount

6 month growth

0%

1 year growth

0%

2 year growth

4%
Business Wire
Dec 18th, 2024
insitro Receives $25 Million in Milestone Payments from Bristol Myers Squibb for the Achievement of Discovery Milestones and the Selection of First Novel Genetic Target for ALS

insitro receives $25 million in milestone payments from Bristol Myers Squibb for the achievement of discovery milestones and the selection of first novel genetic target for ALS.

Stock Titan
Dec 18th, 2024
insitro Receives $25 Million in Milestone Payments from Bristol Myers Squibb for the Achievement of Discovery Milestones and the Selection of First Novel Genetic Target for ALS

insitro receives $25 million in milestone payments from Bristol Myers Squibb for the achievement of discovery milestones and the selection of first novel genetic target for ALS.

San Diego Business Journal
Jul 1st, 2024
Iambic Raises $153 Million Series B

Other competitors in the biotech space that use AI include Xaira Therapeutics, which has raised over $1 billion in funding and San Fransico-based Insitro, which has raised over $600 million in venture capital funding.

Business Wire
Apr 30th, 2024
Insitro Hires Ai And Machine Learning Visionary, Emily Fox, Ph.D., As Senior Vice President Of Ai/Ml

SOUTH SAN FRANCISCO, Calif.--(BUSINESS WIRE)--insitro, a machine learning-powered drug discovery and development company, today announced the appointment of Emily Fox, Ph.D., as senior vice president of AI/machine learning. In this role, she will oversee those areas as well as data science and computational biology, inclusive of data modalities that span genetics, omics, imaging, clinical data, and molecular design. Dr. Fox, a professor in the Department of Statistics and Department of Computer Science at Stanford University, has made groundbreaking contributions in the application of machine learning in healthcare, with her pioneering work directly translating into patient impact. "AI leaders of Emily’s caliber who simultaneously have an understanding of biology and health are rare, and we are privileged to have recruited her to lead our AI/ML teams,” said Daphne Koller, Ph.D., co-founder and CEO. “Her groundbreaking work in machine learning, alongside her track record in translating research into impactful applications in healthcare, aligns perfectly with our mission to revolutionize drug discovery through data-driven approaches

Elk Valley Times
Apr 4th, 2024
Leading Clinical Research Innovator, Amy Abernethy, M.D., Ph.D., Joins insitro Board of Directors

insitro, a machine learning-powered drug discovery and development company, today announced that Amy Abernethy, M.D., Ph.D, has joined the company's board of directors.