Senior Data Scientist
Clinical Informatics
Confirmed live in the last 24 hours
Valo Health

51-200 employees

AI-powered drug discovery & development
Company Overview
Valo is a mission-driven technology company that was created with the belief that the drug discovery and development process can and should be better—faster, less expensive, and with a higher probability of success.
AI & Machine Learning
Data & Analytics
Biotechnology
B2B

Company Stage

Series C

Total Funding

$570.2M

Founded

2019

Headquarters

Boston, Massachusetts

Growth & Insights
Headcount

6 month growth

-1%

1 year growth

-16%

2 year growth

-25%
Locations
Cambridge, MA, USA
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
Python NLTK
Python
Data Science
R
Apache Spark
SQL
AWS
Pandas
Linux/Unix
Snowflake
CategoriesNew
AI & Machine Learning
Data & Analytics
Requirements
  • Bachelor’s with 5+ years or MS / PhD with 1+ years in a medically-related field with a quantitative focus (public health, health economics, biostatistics) or in in a quantitative field (computer science, statistics, computational biology, biomedical engineering)
  • 3+ years of experience working with real world data, clinical trial data and/or preclinical data
  • Fluency in Python and relevant data science packages (e.g. pandas)
  • Familiarity and/or experience with Spark (pyspark), cloud computing (AWS), Linux environments, and shell scripting
  • Experience with SQL connectors (e.g. Snowflake), and unstructured/text analysis (e.g. regular expressions, NLTK, spaCy/medspaCy)
Responsibilities
  • Develop clinical concepts, medical code mappings, and data dictionaries for real-world data types
  • Define and implement data harmonization standards and curation strategies for real-world patient data products
  • Lead the development and maintenance of user-driven tools/packages to generalize data processes and automate data transformation
  • Collaborate with pre-clinical data science and clinical teams to define data requirements
  • Contribute to writing reports and presentations to non-technical internal audiences and external clients
  • Create high-quality, analysis-ready data sets and products to be used across the organization
  • Articulate and break down large problems into solvable pieces
Desired Qualifications
  • Experience in R and SQL
  • Experience in designing python libraries or packages in other languages
  • Familiarity with traditional drug discovery and development processes and approaches
  • Familiarity with common clinical practices, human physiology, or epidemiology of cardiovascular, metabolic, and renal disease areas