Data Engineering Co-Op
Research & Development
Posted on 9/7/2023
INACTIVE
Sleep fitness optimization company
Company Overview
Eight Sleep is obsessed with helping people get sleep fit. They believe sleep, just like fitness, can be measured and improved. That's why they develop technology that goes beyond data, and actually helps unlock the best rest possible, night after night.
Consumer Goods
Hardware
B2C
Company Stage
Series C
Total Funding
$163.3M
Founded
2014
Headquarters
New York, New York
Growth & Insights
Headcount
6 month growth
↑ 22%1 year growth
↑ 22%2 year growth
↑ 75%Locations
Cambridge, MA, USA
Experience Level
Intern
Desired Skills
AWS
Data Analysis
Data Science
Google Cloud Platform
Jupyter
Git
NumPy
Pandas
SQL
Tableau
Python
Looker
Power BI
CategoriesNew
AI & Machine Learning
Data & Analytics
Requirements
- Bachelor's or Master's degree (or currently pursuing) in Computer Science, Software Engineering, Data Science, or a related field
- Highly proficient in SQL, AWS S3, GitHub, and Python (Pandas, Scipy, Numpy, Jupyter Notebooks, PyCharm, seaborn/matplotlib/plotly)
- Experience with data manipulation, cleaning, and analysis, and data visualization
- Ability to write clean, maintainable, and efficient code using best practices
- Strong time-management skills, and excellent attention to detail while moving at a fast pace
- Flexibility: willingness to jump into new projects or assignments (e.g. help collecting data)
- 6+ months of work experience
- Experience working with wearable and/or human physiology data, or knowledge of sleep and physiology
- Basic understanding of HIPAA, GCP, and IRB compliance for data privacy
- Human research experience
- Proficiency in one or more dashboard development tools, such as Tableau, Power BI, or Looker
Responsibilities
- Data engineering:
- Manipulating, cleaning, and transforming raw data into a usable format for analysis (requires time aligning and parsing data from multiple sources with varying sample rates)
- Writing, testing, and maintaining Python and SQL scripts for data analysis, and compiling results into tables and presentations
- Extracting and cleaning temperature and physiological data to support future Machine Learning model development
- Ensure good data quality & privacy:
- Developing and implementing processes to ensure that data are accurate, complete, and consistent throughout the data collection period of a study
- Building databases and workflows that ensure subjects' PII are protected for data privacy (in accordance with HIPAA and IRB compliance)
- Tooling:
- Enhancing our existing data quality scripts by building dashboards, scripts, and/or efficient methods to check incoming study data quality to identify problems with sensors and/or trials
- Developing and maintaining tools and processes to improve efficiency and consistency across projects, including abstracting one-off scripts into reusable tools and developing new features for daily quality checks and study management