Senior Data Engineer
Updated on 11/30/2023
Spotter

51-200 employees

Provides growth capital and data insights for YouTube creators.
Company Overview
Spotter stands out as a leading provider of growth capital tailored to YouTube creators, offering them the resources and freedom to expand their businesses while maintaining full control over their content. The company's unique approach includes providing in-depth data insights to optimize content performance, and has already resulted in deploying over $600 million to creators, with a goal to reach $1 billion in investment by 2023. With a vast catalog spanning over 250,000 videos and generating 75 billion monthly watch-time minutes, Spotter offers a transparent, efficient, and brand-safe media solution to advertisers.
B2B

Company Stage

Later Stage VC

Total Funding

$230.6M

Founded

2019

Headquarters

Los Angeles, California

Growth & Insights
Headcount

6 month growth

0%

1 year growth

10%

2 year growth

193%
Locations
Culver City, CA, USA
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
Apache Spark
AWS
Data Analysis
Pandas
Redshift
SQL
Apache Flink
Python
CategoriesNew
Data & Analytics
Requirements
  • Bachelor’s degree, preferably in Computer Science or Computer Information Systems
  • 4+ years of software engineering experience
  • 3+ years of data engineering experience with Apache Spark or Apache Flink
  • 3+ years of experience running software and services in the cloud
  • Proficiency in working with DataFrame APIs (Pandas and Spark) for parallel and single node processing
  • Proficiency using advanced languages and techniques with Python, Scala, etc. with modern data optimized file formats such as Parquet and Avro
  • Proficiency with SQL on RDBMS and data warehouse solutions like Redshift
Responsibilities
  • Develop and maintain scalable data pipelines, including:
  • ETL pipelines, both single and multi-node solutions
  • Build data quality assurance steps for new and existing pipelines
  • Create derived datasets with augmented properties
  • Work on analytics ready datasets to power internal and creator facing tools
  • Troubleshoot issues when they arise, working directly with internal data consumers
  • Automate pipeline runs with scheduling and orchestration tools
  • Work with large scale datasets
  • Work with/use various external APIs to enhance data
  • Setup database tables for analytics users to consume the data collected by the Data Engineering team
  • Work with big data technologies to improve data availability and data quality in the cloud (AWS)
  • Mentor members of the team
Desired Qualifications
  • Experience with YouTube APIs
  • Experience with data acquisition from external APIs at large scale / in parallel processing
  • Experience with Data-Lake technologies
  • Experience with AWS Glue metastore
  • Experience with Data-Mesh approaches
  • Experience with data cataloging, data lineage and data governance tools and approaches
  • Experience with vector databases