Principal Data Engineer
Updated on 3/24/2023
Locations
Remote
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
Apache Spark
AWS
Apache Kafka
Data Analysis
Elasticsearch
Java
MongoDB
Scala
Natural Language Processing (NLP)
Python
Requirements
- Scala, Java, Python
- Apache Spark, Spark Streaming, Databricks/Delta Lake, Apache Storm, Elasticsearch, Apache Nifi
- Kafka, MongoDB, Redis
- AWS
- Bachelor's degree (or higher) in Computer Science, Engineering, or similar and/or relevant work experience
- Experience providing technical leadership at the enterprise level for the design of information technology systems
- Crafted and implemented operational data stores, as well as data lakes in production environments
- Ability to analyze, diagnose and resolve complex architectural problems using industry standard engineering principles
- Design and build data ingestion pipelines and ETL processing, including stream processing, while factoring in performance and cost
- Identify and solve issues concerning data management to improve data quality
- Clean, prepare and optimize data for ingestion and consumption
- Experience solving performance problems with Lucene based search solutions like Elasticsearch or Solr
- 9+ years experience in server-side/back-end full cycle product development in a production environment
- 4+ years developing with Apache Spark, including Structured Streaming. Experience with Databricks is a big plus
- Knowledge of Scala or Java with exposure to or interest in Scala
- Leads and mentors other team members
- Provides partners with coaching and feedback in order to build effective teams
- Provides effective support to cross-functional teams
Responsibilities
- Solve complex real-time data collection & analysis problems with cutting edge technical solutions
- Iterate on our high performance and scalable platform for massive data collection, real-time analytics, NLP, machine learning, and backend data services
- Build high performance, scalable, real-time, server-side technologies
- Write scalable code with extensive test coverage, working in a professional software engineering environment with source control, dev/stage/production release cycles, continuous integration, and deployment
- Work closely with product management, design, quality assurance and operations teams to understand our customers' needs and effectively translate them to technical specifications
- Lead projects from translating product requirements into architecture to production