Simplify Logo

Full-Time

AI Data Engineer Lead – Principal

Posted on 11/2/2023

Grindr

Grindr

201-500 employees

Location-based social networking app for LGBTQ

Compensation Overview

$208k - $244.5kAnnually

+ Equity + Bonus

Senior, Expert

San Francisco, CA, USA

Category
Applied Machine Learning
AI & Machine Learning
Data Analysis
Data Engineering
Data & Analytics
Required Skills
Bash
Kubernetes
Microsoft Azure
Python
Airflow
NoSQL
R
Git
Apache Spark
SQL
Java
Docker
AWS
Pandas
Natural Language Processing (NLP)
Data Analysis
Snowflake
Google Cloud Platform
Requirements
  • Bachelors in Computer Science, Mathematics, Physics, or a related field
  • 5+ years of experience as a data engineer building production-level pre/post-processing data pipelines for ML/DL models, including 2+ years of technical leadership experience
  • Experience in statistical analysis & visualization on datasets using Pandas or R
  • Experience designing and building highly available, distributed systems of data extraction, ingestion, normalization and processing of large data sets in real time as well as batch
  • Demonstrated prior experience in creating data pipelines for text data sets NLP/ large language models
  • Excellent coding skills in Python, Java, bash, SQL, and expertise with Git version control
  • Experience using big data technologies (Snowflake, Airflow, Kubernetes, Docker, Helm, Spark, pySpark)
  • Experience with any public cloud environment - AWS, GCP or Azure
  • Significant experience with relational databases and query authoring (SQL) as well as NoSQL databases like DynamoDB etc
  • Experience building and maintaining ETL (managing high-quality reliable ETL pipelines)
Responsibilities
  • Dive into dataset and design, implement and scale data pre/post processing pipelines of ML models
  • Work on applied ML solutions in the areas of data mining, cleaning, normalizing and modeling
  • Collaborate with engineers in conceptualizing, planning and implementing data engineering initiatives
  • Design and build data platforms & frameworks for processing high volumes of data, in real time as well as batch
  • Build data processing streams for cleaning and modeling text data for LLMs
  • Research and evaluate new technologies in the big data space to guide continuous improvement
  • Collaborate with multi-functional teams to help tune the performance of large data applications
  • Work with Privacy and Security team on data governance, risk and compliance initiatives
  • Work on initiatives to ensure stability, performance and reliability of data infrastructure

Grindr, a leading location-based social networking app specifically tailored for LGBTQ individuals, connects users globally and offers convenience with a web-based version eliminating the need for downloads. It offers a platform that not only fosters community and personal connections but also prioritizes inclusiveness and diversity, embodying values that are crucial in today's workplace. Employing cutting-edge location-based technology to enhance user experience, this company is at the forefront of digital innovation in social networking.

Company Stage

IPO

Total Funding

$643M

Headquarters

Los Angeles, California

Founded

2009

INACTIVE