Full-Time

Senior Data Engineer

Posted on 10/31/2025

Speak

Speak

201-500 employees

AI-powered mobile app for English practice

Compensation Overview

$170k - $240k/yr

San Francisco, CA, USA

Remote

Category
Data & Analytics (1)
Required Skills
Redshift
Python
Airflow
BigQuery
SQL
Machine Learning
AWS
Data Analysis
Snowflake
Google Cloud Platform
Requirements
  • 5+ years of relevant experience
  • Data Modeling: Deep understanding of big data warehouses (BigQuery, Snowflake, Redshift), theories, principles, and practices. Ability to design, implement, and manage data warehouses effectively.
  • Programming Skills: Strong programming skills in Python and SQL. Ability to write efficient, reliable, and maintainable code.
  • Data Pipeline and ETL Development: Experience in building and optimizing data pipelines, architectures, and datasets. Familiarity with ETL (extract, transform, load) processes and tools.
  • Big Data Technologies: Experience with end-to-end data platform beyond creating pipelines, such as data ingestion, reverse ETL, visualization, data observability, etc.
  • Cloud Computing: Knowledge of cloud services (GCP, AWS, dbt) and understanding of how to leverage them for data processing and storage solutions.
  • Data Analysis and Visualization: Ability to analyze data to identify patterns, anomalies, and insights. Proficiency in using data visualization tools (e.g. Mode) to communicate findings clearly.
  • Debugging Skills: Strong problem-solving skills and the ability to approach complex challenges methodically including data inconsistency issues.
  • Effective Communication: Ability to communicate technical information to non-technical stakeholders clearly and effectively. This includes writing documentation, presenting findings, and collaborating on projects.
Responsibilities
  • Design and Build Data Infrastructure: You'll architect and implement robust, scalable data pipelines using Airflow for orchestration and dbt for transformation that ensure efficient data flow and processing. Your work will be critical in managing the ingestion, storage, and accessibility of data from various sources, ensuring our platform's backbone is strong and reliable.
  • Enable Data-Driven Decisions: By collaborating with cross-functional teams, you will develop and deploy tools and frameworks that facilitate data access and analysis, empowering product and business teams to make informed decisions.
  • Optimize Data Architecture: Constantly evaluate and refine the data architecture to support our growing data needs and ensure optimal performance. This includes managing a data warehouse and various data sources, as well as implementing best practices for data modeling, data quality, and data governance.
  • Support Machine Learning Projects: Work closely with analysts and machine learning engineers by providing them with clean, structured data for building and deploying predictive models that enhance personalized learning experiences and engagement strategies.
  • Innovate and Experiment: Stay ahead of the curve by researching and implementing cutting-edge technologies and methodologies in data engineering and analytics.
  • Collaborate Across Teams: As a key player in the engineering team, you'll work closely with product managers, analysts, and other engineers to bring data-driven products and features from concept to launch.

Speak provides an AI-powered mobile app for practicing English through realistic, simulated conversations without a live tutor. Users interact with the app to engage in everyday scenarios, and the AI analyzes responses to offer immediate feedback and personalized learning paths. The product operates on a subscription model with multiple tiers, plus potential in-app purchases or premium features, enabling users to learn at their own pace and schedule. What sets Speak apart from competitors is its ability to reproduce natural conversational practice through advanced AI combined with an easy-to-use mobile interface, removing the need for live tutoring. Speak’s goal is to make English learning more accessible and efficient for students and professionals around the world by providing flexible, on-demand practice anytime, anywhere.

Company Size

201-500

Company Stage

Series C

Total Funding

$165.8M

Headquarters

San Francisco, California

Founded

2015

Simplify Jobs

Simplify's Take

What believers are saying

  • Over 25 million personalized lessons in 2024 improve AI speech recognition.
  • Series C $78M from Accel in December 2024 doubles valuation to $1B.
  • Offices in Seoul, Tokyo, Ljubljana penetrate Asia-Pacific and Europe markets.

What critics are saying

  • Duolingo Max GPT-4o conversations erode differentiation by Q4 2025.
  • OpenAI GPT-5 pricing hikes explode API costs 30% by February 2026.
  • ChatGPT native voice mode drives subscription cancellations by November 2025.

What makes Speak unique

  • Speak uses OpenAI's real-time API for dynamic two-way AI dialogues.
  • Mic2Speak provides real-time phrase translation and personalized vocabulary lists.
  • AI generates custom on-demand conversation scenarios for immersive roleplay.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Standard tech startup benefits

Stipends for commuting, fitness, & learning

Annual company offsite.

Growth & Insights and Company News

Headcount

6 month growth

0%

1 year growth

0%

2 year growth

1%
CIO
Apr 11th, 2025
Speak secures Series A funding from Altman

Sam Altman has invested in various sectors including AI, energy, anti-aging, and edtech. He participated in Series B funding for Slope, a B2B payment automation platform, and Wrap, a coding assistant tool. In 2019, he signed a letter of intent to invest $51 million in Rain AI's chip through OpenAI. In 2022, Speak, an AI-based language learning app, received Series A funding from Altman and later secured further investments from OpenAI's startup fund. Recent investments include AirOps, CrewAI, and Exowatt.

36Kr
Jan 3rd, 2025
Speak raises $78M, hits $1B valuation

OpenAI has invested in Speak, a language learning app that emphasizes speaking and listening through generative AI. In December 2024, Speak raised $78 million in a Series C round, led by Accel, reaching a $1 billion valuation. This follows a $20 million Series B round just six months prior. Speak's approach involves AI-generated real-life scenarios for immersive language practice. The app has over 10 million downloads and offers courses in multiple languages, with plans to expand further.

Crunchbase
Dec 13th, 2024
Speak raises $78M, reaches unicorn status

Speak, an edtech startup, raised a $78 million Series C led by Accel, doubling its valuation to $1 billion in under six months. The San Francisco-based company, founded in 2016, uses AI to help users learn new languages and has created over 25 million personalized lessons this year. Speak has raised a total of $162 million.

WowTale
Dec 11th, 2024
AI 영어 학습 '스픽', 1,094억원 시리즈C 투자 유치...유니콘 등극 - 와우테일

 AI 기반 영어 학습 솔루션 ‘스픽(Speak)’을 운영하는 스픽이지랩스코리아가 시리즈C 투자 라운드에서 약 1,094억 원(7,800만 달러)을 유치하며 유니콘 기업으로 진입했다고 11일 밝혔다.

Bloomberg L.P.
Dec 10th, 2024
OpenAI-Backed Language Tutor Startup Doubles Value to $1 Billion

Speak’s app uses an artificial intelligence “conversational partner” to help people become fluent in English.

INACTIVE