Full-Time

Data Platform Engineer

US

Confirmed live in the last 24 hours

Onehouse

Onehouse

51-200 employees

Cloud-native managed lakehouse service provider

Data & Analytics

Mid

Sunnyvale, CA, USA

Required Skills
Kubernetes
Airflow
Apache Flink
Apache Spark
SQL
Java
Gradle
Maven
Requirements
  • 3+ years of experience in building and operating data pipelines in Apache Spark or Apache Flink.
  • 2+ years of experience with workflow orchestration tools like Apache Airflow, Dagster.
  • Proficient in Java, Maven, Gradle and other build and packaging tools.
  • Adept at writing efficient SQL queries and troubleshooting query plans.
  • Experience managing large-scale data on cloud storage.
  • Great problem-solving skills, eye for details.
  • Operational excellence in monitoring, deploying, and testing job workflows.
  • Open-minded, collaborative, self-starter, fast-mover.
Responsibilities
  • Be the thought leader around all things data engineering within the company - schemas, frameworks, data models.
  • Implement new sources and connectors to seamlessly ingest data streams.
  • Building scalable job management on Kubernetes to ingest, store, manage and optimize petabytes of data on cloud storage.
  • Optimize Spark or Flink applications to flexibly run in batch or streaming modes based on user needs, optimize latency vs throughput.
  • Tune clusters for resource efficiency and reliability, to keep costs low, while still meeting SLAs

Onehouse offers a cloud-native managed lakehouse service, combining the usability of a warehouse with the scalability of a data lake. It is powered by Apache Hudi technology and prioritizes interoperability across table formats, compute engines, and cloud providers.

Company Stage

Series A

Total Funding

$33M

Headquarters

Menlo Park, California

Founded

2021

Growth & Insights
Headcount

6 month growth

0%

1 year growth

18%

2 year growth

280%