Full-Time

Data Platform Engineer

Confirmed live in the last 24 hours

Onehouse

Onehouse

51-200 employees

Data lakehouse solution for efficient data management

Data & Analytics
Enterprise Software
AI & Machine Learning

Compensation Overview

$215k - $250kAnnually

+ Equity Compensation

Mid, Senior

Sunnyvale, CA, USA

Hybrid position based in Sunnyvale, CA.

Category
DevOps & Infrastructure
Cloud Engineering
Required Skills
Kubernetes
Apache Flink
Apache Spark
SQL
Java
Gradle
Maven
Requirements
  • 3+ years of experience in building and operating data pipelines in Apache Spark or Apache Flink.
  • 2+ years of experience with workflow orchestration tools like Apache Airflow, Dagster.
  • Proficient in Java, Maven, Gradle and other build and packaging tools.
  • Adept at writing efficient SQL queries and trouble shooting query plans.
  • Experience managing large-scale data on cloud storage.
  • Great problem-solving skills, eye for details. Can debug failed jobs and queries in minutes.
  • Operational excellence in monitoring, deploying, and testing job workflows.
  • Open-minded, collaborative, self-starter, fast-mover.
Responsibilities
  • Be the thought leader around all things data engineering within the company - schemas, frameworks, data models.
  • Implement new sources and connectors to seamlessly ingest data streams.
  • Building scalable job management on Kubernetes to ingest, store, manage and optimize petabytes of data on cloud storage.
  • Optimize Spark or Flink applications to flexibly run in batch or streaming modes based on user needs, optimize latency vs throughput.
  • Tune clusters for resource efficiency and reliability, to keep costs low, while still meeting SLAs.

Onehouse.ai offers a data lakehouse solution that helps businesses manage and optimize their data efficiently. Their main product is a fully managed service that allows clients to organize various types of data seamlessly, using formats like Apache Hudi, Apache Iceberg, and Delta Lake. This service automates data management tasks such as clustering, compaction, and encryption, making it easier for businesses to handle their data without needing extensive engineering resources. Onehouse.ai stands out from competitors with its usage-based pricing model, which significantly reduces data management costs by 50% or more compared to traditional cloud data warehouses. The company's goal is to simplify data management for businesses of all sizes, enabling them to scale their data operations while minimizing expenses.

Company Stage

Series B

Total Funding

$66.1M

Headquarters

San Francisco, California

Founded

2021

Growth & Insights
Headcount

6 month growth

12%

1 year growth

10%

2 year growth

96%
Simplify Jobs

Simplify's Take

What believers are saying

  • Onehouse secured $35M in Series B funding, indicating strong investor confidence.
  • Partnerships with Microsoft and Google enhance Onehouse's strategic positioning.
  • The launch of a vector embeddings generator aligns with AI/ML market demand.

What critics are saying

  • Increased competition from major cloud providers like Microsoft and Google.
  • Rapid advancements in AI/ML by tech giants may overshadow Onehouse's offerings.
  • Emergence of new data streaming solutions could divert potential clients.

What makes Onehouse unique

  • Onehouse offers a cloud-native managed lakehouse service built on Apache Hudi.
  • The platform supports multiple table formats, ensuring flexibility and compatibility.
  • Onehouse's usage-based pricing model reduces data management costs significantly.

Help us improve and share your feedback! Did you find this helpful?