Senior Software Engineer



51-200 employees

Cloud-native managed lakehouse service for data interoperability

Data & Analytics

$150,000 - $220,000



Seattle, WA, USA + 1 more

Required Skills
Microsoft Azure
Apache Spark
Data Analysis
Google Cloud Platform
  • Bachelor's degree in Computer Science or related field.
  • 7+ years of experience in software engineering or SRE roles, with a focus on large scale distributed systems.
  • Strong coding skills in at least one programming language, such as Java, Python, or Go.
  • Strong conviction in software development best practices, including version control, automated testing, and continuous integration and delivery.
  • Excellent problem-solving, triaging, and debugging skills in large-scale distributed systems.
  • Experience with managing kubernetes clusters and applications at scale.
  • Experience deploying applications on one or more cloud platforms such as AWS, Google Cloud Platform or Microsoft Azure.
  • Experience defining and owning reliability focussed systems and processes (e.g. Incident Management, Post-mortem).
  • Experience with software development related compliance processes (e.g. Soc2, FedRAMP).
  • Experience with the following tech stack: Infrastructure-as-code (e.g. Terraform, Cloudformation), Automation frameworks (e.g. Jenkins, CircleCI), Monitoring stacks (e.g. Prometheus and ELK), Cloud security management (e.g IAM, SSO), Data processing technologies like Spark
  • Build and own our reliability engineering practice from the ground up, owning our entire production infrastructure and operational posture.
  • Establish a culture of reliability across engineering by providing a comprehensive incident management platform that is being used for instrumentation, operability, and around incidents.
  • Design, implement and maintain new services, tools, and monitoring to support service reliability and alerting.
  • Serve as an active member of our SRE team, responding to and managing high severity incidents or any situations concerning the wellbeing and continuous operation of our mission-critical systems.
  • Collaborate with your stakeholders across engineering teams to ensure continuous adoption of best practices, rollout scenarios for the space, and that services are designed with reliability in mind.
  • Continuously analyze and evaluate the tradeoffs of the existing designs and make recommendations based on new technologies and industry best practices.
  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity management and launch reviews.
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health through an intimate understanding of how the critical parts of our site work.
  • Contribute to better incident management posture and retrospectives, driving improvements in our overall reliability and incident response time as well as on-call runbooks and post-mortem reports.
  • Drive our compliance posture; ensuring that all our products and processes comply with relevant regulations and standards, especially during compliance audits.

Onehouse stands out as a leading company in the data management sector, offering a unique cloud-native managed lakehouse service built on Apache Hudi, a technology developed by its founding team during their tenure at Uber. The company's strength lies in its ability to merge the simplicity of a warehouse with the vastness of a data lake, providing engineers with a streamlined process for setting up their data lakes. With a team of experienced professionals from Uber, LinkedIn, Confluent, and Amazon, Onehouse guarantees wide interoperability for data across table formats, multiple compute engines, and various cloud providers, ensuring that your infrastructure remains vendor-independent.

Company Stage

Series A

Total Funding



Menlo Park, California



Growth & Insights

6 month growth


1 year growth


2 year growth