Simplify Logo

Full-Time

Staff Software Engineer

Datalake

Posted on 12/12/2023

Dremio Corporation

Dremio Corporation

201-500 employees

Open data lakehouse platform, self-service analytics

Data & Analytics
Consulting
Enterprise Software

Compensation Overview

$154.5k - $209.1kAnnually

Senior, Expert

Remote in USA

Category
Software Engineering
Required Skills
Microsoft Azure
Apache Spark
SQL
Java
AWS
Apache Hive
Hadoop
Data Analysis
Google Cloud Platform
Requirements
  • 8+ years of industry experience
  • B.S./M.S/Equivalent in Computer Science or a related technical field or equivalent experience
  • Fluency in Java, C++ or another modern language
  • Strong database fundamentals including SQL, performance, and schema design
  • Background in large scale data processing systems (e.g., Hadoop, Spark, etc.)
  • Understanding of distributed file systems such as S3, ADLS, or HDFS
  • Experience with Apache Iceberg, Parquet, AVRO and/or Delta Lake
  • Experience with Hive and AWS Glue
Responsibilities
  • Develop core components for Dremio’s query engine
  • Deliver key features and feature enhancements for customers in the Datalake
  • Work with open source projects like Apache Iceberg, Parquet, Arrow and Calcite
  • Own design, implementation, testing, and support of next-generation features
  • Collaborate with Product Management to innovate and deliver on customer requirements
  • Understand and reason about concurrency and parallelization to deliver scalability and performance
  • Solve complex technical problems and customer issues while improving telemetry and instrumentation
  • Work with engineering leaders to establish solid designs/architecture for upcoming features
  • Develop future leaders of Dremio by providing continuous mentorship and coaching of junior software engineers
Desired Qualifications
  • Hands-on experience with distributed query engines, query processing or optimization, distributed systems, concurrency control, data replication, code generation, or storage systems
  • Hands-on experience with AWS, Azure and Google Cloud Platform

Dremio offers an open data lakehouse platform that combines self-service analytics with data warehouse functionality and data lake flexibility, leveraging technologies such as Apache Arrow, Apache Iceberg, and Apache Parquet to deliver sub-second performance at a fraction of the cost. The platform's products, including Dremio Sonar and Dremio Arctic, enable improved governance, data quality, and query performance, making it the ideal fit for data-driven departments and organizations of any size.

Company Stage

Series E

Total Funding

$410M

Headquarters

Santa Clara, California

Founded

2015

Growth & Insights
Headcount

6 month growth

2%

1 year growth

2%

2 year growth

-15%

Benefits

Health, Dental, and Vision Insurance

401(k)

Stock Options

Work From Home

Office Events

Parental Leave Benefits

Paid Time Off

INACTIVE