Full-Time

Staff Data Engineer

Posted on 6/27/2024

Trunk

Trunk

51-200 employees

Platform for managing code repositories

Enterprise Software

Compensation Overview

$200k - $245kAnnually

+ Equity Range: 0.3% - 0.5%

Expert

San Francisco, CA, USA

Hybrid position in San Francisco.

Category
Data Engineering
Data & Analytics
Required Skills
Kotlin
Rust
Python
Pytorch
Apache Spark
Apache Kafka
Java
Pandas
Go
Scala
Natural Language Processing (NLP)
NumPy
C/C++
Requirements
  • 10-12+ years of experience as a software engineer with a strong understanding of key concepts in distributed systems
  • 10-12+ years of experience in building and deploying data applications, with a track record of regularly shipping new features
  • Fluency in at least two of these languages: Java/Scala/Kotlin, Python, Go, Rust, or C++
  • Good understanding and practical experience with partitioning, replication, map-reduce, indexing, and CAP theorem
  • Experience with distributed storage systems (S3, HDFS, Hive, ClickHouse, Elastic, etc), distributed processing engines (Spark, etc), and message queues (Kafka, SQS, etc)
  • Passion for building large-scale ML applications and improving software engineers' productivity
  • Understanding of key concepts in natural language processing, machine learning, or statistical analysis
  • (Nice to have) Some experience with machine learning stack (pandas, PyTorch, numpy, sci-kit, transformers, etc)
Responsibilities
  • Build fault-tolerant and scalable data pipelines
  • Design efficient data storage, collaborating with product engineers to create fast and reliable data-driven features
  • Debug, profile, and optimize distributed data-intensive applications to improve their latency, accuracy, resource consumption, and throughput
  • Design and build observability of data quality and accuracy
  • Integrate ML models like Llama to analyze data and create features

Trunk.io provides a platform that enhances code quality while maintaining developer speed in the software development market. Its main product is a tool for managing code repositories, which allows developers to store and share their code. A key feature of this tool is the parallel merge queues, enabling multiple code changes to be merged at the same time, thus speeding up the development process. This sets Trunk.io apart from competitors that typically handle merges sequentially. The primary users of Trunk.io are software developers and businesses with in-house development teams, who benefit from improved code quality and faster development cycles. The company's goal is to help clients release products more quickly and stay competitive in the market, operating on a subscription-based model for access to its services.

Company Stage

Series A

Total Funding

$27.8M

Headquarters

San Francisco, California

Founded

2018

Growth & Insights
Headcount

6 month growth

22%

1 year growth

29%

2 year growth

229%
Simplify Jobs

Simplify's Take

What believers are saying

  • Growing demand for CI/CD tools boosts interest in Trunk's CI Analytics.
  • Remote work trends increase demand for Trunk's collaboration and code management tools.
  • Focus on software supply chain security enhances demand for Trunk's code quality solutions.

What critics are saying

  • Competition from established CI/CD platforms like Jenkins may hinder market penetration.
  • Rapid advancements in AI-driven code analysis tools could outpace Trunk's offerings.
  • Reliance on GitHub Actions poses risks if API policies or pricing change.

What makes Trunk unique

  • Trunk.io offers parallel merge queues, enhancing code merging efficiency.
  • The platform provides real-time code correction and formatting tools.
  • Trunk's CI Analytics offers deep historical analysis of GitHub Actions workflows.

Help us improve and share your feedback! Did you find this helpful?

INACTIVE