Full-Time

Staff Engineer

Fleet Performance

Confirmed live in the last 24 hours

DigitalOcean

DigitalOcean

1,001-5,000 employees

Cloud computing platform for developers and businesses

Data & Analytics
Enterprise Software

Compensation Overview

$230k - $270kAnnually

+ Bonus + Equity Compensation

Senior, Expert

Remote in USA

This is a remote role.

Category
Applied Machine Learning
AI & Machine Learning
Required Skills
Python
Ruby
Go
Linux/Unix
Requirements
  • Bachelor's or Master's degree in Computer Science, Mathematics, Statistics or Computer/Electrical Engineering or equivalent work experience
  • Extensive knowledge of Linux kernel, hypervisors, and open-source operating systems
  • 7+ experience with performance measurement tools such as profilers, eBPF, XDP, fio, TPCC, MLPerf, and NCCL
  • 5+ years developing strategies for managing, monitoring, and analyzing infrastructure, applications and services
  • Strong proficiency in Go, Python, and/or Ruby
  • Deep understanding of kernel performance aspects, including scheduling, context switching, and hardware acceleration
  • Expertise in distributed systems performance, including tracing and debugging methodologies
  • Knowledge of GPU technology, GPU fabrics, and programming for multi-GPU workloads
  • Demonstrated ability to solve complex problems at scale
  • Strong security mindset with proactive approach to implementing best practices
  • Excellent cross-team collaboration and communication skills
  • Leadership experience in skills development and mentorship
  • Professional-level written and spoken English with strong presentation abilities
Responsibilities
  • Develop and implement comprehensive performance metrics, analysis tools, and reporting systems
  • Lead initiatives to enhance shared infrastructure, balancing performance optimization with rigorous security standards
  • Collaborate with hardware engineering teams and vendors to continuously validate GPU fabric performance
  • Engage with the open-source Linux community to advance virtualization technologies and integrate them into our fleet
  • Conduct in-depth performance analysis of the Linux kernel, virtualization layer, storage, and network stack to devise optimization strategies
  • Identify system bottlenecks proactively and drive optimizations across the hypervisor software stack
  • Work cross-functionally to harness new performance capabilities from evolving hardware architectures
  • Enhance test frameworks, harnesses, and pipelines to ensure robust performance validation
  • Investigate and resolve virtual machine downtime and performance issues in our production environment
  • Participate in on-call rotations as needed to support system reliability

DigitalOcean provides cloud computing services that enable developers and businesses to build, deploy, and scale applications efficiently. Its platform offers a range of fully managed services that simplify the process of managing infrastructure, allowing users to focus on software development. DigitalOcean stands out from competitors by emphasizing simplicity, a strong community, and open-source support, which helps users quickly get started and find solutions to their challenges. The company's goal is to empower developers and small to medium-sized businesses to innovate and grow by providing the tools and resources they need to succeed in the cloud.

Company Stage

IPO

Total Funding

$168.5M

Headquarters

New York City, New York

Founded

2012

Growth & Insights
Headcount

6 month growth

8%

1 year growth

13%

2 year growth

28%
Simplify Jobs

Simplify's Take

What believers are saying

  • The appointment of experienced leaders like Wade Wegner and Bratin Saha signals strong strategic direction and potential for growth.
  • Partnerships with companies like LinkDaddy enhance DigitalOcean's ecosystem, providing additional value to customers.
  • The continuous enhancement of services, such as the introduction of Managed OpenSearch and advanced MongoDB configurations, demonstrates DigitalOcean's commitment to innovation and customer needs.

What critics are saying

  • The competitive cloud services market, dominated by giants like AWS, Azure, and Google Cloud, poses a significant challenge to DigitalOcean's market share.
  • Legal issues, such as the recent case with the Dutch gambling regulator, could impact the company's reputation and operational stability.

What makes DigitalOcean unique

  • DigitalOcean's focus on simplicity and community support sets it apart from larger, more complex cloud service providers like AWS and Google Cloud.
  • Their fully managed offerings, such as Managed OpenSearch and MongoDB, provide specialized solutions that cater specifically to developers and SMBs.
  • DigitalOcean's revamped App Platform emphasizes cost efficiency and scalability, making it particularly attractive for startups and growing technology businesses.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Remote-first

Full health coverage

Wellness coverage

Flexible vacation time

Team-building & social events

401(k) plans

ESPP

Education support

Partner support

Employee giving