Full-Time

Staff Engineer

Fleet Performance

Confirmed live in the last 24 hours

DigitalOcean

DigitalOcean

1,001-5,000 employees

Cloud computing platform for developers and businesses

Consumer Software
Enterprise Software

Compensation Overview

$230k - $270kAnnually

+ Bonus + Equity Compensation

Senior, Expert

Remote in USA

This is a remote role.

Category
Applied Machine Learning
AI & Machine Learning
Required Skills
Python
Ruby
Go
Linux/Unix
Requirements
  • Bachelor's or Master's degree in Computer Science, Mathematics, Statistics or Computer/Electrical Engineering or equivalent work experience
  • Extensive knowledge of Linux kernel, hypervisors, and open-source operating systems
  • 7+ experience with performance measurement tools such as profilers, eBPF, XDP, fio, TPCC, MLPerf, and NCCL
  • 5+ years developing strategies for managing, monitoring, and analyzing infrastructure, applications and services
  • Strong proficiency in Go, Python, and/or Ruby
  • Deep understanding of kernel performance aspects, including scheduling, context switching, and hardware acceleration
  • Expertise in distributed systems performance, including tracing and debugging methodologies
  • Knowledge of GPU technology, GPU fabrics, and programming for multi-GPU workloads
  • Demonstrated ability to solve complex problems at scale
  • Strong security mindset with proactive approach to implementing best practices
  • Excellent cross-team collaboration and communication skills
  • Leadership experience in skills development and mentorship
  • Professional-level written and spoken English with strong presentation abilities
Responsibilities
  • Develop and implement comprehensive performance metrics, analysis tools, and reporting systems
  • Lead initiatives to enhance shared infrastructure, balancing performance optimization with rigorous security standards
  • Collaborate with hardware engineering teams and vendors to continuously validate GPU fabric performance
  • Engage with the open-source Linux community to advance virtualization technologies and integrate them into our fleet
  • Conduct in-depth performance analysis of the Linux kernel, virtualization layer, storage, and network stack to devise optimization strategies
  • Identify system bottlenecks proactively and drive optimizations across the hypervisor software stack
  • Work cross-functionally to harness new performance capabilities from evolving hardware architectures
  • Enhance test frameworks, harnesses, and pipelines to ensure robust performance validation
  • Investigate and resolve virtual machine downtime and performance issues in our production environment
  • Participate in on-call rotations as needed to support system reliability

DigitalOcean provides cloud computing services designed to help developers and businesses focus on building software. Its platform offers mission-critical infrastructure and fully managed services that enable users to quickly build, deploy, and scale applications. DigitalOcean's products work by providing a user-friendly interface and tools that simplify the management of cloud resources, allowing users to allocate computing power, storage, and networking capabilities as needed. What sets DigitalOcean apart from its competitors is its emphasis on simplicity, a strong community, and dedicated customer support, which makes it easier for users to navigate cloud computing. The company's goal is to empower developers and businesses to innovate and grow by reducing the time spent on infrastructure management.

Company Stage

IPO

Total Funding

$168.5M

Headquarters

New York City, New York

Founded

2012

Growth & Insights
Headcount

6 month growth

9%

1 year growth

16%

2 year growth

30%
Simplify Jobs

Simplify's Take

What believers are saying

  • The appointment of experienced leaders like Wade Wegner and Bratin Saha signals strong strategic direction and potential for growth.
  • Partnerships with companies like LinkDaddy enhance DigitalOcean's ecosystem, providing additional value to customers.
  • The continuous enhancement of services, such as the introduction of Managed OpenSearch and advanced MongoDB configurations, demonstrates DigitalOcean's commitment to innovation and customer needs.

What critics are saying

  • The competitive cloud services market, dominated by giants like AWS, Azure, and Google Cloud, poses a significant challenge to DigitalOcean's market share.
  • Legal issues, such as the recent case with the Dutch gambling regulator, could impact the company's reputation and operational stability.

What makes DigitalOcean unique

  • DigitalOcean's focus on simplicity and community support sets it apart from larger, more complex cloud service providers like AWS and Google Cloud.
  • Their fully managed offerings, such as Managed OpenSearch and MongoDB, provide specialized solutions that cater specifically to developers and SMBs.
  • DigitalOcean's revamped App Platform emphasizes cost efficiency and scalability, making it particularly attractive for startups and growing technology businesses.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Remote-first

Full health coverage

Wellness coverage

Flexible vacation time

Team-building & social events

401(k) plans

ESPP

Education support

Partner support

Employee giving