Full-Time

Principal Software Engineer

Architecture, AI/ML

Updated on 12/21/2024

DigitalOcean

DigitalOcean

1,001-5,000 employees

Cloud computing platform for developers and businesses

Data & Analytics
Enterprise Software

Compensation Overview

$225k - $338kAnnually

+ Bonus + Equity Compensation

Senior, Expert

Remote in USA

This is a remote role.

Category
Backend Engineering
FinTech Engineering
Software Engineering
Required Skills
Tensorflow
Pytorch
Requirements
  • Proven expertise in large-scale cloud and AI/ML services, and a deep understanding of cloud computing’s potential in enhancing AI/ML applications.
  • Demonstrated ability to lead and mentor large software and AI/ML teams.
  • Experience with web and cloud-native services is a must-have, with experience deploying scalable AI/ML solutions in production.
  • Adept at Systems Thinking with an ability to decompose complex problems into simple, straight-forward solutions, including AI/ML-specific challenges like model drift and data dependency management.
  • Strong grasp of system interdependencies, limitations, and expertise in AI/ML optimization techniques for performance, scalability, and accuracy.
  • Hands-on experience in AI/ML frameworks and libraries, such as TensorFlow, PyTorch, or Scikit-Learn, and model-serving frameworks such as TensorFlow Serving or ONNX.
  • Proven experience in developing and deploying models for performance-intensive applications at web-scale.
  • Understanding of the MLOps lifecycle, including data engineering, model training, validation, deployment, and monitoring.
  • Understanding of key HPC technologies including RDMA, InfiniBand/RoCE, GPUDirect and other storage technologies.
  • Knowledge in performance, scalability, enterprise system architecture, and engineering best practices with an emphasis on the integration of AI/ML.
  • Leverage knowledge of open-source, industry standards, and prior art in architecture decisions with AI/ML considerations.
  • Balance technical leadership and savvy with strong business judgment to make the right decisions about technology, demonstrating simplicity and creativity.
  • Master’s degree or higher preferred in Computer Science, AI/ML, or a related field.
  • 15+ years professional experience in web-scale system software development.
  • 5+ years experience demonstrating an established track record in Deep Learning and Machine Learning.
  • 3+ years recent experience as an ML engineer, data science engineer, or similar.
  • In-depth experience in two or more of the following areas: Cloud Computing, Storage, Networking, Platform-as-a-Service, Infrastructure-as-a-Service, Software-as-a-Service.
  • Excellent communication skills at all levels.
Responsibilities
  • Working at the forefront of cloud, distributed computing, and AI/ML technologies.
  • Serving as the architect driving the technical strategy and direction for our large-scale cloud services, including machine learning model deployment and orchestration.
  • Developing AI/ML models to optimize cloud infrastructure, improve system reliability, and enhance user experience.
  • Building and refining machine learning pipelines and frameworks to support scalable AI/ML solutions.
  • Owning the primary responsibility for establishing a pragmatic long-term technical direction for our software services, ensuring alignment with our customers, business goals, and internal teams.
  • Leading a team of highly passionate technical leads to evolve our service architecture, with alignment across several product technical roadmaps.
  • Leading by example through direct contribution and providing direction in establishing development and operational practices, with specific attention to AI/ML model lifecycle management.
  • Serving as the technical lead on our most demanding, cross-functional projects.
  • Actively mentoring individuals and the engineering community on advanced technical issues, including best practices in AI/ML.

DigitalOcean provides cloud computing services that enable developers and businesses to build, deploy, and scale applications efficiently. Its platform offers a range of fully managed services, allowing users to focus on software development rather than infrastructure management. DigitalOcean stands out from competitors by emphasizing simplicity, a supportive community, and open-source resources, making it accessible for startups and small to medium-sized businesses. The company's goal is to empower users to innovate and grow their businesses by providing the tools and support needed to streamline the development process.

Company Stage

IPO

Total Funding

$168.5M

Headquarters

New York City, New York

Founded

2012

Growth & Insights
Headcount

6 month growth

8%

1 year growth

14%

2 year growth

29%
Simplify Jobs

Simplify's Take

What believers are saying

  • Growing demand for cloud-native tools aligns with DigitalOcean's offerings.
  • Expansion of the global developer community increases potential customer base.
  • Rising adoption of Kubernetes supports DigitalOcean's Kubernetes services.

What critics are saying

  • Increased competition from Vultr threatens DigitalOcean's market share.
  • Potential over-reliance on automated scaling could lead to unexpected costs.
  • Expansion in New York may lead to increased operational costs.

What makes DigitalOcean unique

  • DigitalOcean offers a no DevOps required experience for developers.
  • The company focuses on simplicity and open source to attract developers.
  • DigitalOcean's customer service and community support are key differentiators.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Remote-first

Full health coverage

Wellness coverage

Flexible vacation time

Team-building & social events

401(k) plans

ESPP

Education support

Partner support

Employee giving