Full-Time

Head of Cloud Inference

Posted on 6/6/2025

Modular

Modular

201-500 employees

Simplifies AI infrastructure for businesses

Compensation Overview

$297k - $363k/yr

+ Bonus + Equity + Benefits

Senior, Expert

Mountain View, CA, USA

Candidates based in the United States are welcome to apply. You can work remotely from home or from our office in Los Altos, CA.

Category
Applied Machine Learning
AI Research
AI & Machine Learning
Required Skills
Machine Learning
Requirements
  • 7+ years of experience in people management.
  • 10+ years of experience in the field of cloud infrastructure.
  • Proven experience in developing production-quality high-performance software.
  • Proven experience in AI/ML infrastructure, model serving, or related field.
  • Proven experience in managing large teams and managing of managers.
  • A robust understanding of the principles cloud infrastructure and running large scale systems.
Responsibilities
  • Lead and scale the cloud inference organization to build next-generation cluster-level AI inference solutions.
  • Drive product strategy and design decisions for enterprise-grade cloud inference serving platform, work closely with customers to ensure their success while advancing Modular platform as the best platform for AI inference in production.
  • Coach, mentor, and develop a high-performance engineering team while fostering cross-functional collaboration with product and customer support teams.
  • Navigate fast-paced environment with changing priorities while establishing technical expertise and cutting-edge technology adoption.
  • Collaborate with engineering leaders to deliver integrated AI inference deployment solutions for enterprise customers.

Modular provides a platform that simplifies AI infrastructure for businesses in various industries, including technology, engineering, and finance. Its product suite is designed to be integrated and composable, allowing teams to develop, deploy, and innovate more efficiently. The platform offers direct access to industry experts, helping clients address infrastructure challenges and meet their Service Level Agreements (SLAs) and Service Level Objectives (SLOs). Unlike many competitors, Modular focuses on creating a user-friendly experience and emphasizes a culture of teamwork, learning, and diversity. The company's goal is to continuously enhance its platform, recently adding support for dynamic shapes in AI workloads to improve flexibility and efficiency.

Company Size

201-500

Company Stage

Late Stage VC

Total Funding

$130M

Headquarters

Palo Alto, California

Founded

2022

Simplify Jobs

Simplify's Take

What believers are saying

  • Modular's collaboration with NVIDIA enhances its platform's hardware-software integration.
  • The shift to .com domains strengthens Modular's market positioning and investor trust.
  • Rising AI-first programming languages create opportunities for Modular's unified platform.

What critics are saying

  • Dependency on NVIDIA's technology could impact Modular's platform capabilities.
  • Fragmentation in AI programming languages may affect Modular's platform adoption.
  • The shift from .ai to .com domains may affect Modular's innovative brand perception.

What makes Modular unique

  • Modular unifies AI development and deployment, enhancing developer velocity and efficiency.
  • The platform integrates compilers and runtimes for heterogeneous computing environments.
  • Modular's culture emphasizes teamwork, diversity, and continuous innovation in AI infrastructure.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Health Insurance

401(k) Company Match

Unlimited Paid Time Off

Stock Options

Growth & Insights and Company News

Headcount

6 month growth

2%

1 year growth

1%

2 year growth

13%
Tech Startups
Apr 7th, 2025
From .Ai To .Com: The Quiet Domain Rebrand Sweeping Startup Ecosystem

When generative AI took off in 2022 following the popularity of ChatGPT, launching a startup on a .ai domain felt like the obvious move. It signaled your company was part of the next wave of innovation.¬†It instantly signaled to investors, journalists, and users that you were building in AI.Now, a shift is happening. Some of the most promising startups in AI are quietly moving away from their .ai domains and switching to .com. It‚Äôs not about trend-chasing‚Äîit‚Äôs about positioning.This isn’t a flood, but it‚Äôs enough to raise eyebrows. And it says something about how branding, trust, and long-term ambition play a bigger role in how startups choose to present themselves.Why Startups Went With .AI DomainsIn the early days, .ai domains were easy to get and made a lot of sense. Short .coms are hard to come by and often expensive

VentureBeat
May 21st, 2024
Mojo Rising: The Resurgence Of Ai-First Programming Languages

Join us in returning to NYC on June 5th to collaborate with executive leaders in exploring comprehensive methods for auditing AI models regarding bias, performance, and ethical compliance across diverse organizations. Find out how you can attend here. Blink, and you might just miss the invention of yet another programming language. The old joke goes that programmers spend 20% of their time coding and 80% of their time deciding what language to use. In fact, there are so many programming languages out there that we are not sure how many we actually have. It’s probably safe to say there are at least 700 programming languages lingering in various states of use and misuse

Modular
Dec 4th, 2023
Modular to bring NVIDIA Accelerated Computing to the MAX Platform

Today, Modular is excited to announce that it is collaborating with NVIDIA to bring the power of NVIDIA GPUs, CPUs and CUDA software to the Modular Accelerated Execution (MAX) Platform.

Modular
Aug 25th, 2023
Modular: The Case for a Next-Generation AI Developer Platform

We are building a next-generation AI developer platform for the world. Read our latest post on how The Case for a Next-Generation AI Developer Platform

Modular
Aug 25th, 2023
Modular: We’ve raised $100M to fix AI infrastructure for the world's developers

We are building a next-generation AI developer platform for the world. Read our latest post on how We’ve raised $100M to fix AI infrastructure for the world's developers

INACTIVE