Full-Time

Kernel Engineer

Confirmed live in the last 24 hours

Modular

Modular

201-500 employees

Simplifies AI infrastructure for businesses

Compensation Overview

$166.5k - $242k/yr

+ Bonus + Equity + Benefits

Mid, Senior

Mountain View, CA, USA + 1 more

More locations: Remote in Canada

Candidates must be based in the US or Canada. Onboarding for all new hires is conducted onsite at our Los Altos, CA office.

Category
Embedded Engineering
Software Engineering
Required Skills
Machine Learning
Assembly
C/C++
Requirements
  • In-depth knowledge of C++ and low-level (micro)architectural performance is required.
  • 4+ years of experience working on complex code and systems.
  • Experience with performance modeling and performance data analysis.
  • Understanding of Parallelization techniques for ML / HPC Acceleration.
  • Deep interest in machine learning technologies and use cases.
  • Creativity and curiosity for solving complex problems, a team-oriented attitude that enables you to work well with others, and alignment with our culture.
Responsibilities
  • Design and optimize high-performance ML numeric and data manipulation kernels/operators.
  • Utilize low-level C/C++/Assembly programming to achieve state of the art performance. Your work will also entail potentially introducing new novel compiler and tools support.
  • Work with compiler, framework, runtime and performance teams to deliver end-to-end performance that fully utilizes today’s complex server and mobile systems.
  • Collaborate with architects and hardware engineers to co-design future accelerators, including ISA for new hardware features and evolving ISA.
  • Collaborate with machine learning researchers to guide system development for future ML trends.
Desired Qualifications
  • Some knowledge of compiler fundamentals is valuable, as is familiarity with kernel authoring paradigms (i.e., OpenMP, CUDA, Halide, Rise/Lift, or others)
  • Experience with performance profilers, performance data analysis tools, visualization tools, and debugging or experience working with embedded systems
  • Experience working with distributed/parallel programming models and an understanding of parallel hardware.
  • Experience developing firmware for accelerators and embedded programming.
  • Experience with HPC programming and accelerator languages such as CUDA, OpenCL, SYCL, etc.

Modular provides a platform that simplifies AI infrastructure for businesses in various industries, including technology, engineering, and finance. Its product suite is designed to be integrated and composable, allowing teams to develop, deploy, and innovate more efficiently. The platform offers direct access to industry experts, helping clients address infrastructure challenges and meet their Service Level Agreements (SLAs) and Service Level Objectives (SLOs). Unlike competitors, Modular focuses on creating a user-friendly experience and emphasizes teamwork, learning from failures, and promoting diversity and inclusion within its culture. The company's goal is to continuously innovate and enhance its platform, recently introducing support for dynamic shapes in AI workloads to improve flexibility and efficiency.

Company Size

201-500

Company Stage

Late Stage VC

Total Funding

$130M

Headquarters

Palo Alto, California

Founded

2022

Simplify Jobs

Simplify's Take

What believers are saying

  • Modular's collaboration with NVIDIA enhances its platform's hardware-software integration.
  • The shift to .com domains strengthens Modular's market positioning and investor trust.
  • Rising AI-first programming languages create opportunities for Modular's unified platform.

What critics are saying

  • Dependency on NVIDIA's technology could impact Modular's platform capabilities.
  • Fragmentation in AI programming languages may affect Modular's platform adoption.
  • The shift from .ai to .com domains may affect Modular's innovative brand perception.

What makes Modular unique

  • Modular unifies AI development and deployment, enhancing developer velocity and efficiency.
  • The platform integrates compilers and runtimes for heterogeneous computing environments.
  • Modular's culture emphasizes teamwork, diversity, and continuous innovation in AI infrastructure.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Health Insurance

401(k) Company Match

Unlimited Paid Time Off

Stock Options

Growth & Insights and Company News

Headcount

6 month growth

0%

1 year growth

1%

2 year growth

6%
Tech Startups
Apr 7th, 2025
From .Ai To .Com: The Quiet Domain Rebrand Sweeping Startup Ecosystem

When generative AI took off in 2022 following the popularity of ChatGPT, launching a startup on a .ai domain felt like the obvious move. It signaled your company was part of the next wave of innovation.¬†It instantly signaled to investors, journalists, and users that you were building in AI.Now, a shift is happening. Some of the most promising startups in AI are quietly moving away from their .ai domains and switching to .com. It‚Äôs not about trend-chasing‚Äîit‚Äôs about positioning.This isn’t a flood, but it‚Äôs enough to raise eyebrows. And it says something about how branding, trust, and long-term ambition play a bigger role in how startups choose to present themselves.Why Startups Went With .AI DomainsIn the early days, .ai domains were easy to get and made a lot of sense. Short .coms are hard to come by and often expensive

VentureBeat
May 21st, 2024
Mojo Rising: The Resurgence Of Ai-First Programming Languages

Join us in returning to NYC on June 5th to collaborate with executive leaders in exploring comprehensive methods for auditing AI models regarding bias, performance, and ethical compliance across diverse organizations. Find out how you can attend here. Blink, and you might just miss the invention of yet another programming language. The old joke goes that programmers spend 20% of their time coding and 80% of their time deciding what language to use. In fact, there are so many programming languages out there that we are not sure how many we actually have. It’s probably safe to say there are at least 700 programming languages lingering in various states of use and misuse

Modular
Dec 4th, 2023
Modular to bring NVIDIA Accelerated Computing to the MAX Platform

Today, Modular is excited to announce that it is collaborating with NVIDIA to bring the power of NVIDIA GPUs, CPUs and CUDA software to the Modular Accelerated Execution (MAX) Platform.

Modular
Aug 25th, 2023
Modular: The Case for a Next-Generation AI Developer Platform

We are building a next-generation AI developer platform for the world. Read our latest post on how The Case for a Next-Generation AI Developer Platform

Modular
Aug 25th, 2023
Modular: We’ve raised $100M to fix AI infrastructure for the world's developers

We are building a next-generation AI developer platform for the world. Read our latest post on how We’ve raised $100M to fix AI infrastructure for the world's developers