Full-Time

Kernel Engineer

Posted on 1/31/2025

Modular

Modular

201-500 employees

Simplifies AI infrastructure for businesses

Data & Analytics
AI & Machine Learning

Compensation Overview

$166.5k - $242kAnnually

+ Annual Target Bonus + Equity + Benefits

Mid, Senior

Mountain View, CA, USA + 1 more

More locations: Remote in Canada

Candidates based in the US or Canada are welcome to apply. The job can be performed remotely or in the office in Los Altos, CA.

Category
Embedded Engineering
Software Engineering
Required Skills
Machine Learning
Assembly
C/C++

You match the following Modular's candidate preferences

Employers are more likely to interview you if you match these preferences:

Degree
Experience
Requirements
  • In-depth knowledge of C++ and low-level (micro)architectural performance is required.
  • 4+ years of experience working on complex code and systems.
  • Experience with performance modeling and performance data analysis.
  • Understanding of Parallelization techniques for ML / HPC Acceleration.
  • Deep interest in machine learning technologies and use cases.
  • Creativity and curiosity for solving complex problems, a team-oriented attitude that enables you to work well with others, and alignment with our culture.
Responsibilities
  • Design and optimize high-performance ML numeric and data manipulation kernels/operators.
  • Utilize low-level C/C++/Assembly programming to achieve state of the art performance. Your work will also entail potentially introducing new novel compiler and tools support.
  • Work with compiler, framework, runtime and performance teams to deliver end-to-end performance that fully utilizes today’s complex server and mobile systems.
  • Collaborate with architects and hardware engineers to co-design future accelerators, including ISA for new hardware features and evolving ISA.
  • Collaborate with machine learning researchers to guide system development for future ML trends.
Desired Qualifications
  • Some knowledge of compiler fundamentals is valuable, as is familiarity with kernel authoring paradigms (i.e., OpenMP, CUDA, Halide, Rise/Lift, or others)
  • Experience with performance profilers, performance data analysis tools, visualization tools, and debugging or experience working with embedded systems
  • Experience working with distributed/parallel programming models and an understanding of parallel hardware.
  • Experience developing firmware for accelerators and embedded programming.
  • Experience with HPC programming and accelerator languages such as CUDA, OpenCL, SYCL, etc.

Modular provides a platform that simplifies AI infrastructure for businesses in various industries, including technology, engineering, and finance. Its product suite is designed to be integrated and composable, allowing teams to develop, deploy, and innovate more efficiently. The platform offers direct access to industry experts, helping clients address infrastructure challenges and meet their Service Level Agreements (SLAs) and Service Level Objectives (SLOs). Unlike competitors, Modular focuses on creating a user-friendly experience and emphasizes a culture of teamwork, learning, and diversity. The company's goal is to continuously enhance its platform, recently adding support for dynamic shapes in AI workloads to improve flexibility and efficiency.

Company Stage

Late Stage VC

Total Funding

$126.5M

Headquarters

Palo Alto, California

Founded

2022

Growth & Insights
Headcount

6 month growth

-2%

1 year growth

-7%

2 year growth

9%
Simplify Jobs

Simplify's Take

What believers are saying

  • Growing demand for AI infrastructure supports Modular's scalable business model.
  • Interest in AI-first programming languages may increase platform adoption.
  • Edge computing trend creates opportunities for Modular's edge-to-datacenter distribution.

What critics are saying

  • Rapid evolution of AI languages may cause fragmentation and compatibility issues.
  • Dependency on NVIDIA hardware could impact Modular's flexibility and bargaining power.
  • Investor pressure from $100M funding may lead to strategic misalignments.

What makes Modular unique

  • Modular unifies AI development and deployment with a 'just works' approach.
  • The platform supports dynamic shapes, enhancing flexibility and efficiency for AI workloads.
  • Collaboration with NVIDIA boosts platform capabilities with high-performance computing solutions.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Health Insurance

401(k) Company Match

Unlimited Paid Time Off

Stock Options