AI CPU Performance Engineer

Confirmed live in the last 24 hours



51-200 employees

Unified AI frameworks and hardware platform

AI & Machine Learning

Compensation Overview

$166,500 - $242,000Annually

+ Annual target bonus + Equity + WFH stipends + Flexible paid time off + Team building events

Mid, Senior

Remote in USA + 1 more

  • 3+ years of relevant experience working on complex code and systems.
  • In-depth knowledge of C++ and low-level CPU micro-architecture.
  • Experience with performance modeling and analysis.
  • Understanding of parallelization and vectorization techniques for AI applications.
  • Deep interest in AI technologies and use cases.
  • Creativity and curiosity for solving complex problems, a team-oriented attitude that enables you to work well with others, and alignment with our culture.
  • Design and optimize high-performance ML numeric and data manipulation operators for CPUs.
  • Utilize low-level C++ and assembly programming to achieve state-of-the-art performance.
  • Work with other teams at Modular to ensure that performance libraries can be utilized as effectively and efficiently as possible.
  • Perform micro-benchmarking, workload characterization, competitive analysis, bottleneck identification, and optimization.
  • Collaborate with machine learning researchers to guide system development for future ML trends.

The company offers a next-generation AI developer platform that unifies AI frameworks and hardware, providing unparalleled performance and cost savings. The platform includes a new programming language, Mojo, and a modular engine that enables model deployment, compute portability, and scalability, revolutionizing AI infrastructure.

Company Stage

Series A

Total Funding






Growth & Insights

6 month growth


1 year growth


2 year growth