Simplify Logo

Full-Time

AI GPU Performance Engineer

Updated on 9/5/2024

Modular

Modular

51-200 employees

Simplifies AI infrastructure for businesses

Hardware
Enterprise Software
AI & Machine Learning

Compensation Overview

$135k - $242kAnnually

+ Bonus + Equity + Benefits

Mid, Senior

Remote in USA + 1 more

Category
Applied Machine Learning
Deep Learning
AI & Machine Learning
Required Skills
CUDA
Requirements
  • Deep understanding of computer architecture (memory hierarchies, caching, etc.) and their impact on algorithm design.
  • 3+ years of relevant experience working on complex code and software systems.
  • Self motivated and independent with the ability to execute on agreed upon specifications.
  • Experience with GPU programming languages such as CUDA or OpenCL.
  • Creativity and curiosity for solving complex problems, a team-oriented attitude that enables you to work well with others, and alignment with our culture.
Responsibilities
  • Design, develop, and optimize high-performance AI numeric and data manipulation kernels/operators for GPUs.
  • Achieve state-of-the-art performance by leveraging software and micro-architectural features of GPUs.
  • Work with compiler, framework, runtime, and serving teams to deliver end-to-end performance that fully utilizes GPU workstations and servers.
  • Collaborate with machine learning researchers to guide system development for future ML trends.

Modular provides a platform that simplifies AI infrastructure for businesses in various industries, including technology, engineering, and finance. Its product suite is designed to be integrated and composable, allowing teams to develop, deploy, and innovate more efficiently. The platform offers direct access to industry experts, helping clients address infrastructure challenges and meet their Service Level Agreements (SLAs) and Service Level Objectives (SLOs). Unlike competitors, Modular focuses on creating a user-friendly experience and emphasizes a culture of teamwork, learning, and diversity. The company's goal is to continuously innovate and enhance its platform, recently introducing support for dynamic shapes in AI workloads to improve flexibility and efficiency.

Company Stage

Series A

Total Funding

$130M

Headquarters

Palo Alto, California

Founded

2022

Growth & Insights
Headcount

6 month growth

9%

1 year growth

40%

2 year growth

827%
Simplify Jobs

Simplify's Take

What believers are saying

  • The recent $100M funding round positions Modular.com for rapid growth and further innovation in AI infrastructure.
  • Partnerships with industry leaders like NVIDIA enhance the platform's capabilities and market credibility.
  • A strong company culture focused on teamwork, diversity, and continuous learning makes Modular.com an attractive workplace for top talent.

What critics are saying

  • The highly competitive AI infrastructure market could make it challenging for Modular.com to maintain its unique value proposition.
  • Dependence on partnerships, such as with NVIDIA, may pose risks if these collaborations face any disruptions.

What makes Modular unique

  • Modular.com offers a unified and extensible platform specifically designed for AI infrastructure, setting it apart from more generalized cloud service providers.
  • The company's focus on dynamic shapes for AI workloads enhances the flexibility and efficiency of its platform, a feature not commonly found in competitor offerings.
  • Collaboration with NVIDIA to integrate GPUs, CPUs, and CUDA software into the MAX Platform provides a significant performance boost, distinguishing Modular.com from other AI infrastructure providers.