Full-Time

Senior Performance Software Engineer

Deep Learning Libraries

Posted on 5/9/2025

NVIDIA

NVIDIA

10,001+ employees

Designs GPUs and AI computing solutions

Compensation Overview

$184k - $425.5k/yr

+ Equity

Senior

Company Historically Provides H1B Sponsorship

Austin, TX, USA + 4 more

More locations: Redmond, WA, USA | Santa Clara, CA, USA | Durham, NC, USA | Hillsboro, OR, USA

Category
Applied Machine Learning
Deep Learning
AI & Machine Learning
Software Engineering
Required Skills
CUDA
Assembly
C/C++
Requirements
  • Masters or PhD degree or equivalent experience in Computer Science, Computer Engineering, Applied Math, or related field
  • 6+ years of relevant industry experience
  • Demonstrated strong C++ programming and software design skills, including debugging, performance analysis, and test design
  • Experience with performance-oriented parallel programming, even if it’s not on GPUs (e.g. with OpenMP or pthreads)
  • Solid understanding of computer architecture and some experience with assembly programming
Responsibilities
  • Writing highly tuned compute kernels, mostly in C++ CUDA, to perform core deep learning operations (e.g. matrix multiplies, convolutions, normalizations)
  • Following general software engineering best practices including support for regression testing and CI/CD flows
  • Collaborating with teams across NVIDIA:
  • CUDA compiler team on generating optimal assembly code
  • Deep learning training and inference performance teams on which layers require optimization
  • Hardware and architecture teams on the programming model for new deep learning hardware features
Desired Qualifications
  • Tuning BLAS or deep learning library kernel code
  • CUDA/OpenCL GPU programming
  • Numerical methods and linear algebra
  • LLVM, TVM tensor expressions, or TensorFlow MLIR

NVIDIA designs and manufactures graphics processing units (GPUs) and system on a chip units (SoCs) for various markets, including gaming, professional visualization, data centers, and automotive. Their main products are GPUs that enhance gaming experiences and support professional applications, along with AI and high-performance computing platforms tailored for developers and data scientists. NVIDIA stands out from competitors by offering a combination of hardware and software solutions, including cloud-based services like NVIDIA CloudXR and NGC, which enable scalable applications in AI and machine learning. The company's goal is to drive innovation in technology and provide advanced solutions that cater to a wide range of clients, from gamers to enterprises.

Company Size

10,001+

Company Stage

IPO

Headquarters

Santa Clara, California

Founded

1993

Simplify Jobs

Simplify's Take

What believers are saying

  • Collaboration with Utilidata enhances NVIDIA's position in the energy sector.
  • Acquisition of Lepton AI strengthens NVIDIA's cloud service offerings.
  • Involvement in nEye Systems boosts data transfer speeds and efficiency.

What critics are saying

  • Silicon photonics technology could challenge NVIDIA's electrical interconnects.
  • AI21's funding round may intensify competition in AI infrastructure.
  • Integration challenges with Lepton AI could impact operational efficiency.

What makes NVIDIA unique

  • NVIDIA leads in AI and HPC with cutting-edge GPU technology.
  • The company excels in gaming and professional visualization markets.
  • NVIDIA's cloud services offer scalable solutions for AI and machine learning.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Company Equity

401(k) Company Match

Growth & Insights and Company News

Headcount

6 month growth

1%

1 year growth

1%

2 year growth

0%
Business Insider
May 9th, 2025
Nvidia-backed Israeli AI startup AI21 is raising a $300 million funding round

AI21, an Israeli startup building its own large language models (LLMs), is raising a $300 million funding round, a source said.

Canary Media
Apr 29th, 2025
Utilidata raises $60M for smarter grids

Utilidata has raised $60 million to explore the potential of AI chips in enhancing grid intelligence. Collaborating with Nvidia and utility partners like Portland General Electric and Duquesne Light, the projects aim to gather detailed grid data, particularly regarding distributed energy resources like rooftop solar and EV chargers. These efforts focus on optimizing grid operations through virtual power plants and distributed energy resource management systems, leveraging real-time data and communication.

SiliconANGLE
Apr 11th, 2025
nEye Systems raises $58M for AI chips

Silicon photonics startup nEye Systems raised $58M in funding led by CapitalG, with participation from Microsoft, Micron, Nvidia, and others. The Emeryville-based company is developing optical networking chips for AI data centers, promising faster, more efficient, and cost-effective data transfers. nEye's technology aims to overcome bandwidth and energy limitations of current electrical interconnects. Prototypes are ready, with production samples expected next year. Total funding exceeds $72M.

Aibase
Apr 8th, 2025
Nvidia Acquires Lepton AI for Millions

Nvidia has completed its acquisition of Lepton AI, a startup founded by former Alibaba VP Yangqing Jia, for reportedly hundreds of millions of dollars. Lepton AI, established in 2023, focuses on AI infrastructure and cloud solutions. Co-founders Yangqing Jia and Junjie Bai have joined Nvidia. Jia, a notable AI expert, previously contributed to TensorFlow at Google and led AI R&D at Alibaba.

Yahoo Finance
Apr 7th, 2025
Rescale Raises $115M in Venture Funding

Rescale, a San Francisco-based startup specializing in engineering software for designing race cars and computer chips, secured $115 million in venture financing.