Full-Time

Senior DL Algorithms Engineer

Inference Optimizations

Confirmed live in the last 24 hours

NVIDIA

NVIDIA

10,001+ employees

Designs GPUs and AI computing solutions

Automotive & Transportation
Enterprise Software
AI & Machine Learning
Gaming

Compensation Overview

$148k - $287.5kAnnually

+ Equity

Senior

Company Historically Provides H1B Sponsorship

Remote in USA

The job is remote, but candidates from Santa Clara, CA and New York, NY are also considered.

Category
Applied Machine Learning
Deep Learning
AI & Machine Learning
Required Skills
Python
Neural Networks
C/C++

You match the following NVIDIA's candidate preferences

Employers are more likely to interview you if you match these preferences:

Degree
Experience
Requirements
  • PhD in CS, EE or CSEE or equivalent experience
  • 5+ years of experience
  • Strong background in deep learning and neural networks, in particular inference
  • Deep understanding of computer architecture, and familiarity with the fundamentals of GPU architecture
  • Programming skills in C++ and Python
Responsibilities
  • Deliver hyper-optimized recipes for LLM inference as part of NVIDIA Inference Microservices (NIMs)
  • Analyze, validate and debug performance and accuracy characteristics of optimized models
  • Benchmark state-of-the-art offerings in LLM inference and perform competitive analysis for NVIDIA SW/HW stack
  • Develop software, tooling and processes across multiple layers of the stack to streamline and scale the delivery of hundreds of optimized LLM models
  • Collaborate heavily with other SW/HW co-design teams to enable the creation of the next generation of AI-powered services
Desired Qualifications
  • Strong fundamentals in algorithms
  • Experience and good understanding of LLMs and/or VLMs
  • Proven experience with processor and system-level performance modelling
  • Experience with MLOps and DLOps, building CI/CD pipelines
  • GPU programming experience (CUDA or OpenCL) is a strong plus but not required

NVIDIA designs and manufactures graphics processing units (GPUs) and system on a chip units (SoCs) for various markets, including gaming, professional visualization, data centers, and automotive. Their main products are GPUs that enhance gaming experiences and support professional applications, along with AI and high-performance computing platforms tailored for developers and data scientists. NVIDIA stands out from competitors by offering a combination of hardware and software solutions, including cloud-based services like NVIDIA CloudXR and NGC, which enable scalable applications in AI and machine learning. The company's goal is to drive innovation in technology and provide advanced solutions that cater to a wide range of clients, from gamers to enterprises.

Company Stage

IPO

Total Funding

$19.5M

Headquarters

Santa Clara, California

Founded

1993

Growth & Insights
Headcount

6 month growth

0%

1 year growth

0%

2 year growth

-1%
Simplify Jobs

Simplify's Take

What believers are saying

  • Acquisition of VinBrain enhances NVIDIA's AI-driven healthcare solutions.
  • Investment in Nebius Group boosts NVIDIA's AI infrastructure capabilities.
  • Partnership with Serve Robotics aligns with NVIDIA's focus on robotics and AI applications.

What critics are saying

  • Increased competition from AI startups like xAI challenges NVIDIA's market position.
  • Serve Robotics' rapid expansion may lead to financial strain if market growth lags.
  • Integration challenges from VinBrain acquisition may affect NVIDIA's operational efficiency.

What makes NVIDIA unique

  • NVIDIA leads in AI and HPC solutions with cutting-edge GPU technology.
  • The Omniverse platform enhances NVIDIA's capabilities in industrial AI and digital twins.
  • NVIDIA's cloud services, like CloudXR, offer scalable solutions for AI and machine learning.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Company Equity

401(k) Company Match