Full-Time

Senior DL Algorithms Engineer

Inference Optimizations

Updated on 3/14/2025

NVIDIA

NVIDIA

10,001+ employees

Designs GPUs and AI computing solutions

Compensation Overview

$148k - $287.5kAnnually

+ Equity

Senior

Company Historically Provides H1B Sponsorship

Remote in USA

The job is remote, but candidates from Santa Clara, CA and New York, NY are also considered.

Category
Applied Machine Learning
Deep Learning
AI & Machine Learning
Required Skills
Python
Neural Networks
C/C++
Requirements
  • PhD in CS, EE or CSEE or equivalent experience
  • 5+ years of experience
  • Strong background in deep learning and neural networks, in particular inference
  • Deep understanding of computer architecture, and familiarity with the fundamentals of GPU architecture
  • Programming skills in C++ and Python
Responsibilities
  • Deliver hyper-optimized recipes for LLM inference as part of NVIDIA Inference Microservices (NIMs)
  • Analyze, validate and debug performance and accuracy characteristics of optimized models
  • Benchmark state-of-the-art offerings in LLM inference and perform competitive analysis for NVIDIA SW/HW stack
  • Develop software, tooling and processes across multiple layers of the stack to streamline and scale the delivery of hundreds of optimized LLM models
  • Collaborate heavily with other SW/HW co-design teams to enable the creation of the next generation of AI-powered services
Desired Qualifications
  • Strong fundamentals in algorithms
  • Experience and good understanding of LLMs and/or VLMs
  • Proven experience with processor and system-level performance modelling
  • Experience with MLOps and DLOps, building CI/CD pipelines
  • GPU programming experience (CUDA or OpenCL) is a strong plus but not required

NVIDIA designs and manufactures graphics processing units (GPUs) and system on a chip units (SoCs) for various markets, including gaming, professional visualization, data centers, and automotive. Their products include GPUs tailored for gaming and professional use, as well as platforms for artificial intelligence (AI) and high-performance computing (HPC). These products help developers, data scientists, and IT administrators perform complex tasks efficiently. NVIDIA stands out from competitors by offering a combination of hardware and software solutions, including cloud-based services like NVIDIA CloudXR and NGC, which enhance user experiences in AI, machine learning, and computer vision. The company's goal is to drive innovation through continuous research and development, ensuring they provide advanced solutions to a diverse clientele that includes gamers, researchers, and enterprises.

Company Size

10,001+

Company Stage

IPO

Headquarters

Santa Clara, California

Founded

1993

Simplify Jobs

Simplify's Take

What believers are saying

  • Acquisition of Augtera Networks boosts NVIDIA's networking capabilities and Spectrum-X portfolio.
  • Growing demand for NVIDIA GPUs in AI acceleration suggests potential market growth.
  • NVIDIA's support for AI-driven robotics solutions opens new market opportunities.

What critics are saying

  • Increased competition from Lambda's AI Cloud Platform challenges NVIDIA's market position.
  • Edge AI technologies may reduce demand for NVIDIA's cloud-based AI solutions.
  • Regulatory challenges in robot delivery services could impact NVIDIA's investment returns.

What makes NVIDIA unique

  • NVIDIA leads in AI and HPC with cutting-edge GPU technology.
  • The company excels in diverse markets: gaming, data centers, and autonomous vehicles.
  • NVIDIA's Omniverse platform enhances industrial AI applications and digital twin technology.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Company Equity

401(k) Company Match

Growth & Insights and Company News

Headcount

6 month growth

0%

1 year growth

0%

2 year growth

-1%
Data Center Dynamics
Mar 4th, 2025
Nvidia quietly acquires AIOps firm Augtera Networks

GPU giant rolls networking monitoring firm into Spectrum-X portfolio

Business Wire
Feb 21st, 2025
Lambda Raises $480M to Expand AI Cloud Platform

Lambda, the AI Developer Cloud, today announced it has raised a $480 million Series D, bringing the total equity capital raised to date to $863 millio

PR Newswire
Feb 20th, 2025
Together AI Secures $305M Series B Funding

Together AI announced a $305 million Series B funding round led by General Catalyst and Prosperity7, valuing the company at $3.3 billion. The investment will enhance its AI Acceleration Cloud, focusing on open source models and NVIDIA Blackwell GPU deployment. Together AI supports over 450,000 developers and partners with major firms like Salesforce and Zoom. The platform offers enterprise-grade AI solutions with advanced infrastructure and research innovations for improved efficiency and cost-effectiveness.

CoinCentral
Feb 19th, 2025
NVIDIA-Backed Edge AI Startup ClustroAI Raises $12M to Bring AI Processing to Local Devices - CoinCentral

San Francisco-based ClustroAI raised $12M in Series A funding to advance its edge AI technology that enables local device AI processing without cloud computing

Alexa Blockchain
Feb 12th, 2025
GamerBoom Raises $9M with NVIDIA Backing

GamerBoom, an AI-powered gaming data analytics protocol on Solana, raised $9M in a funding round, totaling over $11M. Investors include Bing Ventures, SKY Ventures, and NVIDIA, enhancing its AI capabilities. The funding will scale AI-driven gaming data solutions for Web3. GamerBoom is part of Binance’s MVB Accelerator Program and plans to launch a rewards program and NFT sales.