Full-Time

Senior System Reliability Engineer

Posted on 1/7/2025

NVIDIA

NVIDIA

10,001+ employees

Designs GPUs and AI computing solutions

Compensation Overview

$140k - $264.5kAnnually

+ Equity

Senior

Company Historically Provides H1B Sponsorship

Santa Clara, CA, USA

Category
DevOps & Infrastructure
Site Reliability Engineering
Requirements
  • BS (or equivalent experience) in Engineering, Material Science, Physics, or a related field, MS or PhD preferred.
  • 6+ years in a hardware validation/reliability environment related to PCIE peripherals, graphics cards and servers.
  • Understand power supply, memory, high speed I/O, PCI express, Ethernet and I2C.
  • Hands-on experience in theoretical and practical Reliability concepts as it relates to high-tech electronic enterprise and consumer products.
  • Have a strong command and understanding of statistical concepts/models/analysis and how they relate to product reliability & life analysis.
  • Good verbal and writing skills as well as the ability to communicate at a high level.
  • Self-motivating, independent, and committed to getting things done.
  • Good project management skills and ability to balance multiple simultaneous projects during development and production stages.
Responsibilities
  • Provide expertise in Hardware Reliability Engineering for Electronics/Server Systems (graphics cards, server, rack, cluster) from Concept to End-of-Life phase.
  • Establish, deliver and maintain product reliability standards and metrics for NVIDIA's new system technologies, using existing tools and processes or developing new as required.
  • Participate in product and engineering design reviews, assess the reliability budget of products/designs, and inspire changes that enhance product reliability.
  • Interface and interact with all pertinent engineering groups, suppliers, and partners ensuring the desired reliability is achieved using Design for Reliability (DfR) methods including FMEA and DoE approaches.
  • Define and implement Reliability Plans & Specifications.
  • Provide reliability predictions, along with test plans and methods to access and drive product reliability to the desired levels.
  • Perform and lead appropriate testing with associated failure analysis and recommendations for improving designs and manufacturing.
  • Develop and present methods of correlating reliability test results with actual field performance.
Desired Qualifications
  • MS or PhD preferred.

NVIDIA designs and manufactures graphics processing units (GPUs) and system on a chip units (SoCs) for various markets, including gaming, professional visualization, data centers, and automotive. Their products include GPUs tailored for gaming and professional use, as well as platforms for artificial intelligence (AI) and high-performance computing (HPC) that cater to developers, data scientists, and IT administrators. NVIDIA generates revenue through the sale of hardware, software solutions, and cloud-based services, such as NVIDIA CloudXR and NGC, which enhance experiences in AI, machine learning, and computer vision. What sets NVIDIA apart from competitors is its strong focus on research and development, allowing it to maintain a leadership position in a highly competitive market. The company's goal is to drive innovation and provide advanced solutions that meet the needs of a diverse clientele, including gamers, researchers, and enterprises.

Company Size

10,001+

Company Stage

IPO

Headquarters

Santa Clara, California

Founded

1993

Simplify Jobs

Simplify's Take

What believers are saying

  • Acquisition of Augtera Networks boosts NVIDIA's networking capabilities and Spectrum-X portfolio.
  • Growing demand for NVIDIA GPUs in AI acceleration suggests potential market growth.
  • NVIDIA's support for AI-driven robotics solutions opens new market opportunities.

What critics are saying

  • Increased competition from Lambda's AI Cloud Platform challenges NVIDIA's market position.
  • Edge AI technologies may reduce demand for NVIDIA's cloud-based AI solutions.
  • Regulatory challenges in robot delivery services could impact NVIDIA's investment returns.

What makes NVIDIA unique

  • NVIDIA leads in AI and HPC with cutting-edge GPU technology.
  • The company excels in diverse markets: gaming, data centers, and autonomous vehicles.
  • NVIDIA's Omniverse platform enhances industrial AI applications and digital twin technology.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Company Equity

401(k) Company Match

Growth & Insights and Company News

Headcount

6 month growth

0%

1 year growth

0%

2 year growth

-1%
Data Center Dynamics
Mar 4th, 2025
Nvidia quietly acquires AIOps firm Augtera Networks

GPU giant rolls networking monitoring firm into Spectrum-X portfolio

Business Wire
Feb 21st, 2025
Lambda Raises $480M to Expand AI Cloud Platform

Lambda, the AI Developer Cloud, today announced it has raised a $480 million Series D, bringing the total equity capital raised to date to $863 millio

PR Newswire
Feb 20th, 2025
Together AI Secures $305M Series B Funding

Together AI announced a $305 million Series B funding round led by General Catalyst and Prosperity7, valuing the company at $3.3 billion. The investment will enhance its AI Acceleration Cloud, focusing on open source models and NVIDIA Blackwell GPU deployment. Together AI supports over 450,000 developers and partners with major firms like Salesforce and Zoom. The platform offers enterprise-grade AI solutions with advanced infrastructure and research innovations for improved efficiency and cost-effectiveness.

CoinCentral
Feb 19th, 2025
NVIDIA-Backed Edge AI Startup ClustroAI Raises $12M to Bring AI Processing to Local Devices - CoinCentral

San Francisco-based ClustroAI raised $12M in Series A funding to advance its edge AI technology that enables local device AI processing without cloud computing

Alexa Blockchain
Feb 12th, 2025
GamerBoom Raises $9M with NVIDIA Backing

GamerBoom, an AI-powered gaming data analytics protocol on Solana, raised $9M in a funding round, totaling over $11M. Investors include Bing Ventures, SKY Ventures, and NVIDIA, enhancing its AI capabilities. The funding will scale AI-driven gaming data solutions for Web3. GamerBoom is part of Binance’s MVB Accelerator Program and plans to launch a rewards program and NFT sales.

INACTIVE