Full-Time

Senior System Reliability Engineer

Confirmed live in the last 24 hours

NVIDIA

NVIDIA

10,001+ employees

Designs GPUs and AI computing solutions

Automotive & Transportation
Enterprise Software
AI & Machine Learning
Gaming

Compensation Overview

$140k - $264.5kAnnually

+ Equity

Senior

Company Historically Provides H1B Sponsorship

Santa Clara, CA, USA

Category
DevOps & Infrastructure
Site Reliability Engineering

You match the following NVIDIA's candidate preferences

Employers are more likely to interview you if you match these preferences:

Degree
Experience
Requirements
  • BS (or equivalent experience) in Engineering, Material Science, Physics, or a related field, MS or PhD preferred.
  • 6+ years in a hardware validation/reliability environment related to PCIE peripherals, graphics cards and servers.
  • Understand power supply, memory, high speed I/O, PCI express, Ethernet and I2C.
  • Hands-on experience in theoretical and practical Reliability concepts as it relates to high-tech electronic enterprise and consumer products.
  • Have a strong command and understanding of statistical concepts/models/analysis and how they relate to product reliability & life analysis.
  • Good verbal and writing skills as well as the ability to communicate at a high level.
  • Self-motivating, independent, and committed to getting things done.
  • Good project management skills and ability to balance multiple simultaneous projects during development and production stages.
Responsibilities
  • Provide expertise in Hardware Reliability Engineering for Electronics/Server Systems (graphics cards, server, rack, cluster) from Concept to End-of-Life phase.
  • Establish, deliver and maintain product reliability standards and metrics for NVIDIA's new system technologies, using existing tools and processes or developing new as required.
  • Participate in product and engineering design reviews, assess the reliability budget of products/designs, and inspire changes that enhance product reliability.
  • Interface and interact with all pertinent engineering groups, suppliers, and partners ensuring the desired reliability is achieved using Design for Reliability (DfR) methods including FMEA and DoE approaches.
  • Define and implement Reliability Plans & Specifications.
  • Provide reliability predictions, along with test plans and methods to access and drive product reliability to the desired levels.
  • Perform and lead appropriate testing with associated failure analysis and recommendations for improving designs and manufacturing.
  • Develop and present methods of correlating reliability test results with actual field performance.
Desired Qualifications
  • MS or PhD preferred.

NVIDIA designs and manufactures graphics processing units (GPUs) and system on a chip units (SoCs) for various markets, including gaming, professional visualization, data centers, and automotive. Their products include GPUs tailored for gaming and professional use, as well as platforms for artificial intelligence (AI) and high-performance computing (HPC). These products help developers, data scientists, and IT administrators perform complex tasks efficiently. NVIDIA stands out from competitors by offering a combination of hardware and software solutions, including cloud-based services like NVIDIA CloudXR and NGC, which enhance user experiences in AI, machine learning, and computer vision. The company's goal is to drive innovation through continuous research and development, providing advanced solutions to a diverse clientele that includes gamers, researchers, and enterprises.

Company Stage

IPO

Total Funding

$19.5M

Headquarters

Santa Clara, California

Founded

1993

Growth & Insights
Headcount

6 month growth

0%

1 year growth

0%

2 year growth

-1%
Simplify Jobs

Simplify's Take

What believers are saying

  • Acquisition of VinBrain enhances NVIDIA's AI-driven healthcare solutions.
  • Investment in Nebius Group boosts NVIDIA's AI infrastructure capabilities.
  • Partnership with Serve Robotics aligns with NVIDIA's focus on robotics and AI applications.

What critics are saying

  • Increased competition from AI startups like xAI challenges NVIDIA's market position.
  • Serve Robotics' rapid expansion may lead to financial strain if market growth lags.
  • Integration challenges from VinBrain acquisition may affect NVIDIA's operational efficiency.

What makes NVIDIA unique

  • NVIDIA leads in AI and HPC solutions with cutting-edge GPU technology.
  • The Omniverse platform enhances NVIDIA's capabilities in industrial AI and digital twins.
  • NVIDIA's cloud services, like CloudXR, offer scalable solutions for AI and machine learning.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Company Equity

401(k) Company Match