Full-Time

Senior Site Reliability Engineer

Infrastructure

Updated on 3/14/2025

NVIDIA

NVIDIA

10,001+ employees

Designs GPUs and AI computing solutions

Compensation Overview

$148k - $287.5kAnnually

+ Equity

Senior

Company Historically Provides H1B Sponsorship

Austin, TX, USA + 3 more

More locations: Santa Clara, CA, USA | Durham, NC, USA | Westford, MA, USA

Category
DevOps & Infrastructure
Site Reliability Engineering
Required Skills
Python
Jenkins
Ansible
Linux/Unix
Data Analysis
Requirements
  • Experience with automation workflows such as Ansible and Jenkins.
  • UNIX Systems programming and automation using industry standard languages and familiar with API calls. Python experience preferred.
  • Authoritative level usage of UNIX and UNIX CLI utilities such as sed, awk, grep.
  • Hands on experience with architectural decisions in technologies (storage, networking, compute) our chip engineers depend on.
  • Understanding of distributed UNIX system concepts such as NFS, autofs, DNS, LDAP and/or NIS.
  • Excellent planning and communication skills and a passion for improving the productivity and efficiency of other specialists.
  • Strong experience investigating and debugging complex, multi-discipline problems in a UNIX environment.
  • 5+ years experience in a large, distributed UNIX environment.
  • History of using data analysis principles and influencing data-driven decisions.
  • MS (preferred) or BS in Computer Science, similar degree or equivalent experience.
Responsibilities
  • Develop automation in order to scale infrastructure easily and reliably.
  • Use broad IT infrastructure skills to implement infrastructure innovations which accelerate chip development.
  • Design and implement network architecture, storage solutions, virtualization, and services specific to EDA workflows.
  • Work closely with EDA teams to understand their requirements and translate them into infrastructure solutions.
  • Work in a diverse team performing fast paced investigations to empower engineers to develop at the speed of light.
  • Collaborate to improve how our chip development process utilizes our infrastructure.
  • Directly contribute to the overall quality and improve time to market for our next generation chips.
Desired Qualifications
  • Extensive knowledge with job schedulers (in particular IBM Spectrum LSF and/or SLURM).
  • Experience with perl.
  • Deep understanding of distributed system principles.
  • Experience with chip design workflows, such as front end verification, back end workflows, or mixed signal workflows.
  • Experience in crafting solutions that balance security and productivity for the end user.

NVIDIA designs and manufactures graphics processing units (GPUs) and system on a chip units (SoCs) for various markets, including gaming, professional visualization, data centers, and automotive. Their main products are GPUs that enhance gaming experiences and support professional applications, along with AI and high-performance computing platforms for developers and data scientists. NVIDIA differentiates itself from competitors by focusing on advanced technology and continuous innovation, ensuring their products meet the evolving needs of users. The company's goal is to lead in AI and HPC solutions, providing powerful hardware and software that enable immersive experiences and drive advancements in fields like machine learning and computer vision.

Company Size

10,001+

Company Stage

IPO

Headquarters

Santa Clara, California

Founded

1993

Simplify Jobs

Simplify's Take

What believers are saying

  • Acquisition of Augtera Networks boosts NVIDIA's networking capabilities and Spectrum-X portfolio.
  • Growing demand for NVIDIA GPUs in AI acceleration suggests potential market growth.
  • NVIDIA's support for AI-driven robotics solutions opens new market opportunities.

What critics are saying

  • Increased competition from Lambda's AI Cloud Platform challenges NVIDIA's market position.
  • Edge AI technologies may reduce demand for NVIDIA's cloud-based AI solutions.
  • Regulatory challenges in robot delivery services could impact NVIDIA's investment returns.

What makes NVIDIA unique

  • NVIDIA leads in AI and HPC with cutting-edge GPU technology.
  • The company excels in diverse markets: gaming, data centers, and autonomous vehicles.
  • NVIDIA's Omniverse platform enhances industrial AI applications and digital twin technology.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Company Equity

401(k) Company Match

Growth & Insights and Company News

Headcount

6 month growth

0%

1 year growth

0%

2 year growth

-1%
Data Center Dynamics
Mar 4th, 2025
Nvidia quietly acquires AIOps firm Augtera Networks

GPU giant rolls networking monitoring firm into Spectrum-X portfolio

Business Wire
Feb 21st, 2025
Lambda Raises $480M to Expand AI Cloud Platform

Lambda, the AI Developer Cloud, today announced it has raised a $480 million Series D, bringing the total equity capital raised to date to $863 millio

PR Newswire
Feb 20th, 2025
Together AI Secures $305M Series B Funding

Together AI announced a $305 million Series B funding round led by General Catalyst and Prosperity7, valuing the company at $3.3 billion. The investment will enhance its AI Acceleration Cloud, focusing on open source models and NVIDIA Blackwell GPU deployment. Together AI supports over 450,000 developers and partners with major firms like Salesforce and Zoom. The platform offers enterprise-grade AI solutions with advanced infrastructure and research innovations for improved efficiency and cost-effectiveness.

CoinCentral
Feb 19th, 2025
NVIDIA-Backed Edge AI Startup ClustroAI Raises $12M to Bring AI Processing to Local Devices - CoinCentral

San Francisco-based ClustroAI raised $12M in Series A funding to advance its edge AI technology that enables local device AI processing without cloud computing

Alexa Blockchain
Feb 12th, 2025
GamerBoom Raises $9M with NVIDIA Backing

GamerBoom, an AI-powered gaming data analytics protocol on Solana, raised $9M in a funding round, totaling over $11M. Investors include Bing Ventures, SKY Ventures, and NVIDIA, enhancing its AI capabilities. The funding will scale AI-driven gaming data solutions for Web3. GamerBoom is part of Binance’s MVB Accelerator Program and plans to launch a rewards program and NFT sales.