Full-Time

Senior Solutions Architect

HPC and AI

Confirmed live in the last 24 hours

NVIDIA

NVIDIA

10,001+ employees

Designs GPUs and AI computing solutions

Automotive & Transportation
Enterprise Software
AI & Machine Learning
Gaming

Compensation Overview

$148k - $235.8kAnnually

+ Equity

Senior

Santa Clara, CA, USA

Category
Solution Engineering
Sales & Solution Engineering
Required Skills
Kubernetes
Docker
Development Operations (DevOps)
Linux/Unix
Requirements
  • BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or other Engineering fields or equivalent experience.
  • 5+ years of work-related experience in NVIDIA and/or accelerated computing technologies.
  • Platform level understanding of server architecture, PCIe topology, GPUs, NICs, Linux OS and kernel drivers.
  • Networking experience, including knowledge of Ethernet, InfiniBand or other networking protocols.
  • Experience working with DevOps on-prem or in cloud environments, including but not limited to Docker/Containers, cloud APIs, IaaS and Data Center deployments.
  • SLURM, Kubernetes, and/or other job scheduler use, deployment, and debugging skills.
  • Deep understanding of dense data center design, including computing, storage, networking, cloud APIs, and IaaS.
  • Effective time management and capable of balancing multiple tasks.
  • Strong analytical and problem-solving skills.
  • Strong communication skills, both written and verbal, with the ability to collaborate and coordinate efficiently across multi-functional teams in engineering, sales, marketing, product, and program management.
Responsibilities
  • Validate and debug customer cluster performance issues, functional bottlenecks and drive customer technical engagements around NVIDIA products and technologies.
  • Stay up to date on pioneering High Performance Computing, Deep Learning and Machine Learning ecosystems.
  • Help architect and scale high-performance, distributed AI infrastructure on-prem or in the cloud built with the latest NVIDIA GPU supercomputers for new and existing customers.
  • Address and resolve problems starting from the bare metal level, all the way up to the operating system, software stack, and application level.
  • Share knowledge with different teams by delivering demos, assisting with proof-of-concepts, and writing papers and developer blogs.
  • Work directly with developers and hardware architects to debug cluster performance issues, identify new requirements, and improve workflows.
  • Engage with the account team when extra analysis is required in debugging customer issues.
  • Provide additional expertise to enable the account team to be more adaptable to the customer and product engineering to get more actionable data at speed of light making them more efficient.
  • Build custom product demonstrations and POCs for solutions that address critical business needs of our customers.
Desired Qualifications
  • Demonstrated Communication Collectives (NCCL) experience.
  • Excellent customer-facing skills and background.
  • Platform design engineering, coding and proficient debugging skills including experience in C/C++, Linux kernel, virtualization and drivers, profilers/performance analysis tools (NSys).
  • Familiarity with NVIDIA systems/SDKs (e.g. CUDA), NVIDIA Networking technologies (e.g., RoCE, InfiniBand), Switch interconnects and/or ARM CPU solutions through hands-on experience.
  • Understanding of Deep Learning and Machine Learning frameworks (TensorFlow or PyTorch), LLM, MLOps, DevOps, and workflows applying cloud technologies, using Docker/containers, Kubernetes, cloud APIs, and data center deployments, among others.

NVIDIA designs and manufactures graphics processing units (GPUs) and system on a chip units (SoCs) for various markets, including gaming, professional visualization, data centers, and automotive. Their main products are GPUs that enhance gaming experiences and support professional applications, along with AI and high-performance computing platforms tailored for developers and data scientists. NVIDIA differentiates itself from competitors by focusing on advanced technology and continuous innovation, ensuring their products meet the evolving needs of users. The company's goal is to lead in AI and HPC solutions, providing powerful tools and services that enable immersive experiences and drive advancements in multiple industries.

Company Stage

IPO

Total Funding

$19.5M

Headquarters

Santa Clara, California

Founded

1993

Growth & Insights
Headcount

6 month growth

2%

1 year growth

0%

2 year growth

0%
Simplify Jobs

Simplify's Take

What believers are saying

  • Acquisition of VinBrain enhances NVIDIA's AI capabilities in the healthcare sector.
  • Investment in Nebius Group boosts NVIDIA's AI infrastructure and cloud platform offerings.
  • Serve Robotics' expansion, backed by NVIDIA, highlights growth in autonomous delivery services.

What critics are saying

  • Increased competition from AI startups like xAI could challenge NVIDIA's market position.
  • Serve Robotics' expansion may divert resources from NVIDIA's core GPU and AI businesses.
  • Integration of VinBrain may pose challenges and distract from NVIDIA's primary operations.

What makes NVIDIA unique

  • NVIDIA leads in AI and HPC solutions with cutting-edge GPU technology.
  • The company excels in diverse markets, including gaming, data centers, and autonomous vehicles.
  • NVIDIA's cloud services, like CloudXR, offer scalable solutions for AI and machine learning.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Company Equity

401(k) Company Match