Facebook pixel

Deep Learning Performance Analysis Engineer - New College Grad
Posted on 1/22/2022
INACTIVE
Locations
San Jose, CA, USA
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
CUDA
C/C++/C#
Python
Requirements
  • You are pursuing a PhD or MS or have equivalent in CS, EE or CSEE (or equivalent experience)
  • Background in deep learning and neural networks, in particular training
  • Background in computer architecture, and familiarity with the fundamentals of GPU architecture
  • Experience analyzing and tuning application performance
  • Experience with processor and system-level performance modelling
Responsibilities
  • Understand, analyze, profile, and optimize deep learning training workloads on state-of-the-art hardware and software platforms
  • Understand the big picture of training performance on GPUs, prioritizing and then solving problems across many dozens of state-of-the-art neural networks
  • Implement production-quality software in multiple layers of NVIDIA's deep learning platform stack, from drivers to DL frameworks
  • Implement key DL training workloads in NVIDIA's proprietary processor and system simulators to enable future architecture studies
  • Build tools to automate workload analysis, workload optimization, and other critical workflows
Desired Qualifications
  • Programming skills in C++ and Python. CUDA is a bonus
NVIDIA

10,001+ employees

Designer & manufacturer of computer chips & graphics processors
Company Overview
NVIDIA is on a mission to solve the world's most stimulating technology problems – in industries ranging from gaming to scientific exploration.
Company Values
  • Innovation
  • Speed & Agility
  • Intellectual Honesty
  • Excellence
  • One Team