Principal Workload Performance Architect
Posted on 12/30/2022
INACTIVE
Locations
Santa Clara, CA, USA
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
Data Analysis
Data Structures & Algorithms
C/C++/C#
Requirements
  • BS/MS/PhD in EE/ECE/CE/CS
  • Strong background in CPU ISA, u-architecture research, and performance benchmarks
  • Familiar with program tracing flows (SIMPOINT, SMART,..) to capture traces for applications
  • Strong understanding of ML/AI algorithms, GCC and LLVM compilers, and OS kernel
  • Proficient in C/C++ programming. Experience in the development of highly efficient C/C++ performance models
Responsibilities
  • Collaborate with the software and platform architecture teams to understand hardware requirements for AI accelerator compiler, OS, video/image/voice processing, security, networking, and virtualization technology. Identify the application performance bottlenecks and functional requirements
  • Perform full-stack workload characterization and performance analysis for AI, HPC, and CPU general-purpose applications. Identify representative benchmarks for the workloads. Perform data-driven analysis based on software profiling, performance model simulation, or analytical models to evaluate software and architecture solutions to PPA
  • Set CPU architecture direction based on the data analysis and work with a cross-functional team to achieve the best hardware/software solutions to meet PPA goals
  • Characterizing real-world workloads, conducting end-to-end system performance analysis and workload decomposition to gather requirements for SoC solutions. Generate representative CPU, accelerators, and SoC traces for the performance model to study PPA impacts and guide architecture decisions
  • Work with Tenstorrent's graph compiler team and LLVM/GCC open source community to drive AI/CPU performance improvements. Identify the compiler optimization and align architecture and the compiler teams for implementing the improvements
  • Drive analysis and correlation of performance feature both pre and post-silicon
Desired Qualifications
  • Understanding SOC fabric, coherency protocols, memory technology, and accelerator technology is a plus
Tenstorrent

51-200 employees

Computer processor architecture manufacturer
Company Overview
Tenstorrent is on a mission to address the rapidly growing compute demands for software 2.0. The company designs processors that are optimized for neural network inference, training and can also execute other types of parallel computation.
Company Core Values
  • Collaboration
  • Curiosity
  • Commitment to solving hard problems