Full-Time

Scale-out Engineer

Posted on 8/19/2024

Tenstorrent

Tenstorrent

501-1,000 employees

Builds advanced computers for AI applications

Hardware
Enterprise Software
AI & Machine Learning

Compensation Overview

$100k - $500kAnnually

Junior, Mid

No H1B Sponsorship

Toronto, ON, Canada + 2 more

More locations: Austin, TX, USA | Santa Clara, CA, USA

This role is hybrid, based out of Santa Clara, CA; Austin, TX; or Toronto, ON.

US Citizenship Required

Category
Applied Machine Learning
Deep Learning
AI & Machine Learning
Required Skills
Tensorflow
Pytorch
C/C++
Requirements
  • Bachelor's or Master’s degree in Computer Science, Electrical Engineering, or a related field.
  • Proven experience in low-level software development.
  • Strong proficiency in programming languages such as C / C++.
  • Experience with MPI or similar distributed computing frameworks.
  • Experience with low-level networking libraries (e.g., libfabric, libibverbs).
  • Knowledge of networking protocols, especially Ethernet and InfiniBand.
  • Knowledge of high-performance interconnects.
  • Familiarity with RDMA programming.
  • Familiarity with large-scale deep learning frameworks (e.g., PyTorch, TensorFlow).
  • Familiarity with network offload engines and SmartNICs.
  • Strong communication skills and the ability to work effectively with cross-functional teams.
  • Passion for technology and a commitment to pushing the boundaries of what is possible in AI.
Responsibilities
  • Design, develop, and maintain TT-fabric, a low-level networking library for Tenstorrent AI processors built on top of Ethernet protocol.
  • Design and implement efficient distributed training systems for large-scale deep learning models.
  • Optimize network communication for multi-node AI processor clusters.
  • Tune system performance for inference and training of key AI models.
  • Work in the TT-Metalium team and integrate scale-out APIs into the Programming Model.
  • Work with AI model builder and researchers to improve both the scale out infrastructure and as well as model design.

Tenstorrent builds advanced computers specifically designed for artificial intelligence applications. Their products include high-performance computing systems that utilize specialized hardware and software solutions, leveraging technologies like ASIC design and RISC-V architecture. Unlike many competitors, Tenstorrent focuses on optimizing their systems for AI workloads, which allows them to cater specifically to clients in the AI and computing sectors. The company's goal is to advance the capabilities of AI computing, making it more efficient and powerful for various applications.

Company Stage

Late Stage VC

Total Funding

$1.3B

Headquarters

Toronto, Canada

Founded

2016

Growth & Insights
Headcount

6 month growth

20%

1 year growth

43%

2 year growth

131%
Simplify Jobs

Simplify's Take

What believers are saying

  • The launch of next-generation Wormhole-based developer kits and workstations could attract a significant developer community, driving innovation and adoption.
  • Collaborations with industry giants like Hyundai and Rapidus indicate strong growth potential and access to advanced manufacturing technologies.
  • The introduction of specialized AI inference acceleration boards like the Grayskull e75 and e150 can capture a niche market in AI and machine learning applications.

What critics are saying

  • The competitive landscape in AI hardware is intense, with major players like NVIDIA and Intel posing significant challenges.
  • Dependence on strategic partnerships for advanced manufacturing and technology development could lead to vulnerabilities if these partnerships falter.

What makes Tenstorrent unique

  • Tenstorrent's use of RISC-V architecture in their AI processors offers a unique alternative to traditional x86 and ARM architectures, providing flexibility and open-source benefits.
  • Their focus on high-performance AI chips and scalable developer kits positions them as a key player in the AI hardware market, particularly for developers seeking robust multi-chip solutions.
  • Strategic partnerships with global entities like Rapidus and C-DAC enhance their capabilities in cutting-edge semiconductor technology and edge AI processing.

Help us improve and share your feedback! Did you find this helpful?

INACTIVE