Full-Time

Neural Network Optimization Engineer

Posted on 11/28/2024

Untether AI

Untether AI

51-200 employees

Enhances AI inference with at-memory computing

Hardware
AI & Machine Learning

Senior, Expert

Toronto, ON, Canada

Hybrid position based in Toronto.

Category
Applied Machine Learning
Deep Learning
AI & Machine Learning
Required Skills
Data Structures & Algorithms
C/C++
Requirements
  • Computer Science, engineering, or related degree
  • At least 5 years of software development experience, specifically in AI
  • Knowledge of Neural Network basic operator algorithms - Convolutions, Transformers, RNNs
  • Experience with end to end software development, specifically data structures and algorithms, experience with software architecture patterns, etc.
  • Experience working in cross-functional teams
  • Strong bias for teamwork and effective problem solving
  • Experience with tuning and optimizing code and Neural Networks for high performance
  • Strong communication skill and a background in program / project management
  • Superior problem solving skills, both technical and interpersonal
Responsibilities
  • Use and advance our products in the development and delivery of leading edge customer neural networks and applications
  • Understand and evaluate neural network performance at a chip, system and application level through performance profiling and benchmarking
  • Identify/diagnose challenges in using our products for specific use cases and define and develop projects to resolve them.
  • Engage with the Customer Neworks Lead on any escalations to the software and hardware team
  • Actively engage with customers to ensure they feel supported by Untether AI post sale and delivery of the product
  • Partner with the Products team to ensure they are up to date on client experiences and their feedback so that interactions with customers appear seamless
  • Management of program / project teams to deliver Neural Network Model Garden Models including the identification, design and implementation of C++ low-level flexible programs (kernels) for various neural net operations
  • When required development of complexed kernels and compiler strategies to address the network requirements
  • Communicate performance optimization ideas both to compiler/kernel engineers and to architects working on future product generations

Untether AI enhances the speed and efficiency of AI inference workloads using at-memory computing. This method places the compute element next to memory cells, which boosts compute density and accelerates AI inference for various neural networks, such as those used in vision, natural language processing, and recommendation systems. The company targets businesses that rely on AI technologies and need high-performance computing for inference tasks. Their products include the runAI200® devices and tsunAImi® accelerator cards, which are designed to deliver exceptional performance, with the tsunAImi® card providing over 2 PetaOps. This allows businesses to optimize their AI workloads while maintaining a compact PCI-Express form factor. Untether AI's goal is to provide efficient and cost-effective solutions for companies looking to improve their AI application performance.

Company Stage

Series B

Total Funding

$144.6M

Headquarters

Toronto, Canada

Founded

2018

Growth & Insights
Headcount

6 month growth

10%

1 year growth

20%

2 year growth

23%
Simplify Jobs

Simplify's Take

What believers are saying

  • Untether AI's recent $20 million funding round and total funding of over $200 million CAD indicate strong financial backing and growth potential.
  • The appointment of industry veterans like Chris Walker as CEO and Renxin Xia as VP of Hardware Engineering brings experienced leadership to drive innovation and market expansion.
  • The release of the imAIgine SDK version 22.12 enhances developer velocity, making it easier and faster to deploy neural networks on Untether AI's hardware.

What critics are saying

  • The highly competitive AI hardware market requires continuous innovation to maintain a technological edge.
  • Dependence on hardware sales could be a vulnerability if market demand shifts or new, more efficient technologies emerge.

What makes Untether AI unique

  • Untether AI's at-memory computing architecture significantly enhances compute density and efficiency, setting it apart from traditional AI inference solutions.
  • The company's tsunAImi® accelerator cards deliver over 2 PetaOps per card, offering unparalleled performance in a PCI-Express form factor.
  • Collaborations with industry giants like Arm and J-Squared Technologies highlight Untether AI's commitment to integrating cutting-edge technology into diverse applications, from automotive to defense.

Help us improve and share your feedback! Did you find this helpful?