Full-Time

Inference ML Engineer

Confirmed live in the last 24 hours

Cerebras

Cerebras

201-500 employees

Develops AI accelerators for efficient computing

No salary listed

Senior

Toronto, ON, Canada

Category
Applied Machine Learning
Deep Learning
AI & Machine Learning
Required Skills
Python
Tensorflow
Pytorch
Machine Learning
C/C++
Requirements
  • Bachelor’s, Master’s, or PhD in Computer Science, Computer Engineering, Mathematics, or a related field.
  • 5+ years of experience in large-scale software engineering, with a focus on deep learning or related domains.
  • Proficiency in Python for building and maintaining scalable systems.
  • Advanced proficiency in C++, with an emphasis on multi-threaded programming, performance optimization, and system-level development.
  • Hands-on experience with ML frameworks such as PyTorch, TensorFlow, or JAX, and a strong understanding of their underlying architectures.
  • Solid understanding of software architectural patterns for large-scale, high-performance applications.
  • Proven experience leading and mentoring software or machine learning engineers.
  • In-depth knowledge of machine learning algorithms, theory, and best practices for developing production-ready software.
  • Strong problem-solving skills, with the ability to balance technical depth with practical implementation constraints.
  • Exceptional communication and presentation skills, with the ability to work both independently and collaboratively across multidisciplinary teams.
Responsibilities
  • Lead and provide technical guidance to a team of machine learning engineers working on complex machine learning integration projects.
  • Design and implement scalable and efficient integrations with popular machine learning frameworks, such as PyTorch, while ensuring compatibility with future frameworks.
  • Analyze the characteristics of various ML models to make informed design decisions for scalable, intuitive, and user-friendly APIs.
  • Optimize software to accelerate ML model training and ensure high throughput and low latency during inference.
  • Stay up-to-date with advancements in machine learning and deep learning, and apply state-of-the-art techniques to enhance our solutions.
  • Evaluate trade-offs between different approaches, clearly articulate design choices, and develop detailed proposals for implementing new features.
  • Build and maintain robust automated test suites to ensure software quality, performance, and reliability.
  • Contribute to an agile team environment by delivering high-quality software and adhering to agile development practices.
  • Collaborate with cross-functional teams, including compiler engineers, kernel developers, and system architects, to integrate ML capabilities seamlessly into our products and services.

Cerebras Systems specializes in accelerating artificial intelligence (AI) through its CS-2 system, which is recognized as the fastest AI accelerator available. This system is designed to replace traditional clusters of graphics processing units (GPUs) used in AI computations, simplifying the process by eliminating the need for complex parallel programming and cluster management. Cerebras serves a variety of clients, including major pharmaceutical companies and government research labs, providing them with faster results for critical applications like cancer drug response predictions. The company operates in the high-performance computing and AI markets, generating revenue by selling its proprietary hardware and software solutions, including the CS-2 system and related cloud services. Cerebras aims to enhance the efficiency of AI research and development, enabling clients to achieve quicker results and lower costs.

Company Size

201-500

Company Stage

Series F

Total Funding

$720M

Headquarters

Sunnyvale, California

Founded

2016

Simplify Jobs

Simplify's Take

What believers are saying

  • Growing AI model efficiency demand aligns with Cerebras' energy-efficient accelerators.
  • AI democratization increases need for user-friendly systems like Cerebras' CS-2.
  • Pharmaceutical industry's push for faster drug discovery boosts demand for Cerebras' technology.

What critics are saying

  • Competition from NVIDIA and Graphcore could impact Cerebras' market share.
  • Rapid AI model evolution may necessitate frequent hardware updates, increasing R&D costs.
  • Supply chain vulnerabilities could delay production of Cerebras' hardware.

What makes Cerebras unique

  • Cerebras' Wafer-Scale Engine is the largest chip ever built for AI.
  • The CS-2 system replaces traditional GPU clusters, simplifying AI computations.
  • Cerebras serves diverse industries, including pharmaceuticals and government research labs.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Professional Development Budget

Flexible Work Hours

Remote Work Options

401(k) Company Match

401(k) Retirement Plan

Mental Health Support

Wellness Program

Paid Sick Leave

Paid Holidays

Paid Vacation

Parental Leave

Family Planning Benefits

Fertility Treatment Support

Adoption Assistance

Childcare Support

Elder Care Support

Pet Insurance

Bereavement Leave

Employee Discounts

Company Social Events