Simplify Logo

Full-Time

Distributed ML Systems Engineer-Inference

Confirmed live in the last 24 hours

Together AI

Together AI

51-200 employees

Enterprise Software
AI & Machine Learning

Compensation Overview

$160k - $230kAnnually

+ Equity + Benefits

Mid, Senior

San Francisco, CA, USA

Category
Applied Machine Learning
AI Research
AI & Machine Learning
Required Skills
Microsoft Azure
Python
AWS
Google Cloud Platform
Requirements
  • 3+ years of experience in building large-scale, fault-tolerant, high-performance distributed systems.
  • Strong programming skills in one or more of Python, Go, Rust, or C/C++.
  • Excellent understanding of low-level operating systems concepts including multi-threading, memory management, networking, and storage, performance, and scale.
  • Experience with cloud computing platforms (AWS, GCP, Azure etc.) and large-scale infrastructure.
  • Strong problem-solving skills and ability to work in a fast-paced environment.
Responsibilities
  • Design and build large-scale, distributed machine learning systems that are fault-tolerant and high-performance.
  • Develop and optimize distributed processing frameworks and storage systems.
  • Collaborate with researchers, engineers, and product managers to integrate ML systems into our infrastructure.
  • Conduct architecture and design reviews to ensure best practices in system design.
  • Implement robust monitoring and logging systems to ensure the health and performance of our ML systems.

Company Stage

Series A

Total Funding

$222.3M

Headquarters

Menlo Park, California

Founded

N/A

Growth & Insights
Headcount

6 month growth

74%

1 year growth

135%

2 year growth

683%
Simplify Jobs

Simplify's Take

What believers are saying

  • The $106M funding round led by Salesforce Ventures provides significant capital for growth and innovation.
  • Hiring top talent, such as the head of sales operations from Coinbase, strengthens the company's leadership team.
  • The release of the biological foundational model Evo opens new avenues in biotech, potentially revolutionizing DNA, RNA, and protein sequence analysis.

What critics are saying

  • The competitive landscape in AI hardware optimization is intense, with new entrants like Groq posing potential threats.
  • Dependence on Nvidia's GPUs could be a vulnerability if supply chain issues or technological shifts occur.

What makes Together AI unique

  • Together AI's collaboration with top-tier institutions like Meta, Nvidia, and Princeton University on FlashAttention-3 showcases its cutting-edge research capabilities.
  • The company's focus on optimizing LLMs for Nvidia Hopper GPUs positions it uniquely in the AI hardware optimization space.
  • Together AI's valuation of $1.25 billion and backing from industry giants like Salesforce Ventures and Nvidia highlight its strong market position and investor confidence.