Full-Time

Software Engineer

Model Training Infrastructure

Updated on 1/16/2025

Anyscale

Anyscale

201-500 employees

Platform for scaling AI workloads

Enterprise Software
AI & Machine Learning

Compensation Overview

$170.1k - $237kAnnually

Senior

San Francisco, CA, USA

Hybrid position based in San Francisco, CA.

Category
Backend Engineering
FinTech Engineering
Software Engineering
Required Skills
Tensorflow
Data Structures & Algorithms
Pytorch
Requirements
  • Minimum 5+ years of experience building, scaling, and maintaining software systems in production environments
  • Strong fundamentals in algorithms, data structures, and system design
  • Proficiency with machine learning frameworks and libraries (e.g., PyTorch, TensorFlow, XGBoost)
  • Experience designing fault-tolerant distributed systems
  • Solid architectural skills
Responsibilities
  • Develop scalable, fault-tolerant distributed machine learning libraries that power leading ML platforms
  • Create an exceptional end-to-end experience for training machine learning models
  • Solve complex architectural challenges and transform them into practical solutions
  • Contribute to and engage with the open-source community, collaborating with ML researchers, engineers, and data scientists to build new scalable machine learning abstractions
  • Share your work and expertise with a broader audience through talks, tutorials, and blog posts
  • Collaborate with a team of experts in distributed systems and machine learning
  • Work directly with end-users to iterate on and enhance the product based on their feedback
  • Partner with engineering and product managers to nurture a talented team of software engineers
  • Play a key role in building and shaping a world-class company
Desired Qualifications
  • Experience with cloud technologies (AWS, GCP, Kubernetes)
  • Hands-on experience building ML training platforms in production
  • Background in managing and maintaining open-source libraries
  • Experience leading small teams to achieve ambitious technical goals
  • Familiarity with Ray

Anyscale provides a platform designed to scale and productionize artificial intelligence (AI) and machine learning (ML) workloads. Its main product, Ray, is an open-source framework that helps users manage and enhance AI applications across various fields, including Generative AI, Large Language Models (LLMs), and computer vision. Companies, including major players like OpenAI and Ant Group, utilize Ray to train large models and improve the performance and reliability of their ML systems. Anyscale's platform has been reported to significantly enhance scalability, reduce latency, and improve cost-efficiency for large workloads, with some clients experiencing over 90% improvements. The company operates on a software-as-a-service (SaaS) model, allowing clients to subscribe to access Ray and its features, which provides a consistent revenue stream. Anyscale's goal is to empower organizations to effectively scale their AI workloads and optimize their operational efficiency.

Company Stage

Series C

Total Funding

$252.5M

Headquarters

San Francisco, California

Founded

2019

Growth & Insights
Headcount

6 month growth

5%

1 year growth

-13%

2 year growth

-22%
Simplify Jobs

Simplify's Take

What believers are saying

  • Anyscale's $100M Series C funding indicates strong investor confidence and growth potential.
  • Partnership with Nvidia enhances performance and cost-efficiency for AI deployments.
  • Anyscale Endpoints offers 10X cost-efficiency for popular open-source LLMs.

What critics are saying

  • ShadowRay vulnerability in Ray framework poses significant security risk with no patch.
  • OctoML's OctoAI service increases competition in AI infrastructure market.
  • Dependency on Nvidia's technology could be risky if Nvidia faces issues.

What makes Anyscale unique

  • Anyscale's Ray framework scales AI applications from laptops to cloud seamlessly.
  • Ray is widely used in Generative AI, LLMs, and computer vision fields.
  • Anyscale's SaaS model provides recurring revenue through subscription fees for Ray platform.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Medical, Dental, and Vision insurance

401K retirement savings

Flexible time off

FSA and Commuter benefits

Parental and family leave

Office & phone plan reimbursement