Full-Time

Generative AI Inference Solutions Architect

Confirmed live in the last 24 hours

Cerebras

Cerebras

201-500 employees

Develops AI accelerators for efficient computing

Data & Analytics
Enterprise Software
AI & Machine Learning

Senior

Remote in USA

Category
AI & Machine Learning
Solution Engineering
Sales & Solution Engineering
Required Skills
Python
Product Management
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, or a related field.
  • 5+ years in customer-facing engineering roles.
  • Strong understanding of Generative AI model architecture, inference optimization, enterprise infrastructure and deployment challenges.
  • Experience with specialized AI accelerators.
  • Solid programming skills in Python and familiarity with distributed computing.
  • Exceptional communication skills with the ability to explain complex technical concepts to both technical and non-technical audiences.
  • Ability to work collaboratively in a fast-paced environment and adapt to changing customer needs.
  • Ability to manage complex technical projects and deliver solutions tailored to customer needs.
  • Strong interpersonal and communication skills, effective in collaborative and fast-paced team settings.
Responsibilities
  • Lead the technical aspects of the sales process
  • Join sales calls to present technical aspects of Cerebras Inference solution, addressing customer questions and demonstrating our value proposition. Provide in-depth explanations of our product features, focusing on performance benefits, scalability, and optimizations that our specialized hardware enables.
  • Understand and gather customer requirements.
  • Design, scope and drive demos, trials and PoCs
  • Design demos to showcase key advantages of our unique product
  • Scope and drive customer trials and proof-of-concept projects, define success metrics, oversee execution and ensure a smooth experience customer satisfaction.
  • Own end-to-end delivery of the solution, provide technical guidance during deployment and post-sales support
  • Work closely with customers to design deployment solutions tailored to their needs
  • Drive end-to-end delivery of the solution from the technical side
  • Build and maintain strong customer relationships to become their go-to technical expert
  • Provide feedback to the internal product and engineering teams
  • Collaborate with internal teams, including R&D and product management, to communicate customer feedback and drive future product improvements.

Cerebras Systems accelerates artificial intelligence (AI) processes with its CS-2 system, which replaces traditional clusters of graphics processing units (GPUs). This system simplifies AI tasks by removing the complexities of parallel programming and cluster management, allowing for faster results in critical applications like cancer drug response prediction. Cerebras serves clients across various industries, including pharmaceuticals and government research labs, and generates revenue through the sale of its hardware and software solutions. The company's goal is to enhance the speed and efficiency of AI training and inference, reducing costs in AI research and development.

Company Stage

Series F

Total Funding

$700.4M

Headquarters

Sunnyvale, California

Founded

2016

Growth & Insights
Headcount

6 month growth

0%

1 year growth

-5%

2 year growth

-10%
Simplify Jobs

Simplify's Take

What believers are saying

  • Growing AI model efficiency demand aligns with Cerebras' energy-efficient accelerators.
  • AI democratization increases need for user-friendly systems like Cerebras' CS-2.
  • Pharmaceutical industry's push for faster drug discovery boosts demand for Cerebras' technology.

What critics are saying

  • Competition from NVIDIA and Graphcore could impact Cerebras' market share.
  • Rapid AI model evolution may necessitate frequent hardware updates, increasing R&D costs.
  • Supply chain vulnerabilities could delay production of Cerebras' hardware.

What makes Cerebras unique

  • Cerebras' Wafer-Scale Engine is the largest chip ever built for AI.
  • The CS-2 system replaces traditional GPU clusters, simplifying AI computations.
  • Cerebras serves diverse industries, including pharmaceuticals and government research labs.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Professional Development Budget

Flexible Work Hours

Remote Work Options

401(k) Company Match

401(k) Retirement Plan

Mental Health Support

Wellness Program

Paid Sick Leave

Paid Holidays

Paid Vacation

Parental Leave

Family Planning Benefits

Fertility Treatment Support

Adoption Assistance

Childcare Support

Elder Care Support

Pet Insurance

Bereavement Leave

Employee Discounts

Company Social Events