Full-Time

Generative AI Inference Solutions Architect

Confirmed live in the last 24 hours

Cerebras

Cerebras

201-500 employees

Develops AI acceleration hardware and software

No salary listed

Senior

Remote in India + 10 more

More locations: Remote in Germany | Remote in USA | Remote in United Arab Emirates | Remote in UK | Remote in Ireland | Remote in Australia | Remote in Spain | Remote in Canada | Remote in Italy | Remote in France

Category
AI & Machine Learning
Sales Engineering
Sales & Solution Engineering
Required Skills
Python
Product Management
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, or a related field.
  • 5+ years in customer-facing engineering roles.
  • Strong understanding of Generative AI model architecture, inference optimization, enterprise infrastructure and deployment challenges.
  • Experience with specialized AI accelerators.
  • Solid programming skills in Python and familiarity with distributed computing.
  • Exceptional communication skills with the ability to explain complex technical concepts to both technical and non-technical audiences.
  • Ability to work collaboratively in a fast-paced environment and adapt to changing customer needs.
  • Ability to manage complex technical projects and deliver solutions tailored to customer needs.
  • Strong interpersonal and communication skills, effective in collaborative and fast-paced team settings.
Responsibilities
  • Lead the technical aspects of the sales process
  • Join sales calls to present technical aspects of Cerebras Inference solution, addressing customer questions and demonstrating our value proposition. Provide in-depth explanations of our product features, focusing on performance benefits, scalability, and optimizations that our specialized hardware enables.
  • Understand and gather customer requirements.
  • Design, scope and drive demos, trials and PoCs
  • Design demos to showcase key advantages of our unique product
  • Scope and drive customer trials and proof-of-concept projects, define success metrics, oversee execution and ensure a smooth experience customer satisfaction.
  • Own end-to-end delivery of the solution, provide technical guidance during deployment and post-sales support
  • Work closely with customers to design deployment solutions tailored to their needs
  • Drive end-to-end delivery of the solution from the technical side
  • Build and maintain strong customer relationships to become their go-to technical expert
  • Provide feedback to the internal product and engineering teams
  • Collaborate with internal teams, including R&D and product management, to communicate customer feedback and drive future product improvements.

Cerebras Systems specializes in accelerating artificial intelligence (AI) through its CS-2 system, which is designed to replace traditional clusters of graphics processing units (GPUs) used in AI computations. The CS-2 system simplifies the process of AI work by eliminating the complexities of parallel programming, distributed training, and cluster management, making it more efficient. Cerebras serves a variety of clients, including major pharmaceutical companies and government research labs, providing them with faster results for critical applications like cancer drug response prediction. The company operates in the high-performance computing and AI markets, generating revenue by selling its proprietary hardware and software solutions, including the CS-2 systems and associated cloud services. Cerebras aims to reduce the overall cost of AI research and development while enabling clients to achieve quicker training times and lower latency in AI inference.

Company Size

201-500

Company Stage

Series F

Total Funding

$720M

Headquarters

Sunnyvale, California

Founded

2016

Simplify Jobs

Simplify's Take

What believers are saying

  • Growing AI model efficiency demand aligns with Cerebras' energy-efficient accelerators.
  • AI democratization increases need for user-friendly systems like Cerebras' CS-2.
  • Pharmaceutical industry's push for faster drug discovery boosts demand for Cerebras' technology.

What critics are saying

  • Competition from NVIDIA and Graphcore could impact Cerebras' market share.
  • Rapid AI model evolution may necessitate frequent hardware updates, increasing R&D costs.
  • Supply chain vulnerabilities could delay production of Cerebras' hardware.

What makes Cerebras unique

  • Cerebras' Wafer-Scale Engine is the largest chip ever built for AI.
  • The CS-2 system replaces traditional GPU clusters, simplifying AI computations.
  • Cerebras serves diverse industries, including pharmaceuticals and government research labs.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Professional Development Budget

Flexible Work Hours

Remote Work Options

401(k) Company Match

401(k) Retirement Plan

Mental Health Support

Wellness Program

Paid Sick Leave

Paid Holidays

Paid Vacation

Parental Leave

Family Planning Benefits

Fertility Treatment Support

Adoption Assistance

Childcare Support

Elder Care Support

Pet Insurance

Bereavement Leave

Employee Discounts

Company Social Events