Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.

Cerebras' current customers include global corporations across multiple industries, national labs, and top-tier healthcare systems. In January, we announced a multi-year, multi-million-dollar partnership with Mayo Clinic, underscoring our commitment to transforming AI applications across various fields. In August, we launched Cerebras Inference, the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services.

About The Role

As a solutions architect for Cerebras Inference platform, you will provide technical guidance in our sales initiatives, showcase the capabilities of our hardware and software solutions, and drive customer engagements. You will be working with the fastest inference engine in the world and will help our customers to understand and realize its potential for existing and completely new business applications.

We are looking for talented AI Solutions Architects with a blend of deep technical expertise, customer-facing soft skills and sales acumen. The ideal candidate will also bring a broad knowledge of various industries.

Responsibilities

Lead the technical aspects of the sales process
- Join sales calls to present technical aspects of Cerebras Inference solution, addressing customer questions and demonstrating our value proposition. Provide in-depth explanations of our product features, focusing on performance benefits, scalability, and optimizations that our specialized hardware enables.
- Understand and gather customer requirements.
Design, scope and drive demos, trials and PoCs
- Design demos to showcase key advantages of our unique product
- Scope and drive customer trials and proof-of-concept projects, define success metrics, oversee execution and ensure a smooth experience customer satisfaction.
Own end-to-end delivery of the solution, provide technical guidance during deployment and post-sales support
- Work closely with customers to design deployment solutions tailored to their needs
- Drive end-to-end delivery of the solution from the technical side
- Build and maintain strong customer relationships to become their go-to technical expert
Provide feedback to the internal product and engineering teams
- Collaborate with internal teams, including R&D and product management, to communicate customer feedback and drive future product improvements.

Requirements

Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, or a related field.
5+ years in customer-facing engineering roles.
Strong understanding of Generative AI model architecture, inference optimization, enterprise infrastructure and deployment challenges.
Experience with specialized AI accelerators.
Solid programming skills in Python and familiarity with distributed computing.
Exceptional communication skills with the ability to explain complex technical concepts to both technical and non-technical audiences.
Ability to work collaboratively in a fast-paced environment and adapt to changing customer needs.
Ability to manage complex technical projects and deliver solutions tailored to customer needs.
Strong interpersonal and communication skills, effective in collaborative and fast-paced team settings.

Why Join Cerebras

People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras:

Build a breakthrough AI platform beyond the constraints of the GPU.
Publish and open source their cutting-edge AI research.
Work on one of the fastest AI supercomputers in the world.
Enjoy job stability with startup vitality.
Our simple, non-corporate work culture that respects individual beliefs.

Read our blog: Five Reasons to Join Cerebras in 2025.

Apply today and become part of the forefront of groundbreaking advancements in AI!

Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.

This website or its third-party tools process personal data. For more details, click here to review our CCPA disclosure notice.

Generative AI Inference Solutions Architect

Cerebras

About The Role

Responsibilities

Requirements

Why Join Cerebras

Apply today and become part of the forefront of groundbreaking advancements in AI!

Simplify's Take

What believers are saying

What critics are saying

What makes Cerebras unique

Benefits