Simplify Logo

Full-Time

Network Engineer

Posted on 3/5/2024

Cerebras

Cerebras

201-500 employees

AI computing hardware with largest chip

Data & Analytics
Hardware
AI & Machine Learning

Mid

Toronto, ON, Canada + 2 more

Category
DevOps & Infrastructure
Network Engineering
Required Skills
Python
Requirements
  • Engineering degree or related technical discipline
  • Expert Knowledge of IB/RDMA/RoCE Networks
  • 4+ years of experience working on networks supporting large scale training workloads
  • Understanding of RDMA congestion control mechanisms on IB and RoCE Networks
  • Hands-on experience with network equipment and vendors like Broadcom, Mellanox
  • Experience coding in Python, C++, Go, etc
  • Experience in network automation software leveraging software-defined networking principles
  • Developed or modified network telemetry and automation tools
  • Experience in designing, deploying, and operating networks at scale
Responsibilities
  • Design, develop, test, and operate networking systems to support large scale AI training/inference jobs
  • Develop and deploy technologies and network topologies to evolve and scale AI networks
  • Work closely with hardware, software, and sourcing teams to develop new networking solutions
  • Define and develop optimized network monitoring systems
  • Software modeling of network architecture and ML applications
  • Be on call to learn from real-world production challenges

Cerebras Systems specializes in developing large-scale, powerful artificial intelligence computers, specifically the CS-2 powered by the Cerebras Wafer Scale Engine. This technology features a record-setting 2.6 trillion transistors, which vastly accelerates the rate of neural network training from typical durations of months to mere minutes. A career at Cerebras Systems represents an opportunity to engage with a team focused on pushing the boundaries of AI hardware, contributing to a culture that thrives on technical excellence and rapid innovation in AI computation, which underscores its position at the cutting edge of AI research and development initiatives.

Company Stage

Series F

Total Funding

$720M

Headquarters

Sunnyvale, California

Founded

2016

Growth & Insights
Headcount

6 month growth

7%

1 year growth

9%

2 year growth

-20%
INACTIVE