Full-Time

Network and Server Test Engineer

Posted on 4/30/2024

Cerebras

Cerebras

201-500 employees

Produces large-scale AI computing systems

Hardware
AI & Machine Learning

Expert

Sunnyvale, CA, USA

Required Skills
Python
Linux/Unix
Requirements
  • Master’s degree or higher in Electrical Engineering, Computer Engineering, Computer Science, or related majors.
  • 5+ years experience in Software Development, Quality Assurance, System Test of Switches and Routers at a Networking equipment vendor.
  • Understanding of RDMA congestion control mechanisms on InfiniBand and RoCE Networks.
  • Must have deep understanding of networking protocols BGP, PFC, ECN, QoS, MLAG, ECMP, and VRF.
  • Experience with computer system architecture, especially on CPU SoC or Platform Architecture, Interconnect Fabric, and Memory sub-system.
  • Experience designing and implementing large switching and routing networks.
  • Strong technical abilities, problem-solving, design, coding, and debugging skills.
  • Expertise in Linux tools such as lspci, ping, traceroute, tcpdump, ifconfig, ip link, ip route, arp, /proc/net, /proc/sys/net, vmstat, netstat, ttcp, iperf, strac, memtest, fio, ozone, and iometer.
  • Must be proficient in python.
  • Proficient in Networking Test Tools like IXIA and Smartbits.
Responsibilities
  • Identify experiments, tools, and methodology to test complex Data Center equipment including Switches, Routers, Server, NICs, Transceivers.
  • Co-work with equipment vendors to evaluate the performance of newly introduced hardware, and to resolve defects.
  • Design and setup test lab, test beds to exercise and evaluate vendor equipment.
  • Work with architects, software engineers to create test cases, write test scripts, execute tests, and document results of evaluation of solution from different vendors.
  • Troubleshoot, isolate, and drive issues to resolution through partnerships with other teams and vendors.
  • Provide solutions for efficient networking design for AI infrastructure.
  • Design, install, configure, and maintain complex Network for AI Infrastructure.
  • Build up and optimize server system benchmarks based on deep understanding of server system architect, and workload characterization.

Cerebras Systems provides cutting-edge technology for artificial intelligence work with its CS-2 AI computer, which features the world's largest chip, the Wafer Scale Engine (WSE-2). This technology fosters a culture of rapid innovation, significantly reducing AI training times and enhancing productivity. Employees at this company are part of a pioneering environment, working with advanced technology that leads the industry in accelerating AI capabilities, making it an excellent place for career growth in the field of AI technology.

Company Stage

Series F

Total Funding

$720M

Headquarters

Sunnyvale, California

Founded

2016

Growth & Insights
Headcount

6 month growth

7%

1 year growth

2%

2 year growth

-12%