Full-Time

Hardware Engineer

GPU Infrastructure

Updated on 5/22/2024

CoreWeave

CoreWeave

201-500 employees

Specialized cloud provider for GPU compute resources

AI & Machine Learning
Data & Analytics
Hardware

Compensation Overview

$160k - $210kAnnually

+ Tuition Reimbursement + Mental Wellness Benefits + Parental Leave + Childcare Support + Catered Lunch + Weekly Massages + Casual Work Environment

Mid, Senior

Livingston, NJ, USA + 1 more

Required Skills
Python
Ansible
Requirements
  • At least 2 years professional experience supporting and troubleshooting data center class GPUs
  • Proficiency in ansible/python and experience with programmatically interacting with server BMCs
  • Experience using, integrating and automating data center class GPU diagnostics and troubleshooting tools
  • In-depth knowledge of server hardware, components, and management technologies
  • Previous experience collaborating with hardware vendors
  • Passion for automation
  • Excellent documentation skills
  • Strong analytical and problem-solving abilities
Responsibilities
  • Troubleshoot complex GPU and PCIe related failures
  • Partner with external vendors on failure analysis
  • Track component RMAs
  • Develop and maintain hardware/firmware management services
  • Automate all aspects of the server hardware lifecycle
  • Serve as the senior point of contact for hardware escalation and troubleshooting
  • Collaborate with cross-functional teams to define hardware requirements, specifications, and system architecture
  • Create and maintain accurate documentation of hardware designs, specifications, test procedures, and results
  • Analyze and optimize the performance of hardware systems
  • Establish processes for internal hardware testing, deployment, and performance optimization

CoreWeave is a specialized cloud provider focusing on GPU compute resources ideal for VFX, rendering, machine learning, and AI, promising up to 35x faster performance and significant cost savings. Working here offers a unique opportunity to contribute to cutting-edge technologies in a rapidly growing industry sector known for high-performance networking and efficient infrastructure management. The culture prioritizes agility and technical superiority, making it an ideal place for innovative minds eager to make a significant impact in high-demand compute scenarios.

Company Stage

Series B

Total Funding

$12B

Headquarters

New York, New York

Founded

2017

Growth & Insights
Headcount

6 month growth

65%

1 year growth

240%

2 year growth

817%