Full-Time

System Engineer

GPU Fleet

FluidStack

FluidStack

51-200 employees

High-performance GPU cloud for AI workloads

Compensation Overview

$200k - $300k/yr

+ Stock Options

San Francisco, CA, USA

In Person

Category
DevOps & Infrastructure (2)
,
Required Skills
Chef
Bash
Python
Puppet
Terraform
Ansible
Linux/Unix
Requirements
  • Bachelor's degree in Computer Science, Engineering, or related technical field (or equivalent practical experience)
  • 3+ years (System Engineer) or 5+ years (Senior System Engineer) in Linux system administration, datacenter operations, or infrastructure engineering
  • Strong Linux/Unix fundamentals including system administration, shell scripting (Bash, Python), troubleshooting, and performance tuning
  • Experience with server hardware architecture, troubleshooting techniques, and understanding of compute, memory, storage, and networking components
  • Experience in automation and configuration management tools (Ansible, Puppet, Chef, Terraform)
  • Strong analytical and problem-solving skills with ability to diagnose complex technical issues under pressure
  • Excellent communication and collaboration skills; ability to work effectively with cross-functional teams
Responsibilities
  • Operate and maintain large-scale GPU server fleet (H100, B200, GB200) supporting AI/ML workloads; monitor system health, performance, and utilization to maximize uptime and ensure SLA compliance
  • Perform hands-on troubleshooting and root cause analysis of complex hardware, firmware, OS, and application issues across GPU clusters; coordinate with vendors and hardware teams to resolve systemic failures
  • Develop and maintain automation scripts for provisioning, configuration management, monitoring, and remediation at scale.
  • Build and improve tooling for GPU health checks, performance diagnostics, driver validation, and automated recovery
  • Execute server provisioning, configuration, firmware updates, and OS installation using automation frameworks; manage lifecycle operations including deployment, maintenance, and decommissioning
  • Participate in 24x7 on-call rotation; respond to production incidents and coordinate resolution with cross-functional teams including datacenter operations, network engineering, and application teams
  • Lead post-incident reviews, document root causes, and drive continuous improvement initiatives focused on automation, reliability, monitoring, and operational efficiency
Desired Qualifications
  • Experience managing large-scale GPU infrastructure (NVIDIA H100, A100, B200, GB200) in production environments supporting AI/ML workloads
  • Deep knowledge of GPU architecture, CUDA toolkit, GPU drivers, monitoring tools (nvidia-smi, DCGM)
  • Experience with HPC cluster management, job schedulers (Slurm, PBS, LSF), and container orchestration (Kubernetes, Docker)
  • Proficiency in out-of-band management protocols (IPMI, Redfish, BMC) and firmware management for server hardware
  • Experience with high-performance networking (InfiniBand, RoCE, RDMA) and network troubleshooting in GPU cluster environments
  • Familiarity with datacenter operations including rack installations, cabling, power management, and thermal considerations

FluidStack provides GPU-based cloud infrastructure for artificial intelligence workloads, delivering large-scale Nvidia GPU clusters through a neocloud model. The platform offers automated provisioning and a centralized orchestration layer that hides hardware complexity, with native support for Kubernetes and Slurm and proprietary monitoring to track power usage and hardware health. It targets AI labs, research institutions, and enterprise tech teams that need scalable, pay-as-you-go access to high-performance compute without owning data centers. The company's goal is to make it easy for organizations to train, develop, and deploy complex machine learning models by providing reliable, scalable GPU resources on demand.

Company Size

51-200

Company Stage

Late Stage VC

Total Funding

$11B

Headquarters

New York City, New York

Founded

2017

Simplify Jobs

Simplify's Take

What believers are saying

  • Anthropic's $50 billion deal builds custom data centers in New York and Texas.
  • Coatue's Next Frontier JV funds 430MW Indiana campus online by December 2026.
  • $750 million raise at $7 billion valuation accelerates US expansion creating 1,000 jobs.

What critics are saying

  • CoreWeave undercuts Fluidstack's pricing, capturing ex-OpenAI researchers within 6-12 months.
  • $1 billion round at $18 billion valuation fails by July 2026, causing liquidity crunch.
  • Google terminates Indiana lease if Fluidstack defaults on $5.7 billion bonds by 2028.

What makes FluidStack unique

  • Fluidstack delivers zero-setup multi-thousand GPU clusters for AI researchers from OpenAI and DeepMind.
  • Lighthouse platform enables proactive monitoring and automated remediation without customer intervention.
  • HIPAA, GDPR, ISO27001, and SOC 2 TYPE 2 compliance secures regulated AI labs and enterprises.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Health Insurance

Dental Insurance

Vision Insurance

401(k) Retirement Plan

Company Equity

Unlimited Paid Time Off

Growth & Insights and Company News

Headcount

6 month growth

0%

1 year growth

-9%

2 year growth

-4%
Bloomberg L.P.
Apr 14th, 2026
Fluidstack Seeks $1 Billion in New Funding at $18 Billion Valuation

The cloud-computing startup Fluidstack Ltd. is holding funding talks with investors to bring in about $1 billion at a target valuation of $18 billion, according to people briefed on the matter.

Yahoo Finance
Apr 6th, 2026
UK data centre startup Fluidstack raises $750M, hits $7B valuation for US AI expansion

Fluidstack, a London-founded data centre startup, has been valued at $7 billion after raising over $750 million in funding. The company, established in 2017 by Gary Wu, Cesar Maklary and James Cox, is building AI infrastructure across America. The startup relocated its headquarters from London to New York in December to focus on US customers, creating over 1,000 jobs. New investors include Situational Awareness, an AI hedge fund founded by former OpenAI employee Leopold Aschenbrenner. Fluidstack is backed by Google, which has provided a $1.8 billion backstop to the company's data centre lease obligations and is reportedly in talks for an equity stake. The company is also working with Anthropic to build up to $50 billion of AI data facilities across New York and Texas.

Telegraph Media Group
Apr 5th, 2026
UK data centre giant raises $750m for US expansion

City sources say Fluidstack could still secure additional funding as start-up hits $7bn valuation

Yahoo Finance
Mar 20th, 2026
Fluidstack scraps $11.5B French data center for US expansion backed by $50B Anthropic deal

Fluidstack has abandoned an $11.5 billion data centre project in northern France to focus on US expansion, according to Bloomberg. The operator is relocating its global headquarters from the UK to New York and exited a secondary facility near Paris used by Mistral. The move could prove beneficial for Bitcoin miners partnering with Fluidstack. Hut 8, TeraWulf and Cipher Mining have signed deals with the firm over the past six months. Hut 8's 15-year agreement to build a 245-megawatt Louisiana site with Fluidstack and Anthropic generates $7 billion in revenue, potentially rising to $17.7 billion with expansion clauses. Fluidstack's US expansion includes a $50 billion master agreement with Anthropic to operate compute clusters across New York, Texas and other states.

Yahoo Finance
Mar 15th, 2026
Google-backed Fluidstack signs $7B, 15-year AI lease with Hut 8 as miner pivots to data centres

Hut 8 Corp has signed a 15-year, $7 billion IT capacity lease with Google-backed Fluidstack at its River Bend campus, marking a strategic shift from pure Bitcoin mining towards AI and data centre infrastructure. The company also sold a 310MW natural gas power plant portfolio to refocus capital. The deal is part of Hut 8's broader push to build 245MW to 2,295MW of AI data centre capacity with blue-chip clients. The company is carving out legacy mining operations into American Bitcoin whilst developing an 8,500MW infrastructure pipeline. Hut 8's narrative projects $767.3 million revenue and $140.6 million earnings by 2028, requiring 76.9% yearly revenue growth. Some analysts expect the company to reach $1.1 billion in revenue by 2028, though execution risks and potential dilution from capital-intensive expansion remain key concerns.