Full-Time

Staff Software Engineer

Applied Training

CoreWeave

CoreWeave

1,001-5,000 employees

GPU-accelerated cloud computing platform

Compensation Overview

$165k - $242k/yr

+ Discretionary Bonus + Equity Awards

No H1B Sponsorship

New York, NY, USA + 1 more

More locations: Sunnyvale, CA, USA

Hybrid

Hybrid role; remote work may be considered for candidates >30 miles from an office; onboarding at a hub within first month.

US Citizenship, US Top Secret Clearance, Canada Citizenship, Canada Top Secret Clearance, UK Citizenship, UK Top Secret Clearance Required

Category
Software Engineering (2)
,
Required Skills
Kubernetes
Python
Pytorch
Docker
Serverless
Reinforcement Learning
Requirements
  • 8-12+ years building distributed systems, ML infrastructure, or developer platforms
  • Real Kubernetes experience: custom controllers, operators, scheduling, CRDs, workload orchestration at scale. Not just deploying things to Kubernetes or cluster administration
  • Excited about rigorous engineering, but enabled by AI based workflows
  • Understand what makes researchers productive. Code distribution matters. Fast iteration cycles matter. Workflows that don't require becoming infrastructure experts matter
  • Familiarity with training: how distributed jobs get scheduled, how ranks initialize, what breaks at scale
  • You've shipped infrastructure that other people rely on daily. Not prototypes. Production systems
  • Good communicator. Can work with customers, translate researcher complaints into system designs
  • Preferred: Experience building internal ML platforms or research clusters at a company doing large-scale training
  • Familiarity with agentic AI: RL training with rollouts, agent evaluation, sandbox isolation for running untrusted code
  • Background with Slurm, Ray, or similar workload orchestration. Opinions on where they fall short
  • Experience with container runtimes, isolation (gVisor, Kata), or serverless platforms
  • OSS contributions to Kubernetes SIGs, Ray, PyTorch, or similar
Responsibilities
  • Contribute to the roadmap for Applied Training. Figure out what actually unlocks new workloads and what's just nice to have. Work directly and closely with customers, and other teams inside of CoreWeave that are building cloud native primitives. Compute, storage, networking, etc.
  • For the research cluster platform: design and build a complete research cluster experience. CLI, job configuration schema, Kubernetes operators, daemons. Solve the problems researchers actually hit: code distribution, checkpoint-triggered evaluation, cross-cluster scheduling, programmatic job control. Replace the patchwork of scripts customers keep building on their own.
  • For sandbox infrastructure: own the Python SDK and work in a very tight loop with the backend team. When an RL training run needs to spawn thousands of isolated containers for agent rollouts, that's this system. When someone wants to run agent benchmarks at scale, that's this system. Make it work with our Kubernetes clusters, storage, and auth so researchers don't have to think about infrastructure.
  • Write the documentation for running popular OSS training frameworks on CoreWeave. The work that can unblock customers and help them succeed.
  • Work with infrastructure teams and customers directly. The customers are large AI labs running thousands of GPUs. Understand how they structure their internal supercomputing stacks. Bring that knowledge back to what we build.
Desired Qualifications
  • Experience building internal ML platforms or research clusters at a company doing large-scale training
  • Familiarity with agentic AI: RL training with rollouts, agent evaluation, sandbox isolation for running untrusted code
  • Background with Slurm, Ray, or similar workload orchestration. Opinions on where they fall short
  • Experience with container runtimes, isolation (gVisor, Kata), or serverless platforms
  • OSS contributions to Kubernetes SIGs, Ray, PyTorch, or similar

CoreWeave provides cloud computing resources tailored for GPU-accelerated workloads. It offers high-performance, pay-as-you-go access to NVIDIA GPU hardware hosted on bare-metal servers managed by Kubernetes, enabling tasks such as Generative AI, machine learning, LLM inference, VFX rendering, and pixel streaming. Users run GPU-intensive workloads on a fully managed, serverless Kubernetes platform without needing to own or manage the underlying hardware. The company differentiates itself by specializing in GPU workloads, offering a wide range of NVIDIA GPUs, and reducing operational burden through its bare-metal, Kubernetes-based infrastructure. CoreWeave’s goal is to deliver scalable, cost-efficient, high-performance infrastructure for AI, HPC, and digital content creation workloads.

Company Size

1,001-5,000

Company Stage

IPO

Headquarters

Livingston, New Jersey

Founded

2017

Simplify Jobs

Simplify's Take

What believers are saying

  • NVIDIA's $2B investment funds 5GW AI infrastructure by 2030 with Vera CPU access.
  • $99.4B Q1 2026 backlog includes $21B Meta deal through 2032.
  • Q1 2026 revenue hits $2.08B, up 112% YoY, beating estimates.

What critics are saying

  • $24.9B debt at 5.2 ratio triggers covenant breaches within 6-12 months.
  • OpenAI renegotiates $11.9B deal after missing targets, collapsing backlog.
  • Lambda Labs' 30% cheaper H100s erode CoreWeave's startup market share.

What makes CoreWeave unique

  • CoreWeave delivers first NVIDIA RTX PRO 6000 Blackwell GPUs at scale on July 9, 2025.
  • Kubernetes-native platform optimizes AI workloads with Mission Control software.
  • NVIDIA BlueField-3 DPUs and Quantum-2 InfiniBand enable superior GPU efficiency.

Help us improve and share your feedback! Did you find this helpful?

Your Connections

People at CoreWeave who can refer or advise you

Benefits

Health Insurance

Dental Insurance

Vision Insurance

Life Insurance

Disability Insurance

Health Savings Account/Flexible Spending Account

Tuition Reimbursement

Mental Health Support

Family Planning Benefits

Paid Parental Leave

Hybrid Work Options

401(k) Company Match

Unlimited Paid Time Off

Catered lunch each day in our office and data center locations

A casual work environment

Growth & Insights and Company News

Headcount

6 month growth

0%

1 year growth

2%

2 year growth

3%
Dealroom.co
Apr 16th, 2026
CoreWeave company information, funding & investors

CoreWeave, a specialized cloud provider, delivering a massive range of gpu compute resources on demand and at scale. Here you'll find information about their funding, investors and team.

Bloomberg L.P.
Apr 15th, 2026
Jane Street Invests $1 Billion in CoreWeave, Boosts Spending Plans

Jane Street Group, a trading firm, has taken an additional $1 billion stake in AI cloud services provider CoreWeave Inc. and plans to spend about $6 billion on the company’s technology offerings.

Yahoo Finance
Apr 14th, 2026
Nebius surges 70% YTD versus CoreWeave's 40% in AI infrastructure race

Two AI infrastructure providers, Nebius Group and CoreWeave, are competing for dominance in the GPU compute leasing market. Nebius has outperformed year-to-date, rising 70% compared to CoreWeave's 40%, though both have surged since their IPOs last March. Nebius reported fourth-quarter revenue of $227.7 million, up 547% year-over-year, and guided 2026 revenue to $33.4 billion. The company secured a $27 billion deal with Meta Platforms and received a $2 billion investment from Nvidia for joint infrastructure development. Nebius targets over 3 gigawatts of contracted power by year-end 2026. CoreWeave posted fiscal 2025 revenue of $5.13 billion with a revenue backlog of $66.8 billion. Analysts project 2026 revenue around $12.5 billion, roughly four times Nebius's estimate, positioning CoreWeave as the larger-scale player.

Yahoo Finance
Apr 14th, 2026
Meta signs $21B AI cloud deal with CoreWeave through 2032

CoreWeave has secured a $21 billion long-term agreement with Meta Platforms to provide AI cloud capacity through December 2032, utilising Nvidia's Vera Rubin platform. This follows an existing $14.2 billion deal with Meta through 2031. Despite recent major contracts, including a $6.5 billion agreement with OpenAI in September 2025, CRWV stock remains 40% below its June 2025 highs. The company posted Q4 2025 revenue of $1.6 billion and full-year revenue of $5.1 billion, but reported a $452 million quarterly net loss. CoreWeave faces financial challenges with $29.82 billion in total debt against just $3.16 billion in cash, resulting in interest costs representing 23.5% of revenue. Whilst the stock has gained 54% year-to-date, its heavy debt reliance raises concerns about sustainability.

YouTube
Apr 11th, 2026
Meeting the Data Center Demand

CoreWeave CTO and Co-founder Peter Salanki talks with TITV Host Akash Pasricha about the current "bottleneck of the day" in AI infrastructure and why reports of data center delays are misunderstood. We also get into the complexities of deploying Nvidia's Blackwell chips and why specialized labor, like master electricians, is becoming the industry's newest constraint. Subscribe: https://www.theinformation.com/subscribe_youtube The Information’s TITV airs weekdays on YouTube, X and LinkedIn at 10AM PT / 1PM ET. Or check us out wherever you get your podcasts. Follow us: X: https://x.com/theinformation IG: https://www.instagram.com/theinformation/ TikTok: https://www.tiktok.com/@titv.theinformation LinkedIn: https://www.linkedin.com/company/theinformation/