Full-Time

Software Engineer

High Performance Computing

Eventual

Eventual

Compensation Overview

$150k - $250k/yr

+ Equity

San Francisco, CA, USA

In Person

On-site 4 days/week in the SF Mission district office.

Category
Software Engineering (1)
Required Skills
Rust
C/C++
Requirements
  • Obsession with systems-level performance.
  • Ability to recite Jeff Dean's "numbers every programmer should know" and comfort with flamegraphs.
  • Strong opinions on io_uring.
  • Proficiency in Rust, C++, or C.
  • Strong familiarity with operating systems including page cache, scheduling, system calls, Non-Uniform Memory Access, memory hierarchies.
  • Understanding of where bytes go: Non-Volatile Memory Express, memory, network, PCI Express, NVLink, and the throughput and latency budgets of each.
Responsibilities
  • Design and build the video-native dataloader: rank-aware, NVMe-cached, random-access into clips, returns tensors directly to the GPU.
  • Profile and optimize the full data path from object store to non-volatile memory cache to page cache to host RAM to device RAM; eliminate every avoidable copy and stall.
  • Saturate the latest hardware (Non-Volatile Memory Express-based accelerators like B200, GB200, NVL72) on real customer training jobs; push toward Vera Rubin bandwidth requirements.
  • Own performance benchmarks against customer baselines (custom DataLoaders, Data Loading Library variants such as DALI, decord, LeRobot) and against historical numbers; regressions caught at pull request time.
  • Partner with researchers at partner laboratories to land the loader in their training stack and measure mean time to utilization end-to-end.
  • Work cross-team with Storage Infrastructure on the index/format boundary and with Visual Understanding on the model-output ingestion path.
Desired Qualifications
  • Experience working with GPUs is a plus, but not required on day one.
  • Experience with SLURM, Kubernetes for GPU workloads, or other high-performance computing schedulers.
  • Hands-on CUDA experience.
  • Deep expertise on memory and caching subsystems—page cache tuning, huge pages, Non-Uniform Memory Access pinning, GPU-Direct Storage.
  • Experience with video decode pipelines (Python bindings such as PyAV, decord, NVDEC) or PyTorch DataLoader internals.
  • Contributed to open-source systems projects in Rust or C++.

Company Size

N/A

Company Stage

N/A

Total Funding

N/A

Headquarters

N/A

Founded

N/A

Simplify Jobs

Simplify's Take

What believers are saying

  • Secured $30M funding: $20M Series A from Felicis, $10M seed from CRV.
  • Y Combinator W22 alum with 18-person San Francisco engineering team.
  • Launched Eventual Cloud waitlist targeting generative AI data explosion.

What critics are saying

  • Databricks forks Daft, eroding moat as enterprises adopt free versions.
  • AWS internalizes Daft, launches competing service in 12-18 months.
  • Snowflake acquires Pinecone, dominates multimodal queries in 12-24 months.

What makes Eventual unique

  • Daft processes petabytes of multimodal data for AWS, Essential AI, and Together Computer.
  • Founders Sammy Sidhu and Jay Chia built Daft from Lyft autonomous vehicle challenges.
  • Eventual Cloud offers production-grade platform for multimodal AI workloads.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Health Insurance

Dental Insurance

Vision Insurance

401(k) Retirement Plan

401(k) Company Match

Unlimited Paid Time Off

Commuter Benefits

Meal Benefits

Company News

SiliconANGLE Media
Jun 25th, 2025
Eventual launches with $30M in funding to streamline multimodal data processing

Eventual launches with $30M in funding to streamline multimodal data processing.