Full-Time

Founding AI Engineer

Tracer Cloud

Tracer Cloud

11-50 employees

Monitors HPC bioinformatics workloads in real-time

Compensation Overview

ÂŁ70k - ÂŁ130k/yr

+ Equity

London, UK

In Person

Category
Software Engineering (2)
,
Required Skills
Python
Observability
Requirements
  • 5+ years (ideally 10+) professional software engineering experience
  • Proven track record of shipping real products at high velocity
  • Strong backend and distributed-systems foundations, ideally with experience in data platforms and production pipeline stacks and incident/observability tooling
  • Experience working at an early-stage startup
  • High ownership and sharp product instincts: you build what matters and cut what doesn’t
Responsibilities
  • Architect and build the core alert, investigation, root cause analysis (RCA) pipeline in Python
  • Design and implement key systems including: Alert ingestion + normalization; Context enrichment + correlation; Problem framing outputs; Hypothesis orchestration engine; Investigation execution runtime; Investigation artifacts + reporting
  • Drive core architecture decisions and ensure the system is observable, auditable, and reliable from day one
  • Partner with founders to ship a small set of high-value alert types that work extremely well, then expand coverage deliberately
  • Build customer-ready integrations across the pipeline stack
  • Educate and guide future engineers, setting a high bar for technical quality, speed, and pragmatism
Desired Qualifications
  • Bonus points for having joined earlier
  • Experience working in incident/observability tooling is beneficial but not required

Tracer Cloud provides monitoring for High Performance Computing used in life sciences. It collects real-time performance metrics and cost data from complex bioinformatics pipelines, and presents them in dashboards that show where time and resources are spent. It also suggests optimization opportunities and offers instant troubleshooting guidance. The product is tailored to scientific workloads, offering fine-grained visibility into each step of a bioinformatic workflow, which traditional monitoring tools may miss. The goal is to help researchers run HPC jobs more efficiently, lower costs, and quickly identify and fix bottlenecks in their workflows.

Company Size

11-50

Company Stage

N/A

Total Funding

N/A

Headquarters

N/A

Founded

N/A

Simplify Jobs

Simplify's Take

What believers are saying

  • eBPF adoption surges in HPC, aligning with Tracer's low-overhead kernel monitoring.
  • Nextflow usage grows with 5,000 public workflows, eased by Tracer's integration.
  • Slurm usage rises 22% YoY, boosted by Tracer's idle process detection.

What critics are saying

  • Datadog's suite diverts customers via eBPF-Nextflow-Slurm integrations in 6-12 months.
  • Nextflow Tower v25 embeds eBPF metrics, capturing 80% users in 6-12 months.
  • New Relic's Grok AI outpaces synthetic logs, eroding share in 12-18 months.

What makes Tracer Cloud unique

  • Tracer uses Rust-based eBPF for per-task CPU/I/O visibility in scientific workloads.
  • One-line Linux agent integrates with Nextflow, Slurm, AWS Batch without code changes.
  • Bring-your-own-cloud model ensures HIPAA compliance by keeping data on-premises.

Help us improve and share your feedback! Did you find this helpful?

Your Connections

People at Tracer Cloud who can refer or advise you

Benefits

Health Insurance

Paid Vacation

Company Social Events