Full-Time

Senior AI Systems Engineer

LLM Inference & Infra Optimization

Posted on 7/26/2025

Sully.ai

Sully.ai

51-200 employees

Autonomous AI agents manage hospital operations

Compensation Overview

$200k - $230k/yr

Remote in USA

Remote

Category
AI & Machine Learning (2)
,
Required Skills
Python
CUDA
Docker
AWS
Terraform
C/C++
DevOps
Google Cloud Platform
Requirements
  • Proficiency in C++, CUDA, and Python with experience in systems or ML infrastructure engineering.
  • Deep understanding of GPU architectures, inference optimization, and large model serving techniques.
  • Hands-on experience with multi-cloud environments (GCP, AWS, etc.) and infrastructure-as-code tools such as Pulumi, Terraform, or similar.
  • Familiarity with ML deployment frameworks (TensorRT, vLLM, DeepSpeed, Hugging Face Transformers, etc.).
  • Comfortable with DevOps workflows, containerization (Docker), CI/CD, and distributed system debugging.
Responsibilities
  • Develop and optimize inference pipelines using quantization, attention caching, speculative decoding, and memory-efficient serving.
  • Build and maintain low-level modules in C++/CUDA/NCCL to squeeze the most out of GPUs and high-throughput architectures.
  • Stand up and manage multi-cloud environments using modern IaC frameworks such as Pulumi or Terraform. Automate infrastructure provisioning, deployment pipelines, and GPU fleet orchestration.
  • Design low-latency streaming and decision-support systems leveraging embedding models, VRAM token caches, and fast interconnects.
  • Build robust tooling, interfaces, and sandbox environments so that other engineers can contribute safely to the ML systems layer.
Desired Qualifications
  • Experience with streaming embeddings, semantic search, or hybrid retrieval architectures.
  • Interest in building tools that democratize high-performance systems for broader engineering teams.

Sully.ai builds a system of autonomous AI agents that handle hospital operations by running in the background to automate daily workflows. Its team of agents—resembling roles such as receptionists, medical consultants, triage nurses, scribes, and coders—collaborates to complete tasks across the hospital, effectively replacing a large amount of manual labor. The product works by coordinating these agents to execute end-to-end workflows, from patient intake and triage to documentation and coding, across departments. Unlike other tools that focus on a single function, Sully.ai provides an integrated set of autonomous agents that work together to manage the entire hospital operation. The goal is to reduce the roughly $800B of labor currently spent on manual hospital work by enabling health systems to run more efficiently with AI-driven automation.

Company Size

51-200

Company Stage

Series A

Total Funding

$21.9M

Headquarters

California

Founded

2023

Simplify Jobs

Simplify's Take

What believers are saying

  • Sully.ai hit seven-figure ARR in ten months, partnering with over 100 healthcare organizations.
  • Secured $20M Series A at $150M valuation to scale engineering and market reach.
  • Clients like Hillside Medical and City Health adopt AI agents, adding 20M workforce minutes yearly.

What critics are saying

  • Epic Systems blocks Sully.ai access via EHR firewalls, halting 60% US hospital deployments in 3-6 months.
  • Nabla Copilot erodes Sully.ai contracts with superior EHR integration by Q2 2026.
  • FDA issues cease-and-desist for Triage Nurse agents as unauthorized devices in 12-24 months.

What makes Sully.ai unique

  • Sully.ai deploys coordinated AI agents as Receptionist, Triage Nurse, Scribe, and Coder for end-to-end hospital operations.
  • Agents integrate with EHR systems like Tebra to automate patient journey from intake to prescriptions.
  • Y Combinator-backed founders Ahmed Nasser and Ahmed Omar built team replacing $800B manual healthcare labor.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Remote Work Options

Flexible Work Hours

Growth & Insights and Company News

Headcount

6 month growth

14%

1 year growth

8%

2 year growth

80%
Sully.ai
Oct 4th, 2025
AI Medical Employees | Privacy Policy - Sully

Learn about Sully’s latest funding round supporting the expansion of AI medical agents that are transforming healthcare documentation.

BizWeekly
Aug 2nd, 2025
AI and HealthTech Startups Fuel Funding Boom in Mid‑January - BizWeekly

The week spanning January 5–11, 2025, marked a significant uptick in startup…

INACTIVE