Full-Time

Inference Technical Lead

OpenAI

OpenAI

5,001-10,000 employees

Develops safe AI models and tools

Compensation Overview

$380k/yr

San Francisco, CA, USA

Hybrid

Relocation assistance available for new employees.

Category
AI & Machine Learning (2)
,
Requirements
  • Have deep expertise in model performance optimization, particularly at the inference layer
  • Have a strong background in kernel-level systems, data movement, and low-level performance tuning
  • Are excited about scaling high-performing AI systems that serve real-world, multimodal workloads
  • Can navigate ambiguity, set technical direction, and drive complex initiatives to completion
Responsibilities
  • Perform engineering efforts focused on improving model serving, inference performance, and system efficiency
  • Drive optimizations from a kernel and data movement perspective to improve system throughput and reliability
  • Partner closely with research and product teams to ensure our models perform effectively at scale
  • Design, build, and improve critical serving infrastructure to support Sora’s growth and reliability needs

OpenAI conducts AI research and deployment to build advanced AI models and tools that help people automate tasks, be more creative, and make better decisions. Its products include ChatGPT, a conversational AI that can write, code, tutor, and assist in interactive tasks, and Sora, which can generate videos from text prompts. OpenAI’s models typically run through cloud-based services and subscriptions, with licensing and partnerships for broader use. The company operates a capped-profit model to balance generating revenue with ensuring safety, ethics, and long-term societal benefits. Its approach emphasizes safety, responsible deployment, and collaboration with researchers, governments, and institutions. The goal is to ensure artificial general intelligence, when it arrives, benefits all of humanity and minimizes risks.

Company Size

5,001-10,000

Company Stage

Late Stage VC

Total Funding

$196B

Headquarters

San Francisco, California

Founded

2015

Simplify Jobs

Simplify's Take

What believers are saying

  • $4B backed Deployment Company accelerates enterprise AI adoption with partners like Bain.
  • Trusted Access grants Deutsche Telekom and BBVA GPT-5.5-Cyber for cybersecurity.
  • $400B valuation from secondary sale enables $30M staff cash-outs, retaining talent.

What critics are saying

  • Anthropic's Mythos secures Pentagon contracts, blocking OpenAI's Daybreak revenue.
  • Doubled GPT-5.5 API pricing to $5/$30 per 1M tokens shifts developers to Claude.
  • EU blacklists GPT-5.5-Cyber access in 3-6 months, halting Trusted Access program.

What makes OpenAI unique

  • GPT-5.5 launches April 2026 with 1M context for complex professional tasks.
  • Daybreak platform integrates GPT-5.5-Cyber and Codex for cyber defense.
  • Deployment Company acquires Tomoro, adding 150 engineers for enterprise AI.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Health insurance

Dental and vision insurance

Flexible spending account for healthcare and dependent care

Mental healthcare service

Fertility treatment coverage

401(k) with generous matching

20-week paid parental leave

Life insurance (complimentary)

AD&D insurance (complimentary)

Short-term/long-term disability insurance (complimentary)

Optional buy-up life insurance

Flexible work hours and unlimited paid time off (we encourage 4+ weeks per year)

Annual learning & development stipend

Regular team happy hours and outings

Daily catered lunch and dinner

Travel to domestic conferences

Growth & Insights and Company News

Headcount

6 month growth

-2%

1 year growth

3%

2 year growth

2%
Daring Fireball
May 8th, 2026
Y Combinator’s Stake in OpenAI

The fact that Paul Graham personally has billions of dollars at stake with OpenAI doesn’t mean that his public opinion on Sam Altman’s trustworthiness and leadership is invalid. But it certainly seems like the sort of thing that ought to be disclosed when quoting Graham as an Altman character reference.

Bloomberg L.P.
Apr 21st, 2026
OpenAI launches ChatGPT Images 2.0 with improved chart and diagram creation

OpenAI is releasing ChatGPT Images 2.0, an updated AI image-generating software designed to create accurate charts and scientific diagrams. The company aims to make its technology more appealing to professionals. Rolling out Tuesday through ChatGPT and Codex AI coding assistant, the new model improves instruction-following and detail incorporation when generating images. It can produce visuals across multiple styles and render text in various languages. The update represents OpenAI's effort to expand its AI capabilities beyond general use cases into professional applications requiring technical precision and accuracy.

Bloomberg L.P.
Apr 17th, 2026
OpenAI loses head of science initiatives and Sora AI video team leader

OpenAI's head of science initiatives and the leader of its Sora AI video team are leaving the company, adding to recent executive departures as the firm reorganises its product portfolio. The exits continue a pattern of senior leadership changes at the artificial intelligence company.

Bloomberg L.P.
Apr 16th, 2026
OpenAI unveils GPT-5.4 to tackle enterprise trust and governance concerns

OpenAI is addressing enterprise adoption challenges with GPT-5.4 "Cyber", focusing on security, trust and governance issues. Erica Brescia, managing director at Redpoint Ventures and OpenAI backer, discussed the development, emphasising that the AI cyber race centres on governance rather than purely technological advancement. The move represents OpenAI's effort to overcome barriers preventing widespread enterprise adoption of its AI systems by prioritising security features in its latest model release.

Bloomberg L.P.
Apr 16th, 2026
OpenAI launches GPT-Rosalind AI model for drug discovery to rival Google

OpenAI has launched GPT-Rosalind, an AI model designed to accelerate drug discovery and life sciences research. The model aims to extract insights from large datasets and help translate scientific studies into healthcare applications. Initially available as a research preview to select business customers, GPT-Rosalind's early users include pharmaceutical company Amgen, vaccine maker Moderna and bioscience research nonprofit the Allen Institute. The launch positions OpenAI alongside other technology companies entering the drug discovery field, as the industry seeks to demonstrate AI's potential for scientific breakthroughs. The ChatGPT maker announced the model's release on Thursday.