Full-Time

Senior Support Engineer

OpenAI

OpenAI

5,001-10,000 employees

Develops safe AI models and tools

Compensation Overview

$234k - $260k/yr

San Francisco, CA, USA

Hybrid

Three days on-site per week required.

Category
DevOps & Infrastructure (1)
Required Skills
Python
Computer Networking
Docker
Observability
Requirements
  • Have a Bachelor’s degree in Computer Science or a related field. A strong software engineering foundation is important for this role’s success.
  • Have 8+ years of experience in technical operations roles such as SRE/NOC, designing monitoring systems and resolving production issues in fast-paced and mission-critical environments. A strong track record of troubleshooting complex technical problems at the systems level.
  • Have deep familiarity with modern monitoring, alerting, and observability practices. Hands-on experience setting up or managing metrics, logging, and tracing for distributed systems (e.g., understanding of SLIs/SLOs, alert tuning, dashboard creation).
  • Have proven experience leading incident response for high-severity outages or service disruptions. Able to perform real-time incident coordination, root cause analysis, and drive follow-ups (post-mortems, action items) to prevent recurrence. Knowledge of industry best practices for incident management and fault diagnosis.
  • Have strong skills in scripting or software engineering (e.g., Python or similar) to automate repetitive tasks and integrate tools.
  • Have solid understanding of cloud infrastructure and distributed systems fundamentals. Comfortable working with cloud services, load balancers, databases, and containerized applications.
  • Are effective at working cross-functionally in a high-trust environment. Strong communication skills to explain technical issues and resolutions to both engineering and non-technical stakeholders. You can coordinate efforts across teams and are comfortable providing updates in the midst of an ongoing incident.
Responsibilities
  • Be among the foremost technical and troubleshooting experts for our API platform at OpenAI. You are the last line of defense before the core Engineering team.
  • Proactively identify and implement opportunities to scale support operations by leveraging automation and advancements in AI technologies. Contribute to shaping the future of technical support in an AI-driven era.
  • Configure and use advanced monitoring and alerting workflows to proactively detect customer impacting issues in real time.
  • In partnership with engineering, contribute to reliability reviews and preparedness for new features, launches, or strategic customer requirement updates. Ensure that operational readiness (monitoring, alerting, and fallback plans) is in place for any such changes.
  • Design and refine incident response processes and documentation across strategic customers, engineering and support teams.
  • Analyze operational metrics and incident RCAs to identify areas for improvement. Proactively recommend and implement enhancements to monitoring dashboards, alert configurations, and support workflows.
  • Provide support coverage during holidays and weekends based on business needs.
Desired Qualifications
  • Have a Bachelor’s degree in Computer Science or a related field. A strong software engineering foundation is important for this role’s success.
  • Have 8+ years of experience in technical operations roles such as SRE/NOC, designing monitoring systems and resolving production issues in fast-paced and mission-critical environments. A strong track record of troubleshooting complex technical problems at the systems level.
  • Have deep familiarity with modern monitoring, alerting, and observability practices. Hands‑on experience setting up or managing metrics, logging, and tracing for distributed systems (e.g., understanding of SLIs/SLOs, alert tuning, dashboard creation).
  • Have proven experience leading incident response for high‑severity outages or service disruptions. Able to perform real‑time incident coordination, root cause analysis, and drive follow‑ups (post‑mortems, action items) to prevent recurrence. Knowledge of industry best practices for incident management and fault diagnosis.
  • Have strong skills in scripting or software engineering (e.g., Python or similar) to automate repetitive tasks and integrate tools.
  • Have solid understanding of cloud infrastructure and distributed systems fundamentals. Comfortable working with cloud services, load balancers, databases, and containerized applications.
  • Are effective at working cross‑functionally in a high‑trust environment. Strong communication skills to explain technical issues and resolutions to both engineering and non‑technical stakeholders. You can coordinate efforts across teams and are comfortable providing updates in the midst of an ongoing incident.

OpenAI conducts AI research and deployment to build advanced AI models and tools that help people automate tasks, be more creative, and make better decisions. Its products include ChatGPT, a conversational AI that can write, code, tutor, and assist in interactive tasks, and Sora, which can generate videos from text prompts. OpenAI’s models typically run through cloud-based services and subscriptions, with licensing and partnerships for broader use. The company operates a capped-profit model to balance generating revenue with ensuring safety, ethics, and long-term societal benefits. Its approach emphasizes safety, responsible deployment, and collaboration with researchers, governments, and institutions. The goal is to ensure artificial general intelligence, when it arrives, benefits all of humanity and minimizes risks.

Company Size

5,001-10,000

Company Stage

Late Stage VC

Total Funding

$196B

Headquarters

San Francisco, California

Founded

2015

Simplify Jobs

Simplify's Take

What believers are saying

  • $122 billion funding at $852 billion valuation closed March 31, 2026.
  • Nvidia's $30 billion investment funds chips and data centers.
  • $4 billion Deployment Company backed by TPG and Bain Capital.

What critics are saying

  • Ilya Sutskever's Safe Superintelligence Inc competes using OpenAI knowledge.
  • $14 billion 2026 losses and doubled GPT-5.5 pricing drive customer churn.
  • Capped-profit model forces equity dilution in next funding round.

What makes OpenAI unique

  • OpenAI's ChatGPT reached $2 billion monthly revenue by April 2026.
  • Deployment Company acquires Tomoro's 150 engineers for enterprise AI integration.
  • GPT-5.5-Cyber gains EU Commission access ahead of Anthropic's Mythos.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Health insurance

Dental and vision insurance

Flexible spending account for healthcare and dependent care

Mental healthcare service

Fertility treatment coverage

401(k) with generous matching

20-week paid parental leave

Life insurance (complimentary)

AD&D insurance (complimentary)

Short-term/long-term disability insurance (complimentary)

Optional buy-up life insurance

Flexible work hours and unlimited paid time off (we encourage 4+ weeks per year)

Annual learning & development stipend

Regular team happy hours and outings

Daily catered lunch and dinner

Travel to domestic conferences

Growth & Insights and Company News

Headcount

6 month growth

-2%

1 year growth

3%

2 year growth

2%
Daring Fireball
May 8th, 2026
Y Combinator’s Stake in OpenAI

The fact that Paul Graham personally has billions of dollars at stake with OpenAI doesn’t mean that his public opinion on Sam Altman’s trustworthiness and leadership is invalid. But it certainly seems like the sort of thing that ought to be disclosed when quoting Graham as an Altman character reference.

Bloomberg L.P.
Apr 21st, 2026
OpenAI launches ChatGPT Images 2.0 with improved chart and diagram creation

OpenAI is releasing ChatGPT Images 2.0, an updated AI image-generating software designed to create accurate charts and scientific diagrams. The company aims to make its technology more appealing to professionals. Rolling out Tuesday through ChatGPT and Codex AI coding assistant, the new model improves instruction-following and detail incorporation when generating images. It can produce visuals across multiple styles and render text in various languages. The update represents OpenAI's effort to expand its AI capabilities beyond general use cases into professional applications requiring technical precision and accuracy.

Bloomberg L.P.
Apr 17th, 2026
OpenAI loses head of science initiatives and Sora AI video team leader

OpenAI's head of science initiatives and the leader of its Sora AI video team are leaving the company, adding to recent executive departures as the firm reorganises its product portfolio. The exits continue a pattern of senior leadership changes at the artificial intelligence company.

Bloomberg L.P.
Apr 16th, 2026
OpenAI unveils GPT-5.4 to tackle enterprise trust and governance concerns

OpenAI is addressing enterprise adoption challenges with GPT-5.4 "Cyber", focusing on security, trust and governance issues. Erica Brescia, managing director at Redpoint Ventures and OpenAI backer, discussed the development, emphasising that the AI cyber race centres on governance rather than purely technological advancement. The move represents OpenAI's effort to overcome barriers preventing widespread enterprise adoption of its AI systems by prioritising security features in its latest model release.

Bloomberg L.P.
Apr 16th, 2026
OpenAI launches GPT-Rosalind AI model for drug discovery to rival Google

OpenAI has launched GPT-Rosalind, an AI model designed to accelerate drug discovery and life sciences research. The model aims to extract insights from large datasets and help translate scientific studies into healthcare applications. Initially available as a research preview to select business customers, GPT-Rosalind's early users include pharmaceutical company Amgen, vaccine maker Moderna and bioscience research nonprofit the Allen Institute. The launch positions OpenAI alongside other technology companies entering the drug discovery field, as the industry seeks to demonstrate AI's potential for scientific breakthroughs. The ChatGPT maker announced the model's release on Thursday.