Full-Time

Engineering Manager

Observability

Posted on 11/8/2024

OpenAI

OpenAI

5,001-10,000 employees

Develops safe and beneficial AI technologies

AI & Machine Learning

Senior, Expert

San Francisco, CA, USA

Category
Engineering Management
Software Development Management
Required Skills
Datadog
Microsoft Azure
Grafana
AWS
Prometheus
Google Cloud Platform

You match the following OpenAI's candidate preferences

Employers are more likely to interview you if you match these preferences:

Degree
Experience
Requirements
  • Experience building and operating an observability stack from scratch, ideally in a cloud-based environment
  • Technical expertise in observability tools and technologies (e.g., DataDog, Prometheus, Grafana, ELK stack)
  • Deep understanding of cloud platforms (e.g., AWS, GCP, Azure) and their role in observability
  • Strong track record of building and maintaining scalable systems in a cloud-based environment
  • Skilled in collaborating with cross-functional teams
Responsibilities
  • Lead and grow a team of observability engineers, fostering a culture of collaboration and innovation
  • Lead a team in building the observability stack, including monitoring, logging, and tracing, ensuring scalability and cost-efficiency
  • Work closely with product and infrastructure teams to integrate observability tools into their development workflows
  • Scale the observability infrastructure to meet the demands of fast-growing products while managing operational costs
  • Ensure system reliability by identifying and addressing performance bottlenecks
  • Set the strategic direction for observability tools, processes, and infrastructure, with a focus on scalability and delightful UX
  • Stay updated with the latest trends in observability and cloud-native technologies, continuously seeking out improvements
  • Build and maintain strong cross-functional relationships, ensuring that all product teams have visibility into their systems and services
Desired Qualifications
  • Comfortable working in a fast-moving startup environment and can adapt to the pace of rapid growth
  • Understand the challenges of building scalable observability backends and appreciate the importance of creating a user-friendly interface
  • Have a humble, coachable attitude and are eager to learn and grow as a leader

OpenAI develops artificial intelligence technologies aimed at benefiting humanity. The company creates advanced AI models that can perform various tasks, such as automating processes and enhancing creativity. OpenAI's products, like Sora, allow users to generate videos from text descriptions, showcasing the capabilities of their AI systems. Unlike many competitors, OpenAI operates under a capped profit model, which limits the profits it can make and redistributes excess earnings to maximize the social benefits of AI. This commitment to safety and ethical considerations distinguishes OpenAI in the AI market. The company's goal is to ensure that artificial general intelligence (AGI) is developed responsibly and benefits all of humanity.

Company Stage

Debt Financing

Total Funding

$18.4B

Headquarters

San Francisco, California

Founded

2015

Growth & Insights
Headcount

6 month growth

0%

1 year growth

-7%

2 year growth

-9%
Simplify Jobs

Simplify's Take

What believers are saying

  • OpenAI's involvement in Project Stargate boosts infrastructure and tech partnerships.
  • The 'Operator' AI agent positions OpenAI as a leader in web-based task automation.
  • Increasing inference-time compute could set new industry standards for AI security.

What critics are saying

  • Meta's $65 billion AI investment could surpass OpenAI's capabilities.
  • Apple's AI division enhancement poses a competitive threat to OpenAI.
  • Project Stargate may lead to resource allocation challenges for OpenAI.

What makes OpenAI unique

  • OpenAI's capped profit model prioritizes ethical AI development over unlimited profit.
  • The 'Operator' AI agent showcases OpenAI's focus on practical, autonomous AI applications.
  • OpenAI's research on inference-time compute enhances AI security against adversarial attacks.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Health insurance

Dental and vision insurance

Flexible spending account for healthcare and dependent care

Mental healthcare service

Fertility treatment coverage

401(k) with generous matching

20-week paid parental leave

Life insurance (complimentary)

AD&D insurance (complimentary)

Short-term/long-term disability insurance (complimentary)

Optional buy-up life insurance

Flexible work hours and unlimited paid time off (we encourage 4+ weeks per year)

Annual learning & development stipend

Regular team happy hours and outings

Daily catered lunch and dinner

Travel to domestic conferences

INACTIVE