Full-Time

Staff Engineer

Cloud Infrastructure

Confirmed live in the last 24 hours

Amplitude

Amplitude

501-1,000 employees

Product analytics for user behavior insights

Data & Analytics
Enterprise Software

Senior, Expert

San Francisco, CA, USA

Category
DevOps & Infrastructure
Site Reliability Engineering
Cloud Engineering
Required Skills
Datadog
Kubernetes
Python
Airflow
Apache Kafka
AWS
Go
Prometheus
Jenkins
Terraform
Splunk
Requirements
  • 8+ years of experience in some combination of cloud-native software development, platform engineering, site reliability engineering, and/or cloud infrastructure, with a more recent focus on Kubernetes and the cloud-native ecosystem.
  • Strong expertise in Kubernetes and related CNCF projects (e.g., Argo CD/Workflows, Backstage, Envoy, CoreDNS, and more) and in simplifying complex cloud infrastructure for broader teams.
  • Operational experience at scale with technologies like Kafka and Airflow.
  • Proficient in common infrastructure languages like Golang, Python, and Terraform, with experience developing and operating production systems.
  • Extensive experience with AWS cloud infrastructure, networking, and security.
  • Proven experience with monitoring and observability tools (Datadog, Splunk, Prometheus, Grafana Cloud, etc.) and a strong understanding of system performance tuning.
  • Expertise in building abstractions over Kubernetes to simplify developer interaction with the platform.
  • Excellent communication skills, with the ability to collaborate across teams, build consensus, and drive initiatives in a high-pressure environment.
  • High level of empathy and patience, with a commitment to mentoring and helping others succeed, and the ability to incorporate feedback and turn it into actionable improvements.
  • Experience with infrastructure-as-code and automation (Terraform, Helm, Kustomize, etc.), with a focus on reducing toil and operational overhead.
  • A mindset focused on improving the developer experience and business alignment, with the flexibility to make decisions that may go against ideal technical preferences when necessary.
Responsibilities
  • Lead the design, implementation, and management of our Kubernetes-based platform, focusing on scalability, developer experience, and system reliability.
  • Architect and maintain automation around Kubernetes, ensuring that the platform is easy for developers to use and requires minimal toil to deploy or modify workloads in a self-service model.
  • Collaborate with cross-functional teams (developers, leaders, and other infrastructure teams) to gather requirements, build consensus, and deliver impactful solutions.
  • Integrate observability into the platform, using tools like Datadog, Prometheus, Grafana, New Relic, and Splunk to monitor system health and performance.
  • Drive infrastructure-as-code initiatives using tools like Kubernetes Operators, Helm, Kustomize, and Terraform promoting automation, repeatability, and reliability.
  • Ensure that the platform integrates seamlessly with CI/CD pipelines (using Argo CD / Workflows / Rollouts, Github Actions, Jenkins, or similar) and continuously improve developer workflows.
  • Contribute to the operational excellence of the platform, including on-call responsibilities and incident management, while building self-healing capabilities where possible.
  • Act as a mentor to other engineers on the team, promoting growth and knowledge sharing, ensuring that the team thrives even in the absence of specific individuals.
  • Foster a culture of collaboration, empathy, and trust within the team and across departments, helping to bridge gaps between engineering and other business functions.
  • Take a hands-on approach to problem-solving, sometimes submitting PRs to resolve issues in codebases or providing detailed solutions when teams need assistance.

Amplitude provides tools for product analytics that help businesses understand how users interact with their products. The platform allows companies to track product usage, assess the impact of new features, and analyze user journeys to identify areas for improvement. This helps businesses enhance user experiences and increase customer retention. Amplitude stands out from competitors by offering a subscription-based service with various features tailored to different client needs, along with integrations to major platforms like AWS and Segment. The goal of Amplitude is to empower businesses to make data-driven decisions that foster growth and optimize their product offerings.

Company Stage

IPO

Total Funding

$326.8M

Headquarters

San Francisco, California

Founded

2012

Growth & Insights
Headcount

6 month growth

-3%

1 year growth

6%

2 year growth

4%
Simplify Jobs

Simplify's Take

What believers are saying

  • Growing demand for real-time analytics boosts Amplitude's market potential.
  • Expansion into emerging markets like South Africa offers growth opportunities.
  • New Web Experimentation tool aligns with trends in personalized user experiences.

What critics are saying

  • Departure of CFO Christopher Harms may lead to financial instability.
  • unitQ GPT poses a competitive threat with real-time actionable insights.
  • Web Experimentation tool faces competition from established A/B testing platforms.

What makes Amplitude unique

  • Amplitude offers a comprehensive product intelligence platform for diverse industries.
  • The platform integrates with major tech stacks like AWS and Snowflake.
  • Amplitude provides educational resources to maximize platform value for clients.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Parental Leave

Wellness Stipend

Unlimited PTO

Stock Exercise Window