Staff Site Reliability Engineer
Kubernetes
Posted on 3/7/2024
INACTIVE
Peloton

1,001-5,000 employees

Interactive fitness platform with on-demand classes
Company Overview
Peloton Interactive is a global leader in the connected fitness industry, offering a comprehensive fitness ecosystem that combines top-tier equipment, software, and content to make fitness accessible and effective for everyone. The company's culture is centered around fostering social connections and motivation among its 6.7 million members, with a vast library of live and on-demand studio classes available across multiple platforms and devices. Peloton's competitive advantage lies in its unique blend of fitness, technology, and media, offering a variety of membership and payment options, and extending its reach to corporate wellness and commercial sectors.
Consumer Software

Company Stage

N/A

Total Funding

$1.9B

Founded

2011

Headquarters

New York, New York

Growth & Insights
Headcount

6 month growth

-2%

1 year growth

-5%

2 year growth

-25%
Locations
New York, NY, USA
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
Chef
Kubernetes
Python
Puppet
Git
Java
AWS
Go
Jenkins
Terraform
Ansible
SCRUM
CategoriesNew
DevOps & Infrastructure
DevOps Engineering
Site Reliability Engineering
IT & Security
Requirements
  • Master's degree in Computer Science, Engineering, or a similar field of study or equivalent work experience
  • 8+ years of experience in software engineering, with a proven understanding of Kubernetes and Infrastructure as Code
  • 4+ years of systems configuration and automation experience (e.g. Ansible, Chef, Puppet, Terraform)
  • Extensive knowledge and hands-on experience in AWS Cloud infrastructure and Services, including CI/CD and IaC provisioning tools such as Jenkins, ArgoCD, Scalr, Terraform, and Github Actions
  • Experience with a programming language like Python, Golang, or Java
  • Knowledge of best practices in observability and monitoring for Kubernetes clusters at scale with experience in cost optimization tools like Kubecost, Goldilocks, etc.
  • Knowledge of standard processes in regards to securing a Kubernetes cluster and its deployments at scale
Responsibilities
  • Help others in design, execution, and problem-solving
  • Host a critical infrastructure that ensures that developers have the best experience possible on thousands of Kubernetes pods across multiple clusters
  • Develop and lead the Container Orchestration Platform, leading all aspects of a diverse ecosystem of over 2,000 applications
  • Adhere to standard methodologies in architectural design, testing (unit, integration, visual, and regression), and scrum methodology
  • Evaluate developer platform designs, technical decisions, and code to ensure all are high quality, efficient, and well documented
  • Assist in planning, execution, and updating of technical roadmaps
  • Automate everything, from infrastructure down to day-to-day tasks
  • Drive incident management processes, following industry practices and conducting timely post-mortems of infrastructure incidents
  • Assist with operational security and compliance seek out potential threats to security and reliability and advocate solutions
  • Participate in a rotating on-call duty schedule, providing support and assistance for the services within our team's responsibility