Full-Time

Site Reliability Engineer

Posted on 5/31/2024

Lyft

Lyft

10,001+ employees

Rideshare, bikeshare, and electric scooter services


Senior

Montreal, QC, Canada

Required Skills
Microsoft Azure
Python
NoSQL
Java
AWS
Jenkins
CircleCI
Linux/Unix
Google Cloud Platform
Requirements
  • 5+ years of software engineering/production infrastructure industry experience
  • Experience designing, debugging and running fault-tolerant large-scale distributed systems
  • Experience with high level programming languages (Python, Go, Java, etc.)
  • Experience working with public cloud platforms (e.g., AWS, Google Cloud Platform, Microsoft Azure, etc.)
  • Experience bringing software to production at high scale
  • Experience with common CI tools (Jenkins, Buildkite, CircleCI, TeamCity), and proficiency in at least one of those tools an asset
  • Experience working with databases, relational or NoSQL an asset
  • Experience in Linux system administration, or familiarity with managing a fleet of Linux servers an asset
  • Must be fluent in spoken and written English and minimally be willing to learn French if required
Responsibilities
  • Help define the team’s roadmap and architecture based on technology and business needs
  • Design and implement effective infrastructure abstractions that increase velocity of our application teams
  • Be responsible for, design, develop, deploy, monitor, operate and maintain existing or new elements of our systems infrastructure
  • Build holistic visibility into SLIs, SLOs, SLAs, dependency graphs, past performance of software, network, and system to ensure that we can continue to scale without increasing operational burden or toil
  • Use the core Site Reliability Engineering principles of change management, monitoring, emergency response, capacity planning, and production readiness reviews to run the platform
  • Step back to observe patterns and develop innovative tools and automation to minimize toil. Use those learnings to drive the best operational practices
  • Partner with the broader Lyft organization to build a culture of rigorously learning from incidents
  • Unblock, support, and effectively communicate across teams to achieve results
  • Have a good grasp and ability to explain the various tradeoffs made in decisions
  • Share knowledge by giving brown bags, tech talks, and evangelizing appropriate tech and engineering best practices

Lyft offers a flexible ride-sharing service that allows drivers to set their own schedules and earn on their own terms. The company utilizes shared rides, bikeshare systems, electric scooters, and public transit partnerships to promote transportation equity and offset carbon emissions.

Company Stage

IPO

Total Funding

$4.8B

Headquarters

San Francisco, California

Founded

2012

Growth & Insights
Headcount

6 month growth

3%

1 year growth

5%

2 year growth

3%