Full-Time

Lead Site Reliability Engineer

Confirmed live in the last 24 hours

hims & hers

hims & hers

1,001-5,000 employees

Telehealth platform for personalized medical treatments

Consumer Software

Compensation Overview

$150k - $175kAnnually

+ Equity Grant

Senior, Expert

Remote in USA

Category
DevOps & Infrastructure
Site Reliability Engineering
Required Skills
Kotlin
Datadog
Chef
Kubernetes
Python
Puppet
MySQL
Git
SQL
Java
Postgres
Docker
TypeScript
Prometheus
Terraform
Ansible
Requirements
  • 10+ years as a software engineer, shipping production code
  • 5+ years of experience as a Site Reliability Engineer or Production support Engineer
  • Bachelor's degree in Computer Science, Engineering, or related field, or relevant years of work experience
  • Experience with service-oriented architectures and microservices at scale
  • Strong proficiency with RDBMS databases (PostgreSQL, MySQL, SQL Server, etc.)
  • Strong proficiency in SQL scripting
  • Proficiency developing in one or more languages such as Java, Kotlin, Python, and/or others
  • Ability to use containers and orchestration frameworks (Kubernetes, Docker, Container registries etc.)
  • Knowledge of CDN, typescript frameworks, and GQL.
  • Knowledge and good understanding of any pub/sub / Queue messaging systems
  • Proficiency in Git or other VCS
  • Experience with configuring, customizing, and extending monitoring tools (Datadog, Prometheus, New Relic etc.)
  • Excellent debugging and troubleshooting skills
  • Strong technical competency, with a data-driven analytical approach towards solving complex challenges
  • Have a systematic problem-solving approach, coupled with strong and effective communication skills and a sense of drive
  • Nice-to-have: Experience with Terraform or other IAC tools such as Chef, Puppet or Ansible
Responsibilities
  • Design and implement SRE practices ensuring availability, scalability and observability of production systems with a strong focus on excellent customer experience
  • Actively seek and identify opportunities to improve the availability and performance of the system by applying the learnings from monitoring and observation
  • Use automation extensively to design, configure, manage, and monitor systems in support of our product development teams
  • Understanding of Infrastructure and infra automation (Infrastructure as Code)
  • Manage incidents and emergency response, track outages, ensure data integrity and engineer releases to promote safe, efficient and rapid deployments
  • Handle emergency response either by being on-call or by reacting to symptoms according to monitoring and escalation when needed
  • Improve the codebase by resolving logic issues, deprecating unused code, etc.
  • Implement monitoring, logging, alerting and SLO Reporting
  • Identify Service Level Indicators (SLIs) that will align the team to meet the availability and performance objectives
  • Perform and run blameless RCAs on incidents and outages aggressively looking for answers that will prevent incident reoccurrence
  • Provides reviews on design documents from internal and external teams
  • Performs more-complex tasks using highly-specialized knowledge and advanced business experience
  • Resolves complex tickets in creative manners
  • Develops and leads large and highly-complex cross-functional projects or programs
  • Determines solutions to blockers, identify tasks, and developing solutions as appropriate
  • Responsible for at least for 1 major delivery domain and accountable for all the aspects of SRE for that domain
  • Develops standards, tools, and knowledge requirements for skill and career development

Hims & Hers is recognized for blending telehealth convenience with a wide range of personalized medical services, from sexual health to mental health. The employment environment is backed by a commitment to technical excellence and a progressive approach to healthcare, offering opportunities to work on cutting-edge treatments that address diverse patient needs. Its culture promotes innovation and patient-centric solutions, providing a motivating workspace for professionals looking to impact healthcare accessibility and quality.

Company Stage

IPO

Total Funding

$183.2M

Headquarters

San Francisco, California

Founded

2017

Growth & Insights
Headcount

6 month growth

2%

1 year growth

18%

2 year growth

88%

Benefits

Full healthcare - High-coverage medical, dental & vision coverage for individuals and families

Generous PTO

Retirement planning - Take advantage of our 401(k) plan including contribution matching

WFH stipend

Robust compensation

Employee discount

Utility stipend - An extra $75 each month to cover extra cell phone, internet, or data usage

Spending accounts - Options for additional HSA and FSA plans to help toward healthcare costs