Facebook pixel

Site Reliability Engineer
Posted on 11/12/2022
INACTIVE
Locations
Denver, CO, USA
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
Bash
Kubernetes
Python
Requirements
  • 3+ years of experience in site reliability or comparable roles working in a modern containerized cloud environment
  • Proficiency in scripting in at least one language (Bash, Python, Go)
  • Experience implementing monitoring tools and alerting systems
  • Excellent Kubernetes troubleshooting skills during incident response events
  • Previous experience developing runbooks and driving process improvement
Responsibilities
  • Build and maintain monitoring systems and processes to ensure product engineers get actionable data for the components they maintain
  • Coordinate with the product teams to enhance the scalability and reliability of our systems through analysis and observability improvements
  • Engage in capacity planning with load testing and auto-scaling strategies
  • Own the incident response process, including, development of sustainable practices, learnings, and ensuring blameless postmortems
  • Work across the engineering team to encourage excellence in incident response and build a culture of site reliability engineering
  • Efficiently troubleshoot issues across our systems and software to determine root causes and impact
  • Learn Virta's system and network architecture to take part in incident response and troubleshooting activities
  • Improve monitoring and observability tooling to enhance visibility into our systems and software
  • Help define and rollout Service Level Objectives and operational readiness within the Virta system
Virta Health

51-200 employees

Diabetes reversal healthcare technology
Benefits
  • Remote-first work environment
  • Flexible work hours & time off policy
  • Health insurance
  • Paid parental leave
  • Free Virta treatment & family discount
  • Internet, home office, learning & development stipends
  • Employee resource groups
  • 401K & ROTH contribution
Company Core Values
  • People first
  • Ownership
  • Impact
  • No ego
  • Transparency
  • Evidence-based
  • Risk taking & rapid iteration