Site Reliability Engineer
Posted on 11/12/2022
INACTIVE
Locations
Denver, CO, USA
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
Bash
Kubernetes
Python
Requirements
- 3+ years of experience in site reliability or comparable roles working in a modern containerized cloud environment
- Proficiency in scripting in at least one language (Bash, Python, Go)
- Experience implementing monitoring tools and alerting systems
- Excellent Kubernetes troubleshooting skills during incident response events
- Previous experience developing runbooks and driving process improvement
Responsibilities
- Build and maintain monitoring systems and processes to ensure product engineers get actionable data for the components they maintain
- Coordinate with the product teams to enhance the scalability and reliability of our systems through analysis and observability improvements
- Engage in capacity planning with load testing and auto-scaling strategies
- Own the incident response process, including, development of sustainable practices, learnings, and ensuring blameless postmortems
- Work across the engineering team to encourage excellence in incident response and build a culture of site reliability engineering
- Efficiently troubleshoot issues across our systems and software to determine root causes and impact
- Learn Virta's system and network architecture to take part in incident response and troubleshooting activities
- Improve monitoring and observability tooling to enhance visibility into our systems and software
- Help define and rollout Service Level Objectives and operational readiness within the Virta system
Diabetes reversal healthcare technology
Benefits
- Remote-first work environment
- Flexible work hours & time off policy
- Health insurance
- Paid parental leave
- Free Virta treatment & family discount
- Internet, home office, learning & development stipends
- Employee resource groups
- 401K & ROTH contribution
Company Core Values
- People first
- Ownership
- Impact
- No ego
- Transparency
- Evidence-based
- Risk taking & rapid iteration