Facebook pixel

Staff Site Reliability Engineer
Confirmed live in the last 24 hours
Locations
Remote
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
AWS
Terraform
Kubernetes
TypeScript
Requirements
  • You have experience working with teams of engineers to mentor and help them understand and improve their application design for reliability and performance purposes
  • You have expertise in a modern programming language (e. Engineering at Lattice uses Typescript for most of their work) and know how to debug, analyze, and improve applications
  • You've got a good understanding of SRE practices, such as measuring application SLOs/SLIs, analyzing availability, utilizing observability tooling, and managing incidents in a productive way
  • You've worked with Kubernetes in product workloads, with AWS and distributed systems in production workloads
  • Experience with describing infrastructure as code (IaC) in production workloads
  • Proficiency in leveraging CI/CD tools to automate testing and deployment
Responsibilities
  • Partner closely with product engineering teams to influence and promote SRE practices during development
  • Instrument, monitor, and improve service code for reliability and performance, esp. In TypeScript
  • Implement, operate, and recommend efficient use of resilient infrastructure for Lattice SaaS applications
  • Lead and grow a team which will strongly partner with product engineering, ensuring those partner teams can build/maintain their applications reliably
  • Provide expertise in observability and incident management while adhering to SRE principles
  • Contribute to improvements across resilient cloud infrastructure using constructs, such as AWS, Kubernetes, and Terraform
  • Participate in on-call support rotation with the team
  • You love mentoring and supporting other software engineers who are newer to the industry
  • You have experience taking the lead in planning and executing the development roadmap
  • You make the engineering team more effective through the pragmatic application of useful code tools and patterns
Lattice

501-1,000 employees

People success platform