Facebook pixel

Senior Site Reliability Engineer
Confirmed live in the last 24 hours
Locations
United Kingdom • United States
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
AWS
Elasticsearch
Google Cloud Platform
Java
Microsoft Azure
Postgres
Redis
Kubernetes
Python
Requirements
  • Expert in one of the cloud - AWS/Azure/GCP
  • Deep understanding of Kubernetes
  • Have worked on one of the following tools - Argo CD, Argo Workflow, Loft vCluster, or Prometheus/Grafana
  • Experience in one or more of the following: Java, Python or Go
  • Have working experience with multi-tenancy on Kubernetes
  • Exceptional verbal, presentation, and written communications skills to convey information clearly to different audiences
Responsibilities
  • Be responsible for debugging technical issues inside a complex stack involving virtualization, containers, microservices, etc
  • Take ownership of customer cloud issues and follow up on the status of problems with respective workgroups on behalf of the end customer
  • Proactively take calls and manage service levels and abandon rates during business-critical needs
  • Handling major incidents by coordinating with multiple teams
  • Manage outage events proactively and lead from the front while acting as a Priority Incident Manager
  • Generate scripts and templates required for the automatic provisioning of resources
  • Stay well-informed about the failures and complications related to the cloud
  • Be the driving force for correct incident response and blameless postmortems
Desired Qualifications
  • Working experience with Cassandra, Elasticsearch, PostgreSQL or Redis is a plus
Atlan

51-200 employees

Data collaboration workspace
Company Overview
Atlan is on a mission to help democratize enterprise data. The company is building a collaboration platform for data teams—allowing them to truly democratize both internal and external data, while automating repetitive tasks.
Company Core Values
  • Bias for Action
  • Never Be Satisfied
  • Being Straightforward
  • Giving 120%
  • Problem First, Solution Second
  • One Team