SRE/ Site Reliability Engineer – Middle / Senior
Updated on 2/8/2024
Bitquery

11-50 employees

Blockchain data problems solutions company
Company Overview
Bitquery's mission is to provide blockchain data products to businesses for solving real-world problems. Bitquery is an API-first product company dedicated to power and solve blockchain data problems using the ground truth, on-chain data.
Crypto & Web3

Company Stage

Seed

Total Funding

$8.5M

Founded

2019

Headquarters

New York, New York

Growth & Insights
Headcount

6 month growth

65%

1 year growth

200%

2 year growth

450%
Locations
Remote in USA
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
Kubernetes
Python
Ruby
Docker
Blockchain
Go
Jenkins
Terraform
Ansible
CategoriesNew
DevOps & Infrastructure
Software Engineering
Requirements
  • 5+ years of work experience implementing, troubleshooting, and supporting infrastructure software and distributed systems
  • Support experience software in Golang, python , Ruby
  • Worked with virtualization and containerization technologies (containerd, docker, k8s) for more than 2 years
  • Set up CI of varying complexity (Jenkins) with CD to different environments
  • Experience in creating and maintaining a fault-tolerant system, with log coverage, monitoring, and alerting
  • Understanding the principle of 'infrastructure as code' and the ability to test it (Ansible Terraform)
  • Principles of organizing network security (IPsec, WAF, IPS)
  • Experience with maintenance of blockchain nodes
Responsibilities
  • Ensuring the smooth operation of software, environments and company services
  • Analyzing and improving the performance and availability of products
  • Identification of bottlenecks in the architecture and in the infrastructure
  • Improvement of system alerting and incident management
  • Improvements of the monitoring systems based on SLI (Prometheus, Icinga, Grafana etc.)
  • Formalization of SLI under the main business requirements
  • Formation of SLO for services and infrastructure in general
  • Minimization of system recovery time (RPO and RTO)
  • Analysis of incidents in the prod environment
  • Capacity management
Desired Qualifications
  • Opportunity to work & collaborate with a truly global team spread across 5 countries
  • Work from anywhere in the world
  • Choose your own work hours
  • Yearly trip with Bitquery team to any remote destination
  • A promise to finish the interview processes within 1-2 weeks
  • Flat hierarchy in the organization where individuals are empowered and provided an opportunity to deliver results as per his/her working style