Simplify Logo

Full-Time

Site Reliability Engineering Manager

Confirmed live in the last 24 hours

Aerospike

Aerospike

201-500 employees

High-performance NoSQL database for real-time applications

Data & Analytics
Enterprise Software

Compensation Overview

$220k - $240kAnnually

Senior

Mountain View, CA, USA

Category
DevOps & Infrastructure
Site Reliability Engineering
Engineering Management
Required Skills
Kubernetes
Microsoft Azure
NoSQL
Management
Docker
AWS
Terraform
Ansible
Requirements
  • 5+ years providing 24x7 production support for cloud-based, business-critical systems, with demonstrated leadership in managing operations for enterprise-class organizations during challenging situations (e.g., service incidents, degradations, disaster recovery, etc.)
  • 2+ years of experience in technical leadership or management roles
  • Experience with at least one of the major public cloud providers: AWS, Google, Azure
  • Experience with continuous integration/continuous deployment (CI/CD) pipelines.
  • Experience with automation pipelines for cloud infrastructure and software using technologies such as Terraform, Packer, and Ansible
  • Experience supporting distributed, multitenant, auto-scalable backend services
  • Experience with NoSQL or relational databases, and database fundamentals, including data storage, data replication, data modeling, and data access patterns
  • Experience with maintaining distributed services on both virtual machines and containers (Docker) with orchestration (Kubernetes, EKS, GKE)
  • Experience with documenting complex procedures and architectures, including diagramming
  • Experience with cryptographic fundamentals and best practices
  • Experience assessing security vulnerabilities in code and running systems
Responsibilities
  • Team Building and Management: Recruit, onboard, and develop top talent, fostering a collaborative and inclusive team culture focused on innovation, continuous learning, and excellence. Manage a regional SRE team in collaboration with other regional SRE Managers.
  • Technical Leadership: Provide technical guidance, mentorship, and oversight to a team of site reliability engineers, ensuring the delivery of high-quality, scalable, and reliable solutions. Lead by example in hands-on development reviews and code reviews, ensuring adherence to best practices, coding standards, and quality assurance processes.
  • Cross-Functional Collaboration: Work closely with Aerospike Cloud Engineering teams, Product Management, Product Support, and other teams to ensure seamless integration and delivery of end-to-end solutions. Coordinate with Account Management and Professional Services to ensure successful onboarding of new cloud customers and maintenance activities for existing ones.
  • Operational Expertise: Be an Aerospike expert and understand all supported cloud deployment patterns for the distributed database, failure scenarios, and remediation plans. Contribute to improvements to observability and automation systems to ensure the reliability, availability, and performance of our cloud infrastructure and to ensure that all critical business KPIs are met on behalf of the business and our customers. Understand the nuances of customer requirements such as infrastructural or security needs and ensure our operational practices support any requirements.
  • On-Call Procedures: Ensure on-call procedures follow industry standard best practices, manage schedules and escalation policies in PagerDuty, and participate in the manager on-call duties. Drive incident retrospectives, root cause analyses, and on-call remediation activities.

Aerospike builds high-performance databases that are designed for real-time applications, catering to businesses that need quick and reliable access to large amounts of data. Their main product is a NoSQL database that can process millions of transactions per second with minimal delay, making it suitable for industries like finance, telecommunications, e-commerce, and advertising technology. Aerospike's database can be deployed in various environments, including on-premises, cloud platforms like Amazon EC2 and Google Cloud, and hybrid setups. It also supports container tools such as Kubernetes and Docker for flexible deployment. Unlike many competitors, Aerospike focuses on providing enterprise-grade solutions with features like cross-datacenter replication and strong consistency. The company's goal is to help large enterprises efficiently manage and access their data in real-time, generating revenue through software licenses and professional consulting services.

Company Stage

Series E

Total Funding

$241M

Headquarters

Mountain View, California

Founded

2009

Growth & Insights
Headcount

6 month growth

7%

1 year growth

12%

2 year growth

46%
Simplify Jobs

Simplify's Take

What believers are saying

  • Aerospike's recent $109M Series E funding round positions the company for significant growth and innovation, particularly in the AI sector.
  • Winning the 2024 SEAL Business Sustainability Award highlights Aerospike's commitment to sustainable and impactful product development.
  • The release of Aerospike Database 7 with enhanced in-memory capabilities demonstrates the company's ongoing commitment to technological advancement and enterprise resiliency.

What critics are saying

  • The competitive landscape in the database software market is intense, with major players like Redis and traditional SQL databases posing significant challenges.
  • Rapid expansion and new appointments, such as the recent country managers for Southeast Asia and India, may lead to operational and strategic misalignments.

What makes Aerospike unique

  • Aerospike's NoSQL database is optimized for real-time applications, handling millions of transactions per second with low latency, which sets it apart from traditional databases.
  • The company offers flexible deployment options, including on-premises, cloud-based, and hybrid environments, as well as support for container orchestration tools like Kubernetes and Docker.
  • Aerospike's focus on enterprise-grade features such as cross-datacenter replication and strong consistency makes it particularly appealing to large enterprises in sectors like finance and telecommunications.