Full-Time

Staff Site Reliability Engineer

Confirmed live in the last 24 hours

Very Good Security

Very Good Security

201-500 employees

Provides data security and compliance solutions

Cybersecurity
Financial Services

Compensation Overview

$165k - $210kAnnually

Senior

No H1B Sponsorship

Remote in USA

Category
DevOps & Infrastructure
Site Reliability Engineering
Required Skills
Datadog
Chef
Bash
Kubernetes
Python
Puppet
Apache Kafka
Java
Docker
CloudFormation
AWS
Go
Prometheus
Terraform
Ansible
Development Operations (DevOps)
CircleCI
Linux/Unix
Requirements
  • Proven experience in SRE or DevOps roles at a staff level, with a track record of managing production systems in complex, large-scale environments.
  • Strong proficiency in AWS including infrastructure-as-code (Terraform, CloudFormation, etc.).
  • Solid understanding of cloud-native architecture, Linux Systems, microservices, Infrastructure-as-code (Terraform, CloudFormation, CDK), CI/CD (CircleCI, GitHub Actions, Argo), GitOps, Authentication and Authorization, APIs and API Gateway, Docker, Kubernetes (EKS), Kafka (MSK), Java, Spring Framework, Python, and AWS services.
  • Strong plus if you are a database wiz.
  • Expertise in monitoring and observability tools like Prometheus, Grafana, Honeycomb, Datadog, Open Telemetry, New Relic, or similar tools to measure system health and performance.
  • Programming and scripting experience in languages such as Python, Go, Bash, or other relevant languages used in automating infrastructure.
  • Solid understanding of networking, security, and load balancing in cloud-native environments.
  • Experience with configuration management tools like Ansible, Chef, or Puppet is a plus.
  • Strong communication and collaboration skills, with the ability to lead cross-functional initiatives and mentor junior team members.
  • Experience with incident management and disaster recovery best practices.
  • Strong written and verbal communication skills.
Responsibilities
  • Architect and maintain scalable, reliable infrastructure: Design and optimize infrastructure for high availability, fault tolerance, and performance across distributed systems.
  • Lead incident management and root cause analysis: Own incident response processes, ensure swift resolution of issues, and drive post-incident improvements to prevent recurrences.
  • Service monitoring and automation: Build and maintain automated monitoring, alerting, and healing systems that improve system health, reduce manual intervention, and minimize downtime.
  • Performance tuning and capacity planning: Identify bottlenecks and optimization opportunities and execute scaling strategies to ensure efficient handling of traffic spikes and growing workloads.
  • Collaborate with cross-functional teams: Work closely with software engineers, product teams, and DevOps to enhance system reliability and delivery pipelines.
  • Improve operational processes: Champion continuous improvement initiatives in deployment, scaling, and performance testing, while advocating for the adoption of SRE best practices across the organization.
  • Mentorship and leadership: Provide technical mentorship to junior engineers, contribute to strategic decisions around infrastructure, and ensure best practices are implemented at scale.
  • Be proactive and innovative: we rely on your feedback to build a world-class product.
  • Be a part of a team that believes in the core values of transparency, collaboration, grit, and humility; in going above and beyond what is required in order to do the right thing for our customers and the company; and in having fun while doing all this!

Very Good Security provides a security infrastructure that helps businesses protect sensitive data, such as credit card information and social security numbers, while allowing them to focus on developing their products. Their main offerings include the VGS Vault for secure data storage and a Tokenization API that replaces sensitive data with unique symbols, ensuring that no actual data is stored in the event of a breach. This approach helps clients avoid vendor lock-in, reduce transaction fees, and increase transaction volumes. VGS serves a wide range of clients, from small startups to large corporations, and offers scalable pricing that starts free and adjusts as a client's security needs grow. The goal of VGS is to simplify data security and compliance for developers, enabling them to create products without the burden of managing sensitive data.

Company Stage

Series C

Total Funding

$100.7M

Headquarters

San Francisco, California

Founded

2015

Growth & Insights
Headcount

6 month growth

-1%

1 year growth

14%

2 year growth

50%
Simplify Jobs

Simplify's Take

What believers are saying

  • Growing demand for tokenization solutions boosts VGS's market potential.
  • Partnerships with major fintechs expand VGS's reach in emerging markets.
  • VGS's Payment Optimization suite reduces transaction fees and improves approval rates.

What critics are saying

  • DOJ lawsuit against Visa may lead to regulatory changes affecting VGS.
  • Geopolitical risks in Africa and the Middle East could impact VGS's operations.
  • Rapid growth may strain VGS's resources, risking service quality.

What makes Very Good Security unique

  • VGS offers a unique tokenization API for secure data redaction and revelation.
  • Their scalable pricing model supports businesses from startups to large corporations.
  • VGS's partnership with Onafriq enhances security for fintechs in Africa and the Middle East.

Help us improve and share your feedback! Did you find this helpful?