Simplify Logo

Full-Time

Staff Site Reliability Engineer

Remote

Posted on 2/3/2024

SentinelOne

SentinelOne

1,001-5,000 employees

AI-based autonomous endpoint security platform

Hardware
Cybersecurity
AI & Machine Learning

Compensation Overview

$148k - $204kAnnually

Senior, Expert

Remote in USA

Category
DevOps & Infrastructure
DevOps Engineering Management
Site Reliability Engineering
IT & Security
Cloud Engineering
Required Skills
Kubernetes
Python
JavaScript
Communications
Ruby
Java
AWS
Go
Jenkins
Google Cloud Platform
Requirements
  • 7+ years of experience in Site Reliability Engineering
  • 5+ years of production experience with orchestration systems like Kubernetes, Nomad or Mesos
  • Experience with a scripting language, such as Python, Golang, Java, or Ruby
  • Familiarity with running Java and JavaScript applications, including build and deploy
  • AWS experience, and familiarity with other platforms like GCP
  • Experience using Infrastructure as Code (IaC) to setup cloud-native services
  • Familiarity with CI and practical delivery using Jenkins, GHA, ArgoCD, etc. or similar; familiarity with deployment strategies like blue-green, rolling deploys, canary deploys, and best practices around deployment automation
  • Curiosity, fast-learning, and great communication skills
  • Preferred: 2+ years of experience in a FedRAMP environment
  • Ability to work in a diverse and distributed team
  • Self-starter attitude, with passion for new technologies and empathy for legacy systems
  • Ability to learn quickly, and navigate through unfamiliar programming languages, systems, and processes
Responsibilities
  • Support the stability, reliability, and scalability of SentinelOne’s distributed systems through various tasks performed by the Site Reliability Engineering organization including managing Kubernetes, creating IaC, and leading troubleshooting during incident response
  • Identify areas, such as performance issues and availability concerns, as well as perform other technical and architectural reviews to partner with fellow engineering teams to improve overall reliability of SentinelOne systems
  • Design and implement comprehensive monitoring and alerting, as well as concepts such as SLIs/SLOs and critical user journeys to provide deeper insight into the performance and availability of SentinelOne’s systems
  • Analyze systems, identify toil, and develop and implement strategies such as automation to streamline and optimize SRE’s support of critical systems

SentinelOne provides an Autonomous AI Endpoint Protection Platform that utilizes artificial intelligence for real-time defense and automated response capabilities, integrating prevention, detection, and remediation into a unified solution. The platform continuously learns and adapts to new threats, offering a single solution for comprehensive security measures.

Company Stage

IPO

Total Funding

$796.5M

Headquarters

Mountain View, California

Founded

2013

Growth & Insights
Headcount

6 month growth

14%

1 year growth

13%

2 year growth

43%

Benefits

Medical, Vision, Dental, 401(k), Commuter, Health and Dependent FSA

Unlimited PTO

Industry leading gender-neutral parental leave

Paid Company Holidays

Paid Sick Time

Employee stock purchase program

Disability & life insurance

Employee assistance program

Gym membership reimbursement

Cell phone reimbursement

Numerous company-sponsored events

INACTIVE