Full-Time

Staff Site Reliability Engineer

SentinelOne

SentinelOne

1,001-5,000 employees

Autonomous endpoint protection software

Cybersecurity
AI & Machine Learning

$148000 - $204000

Employee Stock Purchase Program

Senior, Expert

Remote in USA

Required Skills
Kubernetes
Python
JavaScript
Communications
Ruby
Java
AWS
Go
Jenkins
Google Cloud Platform
Requirements
  • 7+ years of experience in Site Reliability Engineering
  • 5+ years of production experience with orchestration systems like Kubernetes, Nomad or Mesos
  • Experience with a scripting language, such as Python, Golang, Java, or Ruby
  • Familiarity with running Java and JavaScript applications, including build and deploy
  • AWS experience, and familiarity with other platforms like GCP
  • Experience using Infrastructure as Code (IaC) to setup cloud-native services
  • Familiarity with CI and practical delivery using Jenkins, GHA, ArgoCD, etc. or similar; familiarity with deployment strategies like blue-green, rolling deploys, canary deploys, and best practices around deployment automation
  • Curiosity, fast-learning, and great communication skills
  • Preferred: 2+ years of experience in a FedRAMP environment
  • Ability to work in a diverse and distributed team
  • Self-starter attitude, with passion for new technologies and empathy for legacy systems
  • Ability to learn quickly, and navigate through unfamiliar programming languages, systems, and processes
Responsibilities
  • Support the stability, reliability, and scalability of SentinelOne’s distributed systems through various tasks performed by the Site Reliability Engineering organization including managing Kubernetes, creating IaC, and leading troubleshooting during incident response
  • Identify areas, such as performance issues and availability concerns, as well as perform other technical and architectural reviews to partner with fellow engineering teams to improve overall reliability of SentinelOne systems
  • Design and implement comprehensive monitoring and alerting, as well as concepts such as SLIs/SLOs and critical user journeys to provide deeper insight into the performance and availability of SentinelOne’s systems
  • Analyze systems, identify toil, and develop and implement strategies such as automation to streamline and optimize SRE’s support of critical systems

Company Stage

N/A

Total Funding

$796.5M

Headquarters

Mountain View, California

Founded

2013

Growth & Insights
Headcount

6 month growth

10%

1 year growth

19%

2 year growth

79%

Benefits

Medical, Vision, Dental, 401(k), Commuter, Health and Dependent FSA

Unlimited PTO

Industry leading gender-neutral parental leave

Paid Company Holidays

Paid Sick Time

Employee stock purchase program

Disability & life insurance

Employee assistance program

Gym membership reimbursement

Cell phone reimbursement

Numerous company-sponsored events