Senior Site Reliability Engineer
Remote
Confirmed live in the last 24 hours
Cyware

201-500 employees

Enterprise security operations transformation through threat intelligence
Company Overview
Cyware stands out as a leader in the cybersecurity industry with its unique Cyber Fusion solutions, which enable security teams to proactively counter threats, streamline security incidents, and significantly cut response time. The company's integrated platform, including award-winning Threat Intelligence Platform (TIP) and Security Orchestration, Automation, and Response (SOAR) products, is customizable to specific enterprise needs. With a focus on reducing analyst burnout and improving security outcomes, Cyware serves a diverse range of clients, from enterprises and government agencies to Managed Security Service Providers (MSSPs), and is a trusted platform for global information sharing communities.
Data & Analytics
Energy
Cybersecurity
AI & Machine Learning
Financial Services
Aerospace
Consumer Goods

Company Stage

Series C

Total Funding

$73M

Founded

2016

Headquarters

Jersey City, New Jersey

Growth & Insights
Headcount

6 month growth

9%

1 year growth

7%

2 year growth

-6%
Locations
Remote in USA
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
TCP/IP
Bash
Kubernetes
Microsoft Azure
Python
React.js
MySQL
Git
Postgres
Docker
AWS
Jenkins
Terraform
Vue.js
Redis
Nginx
Ansible
Linux/Unix
Django
Google Cloud Platform
CategoriesNew
DevOps & Infrastructure
IT & Security
Requirements
  • US Citizenship
  • Bachelor's degree in Computer Science, Engineering, IT, or related discipline
  • 4-7 years of experience as a site reliability engineer
  • Cloud: AWS/Azure/GCP
  • Solid understanding of Linux Systems
  • Scripting: Bash/Python
  • Development Languages and Frameworks: Python/Django, Vue, React, Go Lang
  • Fundamentals: Basic DNS & Networking, TCP/UDP, IP Routing, HA & Load Balancing Concepts
  • Application Protocols: SMTP, HTTP, HTTPS, FTP, IMAP, POP
  • Good to have Applications: Database Systems Fundamentals (MySQL/Postgres), Redis, Nginx/Apache, Supervisorctl
  • Tools/Utilities: Nagios, Yum, RPM, GIT, Grafana, Prometheus, New Relic, ELK, Docker, Jenkins
  • Certifications: RHCSA/RHCE/AWS (SysOps)
Responsibilities
  • Be on an on-call rotation to respond to incidents
  • Conduct neutral postmortems of issues and events to identify Root Cause
  • Use on-call shift schedule to prevent incidents
  • Lead Production data-driven operations to define and measure: Error Budgets, SLIs, SLOs, and SLAs
  • Run infrastructure with tools such as Ansible, Terraform, Jenkins CI/CD, Docker, Kubernetes, and Lambdas
  • Build monitoring capabilities that alerts on symptoms and outages
  • Document every action for automation
  • Improve operational processes (deployments and upgrades)
  • Design, build, and maintain core infrastructure
  • Debug production issues across services and levels of the stack
  • Plan the growth of infrastructure
  • Ability to work across time zones
  • Maintain security, integrity, and stability of the Production environment