Simplify Logo

Full-Time

Senior Operations Engineer

Site Reliability, Elasticsearch and Kubernetes

Posted on 8/22/2024

Recorded Future

Recorded Future

1,001-5,000 employees

Provides machine-readable threat intelligence solutions

Cybersecurity

Senior, Expert

Boston, MA, USA + 1 more

More locations: Arlington, VA, USA

Category
DevOps & Infrastructure
Site Reliability Engineering
Cloud Engineering
Required Skills
TCP/IP
Datadog
RabbitMQ
Kubernetes
Python
Apache Spark
Apache Kafka
Postgres
Docker
Elasticsearch
Redis
Development Operations (DevOps)
Data Analysis
Requirements
  • Experience with one or more metrics collection tools (managed and self-hosted): Prometheus, DataDog, InfluxDB, Telegraph, VictoriaMetrics.
  • Experience or desire to build visualization dashboards.
  • Experience with deployment of applications within a Kubernetes environment (preferably) or containerization (Docker, etc).
  • Ideally, experience with Kubernetes administration and custom operators.
  • Ideally, experience with Python.
  • Experience with network and vulnerability scanning tools a huge plus (nmap, zmap, masscan, nuclei, burp, NESSUS, etc).
  • Familiarity with the network stack and operating system tuning: TCP/IP, conntrack, NAT, etc.
  • Experience with data warehouses (Elasticsearch, ClickHouse, ScyllaDB, PostgreSQL).
  • Experience with data processing pipelines using Apache Spark.
  • Experience with messaging queues such as Apache Kafka, RabbitMQ, and Redis.
Responsibilities
  • Work with Product Management to determine internal Service Level Objectives (SLOs) representing the long-term health of the scanning system (both infrastructure and application).
  • Build and maintain metric collection that can feed Service Level Indicators (SLIs) to guarantee SLOs.
  • Build and maintain dashboards displaying SLIs, SLOs, and other key metrics indicating the health of the application.
  • Investigate cases where the scanning infrastructure or application is not performing as desired, provide guidance to the development team for improvements, and develop additional metric collection/alerting as needed.
  • Identify application/deployment configuration optimizations for best use of resources and impending resource gaps.
  • Work closely with DevOps to maintain and improve custom Kubernetes-based application deployment operators.
  • Identify and fix automation gaps to reduce operational toil.
  • Ensure the reliability and performance of the data platform, including data warehouses (Elasticsearch, ClickHouse, ScyllaDB, PostgreSQL).
  • Manage and optimize data processing pipelines using Apache Spark.
  • Work with messaging queues such as Apache Kafka, RabbitMQ, and Redis to ensure seamless data flow and processing.
  • Develop and maintain metric collection, monitoring, and alerting for the data platform.
  • Build and maintain dashboards displaying key metrics for data platform health and performance.
  • Collaborate with data engineering teams to troubleshoot and resolve issues related to data storage, processing, and messaging.
  • Implement best practices for data platform security, scalability, and performance.

Recorded Future provides threat intelligence in the cybersecurity industry by gathering and analyzing information about potential threats. Their intelligence is delivered in a machine-readable format, making it easy for clients like SOCs and CISOs to integrate with their existing security systems. Unlike competitors, Recorded Future focuses on partnerships with Value Added Resellers (VARs) to enhance their offerings and provide comprehensive support. The company's goal is to help organizations lower the risk of cyber attacks through effective threat intelligence.

Company Stage

Series E

Total Funding

$79M

Headquarters

Somerville, Massachusetts

Founded

2009

Simplify Jobs

Simplify's Take

What believers are saying

  • The launch of generative AI tools and Enterprise AI for intelligence positions Recorded Future at the forefront of innovation in threat intelligence.
  • Strategic investments, such as in Hunt.io, demonstrate Recorded Future's commitment to staying ahead in advanced adversary hunting and threat detection.
  • The company's comprehensive support and training for VARs ensure successful implementation and growth, benefiting both partners and clients.

What critics are saying

  • The rapid evolution of cyber threats requires continuous innovation, posing a challenge to maintain a competitive edge.
  • Dependence on VARs for market reach could limit direct customer relationships and feedback, potentially impacting product development.

What makes Recorded Future unique

  • Recorded Future's machine-readable threat intelligence format allows seamless integration with existing security systems, setting it apart from competitors who may offer less compatible solutions.
  • Their partnership model with Value Added Resellers (VARs) ensures a broader market reach and enhanced support, unlike companies that rely solely on direct sales.
  • The company's focus on generative AI and behavioral analytics provides advanced, real-time threat analysis, distinguishing it from traditional threat intelligence providers.

Benefits

Professional development and career advancement

Flexible work environment, be yourself

Generous vacation policy

Wellness programs

Company outings

Competitive compensation and benefits

Free snacks, drinks, and coffee in the office

Parental leave program

Environmentally conscious

INACTIVE