Simplify Logo

Full-Time

Site Reliability Engineer

Sre, & Linux System Admin

Posted on 3/30/2024

Nymbus

Nymbus

201-500 employees

Cloud-native core banking and digital platform

Consulting
Enterprise Software
Fintech
Financial Services

Compensation Overview

$110k - $150kAnnually

+ Cash Bonus + Equity Options

Junior, Mid

Remote in USA

Category
DevOps & Infrastructure
DevOps Engineering Management
Site Reliability Engineering
Cybersecurity
IT & Security
Required Skills
Datadog
Bash
Kubernetes
Microsoft Azure
Python
Communications
Digital Ocean
Java
Docker
AWS
Ansible
Linux/Unix
Google Cloud Platform
Requirements
  • Bachelor's degree in Computer Science, Information Technology, Cybersecurity, or a related field.
  • Linux administration experience and system patch management is a requirement.
  • 3+ years of experience in a site reliability engineering, system administration, or similar role, with specific experience in network operations and cybersecurity.
  • Proven expertise in using Datadog for monitoring, observability, and alerting in a complex network and software environment.
  • Strong understanding of network protocols, infrastructure, and security patching strategies.
  • Proficiency in scripting and automation tools (e.g., Python, Bash, Ansible).
  • With cloud platforms (AWS, Azure, GCP) and containerization technologies (Docker, Kubernetes).
  • Proficiency in Java.
  • Excellent problem-solving skills, with the ability to work under pressure and manage multiple priorities.
  • Strong communication and collaboration skills, capable of working effectively with cross-functional teams.
Responsibilities
  • Implement and manage comprehensive monitoring, observability, and alerting strategies using Datadog for real-time insights into the performance and health of our software applications and network infrastructure.
  • Proactively monitor system performance, identify potential issues, and execute troubleshooting and resolution to minimize downtime and service disruptions.
  • Develop and maintain a robust security patching program, ensuring all network devices, servers, and applications are regularly updated to protect against vulnerabilities and cyber threats.
  • Collaborate with development and operations teams to enhance system reliability by adopting SRE best practices and Datadog's monitoring capabilities.
  • Customize Datadog dashboards and alerts to meet the specific needs of our operations, ensuring critical issues are promptly identified and addressed.
  • Automate routine patching, monitoring, and maintenance tasks to improve operational efficiency and accuracy.
  • Participate in incident response and post-mortem analysis, utilizing Datadog data to identify root causes and implement preventive measures.
  • Keep abreast of the latest trends and technologies in SRE and monitoring tools, particularly Datadog's evolving features and capabilities.

Nymbus provides a cloud-native core banking software and digital platform using advanced API, microservices, and low-code technology, facilitating both traditional and digital-only banking. With a turn-key solution model, this company enables rapid deployment of new digital banks and niche financial brands without the need for core conversion. This approach not only supports scalability but also fosters flexibility and quick adaptation to market demands, positioning the company at the forefront of financial technology services.

Company Stage

Series D

Total Funding

$173.4M

Headquarters

Jacksonville, Florida

Founded

2015

Growth & Insights
Headcount

6 month growth

0%

1 year growth

-4%

2 year growth

0%
INACTIVE