Site Reliability Engineer



201-500 employees

Modern banking solutions for credit unions and banks


$110,000 - $150,000

Cash Bonus, Equity Options

Junior, Mid

Remote in USA

Required Skills
Microsoft Azure
Digital Ocean
Google Cloud Platform
  • Bachelor's degree in Computer Science, Information Technology, Cybersecurity, or a related field.
  • Linux administration experience and system patch management is a requirement.
  • 3+ years of experience in a site reliability engineering, system administration, or similar role, with specific experience in network operations and cybersecurity.
  • Proven expertise in using Datadog for monitoring, observability, and alerting in a complex network and software environment.
  • Strong understanding of network protocols, infrastructure, and security patching strategies.
  • Proficiency in scripting and automation tools (e.g., Python, Bash, Ansible).
  • With cloud platforms (AWS, Azure, GCP) and containerization technologies (Docker, Kubernetes).
  • Proficiency in Java.
  • Excellent problem-solving skills, with the ability to work under pressure and manage multiple priorities.
  • Strong communication and collaboration skills, capable of working effectively with cross-functional teams.
  • Implement and manage comprehensive monitoring, observability, and alerting strategies using Datadog for real-time insights into the performance and health of our software applications and network infrastructure.
  • Proactively monitor system performance, identify potential issues, and execute troubleshooting and resolution to minimize downtime and service disruptions.
  • Develop and maintain a robust security patching program, ensuring all network devices, servers, and applications are regularly updated to protect against vulnerabilities and cyber threats.
  • Collaborate with development and operations teams to enhance system reliability by adopting SRE best practices and Datadog's monitoring capabilities.
  • Customize Datadog dashboards and alerts to meet the specific needs of our operations, ensuring critical issues are promptly identified and addressed.
  • Automate routine patching, monitoring, and maintenance tasks to improve operational efficiency and accuracy.
  • Participate in incident response and post-mortem analysis, utilizing Datadog data to identify root causes and implement preventive measures.
  • Keep abreast of the latest trends and technologies in SRE and monitoring tools, particularly Datadog's evolving features and capabilities.

Nymbus stands out in the financial services industry by providing modern, scalable solutions for banks and credit unions of all sizes, enabling them to tap into new growth opportunities without the need for a core conversion. Their product offerings range from launching full-service digital banks to building niche financial brands, demonstrating their adaptability and commitment to meeting diverse client needs. With an award-winning core platform, Nymbus positions financial institutions for success by helping them stay competitive and reach untapped markets.

Company Stage

Series D

Total Funding



Jacksonville, Florida



Growth & Insights

6 month growth


1 year growth


2 year growth