Simplify Logo

Full-Time

SRE/Devops Engineer

Posted on 1/11/2024

Versana

Versana

51-200 employees

Centralized syndicated loan data platform

Data & Analytics
Fintech
Financial Services

Senior

New York, NY, USA

Category
DevOps & Infrastructure
DevOps Engineering Management
Site Reliability Engineering
Required Skills
Datadog
Kubernetes
Microsoft Azure
Python
JavaScript
Management
Git
Apache Kafka
Java
Docker
AWS
Go
Elasticsearch
Jenkins
Terraform
Development Operations (DevOps)
Linux/Unix
Google Cloud Platform
Requirements
  • 5+ years of experience as a Site Reliability Engineer or similar role
  • 3+ years of experience in at least one coding language such as Java, JavaScript, Python, GoLang, or .NET
  • 3+ years of work experience with public cloud (Azure, AWS or GCP)
  • 3+ years of direct experience with observability tools like Datadog, Elasticsearch, and Grafana Labs, etc.
  • 3+ years of experience with containerization and orchestration technologies like Docker and Kubernetes
  • 2+ years of experience in development and management of CI/CD pipelines (e.g., Azure DevOps, Gitlab CI/CD, Github Actions, Jenkins, etc)
  • 2+ years of experience with Infrastructure-as-code tools like Terraform, Azure Bicep, Cloud Formation, etc
  • 1+ years of experience with site reliability tools like Gremlin, Chaos Mesh, or similar
  • Proven track record leveraging core observability concepts, end-user monitoring, and infrastructure monitoring with SaaS solutions
  • Experience with messaging services like Kafka or Azure Event Hubs
  • Good understanding of the Linux operating system
  • Ability to partner with multi-functional teams and pivot quickly
  • Strong communication, analytical, and problem-solving skills
  • Curiosity and motivation to learn
Responsibilities
  • Design, implement, and maintain observability and event management tools
  • Monitor system performance, create incident response plans, and implement observability practices to gain insights into system behavior
  • Implement and monitor service-level objectives (SLOs) and indicators
  • Improve system reliability and resiliency
  • Conduct post-incident reviews and implement necessary changes to prevent system failures
  • Assist teams in implementing observability tools and leveraging available telemetry data to troubleshoot and resolve incidents and problems
  • Leverage observability and event management to improve key incident management metrics, such as mean time to detect and mean time to restore services
  • Continually optimize systems and workflows by improving architecture, infrastructure, automation, CI/CD, and observability
  • Collaborate with developers to ensure applications are designed with DevOps best practices in mind
  • Participate in weekend support for cloud infrastructure upgrades and/or releases

Versana is a digital data and technology company that excels in transforming the syndicated loan market through its centralized platform. This platform captures data from agent banks in real-time, providing unparalleled transparency and efficiency. The company stands as a credible source for deal information, modernizing an entire asset class and making it a leader in its field. Working here offers an opportunity to be at the forefront of financial technology, engaging in work that drives real change and innovation within the industry.

Company Stage

Seed

Total Funding

$40M

Headquarters

New York City, New York

Founded

2021

Growth & Insights
Headcount

6 month growth

33%

1 year growth

45%

2 year growth

45%
INACTIVE