Sr. Site Reliability Engineer @ Aya Healthcare

Join Aya Healthcare, winner of multiple Top Workplace awards!

Aya Healthcare is seeking a highly skilled and experienced Senior Site Reliability Engineer (SRE) to join our technology organization. The ideal candidate will play a crucial role in ensuring the reliability, scalability, and performance of our critical systems, contributing to the overall success of our mission.

You will be a key member of the SRE team operations across a diverse product spectrum, ensuring our expansive cloud-based infrastructure and services remain robust, efficient, and agile amidst ever-evolving technical challenges. As the leading provider of workforce solutions in the healthcare space we are committed to operational excellence, performance, and ensuring a seamless experience for our users.

If you are a passionate Senior Site Reliability Engineer with a commitment to excellence and a desire to make a significant impact, we invite you to apply and join our dynamic team at Aya Healthcare. Together, we can shape the future of healthcare staffing.

Who We Are:

We’re a $10+ billion, rapidly growing workforce solutions provider in the healthcare industry. We deliver tech-enabled services that help healthcare organizations meet and manage their contingent labor needs. We build and manage tech-enabled marketplaces for national and local healthcare talent and deliver contingent labor management solutions through our proprietary software platform.

At Aya, we’re obsessed with creating exceptional experiences for our clients, clinicians and employees. In fact, we put employee satisfaction above all else. Our team members are responsible for incomparable customer experience and we know that happy employees are critical to maintaining happy clients. We foster an entrepreneurial, high-energy, low-bureaucracy culture and value innovative thinking and creative problem solving. We embrace diversity in thought and backgrounds unified by a commitment to high achievement. When you join Aya, you’ll be surrounded by teammates who care about you as an individual and leaders who will help you grow both personally and professionally.

Responsibilities:

Help lead efforts to improve system reliability through code changes, architectural enhancements, observability development, and infrastructure optimizations
Diagnose and solve problems with our highly available production systems and build solutions and automation to eliminate toil and prevent issues in the future
Drive our "zero error" and "zero downtime" initiatives, underlining our obsessive commitment to flawless service and operational excellence
Continually drive down time-to-detect and time-to-resolve through improved outlier detection and real-time root cause analysis
Spearhead the creation, development, and maintenance of scalable monitoring, alerting, and logging solutions
Promote a culture of learning from outages, continuously improving incident response protocols and tooling
Collaborate with cross-functional teams to identify and address reliability opportunities to ensure optimal performance and reliability
Participate in software releases and deployments
Participate in 24/7 on-call rotations and respond to incidents promptly, providing effective resolutions and root cause analysis

Required Qualifications

Bachelor’s Degree in Computer Science, Information Technology, Engineering or related field, or the equivalent combination of education, training, and experience
8+ years of experience in a combination of Site Reliability Engineering, DevOps, or similar roles
2+ years working specifically with Azure architecture, configuration, and management
2+ years using Infrastructure as Code (IaC) tools to automate infrastructure deployment and configuration – preferably Terraform
Subject matter expert in cloud platforms (AWS, GCP, Azure) and containerization technologies (Docker, Kubernetes, AKS, EKS)
Experience in a technical lead/principal role
Experience using Configuration-as-Code solutions such as Chef, Ansible, Salt Stack, and Puppet.
Experience with cloud-based APM and monitoring tools such as DataDog, NewRelic, AppDynamics, or Dynatrace
Extensive experience with scripting and debugging Linux and Windows environments
Experience with automated Change Management and related methodologies
Experience with advanced project management principles such as Scrum, Agile, sprints, and expertise with Atlassian Jira
Expert analytical/quantitative, problem-solving, and deductive reasoning skills, with demonstrated experience performing advanced troubleshooting and root cause analysis of complex technical issues
Excellent organizational, planning, and time management skills and ability to work either independently or in a team environment to manage competing priorities and meet deadlines
Advanced verbal and written communication skills with the ability to present findings, conclusions, alternatives, and information clearly and concisely

What We Offer:

Free premium medical, dental, life and vision insurance
Generous 401(k) match
Aya also offers other benefits to those that are eligible and where required by applicable law, including reimbursements and discretionary bonuses
Aya provides paid sick leave in accordance with all applicable state, federal, and local laws. Aya’s general sick leave policy is that employees accrue one hour of paid sick leave for every 30 hours worked. However, to the extent any provisions of the statement above conflict with any applicable paid sick leave laws, the applicable paid sick leave laws are controlling
Celebrations! We hit our goals and reward ourselves.
Company-sponsored virtual events, happy hours and team-building activities are always on the horizon — plus, you get a special treat on your birthday!
Unlimited DTO — we believe in time off!
Virtual yoga, meditation or boot camp classes offered daily

Compensation: Aya reasonably anticipates the pay scale for this position to be an annual salary of $170,000 to $195,000.

The pay scale for this position may vary if applicant possesses experience outside of what Aya reasonably anticipates for this position. Bonuses are subject to the role and your manager’s discretion.

Aya is an Equal Opportunity Employer (EEO), including Disability / Vets, and welcomes all to apply. Please click here for our EEO policy.

Compensation Overview

6 month growth

1 year growth

2 year growth