Full-Time

Mgr-Site Reliability Engineering

Confirmed live in the last 24 hours

The Walt Disney Company

The Walt Disney Company

10,001+ employees

Leading producers & providers of entertainment and information

Senior

Orlando, FL, USA

Category
DevOps & Infrastructure
Site Reliability Engineering
Required Skills
Chef
Git
AWS
JIRA
Terraform
Ansible
Confluence
Development Operations (DevOps)
Google Cloud Platform
Requirements
  • Minimum 8 years of related work experience
  • Demonstrated leadership in implementing observability principles across complex systems and environments, fostering a culture of reliability and resilience
  • Extensive experience with modern software delivery tools, including GitHub, GitLab, Harness.io, LaunchDarkly, AWS Code Deploy and Azure DevOps and with optimizing workflows and ensuring seamless deployment processes
  • Proficiency in designing and managing highly scalable and resilient infrastructure using configuration management and orchestration tools such as Terraform, Cloud Formation, Ansible and Chef, driving operational excellence and efficiency
  • Outstanding communication and leadership abilities, to ensure effective growth and development of team
  • A visionary who motivates teams to excel and fosters creativity, consistently driving excellence in all endeavors
  • An advocate for a diverse and inclusive culture that encourages innovation and ensures every team member feels a sense of belonging
  • Bachelor’s degree in Computer Science, Information Systems, Software, Electrical or Electronics Engineering, or comparable field of study, and/or equivalent work experience
Responsibilities
  • Oversee finances and budgets in MyPPM, ensure accurate billing processes, and contribute to forecasting and accrual processes to maintain financial integrity and support organizational objectives
  • Work with the vendor management team to maintain the optimal mix of cast members, contractors and managed services to support the required work
  • Manage the work of your team in Jira and maintain documentation in Confluence
  • Lead the evolution of DevOps practices within the broader team framework, guiding others in leveraging this culture to enhance observability practices
  • Manage the SRE team to deliver monitoring and observability for the development and business users as needed
  • Work with development teams to develop and manage mutually agreeable service levels for all critical business applications
  • Drive teams to consult, design, build, and support development pipelines, automate infrastructure and operations, build telemetry for monitoring, engineer high-reliability and reinforce best-practices to secure company data
  • Lead your team to develop and grow all aspects of technology engineering skills using Amazon Web Services and Google Cloud Platform for container, virtualization and serverless based workloads
  • Develop and advocate strategic directions for reliability, observability and recovery and bring practical knowledge on systems, network, operational excellence and application stability, security, performance, and capacity management
  • Engage in estimation and planning across the organization, voicing recommendations, feedback, and solutions from a technical perspective and aligning to the overall project goals to deliver on-time & in-scope
  • Proactively track and assess new technologies across the industry to inform strategic decision-making and recommendations
The Walt Disney Company

The Walt Disney Company

View

Company Stage

N/A

Total Funding

N/A

Headquarters

N/A

Founded

1923