Full-Time

Manager – Systems Reliability Engineering

Confirmed live in the last 24 hours

The Walt Disney Company

The Walt Disney Company

10,001+ employees

Leading producers & providers of entertainment and information

Compensation Overview

$167.7k - $224.9kAnnually

+ Bonus + Long-term Incentive Units

Senior, Expert

Burbank, CA, USA

Category
DevOps & Infrastructure
Site Reliability Engineering
Required Skills
Chef
Kubernetes
Python
OpenShift
Git
Ruby
AWS
Go
Jenkins
iOS/Swift
Terraform
Ansible
Linux/Unix

You match the following The Walt Disney Company's candidate preferences

Employers are more likely to interview you if you match these preferences:

Degree
Experience
Requirements
  • Experience working in media production environments
  • Hands on experience building and running Linux and Windows platforms
  • Skilled in managing Cloud/IaaS Environments (e.g. AWS, Google Cloud Compute)
  • Knowledge in system management languages (e.g. Chef, Terraform, Ansible)
  • Expertise in Software Development Continuous Integration (CI) Pipeline knowledge (e.g. Jenkins, Gitlab CI) and Source Control Management (e.g. Git)
  • Expertise with Operating Systems, Distributed Systems and Container Platforms (e.g. Kubernetes/GKE, ECS, Openshift, Fargate)
  • Expertise in multiple scripting languages in your toolbox (e.g. Python, GO, Ruby, or Swift), with ability to build test coverage for all code being developed
  • Virtual hosting technologies (e.g. VMWare, KVM)
  • Data center, network, and application architectures
  • Able to evaluate new system and/or infrastructure solutions for technical feasibility against known requirements and standards.
Responsibilities
  • Direct management of a team of Media Systems Engineers, Systems Reliability Engineers, and Storage Engineers that support Studio-run production environments, whether on premise or in the public cloud
  • Develop operational plans and procedures covering change control, maintenance events, intergroup communications, and downtime response
  • Assist and facilitate architecting and engineering of systems solutions that fulfill various business unit requirements
  • Responsible for appropriate application of Corporate, Studio, and individual business unit security practices and standards
  • Manages all critical stakeholder interfaces including: developers, production, post production, client services, Studio Tech groups, and wider Enterprise
  • Responsible for effective collaboration between all Studio Technology operations and engineering groups, including systems reliability engineering, systems, networking, software, and client services
  • Responsible for collaboration and relationships with other proximate Disney groups, including Post Production Services, Enterprise IT, IT outsourcing partners, and the broader Studio Technology groups
  • Make sound technical and business decisions autonomously when confronted with competing operational trade-offs
  • Oversee the performance and deliverables of 3rd party, domestic or offshore technical support suppliers
  • Develop detailed service specifications designed to guide the activities of 3rd party suppliers
  • Assure readily available and secured third party communications
  • Optimize business value of Studio infrastructure operations in a 24x7 shop, and in the most cost-effective way possible
  • Ensure reliable operation of infrastructure and component monitoring, instrumentation and management tools in order to predict, quickly diagnose and resolve abnormal systems behavior
  • Secure and report on all systems and subsystems
  • Manage key vendors and partners, both internal and external
  • Serve as an escalation participant for tickets and issues related to supported infrastructure
  • Manage budgets, forecasts and 5-year plans for the studio systems initiatives
  • Responsible for all equipment/software maintenance contracts and renewals
  • Responsible for all third-party service contracts
Desired Qualifications
  • A seasoned and experienced technical manager, preferably from a dynamic production support environment, able to comfortably oversee highly knowledgeable technical support staff yet also contribute technically to the plan/solution
  • Able to multitask in a highly complex, diverse, systems environment
  • Able to quickly make decisions given incomplete and conflicting knowledge
  • Highly self-directed, being able to both manage and (re)prioritize the multiple concurrent and competing challenges, issues, ambiguities, and contradictions that, inevitably, occur when supporting systems
  • Strong analytical problem-solving skills
  • A team-building leader with good interpersonal, verbal and relationship building skills
  • Ability to construct, manage and oversee a complex budget of annual and 5-yr planning elements
  • A generalist who can perform many different tasks
  • Excellent verbal and written communication skills, and thus able to explain and document the systems to their diverse audiences.
The Walt Disney Company

The Walt Disney Company

View

Company Size

10,001+

Company Stage

N/A

Total Funding

N/A

Headquarters

N/A

Founded

1923