Cloud Platform Manager
Support and Operations, Open to Remote
Confirmed live in the last 24 hours
Locations
Madison, WI, USA • Huntington Beach, CA, USA • Dorchester, Boston, MA...
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
Agile
ASP.NET
AWS
JavaScript
C/C++/C#
Management
PHP
Splunk
SQL
Terraform
TypeScript
Datadog
Requirements
- Bachelor's degree in Computer Science, Information Technology, or related field
- At least 3-5 years' experience in IT Applications management (Support and Operations), with a focus on incident, release, and request management
- Strong knowledge of ITIL and other industry standards for IT service management
- Excellent leadership, communication, and interpersonal skills
- Ability to manage and prioritize multiple tasks and projects
- Strong analytical and problem-solving skills
- Knowledge on application support with Tech Stack around Microsoft technologies and AWS including but not limited to C#, MVC 5, ASP.Net, Web API, .NET Core and Microservices, SQL Server 2014/ 2016/ 2018 and AWS services
- Experience delivering complex solutions utilizing common programming languages C#, JS, TypeScript, YAML, Terraform, PHP
- Extensive experience with configuring and monitoring via tools such as but not limited to DataDog, ELK, Splunk, AppDynamics
Responsibilities
- Develop and implement processes for incident, release, and request management, ensuring compliance with First American and industry standards and best practices
- Lead the IT Support / Operations team in providing 24/7 support to clients, ensuring timely resolution of incidents and requests
- Drive strategy for compliance management, ensuring that all IT operations are carried out in accordance with relevant regulations and standards
- Manage the performance and availability of IT systems, proactively identifying and resolving potential incidents
- Establish Service Level Objectives to ensure there is appropriate monitoring and metrics available to measure the established objectives
- Familiar with the golden signals of monitoring (latency, traffic, error rate, and resource saturation)
- Develop and implement DevSecOps strategies that align with our goals and objectives, ensuring security is integrated into all aspects of the software development lifecycle
- Work closely with the team to develop and implement SRE strategies that ensure availability, reliability, and scalability of the applications and services
- Drive Incident Management (creating, escalating, managing to resolution) for the application and platform, including leading remediation bridges for complex P1/P2 issues
- Establish and oversee a Knowledge Management repository and suggest automation tasks to eliminate manual processes and reduce toil
- Collaborate with cross-functional teams to plan, execute, and manage releases, ensuring minimal disruption to business operations
- Manage the onboarding process for new clients, ensuring a smooth transition to IT services
- Lead the triage management process, ensuring that incidents and requests are prioritized and addressed in a timely manner
- Lead and manage a diverse team of onshore and offshore professionals (Leads, SME's, and Individual contributors), fostering a collaborative and productive work environment while effecting change within the organization
- Work closely with senior management to develop and implement IT strategies that support the overall business goals
- Build strong relationships with customers, stakeholders, and partners to ensure effective incident reporting and to develop next-generation practices to meet their needs and expectations. This includes opportunities for automation, self-service, and leveraging customer feedback to continuously improve the incident management process
- Ensure that the IT Operations team has the necessary skills, training, and resources to perform their role effectively
- Experience collaborating across multiple functional and/or technical teams to deliver an Agile-based project
- Govern and manage Infrastructure by keeping service accounts, certificates renewals up-to-date
- Manage the delivery of all IT audit requests and provide regular reports on their status
- Should possess a deep understanding and keen interest in supporting DevSecOps activities, particularly in the areas of deployment, infrastructure provisioning, and process automation
- Should be proficient at Cloud concepts & guiding principles on AWS with knowledge on key features, advantages, and disadvantages
Title insurance & professional settlement services
Company Overview
First American is on a mission to provide comprehensive title insurance protection and professional closing/settlement services that produce clear property titles and enable the efficient transfer of real estate.
Benefits
- 401k matching
- Health, vision, dental insurance
- Professional development