Full-Time

Cloud Senior Site Reliability Engineer

Confirmed live in the last 24 hours

Bank of America

Bank of America

10,001+ employees

Provides banking, investment, and asset management services

Fintech
Financial Services

Compensation Overview

$149.8k - $188.9kAnnually

+ Discretionary Incentive

Senior

Richmond, VA, USA + 5 more

More locations: New York, NY, USA | Jacksonville, FL, USA | Chandler, AZ, USA | Atlanta, GA, USA | Lawrence Township, NJ, USA

Category
DevOps & Infrastructure
Site Reliability Engineering
Required Skills
Python
Git
Java
.NET
AWS
Prometheus
Jenkins
Splunk
Linux/Unix
Databricks
Requirements
  • 15 years of combined experience in either SRE, software development, or infrastructure engineering (10 years with an advanced degree in Computer Science or related technical field).
  • 7+ years of hands-on experience building and maintaining cloud platforms on a major cloud service provider.
  • Strong experience in implementing, monitoring, and maintaining a highly scalable and resilient Data Services platform on Amazon Web Services
  • Strong experience with monitoring tools such as Grafana, Prometheus, Splunk, or Dynatrace, as well as AWS native tools like CloudWatch & CloudTrail, Azure Monitor and Log Analytics
  • Proficiency in implementing, monitoring, and maintaining a Databricks, RDS, or OpenAI platform.
  • Proficient in at least one programming language such as Python, Java/Spring Boot, and .Net; 5+ years applied experience in Python/Java
  • Proficiency in implementing CI/CD pipelines with tools such as git and Jenkins, familiarity with using a GitOps model.
  • Advanced knowledge of networking (firewalls, DNS, Load Balancing, Proxies, etc.)
  • Advanced understanding of Linux & Windows operating systems including shell scripting
  • Excellent interpersonal, organizational and communication (written, verbal, and presentation) skills are a must.
  • Proven ability to work independently with minimal supervision and as part of a team with direct responsibilities and an ability to juggle competing priorities and adapt to changes in project scope.
Responsibilities
  • Designs solutions to visualize key production support metrics enabling Operational Readiness and Site Reliability Engineer teams to identify scenarios requiring intervention
  • Develops software solutions and/or improved processes to address work identified as ‘toil’ by collaborating with key partners to identify, track and remediate processes to free time allocated to reliability
  • Partners with Development and Infrastructure teams to create error budget policies prioritizing reliability stories that fall below Service Level Objective (SLO) thresholds and suggests code optimizations, additional instrumentation and/or logging structures to gain service reliability visibility
  • Identifies and plans for capacity bottlenecks, vulnerabilities and opportunities for reliability improvement, such as low level error rates and 'noise', and reduces manual support effort and/or improves system reliability
  • Assesses monitoring for new changes with development partners and works with monitoring tools team to monitor dashboards and enhance application and system monitoring designs
  • Engages as a subject matter expert in incident triage efforts, failure scenario modelling and works with the Problem Manager to diagnose root causes for complex/high impact incident/problem management investigations
  • Collaborates with Development and Infrastructure teams to understand technical solutions and develop Service Level Indicators and SLOs to measure/improve the reliability of the services they support

Bank of America provides a wide range of financial services to individuals, small and medium-sized businesses, and large corporations. Their offerings include banking, investing, asset management, and risk management products. The company serves around 56 million consumer and small business clients in the U.S. and is recognized as a leading wealth management firm. Additionally, Bank of America is a major player in corporate and investment banking, as well as trading. What sets Bank of America apart from its competitors is its extensive client base and comprehensive service offerings that cater to various financial needs. The company's goal is to help clients achieve their financial objectives while managing their investments and risks effectively.

Company Stage

IPO

Total Funding

N/A

Headquarters

Charlotte, North Carolina

Founded

N/A

Growth & Insights
Headcount

6 month growth

0%

1 year growth

0%

2 year growth

0%
Simplify Jobs

Simplify's Take

What believers are saying

  • Working at Bank of America offers exposure to a diverse range of financial products and services, enhancing career development opportunities.
  • The bank's leadership in wealth management and investment banking provides employees with the chance to work on high-impact projects and deals.
  • Bank of America's global presence and strong market position offer stability and growth potential for employees.

What critics are saying

  • The highly competitive nature of the financial services industry requires Bank of America to continuously innovate to maintain its market position.
  • Regulatory changes and economic fluctuations can impact the bank's operations and profitability, posing challenges for employees.

What makes Bank of America unique

  • Bank of America stands out as a global leader in corporate and investment banking, offering a comprehensive suite of financial services that cater to a wide range of clients from individuals to large corporations.
  • The bank's extensive network and relationships with approximately 56 million U.S. consumer and small business clients provide a significant competitive edge in the financial services industry.
  • Bank of America's involvement in high-profile credit facilities, such as Uber's $5B revolving credit, showcases its capability to handle large-scale financial transactions.

Help us improve and share your feedback! Did you find this helpful?