Full-Time

Site Reliability Engineer Lead

Bank of America

Bank of America

10,001+ employees

Global banking, investing, and wealth management

Compensation Overview

$125.3k - $167.9k/yr

+ Discretionary incentive

Tampa, FL, USA + 3 more

More locations: Plano, TX, USA | Charlotte, NC, USA | Lawrence Township, NJ, USA

In Person

Category
DevOps & Infrastructure (1)
Required Skills
Microsoft Azure
Microservices
AWS
Observability
Google Cloud Platform
Requirements
  • 8+ years in technology architecture, reliability engineering, or infrastructure strategy roles
  • Proven track record of delivering stability-focused initiatives in large-scale environments
  • Strong knowledge of distributed systems, cloud architecture (AWS, Azure, GCP), and microservices
  • Experience with reliability engineering, chaos testing, and observability tools
  • Ability to influence cross-functional teams and communicate complex concepts to non-technical stakeholders
Responsibilities
  • Collaborates with Development and Infrastructure teams to understand technical solutions and implement monitoring capabilities outlined in the application and system monitoring designs put forward by the Senior Site Reliability Engineer (SRE)
  • Develops and maintains reliability scripts, tools and libraries and leverages them for common instrumentation, automation, and operational needs, and when mentoring SRE resources on reliability practices and established tools/capabilities
  • Partners to implement code changes to make use of common reliability libraries and tools and helps Application Production Services and Application Development teammates understand how to use them
  • Participates regularly in architecture community of practice meetings and communication via other channels
  • Identifies vulnerabilities and opportunities for reliability improvement, such as investigating low level error rates and 'noise' in monitoring, and defines solutions to reduce manual support effort and/or improve system reliability
  • Engages as a subject matter expert in major incident triage efforts and failure scenario modelling and diagnosis with Problem Manager root causes for major incident/problem management investigations
  • Define and maintain a multi-year stability roadmap aligned with business objectives and technology strategy
  • Identify critical dependencies, risks, and mitigation strategies across infrastructure, applications, and services
  • Work with the architects to develop and adhere to the enterprise architectural patterns and frameworks that enhance system reliability and fault tolerance
  • Ensure designs adhere to best practices for high availability, disaster recovery, and performance optimization
  • Establish stability metrics, KPIs, and compliance standards for technology teams
  • Drive adoption of reliability engineering principles across development and operations
  • Partner with engineering, operations, and product teams to embed stability into the software development lifecycle
  • Act as a trusted advisor to senior leadership on stability-related initiatives and investments
  • Monitor emerging technologies and industry trends to enhance stability strategies
  • Lead post-incident reviews and ensure lessons learned are incorporated into future designs
  • Collaborate with Development and Infrastructure teams to understand technical solutions and to implement the monitoring capabilities outlined in the application and system monitoring designs put forward by the Senior SRE
  • Develop and maintain a catalog of extensible reliability scripts, tools, and libraries that can be leveraged for common instrumentation, automation and operational needs
  • Partner to implement code changes to make use of common reliability libraries and tools and help the Application Production Services (APS) and Application Development teammates understand how to use them
  • Partner with infrastructure engineers and application teams to implement the necessary code changes to make use of common reliability libraries and tools and help the APS and Application Development of teammates to understand how to use them
  • Engage as a subject matter expert (SME) in major incident triage efforts, failure scenario modelling and work with the Problem Manager to diagnose root causes for major incident / problem management investigations
  • Identify vulnerabilities and opportunities for reliability improvement, such as investigating low level error rates and 'noise' in monitoring, and to help define solutions to reduce manual support effort / or improve system reliability
Desired Qualifications
  • SRE Certification

Bank of America provides a full range of financial services to individuals, small businesses, and large corporations, including banking, investing, asset management, and risk management products. Customers access services via branches, online and mobile banking, and advisory and trading capabilities across consumer banking, wealth management, corporate and investment banking. Its breadth, scale, and global reach enable cross-service solutions and large-scale operations that few peers match. Its goal is to be a trusted, full-service financial partner helping customers manage money, grow assets, and navigate risk.

Company Size

10,001+

Company Stage

IPO

Headquarters

Charlotte, North Carolina

Founded

1904

Simplify Jobs

Simplify's Take

What believers are saying

  • Q1 2026 delivers $30.3B revenue and $8.6B net income.
  • Fed Taylor rule deviation implies 4.0% funds rate by late 2026.
  • Strong analyst upgrades on Home Depot and Ecolab boost advisory fees.

What critics are saying

  • Chime and SoFi drive 20% deposit outflows in Q1 2026.
  • JPMorgan overtakes small business lending post-First Republic acquisition.
  • 10-year yields hit 5.2% by Q3 2026, slashing NIM to 2.5%.

What makes Bank of America unique

  • Bank of America leads as No. 1 small business lender per FDIC data.
  • Manages $4.6 trillion in client balances across 56 million relationships.
  • Global leader in corporate investment banking and wealth management.

Help us improve and share your feedback! Did you find this helpful?

Your Connections

People at Bank of America who can refer or advise you

Benefits

Health Insurance

Dental Insurance

Vision Insurance

Life Insurance

Disability Insurance

Paid Vacation

Paid Sick Leave

Flexible Work Hours

Remote Work Options

Professional Development Budget

Conference Attendance Budget

Company News

Bloomberg Law
Apr 15th, 2026
Herbalife raises $800M bond to refinance 12.25% debt with 7.75% notes

Herbalife has raised $800 million through a junk-bond sale to refinance existing high-interest debt. The nutrition-focused multilevel-marketing company sold seven-year senior secured notes at a 7.75% yield, led by Bank of America. The proceeds will be used to repay bonds due in 2029 that carry a 12.25% interest rate, significantly reducing Herbalife's borrowing costs. The successful offering comes a month after the company shelved a loan offering due to market volatility, taking advantage of a recent rebound in investor demand for risky debt.

Yahoo Finance
Apr 14th, 2026
Bank of America reports earnings Wednesday, revenue expected to rise 5.8% year on year

Bank of America will report earnings on Wednesday before market open. Last quarter, the company beat revenue expectations with $28.55 billion, up 7.1% year-on-year, though it only narrowly exceeded earnings per share estimates. Analysts expect revenue to grow 5.8% year-on-year this quarter, an improvement from the 4.7% increase recorded in the same period last year. Analyst estimates have remained largely unchanged over the past 30 days. The bank's shares have risen 13.5% over the last month, outperforming the 9.1% average gain across the banking sector. Analysts have set an average price target of $60.56, compared to the current share price of $53.37. Bank of America historically tends to exceed Wall Street's expectations.

Bitget
Apr 13th, 2026
On April 7, 2026, Marathon Oil Corporation entered into a five-year revolving credit agreement worth 5 billion dollars with Bank of America and several other financial institutions. | Bitget News

This agreement will significantly enhance the company's liquidity management capabilities, providing flexible financial support for its strategic investment | Bitget crypto news!

National Today
Apr 9th, 2026
CCLA Investment Management acquires $124M stake in Bank of America with 2.2M shares

CCLA Investment Management acquired 2,254,107 shares in Bank of America Corporation during the fourth quarter of 2025, valued at approximately $123.95 million, according to a 13F filing submitted on 9 April 2026. The position represents roughly 2% of CCLA's total investment portfolio, making Bank of America the firm's 23rd largest holding. The acquisition signals institutional confidence in Bank of America's long-term prospects and demonstrates continued investor interest in major US financial institutions. The purchase establishes Bank of America as one of CCLA's top 25 holdings, reflecting the investment management firm's belief in the bank's valuation and growth potential despite broader economic uncertainty.

Yahoo Finance
Apr 9th, 2026
BofA lifts Broadcom target to $450 on $35B+ Google, Anthropic supply deals through 2031

Bank of America has reiterated a buy rating for Broadcom stock with a $450 price target following new supply agreements with Google and Anthropic revealed in an 8-K filing. Broadcom shares rose 4.28% to $348.27 on the news. Under the agreements, Broadcom will develop custom Tensor Processing Units for Google through 2031 and supply networking components for AI infrastructure. Anthropic will access approximately 3.5 gigawatts of AI computing capacity starting in 2027, which analysts value at over $35 billion. Bank of America analyst Vivek Arya said the deals solidify Broadcom's position as Google's main TPU design partner and address concerns about insourcing. Analysts expect Broadcom's AI accelerator market share to grow from under 10% in 2025 to approximately 15% by 2027.