Full-Time

Lead Software Engineer

Posted on 5/9/2026

Deadline 5/22/26
Wells Fargo

Wells Fargo

10,001+ employees

Nationwide banking and financial services

No salary listed

Hyderabad, Telangana, India

In Person

Category
Software Engineering (1)
Required Skills
Python
Grafana
Machine Learning
Prometheus
Observability
DevOps
Splunk
Requirements
  • 5+ years of Software Engineering experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
Responsibilities
  • Lead complex technology initiatives including those that are companywide with broad impact
  • Act as a key participant in developing standards and companywide best practices for engineering complex and large scale technology solutions for technology engineering disciplines
  • Design, code, test, debug, and document for projects and programs
  • Review and analyze complex, large-scale technology solutions for tactical and strategic business objectives, enterprise technological environment, and technical challenges that require in-depth evaluation of multiple factors, including intangibles or unprecedented technical factors
  • Make decisions in developing standard and companywide best practices for engineering and technology solutions requiring understanding of industry best practices and new technologies, influencing and leading technology team to meet deliverables and drive new initiatives
  • Collaborate and consult with key technical experts, senior technology team, and external industry groups to resolve complex technical issues and achieve goals
  • Lead projects, teams, or serve as a peer mentor
  • Reliability & Availability Engineering: Own and improve availability, performance, scalability, and resilience of production systems
  • Reliability & Availability Engineering: Define, monitor, and manage SLIs/SLOs and error budgets to guide reliability investments
  • Reliability & Availability Engineering: Lead capacity planning, performance testing, failover readiness, and disaster-recovery design
  • Observability & Monitoring (Grafana / Prometheus / Splunk): Design and operate a comprehensive observability stack using Prometheus for metrics collection and alerting, Grafana for dashboards, visualization, and SLO tracking, Splunk for log aggregation, troubleshooting, and incident forensics
  • Observability & Monitoring (Grafana / Prometheus / Splunk): Build and maintain golden dashboards and actionable alerts aligned to business impact
  • Observability & Monitoring (Grafana / Prometheus / Splunk): Reduce alert fatigue through signal-based monitoring and correlation of metrics, logs, and traces
  • Observability & Monitoring (Grafana / Prometheus / Splunk): Partner with application teams to define instrumentation standards for metrics and logging
  • Observability & Monitoring (Grafana / Prometheus / Splunk): Use observability data to improve MTTD, MTTR, and reliability outcomes
  • Automation & Python Engineering: Develop Python-based automation for monitoring, alert remediation, deployments, scaling, and recovery
  • Automation & Python Engineering: Build self-healing workflows integrated with Prometheus alerts and Splunk signals
  • Automation & Python Engineering: Create reusable automation frameworks and internal SRE tooling
  • Automation & Python Engineering: Embed automation into CI/CD pipelines to improve deployment safety and reliability
  • AI/ML‑Driven Reliability (AIOps): Apply AI/ML techniques to observability and operations use cases, including anomaly detection on Prometheus metrics, log pattern analysis and correlation in Splunk, predictive capacity and trend forecasting, noise reduction and intelligent alerting
  • AI/ML‑Driven Reliability (AIOps): Partner with data and platform teams to operationalize ML models in production
  • AI/ML‑Driven Reliability (AIOps): Evaluate and integrate AIOps capabilities into the observability ecosystem
  • Incident Management & RCA: Serve as incident commander and senior escalation point for P1/P2 incidents
  • Incident Management & RCA: Lead blameless post-incident reviews (PIRs) backed by Grafana metrics and Splunk evidence
  • Incident Management & RCA: Drive corrective and preventive actions to completion
  • Platform & Application Partnership: Collaborate with platform, application, cloud, and SRE teams to embed reliability and observability by design
  • Platform & Application Partnership: Influence architectural decisions to ensure systems are observable, scalable, and operable
  • Platform & Application Partnership: Provide SRE guidance during major releases, migrations, and modernization initiatives
  • Security, Risk & Compliance: Ensure observability and automation comply with enterprise security and audit requirements
  • Security, Risk & Compliance: Support resilience validation, failover drills, and business continuity testing
  • Technical Leadership: Mentor and guide SRE and software engineers
  • Technical Leadership: Define standards for observability, automation, reliability, and incident response
  • Technical Leadership: Act as the technical authority for complex production and platform issues
Desired Qualifications
  • Experience in Software Engineering, SRE, DevOps, or Platform Engineering
  • Strong proficiency in Python for automation and tooling
  • Hands‑on experience with Grafana, Prometheus, and Splunk in production environments
  • Solid understanding of SLIs, SLOs, dashboards, alerting, and observability best practices
  • Experience applying AI/ML concepts to monitoring, alerting, or operational analytics
  • Strong knowledge of Linux, networking, and distributed systems
  • Experience with Cloud platforms and Kubernetes/OpenShift
  • Proven experience leading incidents, RCAs, and reliability initiatives
  • Experience building custom Prometheus exporters or advanced Grafana dashboards
  • Strong Splunk expertise (search, dashboards, alerts, log pipelines)
  • Experience operationalizing ML models for observability (AIOps)
  • Familiarity with CI/CD, Terraform, Ansible, and enterprise automation platforms
  • Experience supporting large‑scale, regulated, or globally distributed systems
  • Improved reliability and performance against defined SLOs
  • Reduced alert noise and faster detection and recovery of incidents
  • Increased automation and self‑healing adoption using Python and observability signals
  • Strong observability maturity across platforms and applications
  • Improved MTTD and MTTR through effective use of Grafana, Prometheus, and Splunk

Wells Fargo provides banking, investment, and payment services to individuals, businesses, and institutions. Its products include checking and savings accounts, loans, credit cards, wealth management, and payments, accessible through branches, online and mobile platforms, and full payment rails. The company combines a wide national footprint with a long history and a business model that integrates banking, investment, and payments, supported by a large network of branches and ATMs. Its goal is to help customers manage money, grow wealth, and move funds safely and reliably.

Company Size

10,001+

Company Stage

IPO

Headquarters

San Francisco, California

Founded

1851

Simplify Jobs

Simplify's Take

What believers are saying

  • Federal Reserve asset cap removal enables $1T loan expansion post-2025.
  • Q1 2026 net income hits $5.3B with $12.1B net interest income.
  • Jefferies Buy rating targets $100, projects 6.8% revenue growth to 2029.

What critics are saying

  • KGI Securities downgrades to Hold at $88 on April 16, 2026, overvaluation.
  • Q1 2026 net charge-offs surge to $1.1B, eroding credit quality.
  • Chime fintech poaches 70M customers, forcing branch closures in 24-36 months.

What makes Wells Fargo unique

  • Wells Fargo holds Charter No. 1, first national bank charter issued June 20, 1863.
  • Iconic stagecoach brand from 1852 Gold Rush express services persists today.
  • 1998 Norwest merger blends Midwest scale with West Coast franchise dominance.

Help us improve and share your feedback! Did you find this helpful?

Your Connections

People at Wells Fargo who can refer or advise you

Benefits

Health Insurance

401(k) Retirement Plan

Paid Vacation

Paid Sick Leave

Parental Leave

Disability Insurance

Life Insurance

Tuition Reimbursement

Commuter Benefits

Adoption Assistance

Company News

Squire Patton Boggs
Apr 27th, 2026
Squire Patton Boggs Advises ICF International on a $1.45 Billion Amended and Restated Credit Agreement | News | Squire Patton Boggs

Squire Patton Boggs represented ICF International, Inc. in connection with an amendment, restatement and increase to its $1.45 billion senior secured credit agreement with PNC Bank, National Association, as administrative agent, and the lenders party thereto. BOFA Securities, Inc. and Wells Fargo Securities, LLC acted as the joint lead arrangers on the transaction.

CANPACK
Apr 17th, 2026
Announcement of pricing of approximately $1,088 million senior notes

THIS RELEASE CONTAINS INSIDE INFORMATION CANPACK GROUP, INC. CANPACK S.A. (“CANPACK”, the “Company”, or the “Group”) Announcement of pricing of approximately $1,088 million (equivalent in a…

StreetInsider
Apr 14th, 2026
Marathon Petroleum enters $5 billion credit agreement

Marathon Petroleum Corporation (NYSE: MPC) entered into a $5 billion, five-year revolving credit agreement on April 7, 2026, according to a company statement.The agreement involves JPMorgan Chase Bank as administrative...

Simply Wall St
Apr 13th, 2026
Donaldson secures $400M credit facility to fund growth and acquisitions

Donaldson Company has entered into a three-year, unsecured delayed draw term loan credit facility of $400 million with a syndicate of lenders led by Wells Fargo Bank. The facility, signed on 8 April 2026, has no current borrowings and includes covenants on interest coverage and adjusted debt-to-EBITDA ratios. The committed borrowing capacity provides Donaldson with additional financial flexibility to fund future growth initiatives or acquisitions whilst maintaining balance sheet discipline. The announcement follows the appointment of Richard S. Lewis as chief executive officer and director, effective 2 March 2026. Analysts project the filtration company's revenue to reach $4.3 billion and earnings of $564.5 million by 2029, requiring 5% annual revenue growth. However, investors face risks from potential margin pressure due to rising input costs and tariffs.

Yahoo Finance
Apr 13th, 2026
Wells Fargo Q1 earnings: revenue expected to grow 7.6% year on year

Wells Fargo will announce its first-quarter earnings on Tuesday before market hours. Analysts expect the company's revenue to grow 7.6% year on year, reversing the 3.5% decrease recorded in the same quarter last year. Last quarter, Wells Fargo reported revenues of $21.37 billion, up 4.4% year on year, but slightly missed analysts' expectations for both revenue and net interest income. The company has missed Wall Street's revenue estimates multiple times over the past two years. Analysts have largely reconfirmed their estimates over the past 30 days. Wells Fargo shares have risen 12.7% over the last month, outperforming the banking sector's 8.5% average gain. The company will be the first amongst its peers to report earnings this season.