Full-Time

Senior Engineer-AI Inference

Posted on 12/13/2025

Bank of America

Bank of America

10,001+ employees

Global banking, investing, and wealth management

No salary listed

Newark, NJ, USA + 3 more

More locations: Charlotte, NC, USA | Kennesaw, GA, USA | Addison, TX, USA

In Person

Category
AI & Machine Learning (2)
,
Required Skills
Python
Jupyter
Data Structures & Algorithms
Machine Learning
Postgres
RAG
Redis
Observability
Hadoop
DevOps
Oracle
Linux/Unix
Requirements
  • Minimum 8 years of relevant experience required.
  • Experience in Model Ops and design, software development with proven effectiveness in delivering technology in fast-paced, demanding, industry driven environment for AI/ML, and advanced analytics.
  • Hands on experience in both Python development on Linux.
  • Strong understanding of modern open-source data science platform architecture for storage & compute separation, interactive development workbenches, containers, and toolsets such as Jupyter, VSCode etc.
  • Experience of data sources and Vector Store platforms such as Redis, Solar, Postgres DB, FAISS, Teradata, Oracle, SQL Server, Hadoop etc.
  • Experienced in using design patterns and following best software engineering practices.
  • An understanding of fundamental algorithms and ability to optimize existing code.
  • Proficient written and verbal communication skills to support and shape the platform and clearly articulate technical designs and concepts; and to communicate effectively with all levels within the organization.
  • Experience with deploying models using vLLM/Triton Inference Server
  • Performance Tuning those models and deployment to provide higher throughput.
  • Experience with various inference metrics, and related monitoring and observability.
  • Experience with serving multiple tenants/clients with model endpoints with secure boundaries.
  • Experience with Atheization & Authorization, Policy as Code, Systems Integration, and Model Routing
  • Model Evaluation frameworks to evaluate different models and their tradeoffs between efficiency and metrics.
  • Experience building RAG for various knowledge bases, and document types.
  • Model Monitoring – Ability to collect metrics to measure things like Model Drift, KPIs.
  • Self-starter with the ability to challenge conventions, excellent communication skills.
  • Strong analytical skills which enable ability to problem solve, apply reason, take initiative, use judgment, and perform concurrent tasks.
  • Follows Test Driven Development practices including continual integration and clean code principles.
Responsibilities
  • Ensures that the design and engineering approach for complex features are consistent with the larger portfolio solution
  • Define the technology tool stack for the solution and evaluate and adapt new testing tool/framework/practices for team(s)
  • Enables team(s)/applications with Continuous Integration/Continuous Development (CI/CD) capabilities and engages with other technical stakeholders pertaining to efficient functioning of CI-CD pipeline
  • Guides and influences team(s) on design and best practices for high code performance –e.g. pairing, code reviews
  • Provides end-to-end delivery of complex features, including automation, for either a single team or multiple teams, at the program level
  • Conducts research, design prototyping and other exploration activities such as evaluating new toolsets and components for release management, CI/CD, and features
  • Works with stakeholders to establish high-level solution needs and with architects for technical requirements
  • Collaborate with product teams, data analysts and data scientists to design and build solutions.
  • Design and execute the implementation plans to both move forward strategically, while at the same time ensuring the current technology stack is supporting current needs.
  • Manage multiple priorities, and simultaneously engage with multiple teams worldwide.
  • Be vocal and actively participate in all session with business stakeholders and agile teams.
  • Manage next generation of architectural decision for advanced analytics platform, create strategy, roadmaps, present to tech and non-tech leaders.
  • Coach and mentor team members.
Desired Qualifications
  • Experience developing Gen AI training and Inferencing platform with open-source model, Gen AI Model servicing capabilities, designing RAG frameworks, MCP modules for enterprise data systems.
  • Automation
  • Influence
  • Result Orientation
  • Stakeholder Management
  • Technical Strategy Development
  • Application Development
  • Architecture
  • Business Acumen
  • Risk Management
  • Solution Design
  • Agile Practices
  • Analytical Thinking
  • Collaboration
  • Data Management
  • Solution Delivery Process

Bank of America provides a full range of financial services to individuals, small businesses, and large corporations, including banking, investing, asset management, and risk management products. Customers access services via branches, online and mobile banking, and advisory and trading capabilities across consumer banking, wealth management, corporate and investment banking. Its breadth, scale, and global reach enable cross-service solutions and large-scale operations that few peers match. Its goal is to be a trusted, full-service financial partner helping customers manage money, grow assets, and navigate risk.

Company Size

10,001+

Company Stage

IPO

Headquarters

Charlotte, North Carolina

Founded

1904

Simplify Jobs

Simplify's Take

What believers are saying

  • Q1 2026 net income hit $8.6 billion with EPS of $1.11 beating estimates.
  • Digital initiatives drove $30.3 billion revenue up 7-10.7% year-over-year.
  • Efficiency ratio improved to 61.22% with 16% return on tangible equity.

What critics are saying

  • CFPB sues Bank of America December 2024 for Zelle fraud enabling scams.
  • Chime erodes deposits with 4.3% APY versus BAC's 0.01-4.2% rates.
  • Federal Reserve June 2026 cut to 3.5% slashes $2-3 billion NII annually.

What makes Bank of America unique

  • Bank of America serves 56 million U.S. consumer relationships with full banking and wealth management.
  • Leads globally in corporate investment banking and trading services.
  • Merrill Lynch dominates wealth management among top institutions.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Health Insurance

Dental Insurance

Vision Insurance

Life Insurance

Disability Insurance

Paid Vacation

Paid Sick Leave

Flexible Work Hours

Remote Work Options

Professional Development Budget

Conference Attendance Budget

Company News

Bloomberg Law
Apr 15th, 2026
Herbalife raises $800M bond to refinance 12.25% debt with 7.75% notes

Herbalife has raised $800 million through a junk-bond sale to refinance existing high-interest debt. The nutrition-focused multilevel-marketing company sold seven-year senior secured notes at a 7.75% yield, led by Bank of America. The proceeds will be used to repay bonds due in 2029 that carry a 12.25% interest rate, significantly reducing Herbalife's borrowing costs. The successful offering comes a month after the company shelved a loan offering due to market volatility, taking advantage of a recent rebound in investor demand for risky debt.

Yahoo Finance
Apr 14th, 2026
Bank of America reports earnings Wednesday, revenue expected to rise 5.8% year on year

Bank of America will report earnings on Wednesday before market open. Last quarter, the company beat revenue expectations with $28.55 billion, up 7.1% year-on-year, though it only narrowly exceeded earnings per share estimates. Analysts expect revenue to grow 5.8% year-on-year this quarter, an improvement from the 4.7% increase recorded in the same period last year. Analyst estimates have remained largely unchanged over the past 30 days. The bank's shares have risen 13.5% over the last month, outperforming the 9.1% average gain across the banking sector. Analysts have set an average price target of $60.56, compared to the current share price of $53.37. Bank of America historically tends to exceed Wall Street's expectations.

Bitget
Apr 13th, 2026
On April 7, 2026, Marathon Oil Corporation entered into a five-year revolving credit agreement worth 5 billion dollars with Bank of America and several other financial institutions. | Bitget News

This agreement will significantly enhance the company's liquidity management capabilities, providing flexible financial support for its strategic investment | Bitget crypto news!

National Today
Apr 9th, 2026
CCLA Investment Management acquires $124M stake in Bank of America with 2.2M shares

CCLA Investment Management acquired 2,254,107 shares in Bank of America Corporation during the fourth quarter of 2025, valued at approximately $123.95 million, according to a 13F filing submitted on 9 April 2026. The position represents roughly 2% of CCLA's total investment portfolio, making Bank of America the firm's 23rd largest holding. The acquisition signals institutional confidence in Bank of America's long-term prospects and demonstrates continued investor interest in major US financial institutions. The purchase establishes Bank of America as one of CCLA's top 25 holdings, reflecting the investment management firm's belief in the bank's valuation and growth potential despite broader economic uncertainty.

Yahoo Finance
Apr 9th, 2026
BofA lifts Broadcom target to $450 on $35B+ Google, Anthropic supply deals through 2031

Bank of America has reiterated a buy rating for Broadcom stock with a $450 price target following new supply agreements with Google and Anthropic revealed in an 8-K filing. Broadcom shares rose 4.28% to $348.27 on the news. Under the agreements, Broadcom will develop custom Tensor Processing Units for Google through 2031 and supply networking components for AI infrastructure. Anthropic will access approximately 3.5 gigawatts of AI computing capacity starting in 2027, which analysts value at over $35 billion. Bank of America analyst Vivek Arya said the deals solidify Broadcom's position as Google's main TPU design partner and address concerns about insourcing. Analysts expect Broadcom's AI accelerator market share to grow from under 10% in 2025 to approximately 15% by 2027.

INACTIVE