Full-Time

Lead Software Engineer II

Machine Learning

Updated on 5/2/2025

Thousand Eyes

Thousand Eyes

501-1,000 employees

Network infrastructure monitoring and performance analytics

Compensation Overview

$241.3k - $306.7k/yr

+ Bonus

Senior, Expert

San Francisco, CA, USA

Category
Applied Machine Learning
AI & Machine Learning
Software Engineering
Required Skills
LLM
Python
Tensorflow
Neural Networks
Pytorch
Machine Learning
Apache Kafka
Data Analysis
Connection
Connection
Connection
logo

Get referrals →

You have ways to get a Thousand Eyes referral from your network.

💡

Applications through a referral are 3x more likely to get an interview!

Requirements
  • 8-10 years of software development experience, and direct experience in building and evaluating ML models and delivering large-scale ML products.
  • MS or PhD in a relevant field
  • Proficient in crafting machine learning models, including neural networks, transformer models, Large Language Models, decision trees, and other traditional machine learning models
  • Fluent in machine learning frameworks such as SKLearn, XGBoost, PyTorch, or Tensorflow
  • Proficient in Python and able to transform abstract machine learning concepts into robust, efficient, and scalable solutions
  • Strong Computer Science fundamentals and object-oriented design skills
  • History of building large-scale data processing systems
  • Background working in a fast-paced development environment
  • Strong team collaboration and communication skills
Responsibilities
  • Collaborate with a team of skilled engineers to design, implement, and maintain large-scale AI/ML pipelines for real-time anomaly detection
  • Responsible for training and tuning the models and performing model evaluations using Deep Learning Machine Learning (AI/ML) Models, and Large Language Models
  • Design and implement sophisticated anomaly detection algorithms, such as Isolation Forests, LSTM-based models, and Variational Autoencoders
  • Create robust evaluation frameworks and metrics to assess the performance of these algorithms
  • Implement and optimize stream processing solutions using technologies like Flink and Kafka

ThousandEyes specializes in monitoring network infrastructure and analyzing internet performance. Its platform operates in the cloud, providing businesses with tools to understand and enhance their digital experiences. By offering visibility into the performance of networks and applications, ThousandEyes enables companies to identify issues and improve the reliability of their online services. The platform maps the global structure of wide-area networks and measures performance metrics, ensuring that clients' digital ecosystems run smoothly. Unlike many competitors, ThousandEyes focuses on providing detailed insights and analytics tailored to various industries, including finance, healthcare, and retail. The company's goal is to help enterprises and service providers optimize their digital performance through a subscription-based model that offers real-time monitoring, outage detection, and customer support.

Company Size

501-1,000

Company Stage

Acquired

Total Funding

$1.1B

Headquarters

San Francisco, California

Founded

2010

Simplify Jobs

Simplify's Take

What believers are saying

  • AI-driven network monitoring enhances digital infrastructure automation and performance.
  • DORA enforcement in 2025 boosts demand for IT risk management solutions in Europe.
  • Full-stack observability trend increases need for integrated end-to-end visibility platforms.

What critics are saying

  • Increased competition from Cisco's Agile Services Networking may impact market share.
  • DORA enforcement may impose additional compliance costs and operational challenges.
  • AI-powered capabilities launch could lead to internal integration and management challenges.

What makes Thousand Eyes unique

  • ThousandEyes offers unmatched vantage points throughout the global Internet for superior visibility.
  • The platform provides real-time monitoring and outage detection for digital ecosystems.
  • ThousandEyes serves major brands, including 100+ of the Global 2000 and 60+ Fortune 500.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Health Insurance

Dental Insurance

Vision Insurance

Disability Insurance

401(k) Retirement Plan

401(k) Company Match

Paid Holidays

Paid Vacation

Employee Stock Purchase Plan

Growth & Insights and Company News

Headcount

6 month growth

0%

1 year growth

0%

2 year growth

0%
PR Newswire
Mar 3rd, 2025
Cisco @ Mobile World Congress 2025: Accelerating Service Provider Growth In The Age Of Ai

News Summary:Cisco launches Agile Services Networking innovations to help service providers compete in the AI marketplace with simplified, resilient, and intelligent networking.innovations to help service providers compete in the AI marketplace with simplified, resilient, and intelligent networking. New assurance capabilities include real-time visibility into on-network and off-network connectivity, allowing service providers to build resilient, profitable AI-ready home and mobile subscriber experiences.include real-time visibility into on-network and off-network connectivity, allowing service providers to build resilient, profitable AI-ready home and mobile subscriber experiences. Early adopters are already delivering secure, competitive AI-connected experiences, improving networking, business performance, and customer outcomes.BARCELONA, March 3, 2025 /PRNewswire/ -- MOBILE WORLD CONGRESS -- Cisco (NASDAQ: CSCO), the worldwide leader in networking and security, today announced networking innovations that empower service providers to introduce differentiated services and deliver assured, AI-connected experiences at scale.View PDF Agile Services Networking Quote SheetCisco's Agile Services Networking fundamentally evolves how service providers build and operate their networks—opening new ways to monetize the services needed to compete in the AI marketplace. A blueprint for growth, the architecture combines high-speed, feature-rich Silicon One routing, a unified software experience, and converged IP and optics within a single, seamless network. Early adopters of Cisco Agile Services Networking, including Arelion, Lumen, and Reliance Jio, have already vastly improved the performance of their network to cut costs, generate new business, and improve the customer experience."The pace of innovation in AI is astounding. Technical breakthroughs are just beginning to translate into new experiences for consumers and applications for businesses that will reshape how the world works and connects," said Jeetu Patel, Executive Vice President and Chief Product Officer, Cisco

PYMNTS
Aug 8th, 2024
Banks And Their Tech Suppliers Face More It Scrutiny In Europe

Banks and their IT providers will soon face tougher scrutiny in the European Union (EU). That’s because of the Digital Operational Resilience Act (DORA), which passed last year but isn’t set to be enforced until January of 2025. A report Thursday (Aug. 8) by CNBC examines the implications of the law, particularly in the wake of last month’s CrowdStrike outage. DORA requires banks to carry out strict IT risk management, digital operational resilience testing, information and intelligence sharing on cyber threats and vulnerabilities, along with taking measures to manage third-party risks

CNBC
Jun 4th, 2024
Cisco-owned ThousandEyes launches AI to predict and fix internet outages, teases ChatGPT-style tech

ThousandEyes, the internet monitoring unit of Cisco, launched unveiled a new set of AI-powered capabilities Tuesday, called Digital Experience Assurance, or DXA.

Auspreneur
Dec 12th, 2023
Coles looks to setup Full-Stack Observability to Enhance IT Service

Cisco tools are already in use: Coles has invested in Cisco technologies like wi-fi Catalyst Center, ThousandEyes for device monitoring, and AppDynamics for application monitoring.

PR Newswire
Jun 6th, 2023
Cisco Showcases Vision To Simplify Networking And Securely Connect The World

News Summary:The company's vision for Cisco Networking Cloud will create a simpler network management platform experience to help customers easily access and manage all Cisco networking products from one place.New innovations include SSO, API key exchange/repository, sustainable data center networking solutions and expanded network assurance with Cisco ThousandEyes.Cisco Networking Cloud will dramatically simplify IT, with a more flexible Cisco Catalyst switch stack, improved visibility into data center power and energy consumption, and new AI data center blueprints to improve performance and visibility for network operators.LAS VEGAS, June 6, 2023 /PRNewswire/ -- CISCO LIVE -- Cisco (NASDAQ: CSCO) is on a mission to simplify IT, today announcing its vision for Cisco Networking Cloud, an integrated management platform experience for both on-prem and cloud operating models.Building a Better Future for Cisco Customers and PartnersManaging networks in today's era of connecting everyone, everywhere is hard. According to Cisco's State of Global Innovation report, 85% of IT professionals indicate they value simplicity in their IT systems. Simplicity becomes increasingly important with the advancement of cloud, IoT, Wi-Fi + 5G, AI/ML, and security. With so many technologies and applications coming together, it can be difficult for IT staff to deliver a consistent, unified experience whether in the office, at home, or on the go.A simplified IT experience influences customer satisfaction, employee retention, and competitive differentiation. Cisco recognizes the struggles with fragmentation, lack of visibility, security threats, and time-consuming integration that get in the way of delivering better experiences. It understands that the journey to simplification is defined by each operator's business objectives, functional needs, and preferred consumption model