Full-Time

Senior Research Engineer

Speaker Identification

Posted on 5/29/2025

Otter.ai

Otter.ai

201-500 employees

Real-time AI transcription and meeting automation

Compensation Overview

$200k - $220k/yr

Senior

Mountain View, CA, USA

In Person

Category
Speech Recognition
AI & Machine Learning
Required Skills
Python
Tensorflow
Neural Networks
Pytorch
Machine Learning
C/C++
Linux/Unix
Requirements
  • 5+ years of industry experience on DNN based speaker id and diarization system
  • Masters or Ph.D. degree in computer science, machine learning, speech/language processing
  • Strong programming skills with working knowledge of C++ and Python
  • Experience with TensorFlow, PyTorch, or similar frameworks in a Linux environment
  • Experience with end-to-end neural network based diarization and state of the art deep learning
  • Experience building data crawlers and data clean up pipelines
  • Excellent problem-solving skills and the ability to work independently as well as collaboratively in a fast-paced research environment with good communication skills
Responsibilities
  • R&D work on a scalable and efficient state-of-the-art speaker identification system using machine learning, signal processing, and pattern recognition methods capable of processing large volumes of audio data in real-time alongside our multidisciplinary team
  • Collaborate with cross-functional teams to integrate speaker identification technology into existing products or develop new applications.
  • Evaluate the performance of speaker identification models under various conditions and scenarios.
  • Support product development, deployment, and maintenance activities.
  • Document research findings in technical reports, papers, and patents
  • Stay abreast of SID and diarization research and participate in conferences, workshops, and seminars

Otter.ai offers real-time transcription services powered by artificial intelligence, primarily through its AI Meeting Assistant, Otter. This tool transcribes audio, identifies speakers, and generates meeting summaries, making it useful for professionals like journalists and corporate teams. It integrates with popular virtual meeting platforms such as Zoom and Microsoft Teams, and operates on a subscription model with various pricing tiers. Otter.ai's goal is to enhance productivity by automating the documentation of meetings and lectures, setting it apart from competitors.

Company Size

201-500

Company Stage

Series B

Total Funding

$73M

Headquarters

Los Altos, California

Founded

2016

Simplify Jobs

Simplify's Take

What believers are saying

  • Growing demand for AI-driven meeting solutions boosts Otter.ai's market potential.
  • Advancements in NLP and cloud solutions can enhance Otter.ai's offerings.
  • The speech-to-text API market is projected to grow significantly, benefiting Otter.ai.

What critics are saying

  • Fireflies.ai's 'Talk to Fireflies' feature poses a competitive threat to Otter.ai.
  • OpenAI's 'Record Mode' feature challenges Otter.ai's market share.
  • Notion's built-in transcription feature competes directly with Otter.ai.

What makes Otter.ai unique

  • Otter.ai offers real-time transcription with speaker identification and meeting summaries.
  • The platform integrates seamlessly with Zoom, Microsoft Teams, and Google Meet.
  • OtterPilot enhances productivity by capturing slides and extracting action items during meetings.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Competitive salary

Comprehensive stock and equity package

Hybrid Work Options

Company Gatherings

Comprehensive health care package (medical, dental, vision, life, disability)

PTO

401(k) retirement savings program

Growth & Insights and Company News

Headcount

6 month growth

1%

1 year growth

6%

2 year growth

3%
Tech Funding News
Jun 13th, 2025
Fireflies Ignites A $1B Valuation With Real-Time Ai Meeting Assistant

Fireflies.ai, the AI meeting assistant used in 75% of Fortune 500 companies, has reached a $1B valuation following its first tender offer in a move that signals a new era for workplace productivity tools. The announcement coincides with the launch of its latest innovation, Talk to Fireflies, a voice-activated AI meeting assistant with real-time web search capabilities powered by Perplexity.For remote teams juggling constant meetings and tight decision cycles, instant access to reliable information can be a game-changer. Fireflies is betting on this unmet need. Rather than toggling between meeting apps and web browsers or postponing decisions due to missing data, users can now speak directly to Fireflies during meetings and get sourced answers on the spot-freeing up time, reducing friction, and boosting informed collaboration.Voice-led AI meetings meet the power of searchWith its new Talk to Fireflies feature, the San Francisco-based company enables participants to interact using voice or chat in over 60 languages across Zoom, Google Meet, and Microsoft Teams. By integrating Perplexity’s web search technology, Fireflies gives users real-time access to online information, without ever leaving the meeting environment.“Talk to Fireflies lets people search the web in real-time and get answers about the current meeting. For example, a late joiner can ask, ‘Hey Fireflies, what key decisions have been made so far?’ or a team can find information by asking, ‘Hey Fireflies, what are the market growth projections for AI meeting agents?’” said Fireflies co-founder and CEO Krish Ramineni

VentureBeat
Jun 4th, 2025
Openai Hits 3M Business Users And Launches Workplace Tools To Take On Microsoft

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More. OpenAI announced Wednesday that its business user base has surged 50% since February, reaching 3 million paying enterprise customers as the artificial intelligence company unveiled an expansive suite of new workplace tools designed to compete directly with Microsoft’s enterprise AI offerings.The milestone, revealed alongside the launch of several new business-focused features, underscores OpenAI’s aggressive push into corporate markets where reliable, secure AI tools can command premium prices. The company introduced new “connectors” that integrate ChatGPT with popular business applications, a meeting transcription feature called Record Mode, and enhanced versions of its Deep Research and Codex coding tools.“ChatGPT is helping transform businesses by helping employees work with more productivity, efficiency, and more strategically,” an OpenAI spokesperson told VentureBeat. “Over the last few months, we’ve continued evolving ChatGPT into an increasingly impactful platform for work with business products like connectors, record mode with ChatGPT, Codex, image generation, deep research, and more.”The rapid enterprise adoption comes as OpenAI faces intensifying competition from tech giants like Microsoft and Google, which offer deep workplace integrations through existing enterprise relationships. Yet the company appears to be winning customers by positioning itself as the premier destination for cutting-edge AI capabilities.“Customers often choose ChatGPT for direct access to SOTA (state-of-the-art) models and tools, combined with enterprise-grade security and commitments on never training on business data,” the spokesperson said, emphasizing OpenAI’s competitive advantage as an “AI-native” company focused solely on advancing artificial intelligence rather than integrating it into legacy systems.OpenAI’s new workplace connectors challenge Microsoft and Google’s enterprise AI dominanceThe newly announced connectors represent OpenAI’s most direct challenge yet to Microsoft’s workplace AI strategy

Amply
May 20th, 2025
13 of the best free AI tools for business

Otter.ai integrates seamlessly with Zoom, Microsoft Teams, and Google Meet, making it a versatile tool for remote and hybrid teams.

Tech Info PK
May 13th, 2025
Notion Takes on AI Note-Takers Like Granola with Its Own Transcription Feature

In the fast-evolving world of productivity tools, Notion has once again stepped up its game by introducing a built-in transcription feature, directly competing with-taking apps like Granola, Otter.ai, and Fireflies.ai.

PR Newswire
May 12th, 2025
Speech-To-Text Api Market To Reach $5 Billion By 2024 In The Short Term And $21 Billion By 2034 Globally, At 15.2% Cagr: Allied Market Research

The global speech-to-text API market is experiencing rapid growth due to rising demand for voice recognition technology in smart devices and cloud-based services. Businesses are adopting these solutions to enhance productivity, accessibility, and customer experiences, driving further expansion.WILMINGTON, Del. , May 12, 2025 /PRNewswire/ -- Allied Market Research published a report titled, "Speech-to-text API Market - Global Opportunity Analysis and Industry Forecast, 2024-2034," valued at $5 Billion in 2024. The market is expected to grow at a CAGR of 15.2% from 2025 to 2034, reaching $21 Billion by 2034. Key factors fueling this growth include the increasing adoption of AI-powered voice recognition, demand for real-time transcription in healthcare and legal sectors, and the rise of voice-enabled smart devices. In addition, advancements in natural language processing (NLP) and cloud-based solutions are accelerating market expansion.Report Overview:The speech-to-text API market is driven by the rising demand for voice-enabled applications in smart devices, virtual assistants, and customer service automation

INACTIVE