Full-Time

Research Engineer

Machine Learning

Confirmed live in the last 24 hours

Captions

Captions

51-200 employees

Video captioning and translation services

Consumer Software
Entertainment

Mid

New York, NY, USA

Requires in-person presence at NYC HQ located in Union Square.

Category
Deep Learning
Computer Vision
AI & Machine Learning
Required Skills
Tensorflow
Data Structures & Algorithms
Pytorch
Computer Vision
Requirements
  • Masters in computer science or related field and 3+ years of industry experience.
  • Strong academic background with a focus on computer vision and hands-on experience implementing generative models. Specializations can include Diffusion, Video Generation, NeRFs, Gaussian Splatting, GANs, etc.
  • Expertise in Deep Learning: Proficiency in deep learning frameworks such as TensorFlow, PyTorch, or similar, with hands-on experience in training and implementing generative models.
  • Strong understanding of Computer Science fundamentals (algorithms and data structures).
Responsibilities
  • Train, implement, and deploy machine learning models that drive product innovation and solve complex real-world problems.
  • Apply scientific principles to implement state-of-the-art algorithms and solutions for generative computer vision and video technologies.
  • Experiment with and optimize advanced neural network architectures for improved performance and efficiency.
  • Collaborate with cross-functional teams to integrate ML models into scalable systems and services impacting millions of users.
  • Stay current with the latest research and advancements in the field of machine learning and computer vision.

Captions.ai enhances video content by providing captioning and translation services tailored for content creators, social media influencers, marketing agencies, and businesses. Their main offerings include automatic subtitle generation, translation into 28 languages, and video compression to improve performance. These tools simplify the video production process, allowing users to produce professional-quality videos with ease. Unlike many competitors, Captions.ai uses a freemium model, offering basic services for free while charging for advanced features, which helps attract a large user base and convert free users into paying customers. The company's goal is to make high-quality video content accessible to a wider audience, and recent funding will support their growth and product development.

Company Stage

Series C

Total Funding

$82.7M

Headquarters

New York City, New York

Founded

2021

Growth & Insights
Headcount

6 month growth

-3%

1 year growth

-17%

2 year growth

433%
Simplify Jobs

Simplify's Take

What believers are saying

  • The $60 million Series C funding round led by top VCs, including Kleiner Perkins, indicates strong investor confidence and provides substantial resources for growth.
  • Expansion to web and desktop platforms allows Captions.ai to cater to a wider audience, enhancing user experience and engagement.
  • The launch of the Lipdub app, which supports translation and dubbing in 28 languages, showcases Captions.ai's innovative approach to making video content more accessible globally.

What critics are saying

  • The competitive landscape in AI-driven video editing is intense, with numerous startups and established companies vying for market share.
  • Technical challenges, such as the lag between audio and lip movement in the Lipdub app, could affect user satisfaction and adoption.

What makes Captions unique

  • Captions.ai's focus on AI-driven video enhancement, including automatic subtitle generation and multi-language translation, sets it apart from traditional video editing tools.
  • Their freemium model attracts a broad user base, converting free users into paying customers, which is a strategic advantage over competitors with only paid services.
  • The recent $60 million Series C funding and expansion to web and desktop platforms demonstrate their commitment to innovation and scalability.

Help us improve and share your feedback! Did you find this helpful?