Full-Time

Research Engineer

Machine Learning

Posted on 11/29/2024

Captions

Captions

51-200 employees

Video captioning and translation services

Consumer Software
Entertainment

Mid

New York, NY, USA

Requires in-person presence at NYC HQ located in Union Square.

Category
Deep Learning
Computer Vision
AI & Machine Learning
Required Skills
Tensorflow
Data Structures & Algorithms
Pytorch
Computer Vision
Requirements
  • Masters in computer science or related field and 3+ years of industry experience.
  • Strong academic background with a focus on computer vision and hands-on experience implementing generative models. Specializations can include Diffusion, Video Generation, NeRFs, Gaussian Splatting, GANs, etc.
  • Expertise in Deep Learning: Proficiency in deep learning frameworks such as TensorFlow, PyTorch, or similar, with hands-on experience in training and implementing generative models.
  • Strong understanding of Computer Science fundamentals (algorithms and data structures).
Responsibilities
  • Train, implement, and deploy machine learning models that drive product innovation and solve complex real-world problems.
  • Apply scientific principles to implement state-of-the-art algorithms and solutions for generative computer vision and video technologies.
  • Experiment with and optimize advanced neural network architectures for improved performance and efficiency.
  • Collaborate with cross-functional teams to integrate ML models into scalable systems and services impacting millions of users.
  • Stay current with the latest research and advancements in the field of machine learning and computer vision.

Captions.ai enhances video content by providing captioning and translation services tailored for content creators, social media influencers, marketing agencies, and businesses. Their main offerings include automatic subtitle generation, translation of captions into 28 languages, and video compression to improve performance. These tools simplify the video production process, allowing users to produce professional-quality videos with ease and reach a wider audience. Captions.ai differentiates itself from competitors through its freemium model, offering basic services for free while charging for advanced features, which helps attract a large user base and convert free users into paying customers. The company's goal is to make high-quality video content creation accessible to everyone, supported by recent funding to expand their services and market reach.

Company Stage

Series C

Total Funding

$82.7M

Headquarters

New York City, New York

Founded

2021

Growth & Insights
Headcount

6 month growth

31%

1 year growth

-20%

2 year growth

433%
Simplify Jobs

Simplify's Take

What believers are saying

  • Recent $60M Series C funding boosts expansion and technological advancements.
  • Acquisition of AlpacaML enhances creative tools with AI rendering capabilities.
  • Expansion to web and desktop broadens user base and increases engagement.

What critics are saying

  • Integration of AlpacaML may face challenges, delaying product development.
  • Expansion to web and desktop could strain resources, affecting iOS app quality.
  • Increased competition from startups like Beeble AI may challenge market position.

What makes Captions unique

  • Captions offers AI-powered video editing with automatic subtitle generation and language dubbing.
  • The platform supports 28 languages, enhancing accessibility for global content creators.
  • Captions' freemium model attracts a wide user base, converting free users to paid subscribers.

Help us improve and share your feedback! Did you find this helpful?

INACTIVE