Full-Time

Research Scientist in Large Multimodal Models Applications

Confirmed live in the last 24 hours

ByteDance

ByteDance

10,001+ employees

Operates global content platforms and apps

No salary listed

Senior, Expert

Company Does Not Provide H1B Sponsorship

San Diego, CA, USA

Category
Computer Vision
AI Research
AI & Machine Learning
Required Skills
LLM
Computer Vision
Requirements
  • Proficiency in Diffusion, LLM, and other advanced large multimodal models; experience with model training, tuning, and application.
  • Familiarity with computer vision (CV) algorithms, including GAN, VAE, and Diffusion for AIGC.
Responsibilities
  • Contribute to the research and development of multimedia algorithms based on large multimodal models, including but not limited to video understanding, quality assessment, video processing and enhancement, and video compression.
  • Optimize and accelerate the performance of algorithms related to large multimodal models.
  • Explore the implementation of large multimodal models in multimedia applications, such as short video streaming, video transcoding, live streaming, etc.
  • Conduct advanced academic research on large multimodal models and publish findings in top international conferences and journals.
Desired Qualifications
  • Experience with NLP and RL algorithms, and knowledge of models such as Transformer, BERT, and GPT is preferred.
  • A history of leading impactful projects in large multimodal models or publishing in top conferences (NeurIPS, ICLR, ICML, etc.) is advantageous.

ByteDance operates various content platforms, including Toutiao for news aggregation and TikTok for short video sharing, catering to a global audience. The company uses advanced algorithms to personalize user experiences, keeping users engaged with user-generated content. ByteDance primarily earns revenue through advertising, allowing brands to target specific audiences effectively. Its goal is to connect people through diverse content while providing effective advertising solutions for businesses.

Company Size

10,001+

Company Stage

Private

Total Funding

$5.6B

Headquarters

Beijing, China

Founded

2012

Simplify Jobs

Simplify's Take

What believers are saying

  • Investment in data centers in Brazil and Thailand boosts data processing capabilities.
  • Seed1.5-VL model enhances AI capabilities, improving content personalization and user engagement.
  • Seedream 3.0 on CapCut's Dreamina platform attracts creators with innovative AI image processing.

What critics are saying

  • TikTok's U.S. eCommerce job cuts may impact operational efficiency and morale.
  • Brazil data center project faces regulatory and political challenges.
  • Open-source AI tools could increase competition and intellectual property risks.

What makes ByteDance unique

  • ByteDance's advanced algorithms personalize user experiences, enhancing engagement and retention.
  • The company leverages open-source AI tools like Agent TARS for task automation.
  • ByteDance's DeerFlow tool combines NLP and data analytics for superior customer acquisition.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Hybrid Work Options

Growth & Insights and Company News

Headcount

6 month growth

0%

1 year growth

0%

2 year growth

0%
ArticleX
Jun 5th, 2025
ByteDance's OmniHuman Revolutionizes AI Video Animation With Single Image Technology

While ByteDance has released a technical paper detailing the system's architecture and training methodology, they have not yet announced plans for public release or open-source availability.

Gadgets 360
May 27th, 2025
ByteDance Unveils Bagel Open Source Multimodal AI Model With Support for Generating, Editing Images

ByteDance unveils Bagel open source multimodal AI model with support for generating, editing images.

PYMNTS
May 21st, 2025
Tiktok Shop Job Cuts Loom Amid Ban And Tariff Uncertainty

TikTok’s U.S. eCommerce workers are reportedly facing possible job cuts. That’s according to a Bloomberg News report Wednesday (May 21), citing an internal memo to the social media company’s employees advising them to work from home as they await emails on “difficult decisions.”. The Chinese-owned company is examining ways to “create a more efficient operating model,” wrote Mu Qing, who took over TikTok Shop in the U.S. in April

VC Cafe
May 16th, 2025
Weekly Firgun Newsletter - May 16 2025

ByteDance has introduced Agent TARS, an open-source AI tool designed for automating complex tasks by visually interpreting web content and interacting with system elements.

MarkTechPost
May 15th, 2025
ByteDance Introduces Seed1.5-VL: A Vision-Language Foundation Model Designed to Advance General-Purpose Multimodal Understanding and Reasoning

Researchers at ByteDance have developed Seed1.5-VL, a compact yet powerful vision-language foundation model featuring a 532 M-parameter vision encoder and a 20 B-parameter Mixture-of-Experts LLM.