Full-Time

Machine Learning Engineer-Model Training Infrastructure

AML-Engine

Posted on 2/13/2025

ByteDance

ByteDance

10,001+ employees

Operates global content platforms and apps

No salary listed

Mid

Company Does Not Provide H1B Sponsorship

San Jose, CA, USA

Category
Applied Machine Learning
Deep Learning
AI & Machine Learning
Required Skills
Python
Tensorflow
CUDA
Pytorch
Machine Learning
C/C++
Requirements
  • Proficient in C/C++/CUDA/Python, and have solid programming skills.
  • Familiar with deep learning frameworks (TensorFlow/Pytorch).
  • Experience in developing and deploying large-scale systems.
  • Ability to work independently and complete projects from beginning to end and in a timely manner.
  • Good communication and teamwork skills to clearly communicate technical concepts with other teammates.
  • Experience on improving core machine learning infrastructure (TensorFlow, Pytorch, and Jax).
  • Experience contributing to an open sourced machine learning framework (TensorFlow/PyTorch).
  • Experience in using/designing open-source machine learning lifecycle management systems: TFX.
Responsibilities
  • Responsible for the design and implementation of a global-scale machine learning system for feeds, ads and search ranking models.
  • Responsible for improving use-ability and flexibility of the machine learning infrastructure.
  • Responsible for improving the workflow of model training and serving, data pipelines, storage system and resource management for multi-tenancy machine learning systems.
  • Responsible for designing and developing key components of ML infrastructure and mentoring interns.

ByteDance operates various content platforms that aim to inform, educate, entertain, and inspire users from different backgrounds. Its main products include Toutiao, a news aggregation app, and Douyin, a popular short video sharing platform in China. Internationally, ByteDance is best known for TikTok, which allows users to create and share short videos. Other products include Helo, a social media platform in India, and Lark, an enterprise collaboration tool. The company uses advanced algorithms to personalize user experiences, keeping users engaged with user-generated content. ByteDance primarily earns revenue through advertising, allowing brands to target specific audiences on its platforms. Unlike competitors such as Facebook and Google, ByteDance focuses heavily on short video content and user engagement through its unique platforms. The goal of ByteDance is to connect people globally through diverse content while providing effective advertising solutions for businesses.

Company Size

10,001+

Company Stage

Private

Total Funding

$5.6B

Headquarters

Beijing, China

Founded

2012

Simplify Jobs

Simplify's Take

What believers are saying

  • Investment in data centers enhances infrastructure and user experience in Southeast Asia.
  • Partnership with Qualcomm positions ByteDance in the growing VR market.
  • Expansion of TikTok Shop into new markets diversifies revenue streams.

What critics are saying

  • VR expansion faces competition from established players like Meta, impacting market share.
  • AI advancements may lead to regulatory challenges in regions with strict AI laws.
  • Data center investments in Brazil and Thailand face geopolitical and regulatory risks.

What makes ByteDance unique

  • ByteDance leverages advanced algorithms for personalized user experiences across its platforms.
  • The company has a diverse product portfolio, including TikTok, Douyin, and Lark.
  • ByteDance's global presence spans major cities like Los Angeles, London, and Tokyo.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Hybrid Work Options

Growth & Insights and Company News

Headcount

6 month growth

0%

1 year growth

0%

2 year growth

0%
Channel News Asia
Apr 25th, 2025
Exclusive-TikTok owner weighs data center project in Brazil, sources say

In February, ByteDance announced plans to invest $8.8 billion in data centers in Thailand over five years.

Future Week
Apr 25th, 2025
Week In Review - OpenAI Wants to Buy Chrome, the British Council Partners with Creatopy, and Meta Expands Ads on Threads

In this week's week in review: OpenAI says it wants to buy Google Chrome at the tech giant's antitrust trial, the British Council partners with AI ads tool Creatopy, Meta expands its ads offering on Instagram Threads, and BMW in China works with ByteDance for marketing.

Generative AI
Apr 22nd, 2025
ByteDance's Seedream 3: The GPT-4o Rival Hiding in Plain Sight

Just when I thought the AI image space was settling down, ByteDance quietly released Seedream 3.0 on CapCut's Dreamina platform.

Aibase
Apr 21st, 2025
Trae v1.3.0 Released: AI-Powered Programming Experience Enhanced

ByteDance has officially released Trae v1.3.0, its AI-powered integrated development environment (IDE).

Wanderwell
Apr 13th, 2025
ByteDance Developing AI Smart Glasses to Compete with Meta's Offerings

During the Mobile World Congress (MWC) 2025, ByteDance announced a partnership with Qualcomm to develop a next-generation VR headset aimed at competing with Meta's Oculus Quest series.

INACTIVE