Full-Time

Senior Site Reliability Architect

Security Engineering

Updated on 3/14/2025

ByteDance

ByteDance

10,001+ employees

Operates global content platforms and apps

No salary listed

Senior

Company Does Not Provide H1B Sponsorship

San Jose, CA, USA

Category
Cybersecurity
IT Project Management
IT & Security
Required Skills
Kubernetes
Python
Grafana
Java
Go
Prometheus
Ansible
Spring
Django
Requirements
  • A bachelor's degree or higher in computer science, information security, or a related field.
  • 5 years of relevant experience in developing and maintaining large-scale distributed systems and SRE platform/tool
  • Solid programming skills, proficient in at least one programming language among Go/Java/Python/Shell, familiar with at least one Web framework such as Gin/Django/Spring, and have a considerable understanding of its design principles.
  • Solid background in operating systems, networks, storage, and computer architectures; familiar with cloud-native frameworks like Kubernetes.
  • Experienced in designing highly reliable and available system architectures. Including distributed systems, microservice architectures, load balancing, storage systems, fault-tolerance mechanisms, etc.
  • Proficient in system troubleshooting, capable of identifying potential problems in the system and designing prevention solutions with effective recovery plans.
  • Familiar with various popular and classic SRE technologies and tools (such as Ansible, ELK, Prometheus, and Grafana, etc.) with an insight into new technologies.
Responsibilities
  • Design the roadmap of improvement of reliability and stability of security building blocks and drive the design and implementation of SRE architecture or frameworks for high availability and reliability of existing security products and services
  • Build an SRE framework for system deployment, upgrade, rapid troubleshooting, and disaster recovery, and promote the design and development of SRE infrastructure and maintenance tools for full lifecycle security system development
  • Responsible for the capacity planning of security building blocks; by analyzing past and future business development, assess the changes in the demand for system resources; accurately estimate the usage of resources such as storage, computing, and networking, and proactively expand and optimize resources.
  • Drive the establishment and improvement of the system monitoring framework, and enhance the awareness of the system operation status.
  • Develop and implement incident response processes and contingency plans; improve the team's emergency handling capabilities.
Desired Qualifications
  • A Master or PhD degree in computer science, information security, or a related field is preferred
  • An international degree or international working experience
  • Experience of R&D in software, especially large scale distributed systems

ByteDance operates various content platforms, including Toutiao for news aggregation and TikTok for short video sharing, catering to a global audience. The company uses advanced algorithms to personalize user experiences, which keeps users engaged and returning for more. ByteDance differentiates itself from competitors like Facebook and Google by focusing on user-generated content and effective targeting for advertising. Its goal is to connect users with relevant content while providing businesses with effective advertising solutions.

Company Size

10,001+

Company Stage

Private

Total Funding

$5.6B

Headquarters

Beijing, China

Founded

2012

Simplify Jobs

Simplify's Take

What believers are saying

  • Increased AI focus boosts user retention and advertising revenue on TikTok and Douyin.
  • ByteDance's $400 billion valuation indicates strong investor confidence for future expansion.
  • OmniHuman-1 opens new revenue streams in digital content creation and entertainment.

What critics are saying

  • Ole Obermann's departure may disrupt TikTok's music licensing and partnerships.
  • Cancellation of Broadcom chip project could affect ByteDance's AI hardware innovation.
  • Geopolitical tensions over TikTok may lead to regulatory challenges or ownership changes.

What makes ByteDance unique

  • ByteDance's AI-driven content personalization enhances user engagement on platforms like TikTok and Douyin.
  • AIBrix positions ByteDance as a leader in AI research, attracting partnerships and collaborations.
  • PhotoDoodle AI diversifies ByteDance's offerings, appealing to digital art enthusiasts.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Hybrid Work Options

Growth & Insights and Company News

Headcount

6 month growth

0%

1 year growth

0%

2 year growth

0%
Asian Financial
Mar 12th, 2025
Shengshu Technology Appoints Former ByteDance AI Executive as CEO

AsianFin - Shengshu Technology, an AI video company, has appointed Luo Yihang, a former AI executive at ByteDance and head of the AI unit at Volcano Engine, as its new CEO.

Investing.com
Mar 4th, 2025
ByteDance launches new share repurchase program at higher valuation - Reuters

ByteDance launches new share repurchase program at higher valuation - Reuters.

TweakTown
Mar 2nd, 2025
ByteDance's custom chip made by Broadcom has been canceled, Broadcom to lose $2B to $3B

In a different post on X, @Jukanlosreve explained: "In June last year, ByteDance reportedly partnered with Broadcom to develop a 5nm AI accelerator, a type of ASIC.

U.S. News & World Report
Feb 28th, 2025
Bytedance's TikTok to Invest $8.8 Billion in Thailand Data Centres, Official Says

Bytedance's TikTok to invest $8.8 billion in Thailand data centres, official says.

Aibase
Feb 28th, 2025
ByteDance Launches AIBrix: A New Open-Source Inference System Designed for Large Language Models

ByteDance launches AIBrix: A new open-source inference system designed for large language models.