Full-Time

Siteops Production Operations Engineer

Production Operations

Posted on 9/26/2025

Meta

Meta

10,001+ employees

Global social networks and advertising platform

Compensation Overview

$153.2k - $178.2k/yr

+ Bonus + Equity

Company Historically Provides H1B Sponsorship

Monroe, GA, USA

In Person

Category
Data & Analytics (1)
Required Skills
Bash
PHP
Rust
Python
SQL
Perl
Go
Linux/Unix
Data Analysis
Requirements
  • Requires a Master’s degree (or foreign equivalent) in Computer Science, Computer Software, Computer Engineering, Telecommunications or related field
  • Requires completion of a graduate-level course, research project or internship involving the following: 1. Linux (or equivalent OS) in a complex IT environment with the ability to triage, debug, and troubleshoot complex, systemic issues 2. Server hardware and components, including storage 3. Interdependencies of data center functions and technologies including electrical, cooling, structured cabling, security, and network 4. Managing multiple technical issues concurrently driving to the root cause 5. Participating in or leading technical projects such as process improvement, technology, or automation 6. HTTP, DNS, RAID, and DHCP 7. Providing technical guidance to external vendors 8. Debugging, modifying and developing scripting or programming languages in at least one of these languages: Bash, PHP, Python, SQL, Rust, Go or Perl 9. Out-of-band/lights-out server communication methods, including IPMI and serial console 10. Using data and metrics to drive decisions
Responsibilities
  • Support platform health by successfully resolving and closing complex tickets, while addressing the overall issue (i.e. addressing root cause) including, but not limited to, remote troubleshooting and physical inspection of services in data halls.
  • Perform deep dives and root cause analysis of complex technical issues within the data center, ranging from automated tooling to hardware failures and network issues.
  • Facilitate collaboration with cross-functional teams on projects and initiatives related to topics such as process, hardware and automation.
  • Lead the introduction of new platforms and hardware to the site and geographical area, in collaboration with partners and global resources, accelerating the time it takes to bring these products to sustained mass production.
  • Use tools and data analysis effectively to identify issues that are larger in scope and which impact one or multiple Data Centers.
  • Take actions to communicate with all stakeholders appropriately and manage or escalate as needed.
  • Drive corrective actions of complex hardware issues, work with internal teams and vendors provide an ownership stake, and influence future design changes to ensure ease of serviceability.
  • Solve complex and systemic hardware and/or software issues at scale using scripting, automation, and tooling to drive global resolution.
  • Continuously evaluate and identify areas for improvement in processes, tools, and systems to optimize efficiency and quality of repairs.
  • Use data analytics to drive maximum server up-time and utilization rates, understanding hardware failure rates and service level agreements.
  • Coach and mentor team members to evaluate and identify better ways to resolve issues, and define updates to tools and processes.
  • Provide engineering support and be a go-to technical resource and Subject Matter Expert for the team, leadership, and cross-functional teams in all aspects of operating and maintaining data center servers.
  • Maintain and update documentation i.e. procedures, runbooks and guides.
  • Build cross functional relationships and influence policies and procedures that improve global data center operations.
  • Participate in 24/7 on-call rotation.

Meta Platforms Inc. runs a family of social apps including Facebook, Instagram, and WhatsApp to help people connect, share content, and participate in online communities. It also develops virtual reality hardware and experiences through Oculus and is exploring the metaverse. Most revenue comes from advertising, with tools that let businesses target audiences using data from its large user base, plus VR product sales and digital services. The company differentiates itself by owning multiple major social platforms, offering a scalable cross-platform ad platform, and investing in VR, AR, and AI to expand digital experiences and monetization opportunities.

Company Size

10,001+

Company Stage

IPO

Headquarters

Menlo Park, California

Founded

2004

Simplify Jobs

Simplify's Take

What believers are saying

  • Q1 2026 revenue surges 33% to $56.3 billion from AI-enhanced advertising.
  • Wisconsin approves $1B 220MW data center online in 2027 for AI expansion.
  • Analysts project stock reaching $653–$792 in five years on ad and AI scale.

What critics are saying

  • EU fines Meta 1.2 billion euros in 2023 for Facebook US data transfers violating GDPR.
  • European Commission fines Meta 200 million euros in April 2025 for DMA data combination breaches.
  • Elsevier, Cengage sue Meta on May 5, 2026, for Llama AI copyright infringement demanding damages.

What makes Meta unique

  • Meta leverages AI for precise ad placement, generating $55 billion Q1 2026 revenue.
  • Meta's vast platforms including Facebook, Instagram, WhatsApp enable unmatched audience targeting.
  • Meta invests in Llama AI model, powering unique visual underage detection across platforms.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Stock Options

Company Equity

Mental Health Support

Flexible Work Hours

Company News

NPR
Apr 20th, 2026
Data center backlash becomes key voting issue ahead of US midterms

Opposition to data centres has become a significant issue ahead of the US midterm elections, with voters unseating local politicians who support them. Residents cite concerns over water pollution, noise, power demands and environmental degradation. In Missouri, four city council members lost their seats over supporting a $6 billion data centre. Similar ousters occurred in Independence, Missouri, and rural North Carolina. The backlash crosses party lines, prompting state legislatures nationwide to consider bills ranging from eliminating tax incentives to construction moratoriums. Despite generating substantial tax revenue and construction jobs, communities increasingly resist these developments. Virginia, which has the most data centres, is considering eliminating sales tax exemptions worth $1.9 billion. President Trump has acknowledged affordability concerns whilst supporting development, though his proposals lack enforcement mechanisms.

Ars Technica
Apr 17th, 2026
Meta raises Quest VR headset prices by up to $100 as its own $115B AI spending drives component costs

Meta is raising prices for its Quest VR headsets by $50–$100 (12–20%) from 19 April, citing a global surge in memory chip prices affecting consumer electronics. However, Meta's own spending priorities have contributed to this component shortage. The company plans to spend $115–$135 billion on capital expenditures this year, up from $72 billion in 2025 and $28 billion in 2023, with most investment directed towards AI infrastructure. This includes $21 billion for data centre company CoreWeave and $10 billion for an El Paso data centre. Meta's AI spending forms part of $630 billion in industry-wide AI infrastructure investment pledged for 2026, driving up prices for RAM and GPUs. Meanwhile, Meta is reportedly planning spending cuts of up to 30% for its metaverse division, which has accumulated $73 billion in losses.

Yahoo Finance
Apr 14th, 2026
Meta partners with Broadcom for custom AI chips through 2029

Meta and Broadcom have announced a strategic partnership under which the chipmaker will provide technology supporting Meta's training and inference accelerator chips through 2029. The deal extends Meta's custom AI chip development plans as the social media giant continues to invest in artificial intelligence infrastructure.

CNBC
Apr 14th, 2026
Meta commits to 1 gigawatt of custom AI chips with Broadcom through 2029

Meta and Broadcom have announced an extended partnership through 2029 for designing Meta's custom AI accelerators. Meta has committed to deploying one gigawatt of its training and inference accelerators under the agreement. The deal expands an existing collaboration between the two companies focused on Meta's in-house chip development. As part of the arrangement, Broadcom CEO Hock Tan has agreed to leave Meta's board of directors. Broadcom shares rose 3% in extended trading following the announcement. The partnership underscores Meta's continued investment in custom silicon to power its artificial intelligence infrastructure and reduce reliance on third-party chip suppliers.

The Associated Press
Apr 14th, 2026
Meta and Broadcom partner on industry-first 2nm AI chip with multi-gigawatt rollout

Broadcom and Meta have announced a multi-year strategic partnership to support Meta's AI compute infrastructure through 2029. The collaboration centres on Meta Training and Inference Accelerator (MTIA) chips, with an initial deployment exceeding one gigawatt as part of a sustained multi-gigawatt rollout. The partnership will deliver what the companies call the industry's first 2nm AI compute accelerator. Broadcom will provide its XPU platform for chip co-development and advanced Ethernet technologies for networking across Meta's expanding AI compute clusters. The technology will underpin Meta's deployment of generative AI features across WhatsApp, Instagram and Threads. Meta aims to deliver what it calls "personal superintelligence" to billions of users globally. Broadcom CEO Hock Tan will transition from Meta's board to an advisory role focusing on Meta's custom silicon roadmap.

INACTIVE