Full-Time

Data Center Facility Operations Reliability Engineer

Posted on 11/15/2025

Meta

Meta

10,001+ employees

Global social networks and advertising platform

Compensation Overview

$133k - $190k/yr

+ Bonus + Equity + Benefits

Company Historically Provides H1B Sponsorship

Remote in USA

Hybrid

Category
DevOps & Infrastructure (2)
,
Required Skills
Data Analysis
Requirements
  • Bachelor’s degree in Mechanical, Electrical Reliability Engineering or similar technical discipline
  • 10+ years of experience in reliability engineering (related to electrical or mechanical cooling equipment)
  • Experienced in Reliability Centered Maintenance (RCM) and Failure Maintenance Effect Analysis (FMEA) activities for maintenance /process/equipment design optimization to meet reliability requirements
  • Proficient in usage of Enterprise Asset Management solutions to extract data and develop meaningful insights
  • Certifications in Maintenance & Reliability such as CMRP, CRL, CRE
  • Knowledgeable of relevant ISO standards (ISO 14224, ISO 17359, ISO 55000)
  • Experience with Program/Project management and cross-functional team management
Responsibilities
  • Prevent operational gaps in reliability engineering expertise across all asset management activities
  • Proactively review, identify, and mitigate risks of equipment failures, unscheduled downtime, and reactive maintenance
  • Ensure all new assets are methodically and consistently onboarded into Meta’s asset management ecosystem.Maintain rigorous asset onboarding processes to enable accurate tracking and seamless integration into maintenance programs
  • Establish and maintain a robust asset criticality framework to prioritize resources and mitigate risk
  • Lead Failure Mode and Effects Analysis (FMEA) to predict failure modes, prioritize risks, and develop preventive actions. Develop and execute Reliability Centered Maintenance (RCM) programs to balance cost, risk, and performance
  • Assess operational risks associated with asset failures, maintenance strategies, and process deviations
  • Develop, maintain, and update the Global Maintenance Library of plans, procedures, and best practices
  • Govern the review and implementation of changes to maintenance strategies and procedures
  • Ensure all maintenance changes are data-driven, risk assessed, and systematically implemented
  • Support accurate accounting of asset depreciation and amortization through timely asset tracking
  • Serve as a subject matter expert and technical lead for Enterprise Asset Management (EAM) implementation and optimization
  • Create and maintain asset useful life models to forecast replacement needs and optimize total cost of ownership
  • Provide technical leadership for condition-based, time-based, and specialized reliability maintenance initiatives
  • Analyze asset health metrics and KPIs to identify risks, predict failures, and measure reliability improvements
  • Collaborate with Operations and Maintenance to optimize scheduling and execution of maintenance activities
  • Mentor staff in reliability methodologies and foster a environment of proactive asset management
  • Sustain continuous improvement of asset management workstreams and processes
  • 25% to 50% travel domestically and internationally
Desired Qualifications
  • Experience with data center equipment such as critical cooling systems, generators, main switchboards, network gear
  • Proficient in data analysis techniques that can include Process Control, Reliability modeling and prediction, Fault Tree Analysis, Weibull Tree Analysis, Six Sigma (6σ) Methodology
  • Proficient in developing and executing test plans for assets
  • Certifications in Maintenance & Reliability such as CMRP, CRL, CRE
  • Knowledgeable of relevant ISO standards (ISO 14224, ISO 17359, ISO 55000)

Meta Platforms Inc. runs a family of social apps including Facebook, Instagram, and WhatsApp to help people connect, share content, and participate in online communities. It also develops virtual reality hardware and experiences through Oculus and is exploring the metaverse. Most revenue comes from advertising, with tools that let businesses target audiences using data from its large user base, plus VR product sales and digital services. The company differentiates itself by owning multiple major social platforms, offering a scalable cross-platform ad platform, and investing in VR, AR, and AI to expand digital experiences and monetization opportunities.

Company Size

10,001+

Company Stage

IPO

Headquarters

Menlo Park, California

Founded

2004

Simplify Jobs

Simplify's Take

What believers are saying

  • Q1 2026 revenue surged 33% to $56.3B from AI-driven ad placement improvements.
  • $1B Beaver Dam data center approved with 220MW power for 10 years from 2027.
  • Stock trades at 19x forward earnings discount to S&P 500 after 20% dip.

What critics are saying

  • UK Ofcom imposes $20B fines under Online Safety Act using 10% global revenue.
  • Publishers win Llama AI lawsuit, forcing $1B+ damages by mid-2028.
  • TikTok erodes Instagram engagement, slashing ad growth to single digits by May 2027.

What makes Meta unique

  • Meta integrates AI on Threads for real-time trend queries like World Cup discussions.
  • Meta deploys AI to detect underage users via height and bone structure analysis.
  • Meta owns Facebook, Instagram, WhatsApp with 97.8% ad revenue in 2023.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Stock Options

Company Equity

Mental Health Support

Flexible Work Hours

Company News

NPR
Apr 20th, 2026
Data center backlash becomes key voting issue ahead of US midterms

Opposition to data centres has become a significant issue ahead of the US midterm elections, with voters unseating local politicians who support them. Residents cite concerns over water pollution, noise, power demands and environmental degradation. In Missouri, four city council members lost their seats over supporting a $6 billion data centre. Similar ousters occurred in Independence, Missouri, and rural North Carolina. The backlash crosses party lines, prompting state legislatures nationwide to consider bills ranging from eliminating tax incentives to construction moratoriums. Despite generating substantial tax revenue and construction jobs, communities increasingly resist these developments. Virginia, which has the most data centres, is considering eliminating sales tax exemptions worth $1.9 billion. President Trump has acknowledged affordability concerns whilst supporting development, though his proposals lack enforcement mechanisms.

Ars Technica
Apr 17th, 2026
Meta raises Quest VR headset prices by up to $100 as its own $115B AI spending drives component costs

Meta is raising prices for its Quest VR headsets by $50–$100 (12–20%) from 19 April, citing a global surge in memory chip prices affecting consumer electronics. However, Meta's own spending priorities have contributed to this component shortage. The company plans to spend $115–$135 billion on capital expenditures this year, up from $72 billion in 2025 and $28 billion in 2023, with most investment directed towards AI infrastructure. This includes $21 billion for data centre company CoreWeave and $10 billion for an El Paso data centre. Meta's AI spending forms part of $630 billion in industry-wide AI infrastructure investment pledged for 2026, driving up prices for RAM and GPUs. Meanwhile, Meta is reportedly planning spending cuts of up to 30% for its metaverse division, which has accumulated $73 billion in losses.

Yahoo Finance
Apr 14th, 2026
Meta partners with Broadcom for custom AI chips through 2029

Meta and Broadcom have announced a strategic partnership under which the chipmaker will provide technology supporting Meta's training and inference accelerator chips through 2029. The deal extends Meta's custom AI chip development plans as the social media giant continues to invest in artificial intelligence infrastructure.

CNBC
Apr 14th, 2026
Meta commits to 1 gigawatt of custom AI chips with Broadcom through 2029

Meta and Broadcom have announced an extended partnership through 2029 for designing Meta's custom AI accelerators. Meta has committed to deploying one gigawatt of its training and inference accelerators under the agreement. The deal expands an existing collaboration between the two companies focused on Meta's in-house chip development. As part of the arrangement, Broadcom CEO Hock Tan has agreed to leave Meta's board of directors. Broadcom shares rose 3% in extended trading following the announcement. The partnership underscores Meta's continued investment in custom silicon to power its artificial intelligence infrastructure and reduce reliance on third-party chip suppliers.

The Associated Press
Apr 14th, 2026
Meta and Broadcom partner on industry-first 2nm AI chip with multi-gigawatt rollout

Broadcom and Meta have announced a multi-year strategic partnership to support Meta's AI compute infrastructure through 2029. The collaboration centres on Meta Training and Inference Accelerator (MTIA) chips, with an initial deployment exceeding one gigawatt as part of a sustained multi-gigawatt rollout. The partnership will deliver what the companies call the industry's first 2nm AI compute accelerator. Broadcom will provide its XPU platform for chip co-development and advanced Ethernet technologies for networking across Meta's expanding AI compute clusters. The technology will underpin Meta's deployment of generative AI features across WhatsApp, Instagram and Threads. Meta aims to deliver what it calls "personal superintelligence" to billions of users globally. Broadcom CEO Hock Tan will transition from Meta's board to an advisory role focusing on Meta's custom silicon roadmap.

INACTIVE