Full-Time

Senior Site Reliability Engineer

Tenant Services

GitLab

GitLab

1,001-5,000 employees

Unified DevOps platform for CI/CD

No salary listed

Remote in India

Remote

Category
DevOps & Infrastructure (1)
Required Skills
Chef
Bash
Kubernetes
Python
Grafana
Ruby
Docker
AWS
Go
Prometheus
Terraform
Ansible
Helm
Google Cloud Platform
Requirements
  • Experience operating highly-available distributed systems at scale, ideally in a SaaS environment with customer-facing SLAs
  • Hands-on experience with at least one major cloud provider (e.g., Google Cloud Platform or Amazon Web Services), including networking, storage, and managed services
  • Experience with Kubernetes and its ecosystem (e.g., Helm), including deploying and troubleshooting workloads
  • Experience with infrastructure as code and configuration management tools such as Terraform, Ansible, or Chef
  • Strong programming skills in at least one general-purpose language (preferably Go or Ruby) and proficiency with scripting (e.g., Shell, Python)
  • Experience with observability systems (e.g., Prometheus, Grafana, logging stacks) and using metrics and logs to troubleshoot performance and reliability issues
  • Practical exposure to data replication, backup/restore, or migration scenarios (e.g., database replication, storage replication, or Geo-like technologies) where data integrity and downtime risk must be carefully managed
  • Comfort participating in an on-call rotation, investigating incidents across the stack, and driving follow-through on corrective actions
  • Ability to engage directly with enterprise customers during migrations and incidents, including on live calls and through clear written updates
  • Ability to clearly define problems, propose options, and think beyond immediate fixes to improve systems and processes over time
  • Ability to be a “manager of one”: self-directed, organized, and able to drive work to completion in a remote, asynchronous environment
  • Strong written and verbal communication skills, with a bias toward clear, asynchronous documentation and collaboration
  • Alignment with our company values and a commitment to working in accordance with those values
Responsibilities
  • Execute Dedicated Geo migrations and cutovers end-to-end, including planning, pre-cutover validation, execution, and post-cutover verification and cleanup
  • Join the team’s shift and weekend coverage rotation for Dedicated cutovers across EMEA and US hours, and participate in the SaaS Site Reliability Engineering on-call rotation to respond to incidents that impact GitLab.com availability
  • Operate and improve the Geo operational surface for Dedicated, including: Environment preparation and data hygiene checks prior to migrations; Execution of replication, validation, and cutover procedures; Handling Geo-related escalations from Support and internal partners
  • Design, build, and maintain automation, tooling, and runbooks that make migrations, cutovers, and Geo escalations as “boring” and repeatable as possible
  • Run our infrastructure with tools such as Ansible, Chef, Terraform, GitLab CI/CD, and Kubernetes; contribute improvements back to GitLab’s product and infrastructure where appropriate
  • Build and maintain monitoring, alerting, and dashboards that: Detect symptoms early, not just outages; Track migration and cutover success rates, duration, rollback frequency, and related SLOs
  • Collaborate closely with: The core Geo team on improving Geo features and operability; Dedicated migrations and Support on migration planning, customer communications, and escalation handling; Other Infrastructure teams on capacity planning, disaster recovery, and reliability improvements
  • Contribute to readiness reviews, incident reviews, and root cause analyses, turning learnings into changes in automation, process, or product
  • Document every action, including runbooks, architecture decisions, and post-incident reviews, so your findings turn into repeatable practices and automation
  • Proactively identify and reduce toil by automating repetitive operational work and simplifying migration workflows
Desired Qualifications
  • Experience working with disaster recovery technologies
  • Experience with managed/hosted environments similar to GitLab Dedicated, including regulated or compliance-sensitive customers (e.g., SOC2, ISO)
  • Prior work on large-scale data migrations or cutovers where customer data integrity, performance, and downtime risk had to be carefully balanced
  • Hands-on experience designing and operating database replication, backup/restore, and cutover workflows (for example, PostgreSQL or cloud-managed equivalents such as AWS RDS), including planning and executing low-risk migrations for large datasets
  • Experience with multi-tenant architectures, sharding, or routing strategies in high-traffic SaaS platforms
  • Familiarity with GitLab (self-managed or SaaS), and/or contributions to open source projects

GitLab provides a unified DevOps platform that brings together the tools needed for software development, including code hosting, collaboration, CI/CD, issue tracking, and security, all in one application. It works by offering a single subscription-based platform where teams can plan, write, test, review, and deploy code through automated pipelines, reducing the need to manage separate tools. This differs from many competitors that require using a collection of separate products; GitLab consolidates these capabilities into one integrated solution, helping teams work more efficiently. The company’s goal is to help organizations speed up software delivery and improve collaboration by simplifying the DevOps process and continuously updating the platform with new features and improvements.

Company Size

1,001-5,000

Company Stage

IPO

Headquarters

San Francisco, California

Founded

2014

Your Connections

People at GitLab who can refer or advise you

Simplify Jobs

Simplify's Take

What believers are saying

  • Duo Agent Platform in public beta enables enterprise adoption of automated DevSecOps workflows.
  • GitLab 19.0 added Secrets Manager and supply-chain security, boosting value for regulated customers.
  • GitLab Flex allows flexible seat and credit reallocation, reducing procurement friction for uncertain agent usage.

What critics are saying

  • GitHub Copilot Workspace bundles AI coding and PRs, compressing GitLab differentiation within 12-18 months.
  • Atlassian deepens developer lifecycle integration, pulling planning spend and forcing price pressure in enterprise deals.
  • Agent prompt injection vulnerabilities and infrastructure strain from agentic workloads create high product and trust risks.

What makes GitLab unique

  • GitLab offers a single unified DevOps platform covering plan, build, secure, and deploy stages.
  • Its Duo Agent Platform embeds specialized AI agents for planning, coding, security, and deployment tasks.
  • Built on open source, GitLab collaborates with thousands of developers and supports 30 million+ users globally.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Spending Company Money

Equity Compensation

Life Insurance

Financial Wellness

Paid Time Off

Growth and Development Benefit

GitLab Contribute

Business Travel Accident Policy

Immigration

Employee Assistance Program

Incentives

All-Remote

Part-time contracts

Meal Train

Fertility & Family Planning

Parental Leave

Growth & Insights and Company News

Headcount

6 month growth

0%

1 year growth

0%

2 year growth

0%
The Associated Press
Apr 14th, 2026
GitLab partners with Google Cloud to power AI agents with Vertex AI models

GitLab has expanded its collaboration with Google Cloud to integrate Vertex AI models into its GitLab Duo Agent Platform. Google Cloud customers can now use Vertex AI models within GitLab and count that usage towards existing cloud commitments. The integration allows AI agents in GitLab's platform to access Vertex AI's Model Garden, including Gemini models, whilst maintaining GitLab's governance controls. Agents can draw context from issues, code repositories and CI pipelines without leaving the platform, with all actions subject to existing access controls and audit logging. Self-hosted customers can use GitLab's Bring Your Own Model option to connect approved models. GitLab's AI Gateway runs on Google Cloud infrastructure including GKE and Cloud Run. The partnership aims to provide enterprises with AI agents that combine strong model performance with enterprise governance requirements.

Yahoo Finance
Mar 31st, 2026
GitLab files $207.7M ESOP shelf registration for 10.2M shares amid 54.6% yearly decline

GitLab has filed a $207.7 million shelf registration for 10.2 million Class A shares tied to an employee stock ownership plan, putting potential dilution in investors' focus. The filing comes as GitLab's shares have declined 54.61% over the past year, trading at $21.63. Recent momentum shows improvement, with one-day and seven-day returns of 3.99% and 4.70% respectively, contrasting sharply with a 90-day decline of 42.37%. One valuation narrative suggests GitLab is 85.6% undervalued, citing a fair value of $150 based on open-source technology adoption and its DevSecOps system security platform. However, this outlook faces challenges from the company's recent losses of $55.96 million and uncertainty around cybersecurity spending.

Yahoo Finance
Mar 26th, 2026
GitLab launches version 18.10 with affordable agentic AI access

GitLab has released version 18.10, making its agentic AI capabilities more accessible and affordable across the software development lifecycle. The update allows organisations on the free tier to access the GitLab Duo Agent platform through a monthly GitLab Credits commitment, enabling teams to scale development within budget constraints. GitLab Credits provides developers with visibility into AI agents and flows whilst connecting AI activity to software delivery work. On 9 March, Bernstein SocGen Group reiterated an Outperform rating with a $60 price target, citing strong adoption of GitLab Duo and durable strategic positioning. However, ARK Investment Management reduced its stake by 75% between Q3 and Q4 2025, from 3.44 million shares to 864,000 shares.

Yahoo Finance
Mar 7th, 2026
GitLab launches agentic AI programme and $400M buyback as revenue tops $1B

GitLab has launched an expanded Managed Service Provider Partner Programme focused on agentic AI deployment across the software development lifecycle, whilst introducing its first $400 million share repurchase programme. The company reported annual recurring revenue surpassing $1 billion. The MSP Partner Programme targets regulated and compliance-focused environments, allowing service providers to integrate AI into enterprise software development. The initiative positions GitLab against competitors like Microsoft GitHub and Atlassian in the DevSecOps space. The buyback, funded through cash, short-term investments and operating cash flow, follows fiscal 2027 guidance indicating slower revenue growth and lower earnings than expected. The programme reflects GitLab's strategy to strengthen its DevSecOps platform through AI-driven features whilst deploying capital through repurchases.

Yahoo Finance
Mar 6th, 2026
CrowdStrike posts first GAAP profit of $38.7M while GitLab plunges 9% on slowing growth guidance

CrowdStrike held steady in premarket trading, rising 0.65%, after reporting Q4 revenue of $1.305 billion, up 23% year-over-year, and its first GAAP net income of $38.7 million. The cybersecurity firm's annual recurring revenue reached $5.25 billion, up 24%, with net new ARR of $330.7 million in Q4, up 47%. Meanwhile, GitLab plunged 8.6% as its FY2027 revenue growth guidance of 15% disappointed investors, slowing significantly from FY2026's 25.81% growth rate. CrowdStrike's Falcon Flex platform showed strong momentum, with ending ARR of $1.69 billion, up over 120% year-over-year. The company's FY27 revenue guidance of $5.867 to $5.928 billion met market expectations. Both software stocks had been under pressure heading into earnings, with CrowdStrike down roughly 16.5% year-to-date before the results.