Site Reliability Engineer
Sre
Confirmed live in the last 24 hours
MetroStar

201-500 employees

Digital services and management consulting for public sector
Company Overview
MetroStar stands out as a digital services and management consulting company with a strong focus on user-centric capabilities and a diverse team of coders, creatives, and strategists. The company's competitive advantage lies in its ability to offer high-quality, customizable solutions like the ML Platform Onyx and DevSecOps solution, Quartz, which cater to diverse mission needs and effectively integrate with data workflows. MetroStar's culture of acknowledging and appreciating team efforts fosters a positive work environment, making it an attractive place for potential employees.
Consulting
Government & Public Sector

Company Stage

N/A

Total Funding

$4.4M

Founded

1999

Headquarters

Reston, Virginia

Growth & Insights
Headcount

6 month growth

11%

1 year growth

22%

2 year growth

44%
Locations
Washington, DC, USA
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
Kubernetes
Microsoft Azure
Git
Development Operations (DevOps)
Splunk
CategoriesNew
DevOps & Infrastructure
DevOps Engineering
Site Reliability Engineering
IT & Security
Cloud Engineering
Requirements
  • Minimum of 8 years of software development experience with a minimum of 2 years with Kubernetes and strong understanding of SRE principles for highly scalable and reliable systems.
  • Experience implementing proactive alert / monitoring workflows and dashboards based on Kubernetes metrics, logs, and traces using Prometheus, Grafana, Loki, Splunk, or similar technologies.
  • Working knowledge of industry best practices with regards to information security.
  • Knowledge of clustering, high-availability, replication, and disaster recovery techniques.
  • Possess a bachelor's degree and an active TS//SCI clearance.
  • Experience working in a DevSecOps environment and with Source Code repositories and CI/CD pipeline solutions such as GitLab, Azure DevOps, GitHub etc.
  • Experience with Infrastructure as Code (IaC), containerization, K8, and CI/CD Automation.
  • Experience with container orchestration tools (Rancher/RKE2, OpenShift, etc.)
  • Ability to work well on a team as well as individually.
  • Ability to work in downtown Washington, DC on client-site 5 days per week.
Responsibilities
  • Monitor platform and containerized applications.
  • Identify performance and availability risks and issues.
  • Work on the core platform to create and optimize all functions needed to establish a strong platform infrastructure.
  • Collaborate with the team and the customer daily