Full-Time

Site Reliability Engineer/Customer Engagement

Posted on 10/31/2025

TalentWerx

TalentWerx

11-50 employees

Talent acquisition services focusing on cost-efficiency and speed

Compensation Overview

$115.2k - $146k/yr

Dayton, OH, USA + 2 more

More locations: Washington, DC, USA | Bedford, MA, USA

In Person

Onsite at Dayton, OH; Hanscom AFB, Bedford, MA; Bolling AFB, Washington, DC; travel/relocation may be required.

US Top Secret Clearance Required

Category
DevOps & Infrastructure (3)
, ,
Requirements
  • Clearance: Active TS/SCI
  • Education and Years of Experience: Bachelor’s degree (or equivalent experience) with 8–10 years of experience, or a Master’s degree with 6–8 years of experience. A minimum of 8 years of specialized experience is required.
  • Must have at least one of the following active certifications: CompTIA Security+, CompTIA CySA+, GICSP, GSEC, SSCP or CCNA Security
  • Proficiency with Amazon Web Services for the purposes of troubleshooting and evaluating availability and resilience
  • Proficiency with one or more cloud-based monitoring and analytics and visualization tools (i.e. Splunk, DataDog, Prometheus, Dynatrace, AppDynamics, Elasticsearch, Grafana, Kibana, Catchpoint, etc.)
  • Proficiency in one or more programming languages such as JavaScript or Python
  • Hands-on experience with containerization technologies and orchestration platforms like Kubernetes (K8s)
  • Experience in software and infrastructure troubleshooting, conducting resilience and availability analysis, monitoring systems using metrics, traces, and logs, and contributing to full-cycle software development
  • Proficiency in querying and managing both SQL and NoSQL databases, with a strong understanding of data structures, performance optimization, and data integrity
  • Experience with system administration installing and managing enterprise applications
  • Familiarity with CI/CD pipeline technologies and DevOps practices
  • Proficiency in root cause analysis techniques and reliability testing frameworks
  • Strong analytical, problem-solving, and technical documentation skills
  • Excellent communication skills to collaborate effectively with cross-functional teams
  • Strong experience using quantitative measure (e.g. metrics collection) and visualization (e.g. charts and graphs) to inform system readiness
  • Familiarity with cybersecurity best practices and software security protocols
  • Experience in system integration and ensuring software scalability
  • Experience providing technical support to operational strategies aligned within your program and initiatives that optimize processes, enhance productivity, and ensure quality across all program functions
  • Strong ability to analyze system performance, develop implementation requirements, and ensure compliance with verification standards
  • Experience in government and defense-related systems engineering
Responsibilities
  • Engage directly with customers to clearly understand their technical requirements and integrate their needs into the onboarding process.
  • Facilitate onboarding of new customers, ensuring seamless integration into an Integrated Digital Environment while achieving high customer satisfaction.
  • Develop governance, policies, and user manuals
  • Evaluate and analyze products and components to predict and address potential failures
  • Review product designs to ensure reliability and dependability
  • Recommend product design changes or alterations to achieve required reliability levels
  • Use tools to monitor production system health, performance and availability in real-time and analyze logs and metrics
  • Analyze Production incident response events and recommend strategies to improve MTTI and MTTR
  • Document findings, including results of root cause analysis, and implement necessary changes to maintain product reliability
  • Define SLIs and SLOs to set specific targets for service performance and availability
  • Work with engineering and development teams to design and implement reliability improvements
  • Determine maintenance requirements and schedules for products
  • Review reliability programs and provide evaluations for decision-making
  • Conduct Continuous Integration/Continuous Delivery (CI/CD) pipeline testing and optimization to ensure robust software delivery
  • Develop and manage container orchestration platforms such as Kubernetes (K8s) to enhance system scalability and reliability
  • Ensure 100% of planned hours are worked and recorded
  • Identify and forward to your leadership any opportunities that could lead to growth within your work area
  • Ensure all contractual deliverables are met/exceeded to the customer's satisfaction
  • Completes personal PDP and attend Staff Meeting and Storytime (with camera on)
  • Within your program, build productive and positive professional relationships with clients
  • Performs other related duties as assigned
Desired Qualifications
  • AWS Solutions Architect (Associate or Professional)
  • AWS Developer (Associate or Professional)
  • AWS CloudOps Engineer (Associate or Professional)
  • DevOps Institute (or similar) Certified SRE Practitioner certification
  • Certified Kubernetes Administrator (CKA) certification
  • Experience with reliability engineering methodologies, including statistical distributions and reliability models
  • Familiarity with root cause analysis techniques and reliability testing frameworks
  • Knowledge of predictive maintenance strategies and tools
  • Ability to assess and refine system architecture for improved performance and reliability

In today's low unemployment... people are everything. The right people hired to join your team will have the greatest impact on current and future state of your organization. Talent Werx was formed to solve the existing problem with traditional talent acquisition firms- lack of focused attention, over-priced fees, and a transactional approach that will have little impact on your hiring needs when it matters most. Our distinct difference is the Speed and Cost savings impact when you partner with the Werx! Your biggest hiring challenges dealing with speed, accuracy, and cost are our opportunity to show you the Talent Werx difference.

Company Size

11-50

Company Stage

N/A

Total Funding

N/A

Headquarters

Nashua, New Hampshire

Founded

2018

Growth & Insights

Headcount

6 month growth

0%

1 year growth

0%

2 year growth

0%
INACTIVE