Facebook pixel

Site Reliability Engineer ( Remote | Work from Home)
Posted on 2/19/2022
INACTIVE
Locations
Remote
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
AWS
Bash
Development Operations (DevOps)
Docker
Linux/Unix
Management
Microsoft Azure
Perl
Puppet
Ruby
Terraform
Python
Ansible
Requirements
  • While we understand you might not have everything on the list, to be successful you are likely to have skills such as
  • Understanding the end to end operations of a 'Business System' vs components
  • Comprehensive systems hardware and network troubleshooting experience
  • Common Linux distribution platform installation, configuration and performance tuning
  • TCP/IP networking, NIC bonding and network services configuration (DNS, NTP, DHCP, SMTP, etc)
  • Operation and administration of virtual infrastructure, including experience with at least one hypervisor (VMware, Hyper-V, KVM, etc)
  • Experience working with at least one major cloud provider (AWS, Azure, Google for example, including infrastructure as code deployment with Cloud Formation, Terraform, Opsworks etc)
  • Ability to describe IaaS, PaaS, SaaS, pros and cons of each, use cases for virtualization and cloud
  • Administration of web servers and supporting technologies, including network load balancers
  • Familiarity with microservices and container technologies (Docker, lxc etc)
  • Scripting and automation of administrative tasks using bash, perl, python, ruby, etc
  • Experience with the design, development and deployment of at least one major configuration management framework (i.e. Puppet, Ansible, Chef)
  • System and application error investigation, troubleshooting of access/availability issues including deep multi system root cause analysis
  • Experience managing networking devices, such as switches and firewalls from a variety of vendors
  • Knowledge of DevOps tools, processes, and culture
  • Exposure and operational experience with network monitoring systems (NMS)
  • Ability to pick up new technologies quickly
  • Working Schedule: Friday, Saturday, Sunday & Monday
Responsibilities
  • Operate, maintain and administer solutions that contribute to the operational efficiency, availability and visibility of customer infrastructure
  • Plan maintenance activity, design documentation and standard procedures
  • Provide Root Cause Analysis reports for outages/incidents (ITIL - Problem Management)
  • Observe and provide feedback on the current state of the client's infrastructure, and identify opportunities to improve resiliency, reduce the occurrence of incidents and automate repetitive administrative and operational tasks
  • Contribute to, improve and maintain team documentation about client systems and infrastructure, procedures, policies and schedules
  • Gather and document information about client environments through audit activities, and analyze the information to identify opportunities for improvement and application of best practices
  • Work collaboratively with team mates to contribute to the continuous improvement of our working culture
  • Act as a technology leader for clients, as well as drive client discussions on technology road maps
  • Participate in an on-call rotation in an escalation capacity
Pythian

201-500 employees

IT consulting and managed services provider
Company Overview
As a global IT services company, Pythian helps organizations transform by leveraging data, analytics, and the cloud. Pythian designs, implements, and supports customized solutions for the toughest data challenges. Delivering thousands of projects to the cloud, because they believe it is the very best place to be.
Company Values
  • Stand apart, stand above - A key differentiator for us is that we’re technology agnostic, which allows us to partner with our clients to address their pain points and needs, as well as offering flexible contract models.