Senior Cloud Infrastructure Engineer
Posted on 9/15/2023
Data management & visualization platform
Company Overview
Splunk's mission is to address the challenges and opportunities of managing massive streams of machine-generated big data. Splunk is the leading software platform for machine data that enables customers to gain real-time Operational Intelligence.
Locations
Remote
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
Agile
AWS
Bash
Google Cloud Platform
Jenkins
Git
Management
Microsoft Azure
Terraform
Kubernetes
Python
CategoriesNew
DevOps & Infrastructure
Software Engineering
Requirements
- Opportunities to develop and grow as an engineer. We are always expanding into new areas, working with open-source projects and contributing back, and exploring new technologies
- A team of incredibly capable and dedicated peers, all the way from engineering to product management and customer support
- Breadth and depth. You are interested to work on an area that dynamically scales to meet the needs of Splunk's cloud offering. You want to go deep into optimizing how we automate every manual process and tedious task we encounter
- Growth and mentorship. We believe in growing engineers through ownership and leadership opportunities. We also believe that mentors help both sides of the equation
- A stable, collaborative, and supportive work environment. We work in an open environment, work together to get things done, and adapt to the changing needs for the team. We keep it real by being open and honest. We are a collaborative team that understands the value in open communication-it's how we interact with our customers
- Balance. We don't expect people to work 12 hour days. We want you to be successful outside of work too. Want to work from home sometimes? No problem. We trust our colleagues to be responsible with their time and commitment, and believe that balance helps cultivate a positive environment
- Fun. We are committed to having every employee want to give it their all, be respectful and a part of the family, and have a smile on their face while doing it
- You have experience deploying critical cloud infrastructure solutions using state-of-the-art engineering principles
- Experience handling SaaS and/or On-prem applications for a large customer base
- Have experience with one or more of the public cloud providers i.e. AWS, Azure or GCP
- Experience with scripting for automation using Bash/Python/Go
- Experience with CI/CD (Gitlab, Jenkins etc.,) and automation (Terraform etc.,) technologies and tools
- Ability to coach/mentor junior engineers on the team, provide technical direction, perform design/code reviews and champion engineering standard methodologies
- 8+ years of relevant industry experience; Bachelor's degree in Computer Science, Computer Engineering or equivalent work experience
Responsibilities
- Extend and maintain our Kubernetes infrastructure across various Cloud Providers. A major focus of this role is developing and evolving our Kubernetes-based control plane. Our goal is to enable scalable and secure deployments on various cloud computing platforms (Amazon AWS, Google Cloud Platform, Microsoft Azure)
- Deploy software that will help drive improvements towards the availability, performance, efficiency, and security. Technologies include DNS management, GSLB/Anycast, Load Balancing (IPVS, L4, L7), API Gateways and Service Meshes, IP address management (IPAM), TLS infrastructure for securing network communication
- Automation. You are driven by adopting and mastering new technologies to automate routine tasks and free up time for innovation. You will be utilizing a variety of languages used in systems programming, ranging from Go to Python to Terraform
- Distributed systems programming. Experience in working on and debugging distributed systems like CDNs, Kubernetes infrastructure, databases, data replication, etc
- Knowledge of technical excellence. You routinely use continuous delivery, testing and security best practices
- Operational excellence. Diving into data excites you and you make decisions based on numbers rather than assumptions. If an issue arises, you strive to be alerted before our customers notice
- Keeping calm and carrying on. Capable of navigating through a product outage, skilled in identifying performance bottlenecks, spotting anomalous system behavior, and figuring out the root cause of incidents
- Desire to learn and adapt. Our agile team has a lot of projects going on at once, and you will have the opportunity to learn to navigate the code and features. You will constantly be learning new areas and new technologies
- Passion. Our customers are passionate about Splunk, and we want the same from our engineers. In this role you will actively own your work and be excited about your projects