Staff Infrastructure Engineer
Updated on 11/9/2023
Plume

501-1,000 employees

SaaS experience platform for Communications Service Providers
Company Overview
Plumes' ambition is to build on their diversity of lived experiences to create more equitable opportunities for underserved communities through community engagement and education.
Data & Analytics
Hardware

Company Stage

Series F

Total Funding

$747.4M

Founded

2015

Headquarters

Palo Alto, California

Growth & Insights
Headcount

6 month growth

2%

1 year growth

-4%

2 year growth

3%
Locations
Palo Alto, CA, USA
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
Apache Spark
AWS
Development Operations (DevOps)
Linux/Unix
Operating Systems
Terraform
Kubernetes
NoSQL
CategoriesNew
DevOps & Infrastructure
Software Engineering
Requirements
  • 10+ years of experience in Programming/Scripting for systems
  • 10+ years of experience with modern cloud infrastructure, preferably AWS
  • Expert experience with modern Linux Operating systems (Enterprise Linux or Debian-based)
  • Expert experience with Production Troubleshooting
  • Kubernetes Knowledge (operate)
  • Basic Terraform Knowledge
  • Experience both setting up and utilizing Monitoring and observability tools
  • Ability and desire to perform in a Technical Leadership capacity for a small team
  • Bachelor’s degree in a related field or equivalent experience
Responsibilities
  • Design, implement, and upgrade Cloud Network Infrastructure
  • Design and Implement Cloud Governance for a mid-size organization with various service teams
  • Write automation code for Networking and Role Based Access Control
  • Coordinate team project roadmap with DevOps Leadership
  • Triage support queue for the team
  • Focus on Production operations/matters and on-call
  • Provision and scale multi-datacenter Kubernetes Infrastructure and Applications (EKS)
  • Deploy Software in multiple Production Environments
  • Own monitoring and alerting to production systems, improvements, and changes
  • Contribute improvements to the current automation
  • Contribute improvements to our on-call process and alerting
Desired Qualifications
  • Previous experience leading other engineers in small teams or mid to long-term projects
  • Passion for organization, communication, and data-driven results
  • Troubleshooting production performance/service degradation or outage issues at scale
  • Experience with Infrastructure Troubleshooting in VMs and/or Bare Metal (SSH/Linux)
  • Advanced Kubernetes knowledge
  • Advanced Terraform knowledge
  • Experience operating NoSQL Databases in Production
  • Experience operating Relational Databases in Production
  • Knowledge of distributed systems - Spark/EMR/Batch systems, storage, database, or streaming
  • Generic Configuration Management experience