Simplify Logo


Cloud Operations Engineer-US Remote

Confirmed live in the last 24 hours



201-500 employees

Unified data platform for cloud and on-premises

Data & Analytics


Remote in USA

Quality Control & Compliance
Supply Chain Management
Operations & Logistics
Required Skills
  • Bachelor’s degree in computer science or equivalent experience related to Information Technology
  • 3+ years’ experience as a Cloud Operations Engineer or Site Reliability Engineer managing a SaaS / PaaS / IaaS environment
  • Experience managing Linux and Windows Server
  • Experience with the configuration and automation toolsets such as Terraform, Puppet, Chef and Ansible
  • Experience in monitoring a global Cloud footprint
  • Experience in the design and/or deployment of Public Cloud technologies (AWS, Azure, GCP)
  • Experience in Network Services such as DNS, DHCP, WAN Routing, TCP/IP networking and DNS, LDAP, NFS and SMTP
  • Knowledge of RDBMS systems such as MySQL and SQL Server
  • Experience with containerization and container orchestration especially with Docker, Kubernetes
  • Experience in the deployment and management of microservices
  • Experience maintaining and managing Spark, Kafka, Tomcat, Cassandra, and MySQL based systems
  • Proficient with Python, Bash, SQL or Java
  • Solid understanding of incident management, change management, and problem management
  • Monitor and debug issues across the platforms (applications, networks, databases)
  • Administer, maintain, automate systems to ensure reliability, resiliency, scalability, and security
  • Deploy, maintain, and enhance monitoring solutions and provide technical resolutions and root cause analysis for high severity incidents
  • Work closely with Engineering and Software Development teams to design, deploy, and operate components/services that are automated, resilient, and scalable
  • Ensures that documented SSAE Policies and Procedures are followed and enforced
  • Create, update, and maintain documentation for all configurations for the production environment
  • Maintains and ensures the readiness and availability of disaster recovery environments
  • Develop and deliver timely reports on service metrics including but not limited to availability, capacity, performance, and latency across all production systems
  • Manage a 24x7x365 regional operational team

If you are seeking a workplace that champions robust data management and high-speed analytics, this company presents an appealing opportunity. Its cloud and on-premises data platform emphasizes powerful performance and flexibility, catering to diverse deployment needs while enabling impactful, data-driven business decisions. This focus not only places the company at the forefront of data analysis technology but also fosters a culture of innovation and efficacy, making it an ideal place for professionals keen on shaping the future of data-driven enterprises.

Company Stage

Series E

Total Funding



Round Rock, Texas



Growth & Insights

6 month growth


1 year growth


2 year growth