Devops Engineer
Confirmed live in the last 24 hours
Arize AI

51-200 employees

Machine learning observability platform for troubleshooting AI/ML issues.
Company Overview
Arize AI stands out as a leading machine learning observability platform, providing a simple and lightweight solution for ML practitioners to efficiently monitor and troubleshoot AI/ML model performance over time. The company's emphasis on ML observability, a cornerstone of operational excellence, allows teams to focus on building and deploying models with confidence, knowing they can swiftly address any arising issues. Arize AI's maturity in the market and its commitment to driving accountability further solidify its competitive advantage and industry leadership.
AI & Machine Learning

Company Stage

Series B

Total Funding

$61M

Founded

2020

Headquarters

Berkeley, California

Growth & Insights
Headcount

6 month growth

8%

1 year growth

13%

2 year growth

60%
Locations
Remote
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
Chef
Bash
Kubernetes
Microsoft Azure
Python
Puppet
AWS
Terraform
Ansible
Development Operations (DevOps)
Linux/Unix
Google Cloud Platform
CategoriesNew
DevOps & Infrastructure
Software Engineering
Requirements
  • 1-2+ years experience in site reliability engineering, DevOps, and system administration
  • CS (preferred) or other technical degree, or equivalent practical experience
  • Experience working with DevOps tools such as Kubernetes, Terraform, Ansible, Puppet and Chef
  • Proficiency with scripting languages such as Python and bash
  • Experience managing cloud infrastructure in AWS, GCP, and/or Azure
  • Expertise in Linux administration, configuration, and networking protocols
Responsibilities
  • Work hands-on with the infrastructure that supports our distributed & highly scalable services in both SaaS and on-prem offerings
  • Gather requirements from customers and adapt manifests and software to support new environments
  • Use and augment monitoring tools to observe platform health, ensure performance and reliability
  • Interact with the product team to test new features and package new on-prem releases
  • Automate and optimize the release pipeline to make it as frictionless as possible
  • Exhibit continuous curiosity for emerging technology that could solve our challenges
  • Regularly have chats with industry experts, researchers, and ethicists across the ecosystem to advance the use of responsible AI
  • Culturally conscious events such as LGBTQ trivia during pride month
  • We have an active Lady Arizers subgroup
Desired Qualifications
  • Experience with on-prem deployment architectures
  • Experience running a 24x7 SaaS platform with defined SLI, SLO, SLA
  • Familiarity with operating machine learning & AI applications