Full-Time

ML Pipeline Architect

Posted on 4/25/2024

Snorkel AI

Snorkel AI

51-200 employees

Data-centric AI platform for enterprises

Data & Analytics
AI & Machine Learning

Mid, Senior

Dallas, TX, USA

Required Skills
Kubernetes
Microsoft Azure
Data Science
Docker
AWS
Google Cloud Platform
Requirements
  • B.S. degree in Computer Science, Engineering, or comparable degree/experience
  • 5+ years experience in a Platform Engineering, ML Operations or similar role
  • Previous experience with cloud infrastructure providers such as Amazon Web Services, Microsoft Azure, or Google Cloud Platform
  • Hands-on experience with common container and container orchestration solutions, e.g., Docker and Kubernetes
  • Solid experience in developing data pipelines, deploying machine learning models as well as building model tracking, monitoring and online learning tools
  • Experience with infrastructure-as-code tools such as terraform
  • Experience with ML operation frameworks like Kubeflow, MLflow, and Airflow
  • Deep technical expertise with a scrappy mindset
Responsibilities
  • Design, prototype, and refine scalable infrastructure for operating Snorkel's machine learning pipeline at scale within customer on-premises and cloud environments
  • Bridge the gap between data science and production, ensuring Snorkel Flow ML pipelines are deployed efficiently, reliably, and securely
  • Work closely with Product Management, Product Engineering, and Support teams to resolve issues that occur during the implementation
  • Guide the team in adopting the best MLOps practices, ensuring a consistent and production-ready approach across all ML developments
  • Assist in troubleshooting and resolving ML operations related issues
  • Submit product feature requests to drive the platform forward
  • Contribute to the Snorkel user documentation, including design, testing, delivery, and training
  • Work closely with Snorkel’s field engineering teams and customers

Snorkel AI provides a data-centric AI platform that enables enterprises to programmatically label, curate, and clean data, fine-tune AI models, and build custom large language models (LLMs) and generative AI applications. The technology leverages weak supervision and programmatic labeling to accelerate AI development 10-100x faster, making AI more accessible and customizable for every enterprise.

Company Stage

Series C

Total Funding

$138.3M

Headquarters

Redwood City, California

Founded

2019

Growth & Insights
Headcount

6 month growth

16%

1 year growth

18%

2 year growth

5%

Benefits

Health - Snorkelers and their dependents are covered by comprehensive medical, dental, and vision plans.

Environment - We provide an allowance for Snorkelers to set up workstations however they want.

Wellness - Snorkelers are given a yearly wellness stipend to be used on anything relating to health and well-being.