Facebook pixel

Machine Learning Operations Software Engineer
Posted on 2/2/2022
INACTIVE
Locations
Mountain View, CA, USA
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
Bash
Data Structures & Algorithms
C/C++/C#
Management
Python
Responsibilities
  • Engage in capacity planning, quota management, and demand forecasting for ML workflows
  • Collaborate with partner teams to co-develop large-scale Machine Learning infrastructure for robot learning
  • Streamline/improve the scalability and efficiency of large-scale distributed ML workflows by analyzing, understanding, and fixing bottlenecks
  • Automate data upload/processing/training pipelines and develop monitoring and notification tools to track various statistics of ML pipelines
  • Respond to and resolve emergent problems in production and test ML training pipelines; write software and build automation to prevent problem recurrence
  • Keep track of major changes and trends in infrastructure and enable adoption of new frameworks and systems without significant impact on development work
Desired Qualifications
  • Experience analyzing and troubleshooting large-scale distributed systems
  • Experience in data structures, algorithms and complexity analysis
  • Fluency in Python, C++, bash
  • Knowledge of IP networking, network analysis, performance and application issues
Everyday Robots

51-200 employees

Robots for everyday assistance
Company Overview
The Everyday Robot Project is building a new type of learning robot — one that can eventually learn to help everyone, every day. The Everyday Robot Project is developing a general-purpose learning robot that can operate autonomously in unstructured environments.