Sr Staff Data Engineer ML Platform
Posted on 10/19/2022
INACTIVE
Locations
Mountain View, CA, USA
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
Apache Spark
CUDA
Data Analysis
Data Science
Data Structures & Algorithms
Google Cloud Platform
Git
Keras
Python NLTK
NumPy
Pandas
SQL
Tensorflow
Python
Requirements
- BS, MS, or Ph.D. degree in Computer Science or related field, or equivalent practical experience
- 6+ years of experience
- Knowledgeable with Data Science tools and frameworks (i.e Python, Scikit, NLTK, Numpy, Pandas, TensorFlow, Keras, R, Spark)
- Understand machine learning principles (training, validation, etc.)
- Knowledge of data query and data processing tools (i.e. SQL)
- Computer science fundamentals: data structures, algorithms, performance complexity, and implications of computer architecture on software performance (e.g., I/O and memory tuning)
- Software engineering fundamentals: version control systems (i.e Git, Github) and workflows, and the ability to write production-ready code
- Experience deploying highly scalable software supporting millions or more users
- Experience with GPU acceleration (i.e CUDA and cuDNN)
- Experience with integrating applications and platforms with cloud technologies (i.eAWS and GCP)
- Strong oral and written communication skills
- Ability to conduct meetings and make professional presentations, and explain complex concepts and technical material to non-technical users
Responsibilities
- This key role will own the technical direction for the AI and Machine Learning Platform that powers all of Coupang's experiences
- You'll Have The Opportunity To Work On a Wide Variety Of Projects That Support The ML Lifecycle From Ideation To Production
- Build scalable distributed training systems to power groundbreaking models, leveraging the latest algorithms, techniques and hardware available
- Create robust and scalable real-time ML inference systems meeting demanding requirements for low-latency, efficiency, and workflow flexibility
- Build ergonomic ML Workflows & Experimentation tracking tools to accelerate development/iteration productivity for ~50 ML Engineers & Data Scientists
- Ensure real-time detection of problems afflicting ML models via ML Observability tools
- Create ML Governance tooling to ensure that all ML use cases adhere to high-quality standards and policies
- Utilize OSS ML Platform technologies such as PolyAxon, Kubeflow, ML Flow, or Metaflow
- Work side by side with customer teams including Search Relevance, Fulfillment Centers, Product Catalogs, etc. to deliver high-impact product wins