Who We Are
Nuro exists to better everyday life through robotics. Founded in 2016, Nuro has spent eight years developing autonomous driving (AD) technology and commercializing AD applications. The Nuro Driver™ is our world-class autonomous driving system that combines AD hardware with our generalized AI-first self-driving software. Built to learn and improve through data, the Nuro Driver™ is one of the few driverless autonomous technologies on public roads today.
Nuro has raised over $2B in capital from Fidelity, Bailie Gifford, T. Rowe Price, Google, and other leading investors. We’ve partnered with some of the world’s most respected brands including Uber, FedEx, Domino’s, Walmart, Kroger, and 7-Eleven
About the Role
The ML Infra team is growing and we are looking for talented engineers to join us. Nuro is pursuing an ML-first software stack. Our team provides solutions to empower machine learning development in Nuro and optimize on-cloud training and onboard inference. Our solutions include a distributed training platform, ML compiler, model components libraries, e.t.c. We are looking for a tech lead to drive the initiatives in distributed training and inference optimization, the key technology to scale our deep learning models to achieve higher performance.
About the Work
- Lead the exploration of the latest technology of distributed training and inference optimization.
- Dive deep into the pain points of ML teams and understand model development bottlenecks.
- Define and drive the technical strategy for model optimization.
- Collaborate with ML teams to ensure smooth adoption of the model optimization solutions.
- Promote engineering best practices and cultivate a culture of continuous improvement in the team.
About You
- 6+ years of work or research experience in ML Infra, distributed training, model inference or GPU programming.
- Knowledge in using at least one deep learning framework, e.g. Tensorflow, Pytorch, JAX. Be able to understand deep learning algorithms, e.g. computer vision, NLP, behavior planning.
- Proficient in C++/C and Python. Strong coding ability.
- Strong communication and teamwork skills. Be passionate about exploring and promoting cutting edge technology.
Bonus Points
- Experience with CUDA, Cublas, Cudnn or any other Nvidia SDKs.
- Experience with model quantization or pruning.
- Experience with compilers or ML compilers (e.g. TensorRT, Triton, XLA, Clang).
- Experience with AI algorithms and hardware codesign (e.g. Depthwise Conv, Flash Attention).
- Experience with distributed training speedup (e.g. FSDP, DeepSpeed).
At Nuro, your base pay is one part of your total compensation package. For this position, the reasonably expected base pay range is between $202,350 and $303,050 for the level at which this job has been scoped. Your base pay will depend on several factors, including your experience, qualifications, education, location, and skills. In the event that you are considered for a different level, a higher or lower pay range would apply. This position is also eligible for an annual performance bonus, equity, and a competitive benefits package.
At Nuro, we celebrate differences and are committed to a diverse workplace that fosters inclusion and psychological safety for all employees. Nuro is proud to be an equal opportunity employer and expressly prohibits any form of workplace discrimination based on race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other legally protected characteristics.