Simplify Logo

Full-Time

Deep Learning Performance Engineer

Posted on 6/21/2024

Genmo

Genmo

1-10 employees

AI-powered video creation services

Senior, Expert

San Francisco, CA, USA

Category
Applied Machine Learning
Deep Learning
AI Research
AI & Machine Learning
Required Skills
CUDA
Pytorch
Requirements
  • Prior experience working on GPUs / CUDA
  • Experience with profiling tools such as the PyTorch profiler
  • Extensive experience in optimizing deep learning models and kernels
  • Knowledge of distributed training strategies and techniques
  • Familiarity with advanced model optimization techniques
  • Strong problem-solving skills and ability to work in a fast-paced environment
  • Passion for artificial intelligence and a drive to push the boundaries of what is possible
Responsibilities
  • Model-Level Performance Optimization: Profile and analyze the performance of deep learning models on the cluster. Identify performance bottlenecks related to arithmetic intensity, memory access patterns, and communication overhead.
  • Kernel Optimization and Tuning: Optimize custom CUDA kernels for specific operations in diffusion models. Utilize profiling tools to guide kernel optimization and achieve maximum GPU utilization. Use graph compilation to perform horizontal/vertical fusion of kernels and kernel rewrites for optimized operators like FlashAttention. Utilize CUDA and Triton for kernel development and optimization.
  • Distributed Training Optimization: Fine-tune distributed training strategies (e.g., sharding, parallelism) for optimal performance on the cluster. Experiment with and implement advanced techniques like model parallelism, pipeline parallelism, and tensor parallelism. Optimize memory footprint of training with methods like rematerialization.

This company stands out as a premier workplace due to its focus on AI-powered video creation, attracting professionals passionate about cutting-edge technology in media production. They offer a dynamic and innovative environment where AI merges with creative video production, positioning themselves at the forefront of industry trends and technological advancements in the digital media space. Employees benefit from working within an intellectually stimulating atmosphere that continuously evolves to incorporate the latest in artificial intelligence applications.

Company Stage

N/A

Total Funding

N/A

Headquarters

N/A

Founded

N/A

Growth & Insights
Headcount

6 month growth

75%

1 year growth

133%

2 year growth

133%
INACTIVE