Full-Time

GPU Performance Engineer

Genmo

Genmo

11-50 employees

AI-powered image and video animation platform

No salary listed

San Francisco, CA, USA

In Person

Category
Software Engineering (2)
,
Required Skills
Python
CUDA
C/C++
Requirements
  • Bachelor's or Master's degree in Computer Science, Electrical Engineering, or related field
  • 5+ years systems programming experience with 3+ years focused on GPU optimization
  • Expert proficiency with GPU profiling tools (Nsight Systems, nvprof)
  • Strong CUDA programming skills with production kernel development
  • Deep understanding of GPU architecture (memory hierarchy, SMs, warps)
  • Track record of achieving significant performance improvements (5-10x)
  • Experience with Python and C++ in production environments
Responsibilities
  • Profile and optimize GPU workloads using Nsight Systems, nvprof, and custom instrumentation
  • Write high-performance CUDA and Triton kernels for critical model operations
  • Optimize cold start latency from seconds to milliseconds for our serving infrastructure
  • Tune memory access patterns, kernel fusion, and GPU utilization
  • Collaborate with ML engineers to optimize model implementations
  • Debug performance issues across the full stack from application to hardware
  • Implement custom memory pooling and allocation strategies
  • Share optimization techniques and build performance culture across teams
Desired Qualifications
  • Experience with Triton kernel development
  • Knowledge of CUTLASS or similar high-performance libraries
  • Background in ML-specific optimizations (attention, transformers)
  • RDMA/InfiniBand optimization experience
  • Contributions to GPU libraries or frameworks
  • Low-level debugging skills (PTX/SASS reading)

Genmo provides AI-powered tools for creating and editing multimedia content, including animated images, generated and edited movies, scripts, trailers, and presentations. Its platform lets users upload an image and animate parts or generate full movies with scene planning, transitions, and overlays. It uses a subscription model with tiered access and may offer one-time purchases, serving individuals and businesses. Genmo differentiates itself by offering end-to-end AI-assisted creation across image animation, video production, scripting, and presentation design in one platform, with precise edit controls and understanding user intent to help users produce high-quality visuals faster.

Company Size

11-50

Company Stage

Early VC

Total Funding

$30M

Headquarters

San Francisco, California

Founded

2022

Simplify Jobs

Simplify's Take

What believers are saying

  • Enterprise video generation demand expanding beyond creators into marketing, training, and communications.
  • Subscription tier expansion potential with only 40,000 users against $58.4M total funding.
  • Multi-investor syndicate including NEA, Gold House Ventures, WndrCo signals strong market validation.

What critics are saying

  • Open-source Apache 2.0 license enables Runway and Kling to integrate and improve Mochi 1 directly.
  • October 2024 server crashes hours after launch exposed inadequate compute scaling and infrastructure.
  • Meta's Llama integration of video capabilities absorbs open-source improvements into free ecosystem-scale model.

What makes Genmo unique

  • Mochi 1 achieves cinematic quality with minimal human intervention from prompt to output.
  • Open-source Apache 2.0 model enables community contributions and third-party integrations at scale.
  • Efficient resource utilization delivers comparable results to competitors with significantly fewer resources deployed.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Relocation Assistance

Company News

Upward Dynamism
Nov 1st, 2024
October 2024: 6 Innovative AI Trends You Shouldn't Miss

Video AI startup Genmo has just launched Mochi-1, an open-source model (Apache 2.0) designed to compete with industry leaders like Runway and Kling.

ClubNation
Oct 23rd, 2024
Genmo launches Mochi 1 Text-to-Video Generation Model, But Server Crashes Within Hours

On October 22, Genmo, the AI based video generating platform, released Mochi 1, a new state-of-the-art open-source text-to-video generation model.

Analytics India Magazine
Oct 23rd, 2024
Genmo launches Mochi 1 Text-to-Video Generation Model, But Server Crashes Within Hours

Genmo launches mochi 1 text-to-video generation model, but server crashes within hours.

VentureBeat
Oct 22nd, 2024
AI video startup Genmo launches Mochi 1, an open source rival to Runway, Kling, and others

In tandem with the Mochi 1 preview, Genmo also announced it has raised a $28.4 million Series A funding round, led by NEA, with additional participation from The House Fund, Gold House Ventures, WndrCo, Eastlink Capital Partners, and Essence VC.

Finextra Research
Nov 26th, 2023
Beyond Imagination: The Rise And Evolution Of Generative Ai Tools

Generative AI has revolutionized the way we create and interact with digital content. Since the launch of Dall-E in July 2022 and ChatGPT in November 2022, the field has seen unprecedented growth. This technology, initially popularized by OpenAI’s ChatGPT, has now been embraced by major tech players like Microsoft and Google, as well as a plethora of innovative startups. These advancements offer solutions for generating a diverse range of outputs including text, images, video, audio, and other media from simple prompts.The consumer now has a vast array of options based on their specific output needs and use cases. From generic, large-scale, multi-modal models like OpenAI’s ChatGPT and Google’s Bard to specialized solutions tailored for specific use cases and sectors like finance and legal advice, the choices are vast and varied. For instance, in the financial sector, tools like BloombergGPT (https://www.bloomberg.com/), FinGPT (https://fin-gpt.org/), StockGPT (https://www.askstockgpt.com/) or BeeBee.AI (https://www.beebee.ai/) or in the domain of legal advise, tools like Law Chat GPT (https://lawchatgpt.com/) or LegalFly (https://www.legalfly.ai/), offer niche solutions with heightened accuracy.To give an idea of what is available on the market, here is a short overview of some notable solutions available:General Chatting and Writing Assistance: : Bard (Google): https://bardai.io/ ChatFlash (Neuroflash AI): https://neuroflash.com/chatflash/ ChatGPT (OpenAI): https://chat.openai.com/chat Claude (Anthropic): https://claude.ai/ Copy AI: https://www.copy.ai Easy Peasy AI Chat: https://easy-peasy.ai Fireflies (https://www.unite.ai/goto/fireflies) GrowthBar: https://growthbarseo.com Jasper AI: https://www.jasper.ai Llama 2 (Meta): https://ai.meta.com/llama/ Notion AI: https://www.notion.so/ Perplexity: https://www.perplexity.ai Quillbot: https://quillbot.com Rytr Chat: https://rytr.me Rytr: https://rytr.me/ Wordtune: https://www.wordtune.com/ Writesonic: https://writesonic.comArt and Picture generation : this can be generating new pictures from a prompt, but also adapting existing pictures (like enhancing, removing parts of a picture, inserting object into a picture…​): Adobe Firefly: https://www.adobe.com/sensei/generative-ai/firefly.htm Artbreeder: https://www.artbreeder.com/ Auto Draw: https://autodraw.com BRIA: https://bria.ai/ Canva: https://www.canva.com/ai-image-generator/ Craiyon: https://www.craiyon.com/ Dall-E 2 (OpenAI): https://openai.com/product/dall-e-2 Deep Dream Generator: https://deepdreamgenerator.com/ DeepAI: https://deepai.org/machine-learning-model/text2img Deepswap: https://www.deepswap.ai Flair AI: https://flairai.io Fotor: https://www.fotor.com/features/ai-image-generator/ Images.ai (unite.ai): https://images.ai/ Jasper Art: https://www.jasper.ai/art Leap AI: https://www.tryleap.ai/ Lensa AI: https://prisma-ai.com/lensa MidJourney: https://www.midjourney.com/home/ Neural Doodle: https://neuraldoodle.com/ NightCafe: https://creator.nightcafe.studio PhotoSonic (Writesonic): https://writesonic.com/photosonic-ai-art-generator Pictory: https://pictory.io Pikazo: https://www.pikazoapp.com/ Pixray: https://replicate.com/pixray/text2image Prodia: https://app.prodia.com/ Remini: https://remini.ai/ RocketAI: https://www.rocketai.io/ Simplified: https://simplified.com/ Stable Diffusion (DreamStudio): https://stablediffusionweb.com/ Stablecog: https://stablecog.com/Video generation : this can be based on "prompts" resulting in video, taking a picture and identifying which part should be moving or providing a set of pictures asking the solution to interpolate the movement between 2 consecutive pictures (thus transforming a set of pictures into a movie)