Full-Time

Research Engineer

CUDA Kernel Engineering

Voltai

Voltai

No salary listed

Palo Alto, CA, USA

In Person

Category
AI & Machine Learning (1)
Required Skills
CUDA
Pytorch
Requirements
  • Writing and optimizing CUDA kernels for large-scale AI workloads (attention, routing, graph-based operations, physics-inspired operators)
  • Profiling and optimizing GPU performance for custom compute or memory-bound workloads
  • Integrating custom kernels into training and inference frameworks (e.g., PyTorch, Megatron, vLLM, TorchTitan)
  • Working with the latest NVIDIA hardware and software stacks (Hopper, Blackwell, NVLink, NCCL, Triton)
Responsibilities
  • Develop, integrate, and optimize state-of-the-art CUDA kernels to power AI models that accelerate semiconductor design and verification.
  • Enable large-scale model training, inference, and reinforcement learning systems that reason about circuit layouts, generate and validate RTL, and optimize chip architectures — running efficiently across thousands of GPUs.
  • Build tools, performance benchmarks, and integration layers that push the limits of GPU utilization for compute-intensive workloads in AI-driven hardware design.
  • Release kernels and tooling as contributions to the open-source AI and high-performance computing ecosystems.
Desired Qualifications
  • Building GPU-accelerated primitives for graph reasoning, symbolic computation, or hardware simulation tasks
  • Collaborating with AI researchers and semiconductor experts to translate domain-specific workloads into high-performance GPU code
  • Releasing kernels and tooling as open-source contributions to AI and HPC ecosystems

Company Size

N/A

Company Stage

N/A

Total Funding

N/A

Headquarters

N/A

Founded

N/A

Simplify Jobs

Simplify's Take

What believers are saying

  • Shipping industry faces tightening emission regulations, creating urgent demand for onboard renewable energy solutions.[3]
  • Global maritime sector seeks to reduce fuel costs and emissions simultaneously, positioning Voltai's technology as cost-competitive alternative.[6]
  • Pre-seed funding of CAD $1.83M from Invest Nova Scotia validates technology and accelerates commercial deployment timeline.[5][6]

What critics are saying

  • Ocean Power Technologies and established competitors deploy proven USV solutions with superior market traction and scalability.[1]
  • Creative Destruction Lab program requires $10M+ follow-on funding by mid-2026 amid cleantech VC funding contraction.[1]
  • Philippine EV battery-swap venture with Aboitiz Power dilutes focus from core marine energy business with uncertain returns.[2]

What makes Voltai unique

  • Proprietary electrostatic generator achieves higher energy density than competing kinetic harvesters with lower cost per kWh.[1][5]
  • Compact, modular system installs on vessels without adding drag, scaling from 25W to several megawatts.[3][6]
  • Converts both ocean waves and vessel vibrations into electricity, addressing dual energy sources competitors cannot efficiently capture.[5][6]

Help us improve and share your feedback! Did you find this helpful?