Internship

Machine Learning Intern

Posted on 4/3/2025

d-Matrix

d-Matrix

201-500 employees

AI compute platform for datacenters

No salary listed

Santa Clara, CA, USA

Hybrid, working onsite at our Santa Clara, CA headquarters 3 days per week.

Category
Applied Machine Learning
AI & Machine Learning
Required Skills
Python
CUDA
Pytorch
Machine Learning
Requirements
  • Currently pursuing a degree in Computer Science, Electrical Engineering, Machine Learning, or a related field.
  • Familiarity with PyTorch and deep learning concepts, particularly regarding model optimization and memory management.
  • Understanding of CUDA programming and hardware-accelerated computation (experience with CUDA is a plus).
  • Strong programming skills in Python, with experience in PyTorch.
  • Analytical mindset with the ability to approach problems creatively.
Responsibilities
  • Research and analyze existing KV-Cache implementations used in LLM inference, particularly those utilizing lists of past-key-values PyTorch tensors.
  • Investigate 'Paged Attention' mechanisms that leverage dedicated CUDA data structures to optimize memory for variable sequence lengths.
  • Design and implement a torch-native dynamic KV-Cache model that can be integrated seamlessly within PyTorch.
  • Model KV-Cache behavior within the PyTorch compute graph to improve compatibility with torch.compile and facilitate the export of the compute graph.
  • Conduct experiments to evaluate memory utilization and inference efficiency on D-Matrix hardware.
Desired Qualifications
  • Experience with deep learning model inference optimization.
  • Knowledge of data structures used in machine learning for memory and compute efficiency.
  • Experience with hardware-specific optimization, especially on custom hardware like D-Matrix, is an advantage.

d-Matrix develops an AI compute platform aimed at improving efficiency for large datacenter customers. Their main product is the digital in-memory compute (DIMC) engine, which combines computing capabilities directly within programmable memory. This design enhances performance while reducing power consumption and data movement. d-Matrix offers scalable and modular AI compute solutions using low-power chiplets that can be tailored for different applications. Unlike competitors, d-Matrix focuses on integrating compute with memory to optimize energy efficiency and performance. The company's goal is to provide high-performance AI inference acceleration for large-scale datacenter operators.

Company Size

201-500

Company Stage

Series B

Total Funding

$154M

Headquarters

Santa Clara, California

Founded

2019

Simplify Jobs

Simplify's Take

What believers are saying

  • Growing interest in brain-inspired computing aligns with d-Matrix's mission and products.
  • The rise of AI-driven chatbots increases demand for specialized AI chips like d-Matrix's.
  • Collaboration with Micron Technology enhances product development and market reach.

What critics are saying

  • Increased competition from Nvidia and startups could pressure d-Matrix's market share.
  • Dependency on Micron Technology may affect d-Matrix's supply chain and innovation pace.
  • Regulatory changes like the EU's AI Act could impose compliance costs on d-Matrix.

What makes d-Matrix unique

  • d-Matrix's DIMC engine integrates compute into programmable memory for enhanced efficiency.
  • The company offers scalable AI solutions through low-power, customizable chiplets.
  • d-Matrix focuses on power-efficient AI inference acceleration for large datacenter operators.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Hybrid Work Options

Growth & Insights and Company News

Headcount

6 month growth

-1%

1 year growth

0%

2 year growth

11%
VC News Network
Jan 2nd, 2025
Corsair: The Future of Generative AI Processing Unveiled

Additionally, Micron Technology is collaborating with d-Matrix to bolster Corsair's development and expansion, ensuring that this innovative processor meets the growing demands of the industry.

DIG Watch
Nov 21st, 2024
d-Matrix debuts AI chip for chatbots

Silicon Valley firm d-Matrix has launched its first AI chip, designed to enhance AI services like chatbots and video generators.

Wall Street Pit
Nov 19th, 2024
Challengers Arise: Can AMD, Intel, and Startups Take on Nvidia's AI Chip Reign?

The design and production of these chips is a highly intricate process, as demonstrated by d-Matrix's recent launch of its AI processor, "Corsair."

VentureBeat
Dec 27th, 2023
Ai Predictions For 2024: What Top Vcs Think

Are you ready to bring more awareness to your brand? Consider becoming a sponsor for The AI Impact Tour. Learn more about the opportunities here. As 2023 draws to a close, it’s a time of reflection on the monumental advances — and ethical debates — surrounding artificial intelligence this past year. The launch of chatbots like Bing Chat and Google Bard showcased impressive natural language abilities, while generative AI models like DALL-E 3 and MidJourney V6 stunned with their creative image generation.However, concerns were also raised about AI’s potential harms. The EU’s landmark AI Act sought to limit certain uses of the technology, and the Biden Administration issued guidelines on its development.With rapid innovation expected to continue, many wonder: What’s next for AI? To find out, we surveyed leading venture capitalists investing in artificial intelligence startups for their boldest predictions on what 2024 may bring. Will we see another “AI winter” as hype meets reality? Or will new breakthroughs accelerate adoption across industries? How will policymakers and the public respond?VCs from top firms including Bain Capital Ventures (BCV), Sapphire Ventures, Madrona, General Catalyst and more offered their outlook on topics ranging from the future of generative AI to GPU shortages, AI regulation, climate change applications, and more

Yahoo Finance
Oct 24th, 2023
A new phase of the AI race is coming - and chip startup d-Matrix could be the winner

With backers like Microsoft, d-Matrix is competing against Nvidia to become the next big thing in AI chips.