Full-Time

AI Systems Solutions Architect

d-Matrix

d-Matrix

51-200 employees

AI inference platform with IMC technology

Data & Analytics
Hardware
AI & Machine Learning

Senior, Expert

Santa Clara, CA, USA

Required Skills
Kubernetes
Tensorflow
Pytorch
Requirements
  • 15+ Years of Industry Experience and Engineering degree in Electrical Engineering, Computer Engineering, or Computer Science with extensive experience.
  • 5+ years of AI Server System experience by working on multiple projects from architecture, development, design including memory, I/O, power delivery, power management, boot process, FW and BMC/hardware management through bring-up and validation and supported through the release to production.
  • 5+ years of experience in a customer-facing role interfacing with OEMs, ODMs and CSPs.
  • Detailed understanding of server industry standard busses, such as DDR, PCIe, CXL and other high-speed IO protocol is required.
  • Ability to work seamlessly across engineering disciplines and geographies to deliver excellent results.
  • Deep understanding of datacenter AI infrastructure requirements and challenge.
  • Preferred: Hands-on understanding of AI/ML infrastructure and hardware accelerators.
  • Experience with leading AI/ML frameworks such as PyTorch, TensorFlow, ONNX, etc. and container orchestration platforms such as Kubernetes.
Responsibilities
  • Design, develop, and deploy scalable GenAI inference solutions with d-Matrix accelerators
  • Work closely with team members across architecture, engineering, product management and business developments to optimize the d-Matrix system solutions for best performance & power balance, feature set and overall system cost.
  • Work closely with Datacenter, OEM and ODM customers at early stage of product concept and planning phase, to enable the system design with partners and industrial ecosystem.
  • Influence and shape the future generations of products and solutions by contributing to the system architecture and technology through the early engagement cycle with customers and industrial partners.
  • Stay abreast of the latest advancements in GenAI hardware and software technologies and assess their suitability for integration into d-Matrix GenAI inference solutions.
  • Establish credibility with both engineering and leadership counterparts at top technology companies, communicate technical results and positions clearly and accurately, and drive alignment on solutions.

Company Stage

Series B

Total Funding

$161.5M

Headquarters

Santa Clara, California

Founded

2019

Growth & Insights
Headcount

6 month growth

0%

1 year growth

47%

2 year growth

334%