Full-Time

AI/ML Infrastructure Engineer

Posted on 11/19/2024

Vultr

Vultr

51-200 employees

Cloud infrastructure provider with global deployment

Data & Analytics
Enterprise Software

Compensation Overview

$120k - $150kAnnually

Mid, Senior

Remote in USA

100% remote work environment with a company-wide virtual get together.

Category
DevOps & Infrastructure
Cloud Engineering
Required Skills
Bash
PHP
Python
Linux/Unix
Requirements
  • Hands-on experience working with current, high-performance GPUs, primarily NVIDIA products (e.g. NVLink, Infiniband, GRID drivers, vGPU and NVAIE)
  • In-depth, hands-on experience working with and automating bare metal internals including BIOS, BMC, firmware, NICs, Redfish/IPMI, PCIe
  • Experience with rail optimization across multiple clusters and architectures
  • Experience with Linux, package management and device drivers
  • Experience with commercial firmware
  • Experience with Python, Bash, and PHP
  • Experience with Machine Learning software
Responsibilities
  • Developing and maintaining infrastructure in bare metal and containerized environments
  • Work directly with our networking team to build scalable and supportable GPU clusters
  • Ensure excellent customer experience by ensuring consistent and reliable provisioning of GPU infrastructure
  • Build and maintain test automation of GPU-based products to ensure fast and reliable provisioning
  • Implement and maintain GPU-based solutions to meet the needs of diverse applications and computational workloads
  • Conduct in-depth benchmarking, performance testing, and troubleshooting of GPU systems to identify and resolve any hardware or software limitations
  • Working with vendors to get all supported drivers and packages
  • Working with vendors on any bugs, performance-related issues, hardware problems, and reference architectures
  • Address any hardware, software, or performance issues promptly, coordinating with vendors, technical support, and internal teams as required

Vultr provides cloud infrastructure services, specializing in high-performance SSD VPS (Solid State Drive Virtual Private Servers) that can be deployed globally in just 60 seconds. Their services include cloud compute instances, storage solutions, and networking capabilities, allowing clients to manage and deploy scalable and reliable cloud infrastructure without the need for physical hardware. Unlike many competitors, Vultr operates on a subscription-based model where clients pay for the resources they use, making it a cost-effective option for businesses of all sizes. With a strong focus on customer support, handling over 35,000 requests monthly, Vultr aims to simplify cloud computing for developers, startups, and enterprises across more than 150 countries.

Company Stage

N/A

Total Funding

N/A

Headquarters

West Palm Beach, Florida

Founded

2014

Growth & Insights
Headcount

6 month growth

0%

1 year growth

0%

2 year growth

0%
Simplify Jobs

Simplify's Take

What believers are saying

  • Vultr's recognition as Dell Technologies' AI Provider of the Year for the Americas highlights its leadership and innovation in AI cloud services.
  • The company's partnerships with industry leaders like NVIDIA and Dell Technologies enhance its capabilities in AI and machine learning, providing clients with cutting-edge technology.
  • Vultr's ability to attract high-profile clients like Athos Therapeutics and Music.AI demonstrates its reliability and effectiveness in supporting advanced AI applications.

What critics are saying

  • The competitive landscape of cloud infrastructure is dominated by giants like AWS and Google Cloud, which could limit Vultr's market share.
  • Rising prices for AI cloud compute instances could deter cost-sensitive clients, impacting Vultr's growth.

What makes Vultr unique

  • Vultr's rapid global deployment of SSD VPS in just 60 seconds sets it apart from competitors who may have longer setup times.
  • The company's focus on high-performance cloud servers and competitive pricing makes it an attractive option for businesses of all sizes.
  • Vultr's strong customer support, handling over 35,000 support requests per month, ensures clients receive timely and effective assistance, a critical differentiator in the cloud services market.

Help us improve and share your feedback! Did you find this helpful?