Full-Time

Senior Network Engineer

Supercomputing

Institute of Foundation Models

Institute of Foundation Models

Compensation Overview

$200k - $400k/yr

+ Bonus + 401K Match

H1B Sponsorship Available

Sunnyvale, CA, USA

In Person

Category
DevOps & Infrastructure (1)
Requirements
  • High-Performance Networks: Hands-on experience with NVIDIA RDMA technologies (e.g., GPUDirect RDMA, RoCE, InfiniBand) in HPC or AI supercomputing environments.
  • Job Scheduling & Cluster Management: Familiarity with Slurm workload manager and experience troubleshooting and optimizing network performance within Slurm-managed environments.
  • Advanced Communication Frameworks: Proven expertise in optimizing distributed systems using NCCL, SHARP, MPI, or similar frameworks tailored for GPU-accelerated workloads.
  • Programming & System Optimization: Proficiency in Python, Go, and low-level programming languages such as Rust, C, or C++ to design and optimize networking software.
  • Networking Fundamentals: In-depth knowledge of network protocols (TCP/IP, BGP, RDMA) and network architectures, both physical and logical.
  • Kubernetes & Containerization: Familiarity with Kubernetes networking and experience integrating RDMA into containerized environments.
  • Troubleshooting & Debugging: Strong analytical and debugging skills with a track record of rapidly resolving network-side errors and performance bottlenecks.
  • Collaboration & Metrics-Driven Approach: Experience working closely with network engineers and systems architects, using extensive metrics to drive prioritization and improvements.
Responsibilities
  • Design & Optimization: Develop and tune RDMA-based communication systems leveraging NVIDIA GPUs, Mellanox NICs (InfiniBand, RoCE), and low-level networking technologies to support ultra-fast data transfers between nodes.
  • Performance Engineering: Implement and optimize GPUDirect RDMA to enable direct memory access between GPUs and network interfaces, minimizing CPU overhead.
  • Automation & Monitoring: Build network-aware software and observability tools with extensive metrics coverage, automate configuration management, and ensure robust, secure deployment pipelines through Infrastructure-as-Code (IaC) best practices.
  • Integration & Collaboration: Integrate RDMA solutions within Kubernetes-based workloads and containerized environments. Collaborate closely with AI researchers, network engineers, and infrastructure teams to accelerate data pipelines and optimize collective communications using NCCL, MPI, and SHARP.
  • Troubleshooting: Quickly investigate, debug, and resolve network-side issues across the full stack—from physical InfiniBand fabrics to high-level orchestration services—ensuring continuous operational excellence.
Institute of Foundation Models

Institute of Foundation Models

View

Company Size

N/A

Company Stage

N/A

Total Funding

N/A

Headquarters

United Arab Emirates

Founded

N/A

Simplify Jobs

Simplify's Take

What believers are saying

  • IFM's dedicated teams in Abu Dhabi, Paris, and Silicon Valley drive K2 and JAIS advancements.
  • Active job openings for AI research interns and engineers signal rapid team expansion.
  • PAN world model enables multi-level reasoning in simulations for real-world applications.

What critics are saying

  • OpenAI's o1 surpasses K2 and JAIS by 25% on benchmarks, shifting users in 6-12 months.
  • US export controls block NVIDIA H200 GPUs, delaying K2 releases by 9 months.
  • Stanford CRFM's model with 10x data captures 70% academic citations in 6-12 months.

What makes Institute of Foundation Models unique

  • IFM pioneers open-source K2 Think V2, UAE's sovereign 70B reasoning system released January 2026.
  • IFM advances JAIS 2, world's leading Arabic LLM trained on largest Arabic-first dataset.
  • IFM hosts models on Hugging Face under mbzuai-ifm for global open collaboration.

Help us improve and share your feedback! Did you find this helpful?

Your Connections

People at Institute of Foundation Models who can refer or advise you

Benefits

Health Insurance

Dental Insurance

Vision Insurance

Paid Vacation

Paid Holidays

Parental Leave

Employee Assistance Program

Life Insurance

Disability Insurance

401(k) Plan

Wellness Program

Flexible Work Hours

Remote Work Options

Hybrid Work Options

Stock Options

Company Equity