Senior AI Network – Compute Consultant

Posted 6ds ago

Employment Information

Education
Salary
Experience
Job Type

Report this job

Job expired or something wrong with this job?

Job Description

Senior AI Network & Compute Consultant at Dell Technologies designing HPC/AI cluster architectures. Collaborating with clients and delivering high-performance compute solutions in Europe.

Responsibilities:

  • Collaborate with Dell consultants and client stakeholders to design and implement sophisticated HPC/AI cluster and network architectures.
  • Oversee the deployment of high-performance compute environments, ensuring both technical excellence and customer satisfaction throughout each phase of the engagement.
  • Design and architect scalable, high-performance compute and network infrastructures for HPC/AI clusters.
  • Lead the implementation of advanced networking solutions, including NVIDIA InfiniBand and Ethernet technologies.
  • Deploy and manage orchestration tools such as NVIDIA Base Command Manager for cluster management and monitoring.
  • Provide expert consulting on compute and network infrastructure strategy, planning, and execution and collaborate with clients to assess technical requirements and deliver customized solutions.
  • Troubleshoot and resolve performance bottlenecks across compute, storage, and network layers and develop comprehensive documentation, including architecture diagrams, deployment guides, and operational procedures.

Requirements:

  • Proven success in designing and deploying large-scale HPC/AI clusters (NVIDIA, AMD, Intel)
  • Demonstrated expertise in NVIDIA networking technologies: InfiniBand (Quantum), Ethernet (Spectrum-X), MLNX-OS, NVIDIA Cumulus OS, and Enterprise SONiC
  • Proficient in Linux systems administration and scripting
  • Extensive hands-on experience with Base Command Manager or equivalent orchestration tools
  • Experience in consulting roles with strong communication and documentation abilities and capacity to manage multiple projects independently and deliver results within dynamic environments
  • Certifications in networking and Linux (e.g., CCNP, LFCS, NCP-AIN, NCP-AIO), experience with NVIDIA DGX systems or similar GPU platforms and familiarity with container orchestration technologies (e.g., Kubernetes, Docker, Slurm)
  • Knowledge of data centre operations and cloud integration methods and experience with GENAI frameworks and related tools

Benefits:

  • Dell Technologies is committed to empowering our customers with innovative products and services that enhance their performance and productivity.
  • Health insurance
  • Retirement plans
  • Paid time off
  • Flexible work arrangements
  • Professional development opportunities