0% found this document useful (0 votes)

67 views3 pages

AI Computing Infrastructure Engineer GPU and High Performance Computing

The document outlines a job description for an AI Infrastructure Engineer focused on GPU and high-performance computing. Key responsibilities include designing and optimizing GPU-accelerated environments, managing containerized systems, and supporting AI model training and inference. Required qualifications include a degree in a related field, experience in AI/ML infrastructure, and familiarity with modern GPU architectures and AI frameworks.

Uploaded by

Amer Ali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

67 views3 pages

AI Computing Infrastructure Engineer GPU and High Performance Computing

Uploaded by

Amer Ali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

AI Computing Infrastructure Engineer – GPU & High-Performance

Computing

Role Overview:

We are looking for a highly capable AI Infrastructure Engineer to design,

implement, and optimize GPU-accelerated compute environments that power
advanced AI and machine learning workloads. This role is critical in building
and supporting scalable, high-performance infrastructure across data centers
and hybrid cloud platforms, enabling training, fine-tuning, and inference of
modern AI models.

Key Responsibilities:

• AI Infrastructure Design & Deployment with multi-GPU clusters using

NVIDIA or AMD platforms.

• Configure GPU environments using CUDA, DGX Systems, and NVIDIA

Kubernetes Device Plugin.

• Deploy and manage containerized environments with Docker,

Kubernetes, and Slurm.

• AI Model Support & Optimization for training, fine-tuning, and inference

pipelines for LLMs and deep learning models.

• Enable distributed training using DDP, FSDP, and ZeRO, with support
for mixed precision.

• Tune infrastructure to optimize model performance, throughput, and

GPU utilization.

• Design and operate high-bandwidth, low-latency networks using

InfiniBand and RoCE v2.

• Integrate GPUDirect Storage and optimize data flow across Lustre,

BeeGFS, and Ceph/S3.

• Support fast data ingestion, ETL pipelines, and large-scale data

staging.

• Leverage NVIDIA’s AI stack including cuDNN, NCCL, TensorRT, and

Triton Inference Server.

Cisco Confidential
• Conduct performance benchmarking with MLPerf and custom test
suites.

Required Skills & Qualifications

• Bachelor’s or Master’s degree in Computer Science, Engineering, or

related field.

• 3–6 years of experience in AI/ML infrastructure engineering or high-

performance computing (HPC).

• Solid experience with GPU-based systems, container orchestration, and

AI/ML frameworks.

• Familiarity with distributed systems, performance tuning, and large-

scale deployments.

• Expertise in modern GPU architectures (e.g., NVIDIA A100/H100, AMD

MI300), multi-GPU configurations (NVLink, PCIe, HBM), and accelerator
scheduling for AI training and inference workloads.

• Good understanding of modern AI model architectures, including LLMs

(e.g., GPT, LLaMA), diffusion models, and multimodal encoder-decoder
frameworks, with awareness of their compute and scaling
requirements.

• Knowledge of leading AI/ML frameworks (e.g., TensorFlow, PyTorch),

NVIDIA’s AI stack (CUDA, cuDNN, TensorRT), and open-source tools like
Hugging Face, ONNX, and MLPerf for model development and
benchmarking.

• Familiarity with AI pipelines for supervised/unsupervised training, fine-

tuning (PEFT/LoRA/QLoRA), and batch or real-time inference, with
expertise in distributed training, checkpointing, gradient strategies,
and mixed precision optimization.

Preferred Certifications:

 NVIDIA Certified Professional – Data Center AI

 Kubernetes Administrator (CKA)

Cisco Confidential
 CCNP or CCIE Data Center

 Cloud Certification (AWS, Azure, or GCP)

Cisco Confidential

Cloud Engineer for AI Infrastructure
No ratings yet
Cloud Engineer for AI Infrastructure
1 page
JD For Senior GenAI Engineer
No ratings yet
JD For Senior GenAI Engineer
2 pages
Job Overview-AI
No ratings yet
Job Overview-AI
2 pages
JD - Python AI - SN
No ratings yet
JD - Python AI - SN
2 pages
Agentic AI and Domain-Specific LLM Developer
No ratings yet
Agentic AI and Domain-Specific LLM Developer
2 pages
AI Engineer Job Description
100% (1)
AI Engineer Job Description
1 page
Job Title - AI Architect - Software & Data Engineering, LLM - RAG Specialist
No ratings yet
Job Title - AI Architect - Software & Data Engineering, LLM - RAG Specialist
3 pages
AI Engineer
No ratings yet
AI Engineer
3 pages
Job Descriptiom-AIML-Offshore
No ratings yet
Job Descriptiom-AIML-Offshore
2 pages
NVIDIA Careers for Tech Innovators
No ratings yet
NVIDIA Careers for Tech Innovators
6 pages
Job Description: Role: Industry Type
No ratings yet
Job Description: Role: Industry Type
2 pages
JD - AI Senior Engineer
No ratings yet
JD - AI Senior Engineer
2 pages
Senior Data Scientist Role Overview
No ratings yet
Senior Data Scientist Role Overview
3 pages
NVidia Web HR Ur Digital Flyer Job Description
No ratings yet
NVidia Web HR Ur Digital Flyer Job Description
8 pages
JD - Software Engineer
No ratings yet
JD - Software Engineer
1 page
AI Engineer Role at Neuralk-AI
No ratings yet
AI Engineer Role at Neuralk-AI
2 pages
AI ML Engineer
No ratings yet
AI ML Engineer
3 pages
Job Description - NTT Data
No ratings yet
Job Description - NTT Data
2 pages
AI Engineer
No ratings yet
AI Engineer
3 pages
AIML Engineer (Trainee) - JD
No ratings yet
AIML Engineer (Trainee) - JD
3 pages
AI NLP Engineer Role at E42.ai
No ratings yet
AI NLP Engineer Role at E42.ai
2 pages
IterateAI Careers
No ratings yet
IterateAI Careers
4 pages
ML Software Engineer, ML Deployment
No ratings yet
ML Software Engineer, ML Deployment
2 pages
AI Engineer JD
No ratings yet
AI Engineer JD
2 pages
AI ML Engineer
No ratings yet
AI ML Engineer
2 pages
CV/ML Engineer Job at Meta Reality Labs
No ratings yet
CV/ML Engineer Job at Meta Reality Labs
2 pages
AI - Python JD
No ratings yet
AI - Python JD
3 pages
Optimize
No ratings yet
Optimize
6 pages
Senior Artificial Intelligence Engineer - New
No ratings yet
Senior Artificial Intelligence Engineer - New
2 pages
Updated JD For Python Fresher
No ratings yet
Updated JD For Python Fresher
2 pages
AI&ML Vamshi Updated 1
No ratings yet
AI&ML Vamshi Updated 1
8 pages
AI/ML Engineer Career Overview
No ratings yet
AI/ML Engineer Career Overview
2 pages
Senior ML Ops Engineer Job Houston
No ratings yet
Senior ML Ops Engineer Job Houston
2 pages
JD - ML Updated Contract
No ratings yet
JD - ML Updated Contract
2 pages
AI Engineer
No ratings yet
AI Engineer
2 pages
ML Computer Vision Engineer-1
No ratings yet
ML Computer Vision Engineer-1
4 pages
Deep Learning and AI - JD - SF
No ratings yet
Deep Learning and AI - JD - SF
2 pages
UCube - Ai - AI Engineer JD
No ratings yet
UCube - Ai - AI Engineer JD
2 pages
BITS Goa - System Design Eng JD
No ratings yet
BITS Goa - System Design Eng JD
2 pages
US - Senior AI Platform Engineer
No ratings yet
US - Senior AI Platform Engineer
2 pages
Software Engineer - Gen AI
No ratings yet
Software Engineer - Gen AI
3 pages
JD Review - Python Developer - DM 4065
No ratings yet
JD Review - Python Developer - DM 4065
1 page
LLM Expert
No ratings yet
LLM Expert
3 pages
Cody Mckeand Resume-Lang
No ratings yet
Cody Mckeand Resume-Lang
5 pages
EngenuityAI Job Description
No ratings yet
EngenuityAI Job Description
2 pages
Cody Mckeand Resume
No ratings yet
Cody Mckeand Resume
5 pages
AI & Full Stack Developer Job Description
No ratings yet
AI & Full Stack Developer Job Description
2 pages
Artificial Intelligence Internship - JD
No ratings yet
Artificial Intelligence Internship - JD
1 page
Data Platform Engineer for AI Models
No ratings yet
Data Platform Engineer for AI Models
3 pages
AI-MLL-Job Desc-Tech
No ratings yet
AI-MLL-Job Desc-Tech
2 pages
Senior ML Engineer Adobe
No ratings yet
Senior ML Engineer Adobe
2 pages
P3 - Senior Machine Learning Engineer - Brett
No ratings yet
P3 - Senior Machine Learning Engineer - Brett
1 page
JD - Associate Data Engineer
No ratings yet
JD - Associate Data Engineer
2 pages
Sunil Kumar - DevOps Engineer
No ratings yet
Sunil Kumar - DevOps Engineer
6 pages
Data Scientist + Agentic AI
No ratings yet
Data Scientist + Agentic AI
5 pages
JD For AI (Intern)
No ratings yet
JD For AI (Intern)
2 pages
Nscale - Staff-Senior AI Engineer Profile-1
No ratings yet
Nscale - Staff-Senior AI Engineer Profile-1
3 pages
Machine Learning Engineer Job Guide
No ratings yet
Machine Learning Engineer Job Guide
6 pages
Final Document
No ratings yet
Final Document
1 page
Akhtar Unissa and Gori Bee Calculation
No ratings yet
Akhtar Unissa and Gori Bee Calculation
1 page
Tax Calculation
No ratings yet
Tax Calculation
2 pages
Recruitment Training Presentation
No ratings yet
Recruitment Training Presentation
11 pages
Mostafa Ali Ismail Morsy Original
No ratings yet
Mostafa Ali Ismail Morsy Original
3 pages
Kaveri Seeds DCF Valuation Report
No ratings yet
Kaveri Seeds DCF Valuation Report
14 pages
Cam Attendance Scaner PDF
No ratings yet
Cam Attendance Scaner PDF
12 pages
OB - Notes-MBA-1 - Unit 1,2
100% (1)
OB - Notes-MBA-1 - Unit 1,2
9 pages
Annotated Bibliography
0% (1)
Annotated Bibliography
4 pages
Chapter 2 Lesson 2 Ethics and Critical Thinking
No ratings yet
Chapter 2 Lesson 2 Ethics and Critical Thinking
16 pages
Energy Trading Risk Management
No ratings yet
Energy Trading Risk Management
11 pages
24 SGW - ISO 450012018 vs. OHSAS 180012007 - Matrix
No ratings yet
24 SGW - ISO 450012018 vs. OHSAS 180012007 - Matrix
9 pages
Sanako Lab 100 Multi MSU Technical Guide
No ratings yet
Sanako Lab 100 Multi MSU Technical Guide
27 pages
Introduction To Artificial Intelligence and Expert Systems
No ratings yet
Introduction To Artificial Intelligence and Expert Systems
14 pages
55566643555801.the Horn of Evenwood
100% (1)
55566643555801.the Horn of Evenwood
26 pages
Maverick Mogul Billionaire Bachelors 7 1st Edition Lila Monroe PDF Download
No ratings yet
Maverick Mogul Billionaire Bachelors 7 1st Edition Lila Monroe PDF Download
81 pages
Lab M3: The Physical Pendulum: pivot c.m. L θ r
No ratings yet
Lab M3: The Physical Pendulum: pivot c.m. L θ r
7 pages
The Humanities Culture Continuity and Change Volume 1 4th Edition Henry M Sayre Ebook and TestBank Bundle PDF Download
No ratings yet
The Humanities Culture Continuity and Change Volume 1 4th Edition Henry M Sayre Ebook and TestBank Bundle PDF Download
330 pages
Fentanyl-Strip-Instructions - V18 - Winter24 Dance Safe
No ratings yet
Fentanyl-Strip-Instructions - V18 - Winter24 Dance Safe
3 pages
Snail Housing Guide
No ratings yet
Snail Housing Guide
7 pages
Oxford Handbook of Medical Dermatology PDF Download
100% (2)
Oxford Handbook of Medical Dermatology PDF Download
19 pages
GOOD NUTRITION & DIABETIC CHART FOR Priya 5-04
No ratings yet
GOOD NUTRITION & DIABETIC CHART FOR Priya 5-04
7 pages
Visit Report Bee and Dragon Fruit
100% (1)
Visit Report Bee and Dragon Fruit
10 pages
Domperidone Maleate Suppository Prescribing Information
No ratings yet
Domperidone Maleate Suppository Prescribing Information
6 pages
157 - 25225 - EA435 - 2013 - 1 - 2 - 1 - CHAPTER - 3 Aggregate Planning
100% (1)
157 - 25225 - EA435 - 2013 - 1 - 2 - 1 - CHAPTER - 3 Aggregate Planning
57 pages
习近平宗教中国化
No ratings yet
习近平宗教中国化
26 pages
Content Addressed Storage: Section 2: Storage Networking Technologies and Virtualization
No ratings yet
Content Addressed Storage: Section 2: Storage Networking Technologies and Virtualization
23 pages
Rajasthan Gazetted Public Holidays 2025
No ratings yet
Rajasthan Gazetted Public Holidays 2025
2 pages
Work Immersion DLL Week 1
100% (1)
Work Immersion DLL Week 1
6 pages
Human Resource Management 15th Edition by Gary Dessler Download
No ratings yet
Human Resource Management 15th Edition by Gary Dessler Download
53 pages
CA Advanced Authentication - 8.1 - ENU - Collecting Device ID and DeviceDNA - 20170816
No ratings yet
CA Advanced Authentication - 8.1 - ENU - Collecting Device ID and DeviceDNA - 20170816
16 pages
Allusions and Foreshadowing in R&J
100% (1)
Allusions and Foreshadowing in R&J
4 pages
(Ebook) Ash Princess, Tome 3: Ember Queen by Laura Sebastian ISBN 9782017881339, 2017881333 Download
100% (2)
(Ebook) Ash Princess, Tome 3: Ember Queen by Laura Sebastian ISBN 9782017881339, 2017881333 Download
36 pages
Pharmacovigilance and Epidemiological Methods
No ratings yet
Pharmacovigilance and Epidemiological Methods
6 pages
Farmer Database Format NP & CAN User North Zone
No ratings yet
Farmer Database Format NP & CAN User North Zone
354 pages

AI Computing Infrastructure Engineer GPU and High Performance Computing

Uploaded by

AI Computing Infrastructure Engineer GPU and High Performance Computing

Uploaded by

AI Computing Infrastructure Engineer – GPU & High-Performance

We are looking for a highly capable AI Infrastructure Engineer to design,

• AI Infrastructure Design & Deployment with multi-GPU clusters using

• Configure GPU environments using CUDA, DGX Systems, and NVIDIA

• Deploy and manage containerized environments with Docker,

• AI Model Support & Optimization for training, fine-tuning, and inference

• Tune infrastructure to optimize model performance, throughput, and

• Design and operate high-bandwidth, low-latency networks using

• Integrate GPUDirect Storage and optimize data flow across Lustre,

• Support fast data ingestion, ETL pipelines, and large-scale data

• Leverage NVIDIA’s AI stack including cuDNN, NCCL, TensorRT, and

Required Skills & Qualifications

• Bachelor’s or Master’s degree in Computer Science, Engineering, or

• 3–6 years of experience in AI/ML infrastructure engineering or high-

• Solid experience with GPU-based systems, container orchestration, and

• Familiarity with distributed systems, performance tuning, and large-

• Expertise in modern GPU architectures (e.g., NVIDIA A100/H100, AMD

• Good understanding of modern AI model architectures, including LLMs

• Knowledge of leading AI/ML frameworks (e.g., TensorFlow, PyTorch),

• Familiarity with AI pipelines for supervised/unsupervised training, fine-

 NVIDIA Certified Professional – Data Center AI

 Kubernetes Administrator (CKA)

 Cloud Certification (AWS, Azure, or GCP)

You might also like