GPU Server Setup for AI and Docker

Quick overview of hosting a GPU server on an Unraid machine.

Uploaded by

traceh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

94 views3 pages

GPU Server Setup for AI and Docker

Quick overview of hosting a GPU server on an Unraid machine.

Uploaded by

traceh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

GPU Server

General Setup
The GPU server should be used exclusively for programs that utilize its GPU in some
way, such as AI stuff or data processing. The idea is to have an API for the app server to
access when it needs to do something with that uses the GPU, and should pass any
required data to the GPU server in the request. This should allow us to run a variety of
functions on the GPU server while maintaining a single point of access.

After a lot of testing, Ollama just runs models better than anything else, and does a better
job of managing multiple models in VRAM than something like HuggingFace. So as much
as I hate to have an API for the API to use, that's how it's being done. Ollama runs in a
container with the port number 11434 . The API for using the GPU server will also run in a
docker container listening to port 8080 .

Docker will save its data to the RAID storage so large models don't shit up the OS'
partition.

Detailed Setup
During development I've just used the ubuntu user for everything, but we will probably
want to make a compliance user for production.

RAID

Set up partitions on drives (during install)

- Create 100G ext4 partition for OS at /
- Create 2G ext4 partition for boot at /boot
- Create unformatted partition with the remaining space
- Create RAID10 array from unformatted partitions with default name md0
Format RAID array to ext4 with sudo mkfs.ext4 /dev/md0
Mount the RAID array with sudo mount /dev/md0 /home/<user>/data
Get UUID for RAID with blkid | grep md0
- If nothing is returns, restart the server
Edit /etc/fstab to mount the RAID array on startup by adding UUID=<UUID of md0>
/mnt/md0 ext4 defaults
- If you had to restart the server to get the RAID's UUID, restart the daemon with
systemctl daemon-reload

Nvidia AI Workbench
The entire purpose of installing this is to get the right drivers for the GPU and things like
nvidia-container-toolkit to make the GPU work with Docker in one download. We won't
use this at all in production.

Install Nvidia AI Workbench with this script

sudo mkdir -p $HOME/nvwb/.nvwb/bin && \

sudo curl -L [Link]
cli/$(curl -L -s [Link]
cli/LATEST)/nvwb-cli-$(uname)-$(uname -m) --output
$HOME/nvwb/.nvwb/bin/nvwb-cli && \
sudo chmod +x $HOME/nvwb/.nvwb/bin/nvwb-cli && \
sudo -E $HOME/nvwb/.nvwb/bin/nvwb-cli install

Accepts the terms and conditions

Choose to install Docker instead of Podman
Choose to install the GPU drivers
Reboot the system

Docker
Ollama has to download the models somewhere, and they tend to take up a lot of space,
so we have to change Docker's settings to save data in the large RAID partition.
Make a directory for docker in the large RAID directory with sudo mkdir
/home/<user>/data/docker
Edit /etc/docker/[Link] and add a line to the dictionary "data-root":
"/home/<user>/data/docker"
Restart docker with service docker restart

Ollama Installation Steps
No ratings yet
Ollama Installation Steps
2 pages
Installing and Managing Open-WebUI Docker
No ratings yet
Installing and Managing Open-WebUI Docker
2 pages
Add GPU to Synology DS1820+ Guide
No ratings yet
Add GPU to Synology DS1820+ Guide
6 pages
Local Ai Integration Tutorial
No ratings yet
Local Ai Integration Tutorial
6 pages
NVIDIA Docker Setup for PyTorch Dev
No ratings yet
NVIDIA Docker Setup for PyTorch Dev
13 pages
TensorFlow User Guide
No ratings yet
TensorFlow User Guide
24 pages
Machine Learning Environment Setup Guide
No ratings yet
Machine Learning Environment Setup Guide
25 pages
Deploying LLMs with Docker Guide
No ratings yet
Deploying LLMs with Docker Guide
5 pages
Docker for AI: Quick Start Guide
No ratings yet
Docker for AI: Quick Start Guide
8 pages
Deep Learning Server Platform - Admin Manual 2.0
No ratings yet
Deep Learning Server Platform - Admin Manual 2.0
20 pages
Ollama Docker Setup and Configuration Guide
No ratings yet
Ollama Docker Setup and Configuration Guide
3 pages
Virtual Machines vs Containers Overview
No ratings yet
Virtual Machines vs Containers Overview
31 pages
Installing DeepStream Docker Image
No ratings yet
Installing DeepStream Docker Image
2 pages
Install Ollama on Linux Without Sudo
No ratings yet
Install Ollama on Linux Without Sudo
8 pages
Docker Solutions for ML Deployment Challenges
No ratings yet
Docker Solutions for ML Deployment Challenges
29 pages
Multi-GPU Support For Florence-2 in Docker
No ratings yet
Multi-GPU Support For Florence-2 in Docker
15 pages
Handling GPU Access in Docker For Florence
No ratings yet
Handling GPU Access in Docker For Florence
16 pages
TensorFlow User Guide v1.15.5
No ratings yet
TensorFlow User Guide v1.15.5
43 pages
Set Up Python Env
No ratings yet
Set Up Python Env
8 pages
Set Up Cloud Virtual Machine for AI
No ratings yet
Set Up Cloud Virtual Machine for AI
2 pages
NVIDIA Triton Inference Server Overview
No ratings yet
NVIDIA Triton Inference Server Overview
18 pages
AnythingLLM Docker Deployment Guide
No ratings yet
AnythingLLM Docker Deployment Guide
5 pages
Accelerating Storage with GPUDirect
No ratings yet
Accelerating Storage with GPUDirect
40 pages
AI Deepfake Detection: 4-Week Guide
No ratings yet
AI Deepfake Detection: 4-Week Guide
5 pages
Journey To Master AI-ML With Docker
No ratings yet
Journey To Master AI-ML With Docker
22 pages
Nvidia Ai Enterprise User Guide
No ratings yet
Nvidia Ai Enterprise User Guide
100 pages
NemoClaw Tutorial
No ratings yet
NemoClaw Tutorial
18 pages
Setup Machine Learning Environment Guide
No ratings yet
Setup Machine Learning Environment Guide
4 pages
Setting Up NVIDIA on Dell R720 VMs
No ratings yet
Setting Up NVIDIA on Dell R720 VMs
6 pages
LLM
No ratings yet
LLM
36 pages
Using Ollama with Cloudflare Tunnel in Colab
No ratings yet
Using Ollama with Cloudflare Tunnel in Colab
3 pages
Install Agent Zero on Kali Linux
No ratings yet
Install Agent Zero on Kali Linux
8 pages
Docker and Kubernetes
No ratings yet
Docker and Kubernetes
20 pages
OpenDrone ODM Quickstart Guide
No ratings yet
OpenDrone ODM Quickstart Guide
3 pages
Containerized ML Models Without GPU
No ratings yet
Containerized ML Models Without GPU
1 page
Ai Setup Guide by Geminie
No ratings yet
Ai Setup Guide by Geminie
5 pages
Ollama Installation Guide for Windows
No ratings yet
Ollama Installation Guide for Windows
9 pages
Docker for Deep Learning Deployment
No ratings yet
Docker for Deep Learning Deployment
65 pages
Installing CUDA Toolkit for Password Cracking
No ratings yet
Installing CUDA Toolkit for Password Cracking
31 pages
Generative Model Setup: Hardware & Software
No ratings yet
Generative Model Setup: Hardware & Software
3 pages
Cloud Computing Lab Manual for MCA Students
No ratings yet
Cloud Computing Lab Manual for MCA Students
102 pages
Containers or VMs - Deploy AI Workloads With Ease - 1647197291151001kben
No ratings yet
Containers or VMs - Deploy AI Workloads With Ease - 1647197291151001kben
26 pages
Ai Dev Platform Commands
No ratings yet
Ai Dev Platform Commands
6 pages
S3516 Build Your Own GPU Research Cluster
No ratings yet
S3516 Build Your Own GPU Research Cluster
28 pages
Grid Vgpu User Guide
No ratings yet
Grid Vgpu User Guide
322 pages
Building Generative AI Services With FastAPI51
No ratings yet
Building Generative AI Services With FastAPI51
10 pages
vGPU Unlock Documentation Guide
No ratings yet
vGPU Unlock Documentation Guide
24 pages
Cloud-E-Book - Notes
No ratings yet
Cloud-E-Book - Notes
36 pages
Building Generative AI Services With FastAPI51
No ratings yet
Building Generative AI Services With FastAPI51
10 pages
Workparty Renderfarm Setup Guide
No ratings yet
Workparty Renderfarm Setup Guide
1 page
NemoClaw BlueCorporate
No ratings yet
NemoClaw BlueCorporate
6 pages
ECT 414 Syllabus Overview
No ratings yet
ECT 414 Syllabus Overview
6 pages
JavaScript Object Methods Cheat Sheet
No ratings yet
JavaScript Object Methods Cheat Sheet
4 pages
Overview of Input Devices
No ratings yet
Overview of Input Devices
13 pages
OSPF Network Configuration Guide
No ratings yet
OSPF Network Configuration Guide
13 pages
B-Way vs C-Way Handshaking Explained
No ratings yet
B-Way vs C-Way Handshaking Explained
7 pages
Dynamics 365 Plugin & Workflow Q&A Guide
No ratings yet
Dynamics 365 Plugin & Workflow Q&A Guide
8 pages
65ALS180
No ratings yet
65ALS180
21 pages
Modeling Data Object
No ratings yet
Modeling Data Object
36 pages
Inter-Cloud Resource Management Insights
No ratings yet
Inter-Cloud Resource Management Insights
27 pages
AI-Enhanced Software Development Guide
No ratings yet
AI-Enhanced Software Development Guide
2 pages
Agile vs Waterfall in Food Delivery Systems
No ratings yet
Agile vs Waterfall in Food Delivery Systems
12 pages
Synchronous Versus Asynchronous Message Passing
No ratings yet
Synchronous Versus Asynchronous Message Passing
6 pages
MongoDB Admin and K8s on Linux
No ratings yet
MongoDB Admin and K8s on Linux
8 pages
ISTE SRMC 2024: Selected Students List
100% (1)
ISTE SRMC 2024: Selected Students List
19 pages
HVAC Control Panels FAT Procedure
No ratings yet
HVAC Control Panels FAT Procedure
7 pages
Exploring Creative Programming Basics
No ratings yet
Exploring Creative Programming Basics
2 pages
Voicemeeter Banana User Manual
No ratings yet
Voicemeeter Banana User Manual
67 pages
Understanding ICT's Impact on Society
No ratings yet
Understanding ICT's Impact on Society
16 pages
C Functions: Types, Syntax, and Examples
No ratings yet
C Functions: Types, Syntax, and Examples
30 pages
Impact of IoT on Society and Culture
No ratings yet
Impact of IoT on Society and Culture
9 pages
IQUW - Senior Network Engineer
No ratings yet
IQUW - Senior Network Engineer
3 pages
SIRIUS 3RW5 Modbus Module Manual
No ratings yet
SIRIUS 3RW5 Modbus Module Manual
186 pages
EST4 Network Firewalls Overview
No ratings yet
EST4 Network Firewalls Overview
4 pages
GameCenter Startup Log Analysis
No ratings yet
GameCenter Startup Log Analysis
14 pages
COVID-19 Case Prediction with ML
No ratings yet
COVID-19 Case Prediction with ML
24 pages
Intel 64 IA-32 Manual Updates 2025
No ratings yet
Intel 64 IA-32 Manual Updates 2025
1,429 pages
Top 50 System Software MCQs with Answers
No ratings yet
Top 50 System Software MCQs with Answers
9 pages
VLSI Design Course Overview 2019-20
No ratings yet
VLSI Design Course Overview 2019-20
91 pages
PayShield 10K Installation Guide
No ratings yet
PayShield 10K Installation Guide
26 pages
Study Guide: Algorithms Final Exam
No ratings yet
Study Guide: Algorithms Final Exam
2 pages

GPU Server Setup for AI and Docker

Uploaded by

GPU Server Setup for AI and Docker

Uploaded by

GPU Server

Set up partitions on drives (during install)

Install Nvidia AI Workbench with this script

sudo mkdir -p $HOME/nvwb/.nvwb/bin && \

Accepts the terms and conditions

You might also like