0 - Introduction
DeepSeek R1 is an advanced open-weight language model designed for powerful AI applications. If you’re looking for a user-friendly interface for managing and interacting with AI models, this guide will walk you through the setup process step by step. Whether you’re a developer or an AI enthusiast, you’ll have DeepSeek R1 running on Open WebUI in no time!
For this article make sure you have docker installed (learn how to install it in this article) and a decent NVIDIA GPU as we will need nearly 6GBs of VRAM.
1 - Nvidia Container Toolkit
To be able to use our Nvidia GPUs inside the container, we need to install Nvidia Container Toolkit, below you can find the commands for doing it, but if you want to follow the official nvidia instructions, you can find them here.
Start by adding the Nvidia keys to your sources:
curl -fsSL https://nvidia.github.io/libnvidia-container/gpgkey | sudo gpg --dearmor -o /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg \
&& curl -s -L https://nvidia.github.io/libnvidia-container/stable/deb/nvidia-container-toolkit.list | \
sed 's#deb https://#deb [signed-by=/usr/share/keyrings/nvidia-container-toolkit-keyring.gpg] https://#g' | \
sudo tee /etc/apt/sources.list.d/nvidia-container-toolkit.list
Run apt update and install the toolkit:
sudo apt update
sudo apt install nvidia-container-toolkit
Configure nvidia-ctk to use docker:
sudo nvidia-ctk runtime configure --runtime=docker
Restart the docker daemon:
sudo systemctl restart docker
2 - Setup Open-WebUI
With docker ready, let’s now setup our Open-WebUI container, start by creating a folder for it and move into it:
mkdir ollama
cd ollama
Inside that folder, create a file named ‘docker-compose.yml’:
nano docker-compose.yml
In this file, paste the configuration below. This configuration allows us to run Open-WebUI with Ollama models. The ‘deploy’ part of this file is extremelly important as it is the part the tells docker to use all our GPUs, if you want to run this model with your CPU, remove it.
services:
open-webui:
image: 'ghcr.io/open-webui/open-webui:ollama'
restart: always
container_name: open-webui
volumes:
- './data/ui:/app/backend/data'
- './data/ollama:/root/.ollama'
ports:
- '3000:8080'
deploy:
resources:
reservations:
devices:
- driver: nvidia
count: all
capabilities: [ gpu ]
After saving the file (on nano, CTRL+O to save and CTRL+X to exit), run the following command to download the image and start your container:
docker compose up -d
3 - Create Account
Once the container starts up, you can access the WebUI at:
http://yourip:3000
On your browser you should now see the following page, click on ‘Get started’:
data:image/s3,"s3://crabby-images/a3c23/a3c23c1be5cc44b1c5d15d530a9e904485411ea2" alt=""
Type your name, email and password and click on ‘Create Admin Account’:
data:image/s3,"s3://crabby-images/54391/54391ea8dbbc81d3bf45dce4962fbcc460632dc6" alt=""
Now you are ready to use Open-WebUI!
data:image/s3,"s3://crabby-images/5e76d/5e76d544c3f55e86a7c63d31a1059da019ac3f96" alt=""
4 - Download DeepSeek
Now that we are ready to use the WebUI, let’s download the DeepSeek model. Click on your icon and go to ‘Admin Panel’.
data:image/s3,"s3://crabby-images/e4282/e42824ae813f6cb17ba8e2b1e4fd540d925ca6f0" alt=""
Go to ‘Settings’ and then ‘Models’, in this menu click on the download button at the top right:
data:image/s3,"s3://crabby-images/ec0eb/ec0eb7f74f7d85e2b92853dac7e6fbfe834f1447" alt=""
On the Manage Models menu, input the model name from ollama and hit the download button:
data:image/s3,"s3://crabby-images/cf788/cf7886094d9b11c8979be13f79fe805021d503de" alt=""
Because of the size of some models, you might need to wait a while for it to download.
data:image/s3,"s3://crabby-images/7ea38/7ea38a7e2041e7c88de9f7a7a0382f99843d7184" alt=""
Back to the home page, you can now start a chat with the DeepSeek R1 model!
data:image/s3,"s3://crabby-images/adab1/adab10055aa42afbda98111bbdcca9d80ab7dbcf" alt=""
With a tool like nvtop, check if your GPUs are being utilized correctly:
data:image/s3,"s3://crabby-images/81f4e/81f4e45f1dda8a34d391b27ac60e916dac40dd8f" alt=""
If your GPUs are not being used, there could be some causes but the main one might be not having enough VRAM, to fix this you can try to use a smaller model.
But that’s all for this article. Thanks for reading and stay tuned for more tech insights and tutorials. Until next time, and keep exploring the world of tech!