How to get Roboflow Inference working in desktop GPU?

Vivek_Rajasekaran · January 11, 2026, 9:35am

I have trained a Yolov11 model for a segmentation task. I am using it for inference with the Roboflow Inference application on my Windows desktop. The inference takes too long to run (around 8 seconds) I have an RTX 4080 GPU but it looks like Rofoflow Inference server is running only on my CPU which might be the reason behind the slowness. Please let me know how I can ensure that the GPU is being used and any other troubleshooting tips to investigate the slowness.

Grzegorz · January 12, 2026, 2:41pm

Hi @Vivek_Rajasekaran ,

Please confirm if my assumption is correct - are you running inference in docker under linux?

If so, you need to provide extra parameters to docker run in order to make the GPU available from the within docker.

For example, if you pulled roboflow/roboflow-inference-server-gpu:0.63.5, you can try below to start inference server with GPU:

docker run -it --rm --privileged --gpus=all roboflow/roboflow-inference-server-gpu:0.63.5

You can quickly verify if GPU is available from within docker container by running nvidia-smi

docker run -it --rm --privileged --gpus=all --entrypoint /bin/bash roboflow/roboflow-inference-server-gpu:0.63.5 -c nvidia-smi

Hope this helps,

Grzegorz

system · February 2, 2026, 2:42pm

This topic was automatically closed 21 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Inferencing with roboflow advice 🤝 Community Help	3	246	June 10, 2023
Unable to run model on GPU using Nvidia Orin AGX and Jetpack 6.2 🤝 Community Help	8	54	February 24, 2026
Inference workflows process-video ... is always falling back to CPU, not using the GPU 🤝 Community Help	6	91	November 4, 2025
How to run roboflow models offline in a React app? 🤝 Community Help	3	318	October 7, 2024
Error when using roboflow/roboflow-inference-server-cpu in Docker 🛠️ Feature Reqs bugs	0	517	May 29, 2023

How to get Roboflow Inference working in desktop GPU?

Related topics