How to use my GPU with inference library

Kallinteris-Andreas · July 26, 2025, 7:31am

how can i use the inference library with my NVIDIA GPU, is there a way to explicitly initialize the model to use the CUDA device (similar to PyTorch’s device argument)

Note: i have already installed inference and inference-gpu, and i can use my GPU with PyTorch
For example how can i change this code to use my NVIDIA GPU.

from inference import get_model

image = "https://media.roboflow.com/inference/people-walking.jpg"
model = get_model(model_id="rfdetr-medium")

while True:
    results = model.infer(image)

The documentation is not clear for how i use the gpu Architecture - Roboflow Inference

Edit: infrence appears to have access to GPU, but it does not work when using the serverless API

$ inference server start 
GPU detected. Using a GPU image.
...

Edit: the execution providers appear to include CUDA

(Pdb) p model.onnxruntime_execution_providers
['CUDAExecutionProvider', 'OpenVINOExecutionProvider', 'CoreMLExecutionProvider', 'CPUExecutionProvider']

Kallinteris-Andreas · July 27, 2025, 6:17am

I figured out you need to:

model = get_model(model_id="rfdetr-medium", onnxruntime_execution_providers=["CUDAExecutionProvider"] )

Ford · July 28, 2025, 9:21pm

Hi @Kallinteris-Andreas!
Thats fantastic! Glad you were able to resolve the issue!

system · August 4, 2025, 9:21pm

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Deploy model from Roboflow Universe platform 🤝 Community Help bugs	6	656	May 7, 2024
Trying to get GPU usage with inference server and Docker 🤝 Community Help	8	55	August 14, 2025
RSTP Local inference running slow/jerky 🤝 Community Help	25	416	December 18, 2024
Inference on Jetson (Jetpack 4.x, 5.x) 🤝 Community Help	3	144	January 28, 2025
Running Trained Model on GPU 🤝 Community Help	6	327	August 13, 2024

How to use my GPU with inference library

Related topics