Inference on Jetson (Jetpack 4.x, 5.x)

sashimiroll · January 7, 2025, 8:32am

This topic is intended to provide instruction for those attempting run Roboflow inference on Jetson devices. The documentation for Jetson deployments on Roboflow’s website seems to be out-of-date or incomplete.

This guide was tested on a Jetson Xavier AGX running Jetpack 5.1.4 with Python 3.8.

1. Inference package

Roboflow’s inference package utilizes the ONNX cpu runtime engine and does not use the Jetson’s gpu. Instead, you must install the inference-gpu package. For Jetsons running Jetpack 4.6 or 5.1, installing with pip will fail as the package depency onnxruntime-gpu is unavailable.

2. Install onnxruntime-gpu from Jetson Zoo
The default pypi package for onnxruntime-gpu does not work with Cuda 11.x.

Go to this link and follow the instructions to download and install the correct pip wheel depending on your Python and Jetpack version.

Note: This did not work with Python 3.10 for some reason (package was compiled for 3.8?)

3. Install inference-gpu with pip

pip install inference-gpu

4. Check that onnxruntime (cpu) is not installed
If both onnxruntime and onnxruntime-gpu are installed, this may cause an issue in which onnx cannot locate the CudaExecutionProvider and will instead fall back to the CPUExcecutionProvider.

# check if onnxruntime is in package list
pip list
# uninstall
pip uninstall onnxruntime

5. Enable all cores on Jetson device
Cores on the Jetson device are deactivated in certain power modes. To maximize performance change the power mode to MODE_30W_ALL (AGX Xavier) using the command line or jtop.

6. Run inference in your python code
Congrats, you should be able to run inference with the Jetson GPU. AGX Xavier was able to run a Roboflow 3.0 object detection model at about 20 fps.

Hope this is helpful to those struggling with dependencies

Grzegorz · January 7, 2025, 2:44pm

Hi @sashimiroll ,

Many thanks for posting this great tip!

Regarding onnxruntime-gpu, can you confirm if you install inference-gpu in fresh venv on your Jetson, is onnxruntime installed together with onnxruntime-gpu?

I have followed below steps on my laptop with GPU:

python -m venv venv
source venv/bin/activate
pip install inference-gpu
pip freeze | grep onnx
# output:
# onnxruntime-gpu==1.19.2

So I only see onnxruntime-gpu installed

Thanks,
Grzegorz

sashimiroll · January 7, 2025, 3:44pm

Hi @Grzegorz

Onnxruntime is not installed with inference-gpu, however the package may be present as a dependency for another which causes the bug. In my case, I had the inference cpu package installed previously.

system · January 28, 2025, 3:44pm

This topic was automatically closed 21 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Running Trained Model on GPU Community Help	6	279	August 13, 2024
Jetson Nano Youtube Tutorial Failing Community Help	7	824	September 8, 2023
Deploy model from Roboflow Universe platform Community Help bugs	6	605	May 7, 2024
Roboflow docker for jetson not support my model Community Help bugs , formats , convert	8	283	May 29, 2024
Error deploying custom trained model on NVIDIA Jetson Xavier Community Help	25	1925	April 17, 2024

Inference on Jetson (Jetpack 4.x, 5.x)

Related topics