Custom training a Grounding DINO model

Abishek_H · September 15, 2023, 7:15am

Hi,

I have found resources in order to perform zero-shot inference of most objects using Grounding DINO.

How do I fine-tune this model? For instance, I want to use Grounding DINO to look at an image of a shelf, find how many racks are present, and what are the products present in this rack? Grounding DINO is either not innately trained to recognize these specific products such as clothes, toys, SKUs etc, or does not do well in most cases.

I am seeking a script to fine-tune the base G-DINO model, so that it can recognize these specific objects. I can get a labelled dataset in whatever format is required for training.

Please help.

Thank you,
Abishek

Topic		Replies	Views
Can't find Grounding-DINO model 🤝 Community Help	5	96	December 9, 2025
Could Someone Give me Advice with Fine-Tuning a Model Using Roboflow? 🤝 Community Help bugs , feature-request , export	1	590	December 16, 2024
How to deploy GroundingDINO to mobile? 🤝 Community Help	2	360	September 9, 2023
Auto-Label AI not labelling 🤝 Community Help	6	530	October 31, 2024
Zero-shot clip example works on default DS - but on my photos , accuracy is 0.0 🤝 Community Help	0	145	May 4, 2023

Custom training a Grounding DINO model

Related topics