Summary:
I am building a workflow that runs Florence-2 locally to automatically generate annotations for images. I used the batch processing feature to run the workflow on a dataset of images. I would like to export the resulting annotations together with the corresponding images so they can be uploaded back and used to train an object detection model for a new project.
The Workflow: I have successfully configured a Florence-2 model block within Roboflow Workflows to detect humans. Using the Batch Utility, I am able to generate high-volume inference results.
The Problem: I am unable to “bridge” these VLM results back into a Roboflow project as valid ground-truth annotations. My current outputs are either:
-
Visualized Images: The bounding boxes are “burnt-in” to the image pixels, making them uneditable and unusable for training.
-
JSON Metadata: I have the coordinate data, but I cannot find a way to re-upload this JSON alongside my raw images so that Roboflow recognizes them as actual labels.
The Goal: I need a way to ensure the “reasoning” from the VLM (Florence-2) persists as structured data that can be used to train an RF-DETR model. What should I do to turn these batch outputs into editable labels in the Annotate tab?
Project Type: Object Detection
Operating System & Browser: Windows / Google Chrome
Do you grant Roboflow Support permission to access your Workspace for troubleshooting?: Yes

