Using Slicer with tracking only annotates first frame

akella · October 8, 2024, 7:48am

Hi! I’m new, so i admit i might be just doing a simple mistake. Still i would be grateful for you guidance!
TASK: I wanted to track birds in the sky.
Here is my input 1920x1080
Here is how small they are:

Took tracking example from repository
supervision/tree/develop/examples/tracking
Because objects are small, i used Slicer, and adapted example in such a way:

import argparse
import os

from inference.models.utils import get_roboflow_model
from tqdm import tqdm
from inference import get_model

import supervision as sv
model = get_model(model_id="yolov8n-640")

def slicer_callback(slice) -> sv.Detections:
    result = model.infer(slice)[0]
    detections = sv.Detections.from_inference(result)
    return detections


def process_video(
    roboflow_api_key: str,
    model_id: str,
    source_video_path: str,
    target_video_path: str,
    confidence_threshold: float = 0.01,
    iou_threshold: float = 0.5,
) -> None:
    model = get_roboflow_model(model_id=model_id, api_key=roboflow_api_key)

    tracker = sv.ByteTrack()
    box_annotator = sv.BoundingBoxAnnotator()
    label_annotator = sv.LabelAnnotator()
    frame_generator = sv.get_video_frames_generator(source_path=source_video_path)
    video_info = sv.VideoInfo.from_video_path(video_path=source_video_path)
    slicer = sv.InferenceSlicer(
        callback=slicer_callback,
        slice_wh=(256, 256),
        overlap_ratio_wh=(0.2, 0.2),
    )

    with sv.VideoSink(target_path=target_video_path, video_info=video_info) as sink:
        for frame in tqdm(frame_generator, total=video_info.total_frames):
            # results = model.infer(
            #     frame, confidence=confidence_threshold, iou_threshold=iou_threshold
            # )[0]
            # detections = sv.Detections.from_inference(results)
            
            detections = slicer(frame)


            detections = tracker.update_with_detections(detections)

            annotated_frame = box_annotator.annotate(
                scene=frame.copy(), detections=detections
            )

            annotated_labeled_frame = label_annotator.annotate(
                scene=annotated_frame, detections=detections
            )

            sink.write_frame(frame=annotated_labeled_frame)

I tried this code on walking people video, it tracked people across the whole video.
Yet when i try it on my birds video, i only get first frame annotated, and then nothing.
Is it due to object size? Or could there be any other reason?
Here is what i get as an annotated result Imgur: The magic of the Internet

OS: Mac OS
I ran it with:

python inference_example.py     --roboflow_api_key ROBOFLOW_KEY     --source_video_path input.mp4     --target_video_path tracking_result.mp4

p.s: also not sure if this considered a roboflow app, sorry if not.

akella · October 9, 2024, 7:39am

So, i went a bit further, i trained model on similar sky images.
when i put my video into Visualize interface, it tracks birds successfully. Or rather - detects. Here is video
But when i try to do tracking with same script, it only annotates first frame again.

def process_video(
    roboflow_api_key: str,
    model_id: str,
    source_video_path: str,
    target_video_path: str,
    confidence_threshold: float = 0.01,
    iou_threshold: float = 0.5,
) -> None:
    model = get_roboflow_model(model_id="birds-nqymw/2", api_key=roboflow_api_key)

    tracker = sv.ByteTrack()
    box_annotator = sv.BoundingBoxAnnotator()
    label_annotator = sv.LabelAnnotator()
    frame_generator = sv.get_video_frames_generator(source_path=source_video_path)
    video_info = sv.VideoInfo.from_video_path(video_path=source_video_path)
    

    with sv.VideoSink(target_path=target_video_path, video_info=video_info) as sink:
        for frame in tqdm(frame_generator, total=video_info.total_frames):
            slicer = sv.InferenceSlicer(
                callback=slicer_callback,
                slice_wh=(512, 512),
                overlap_ratio_wh=(0.5, 0.5),
            )
            detections = slicer(frame)


            detections = tracker.update_with_detections(detections)

            annotated_frame = box_annotator.annotate(
                scene=frame.copy(), detections=detections
            )

            annotated_labeled_frame = label_annotator.annotate(
                scene=annotated_frame, detections=detections
            )

            sink.write_frame(frame=annotated_labeled_frame)

As im a little beginner in this, i would be grateful in just general directions of what could be wrong to see such thing. Low confidence in detection? Too small? Some script error?

akella · October 9, 2024, 8:49pm

For anyone having same issues, i found root of my problems
it is described in this issue:

github.com/roboflow/supervision

Problems with tracker.update_with_detections(detections)

opened 12:45PM - 21 May 24 UTC

CodingMechineer

bug

### Search before asking - [X] I have searched the Supervision [issues](https…://github.com/roboflow/supervision/issues) and found no similar bug report. ### Bug Somehow, I loose predicted bounding boxes in this line: `tracker.update_with_detections(detections)` In the plot from Ultralytics, everything is fine. Though, after the line above gets executed, I loose some bounding boxes. In this example, I loose two. That's the plot from Ultralytics, how it should be: ![image](https://github.com/roboflow/supervision/assets/149016088/b715953b-d6af-4d0a-85ef-a18a5479b85e) That's the plot after the Roboflow labling, some predictions are missing: ![image](https://github.com/roboflow/supervision/assets/149016088/e025214a-b3b4-46bf-ab80-e05d9ea1cbe1) Can somebody help me with this issue? ### Environment - Supervision 0.20.0 - Python 3.12.3 - Ultralytics 8.2.18 ### Minimal Reproducible Example ### Code: ``` import cv2 import supervision as sv from ultralytics import YOLO model_path = "path/to/your/model.pt" video_path = "path/to/your/video.mp4" cap = cv2.VideoCapture(video_path) model = YOLO(model_path) box_annotator = sv.BoundingBoxAnnotator() label_annotator = sv.LabelAnnotator() tracker = sv.ByteTrack() while True: ret, frame = cap.read() results = model(frame, verbose=False)[0] print(f"CLS_YOLO-model: {results.boxes.cls}") results_2 = model.predict(frame, show=True, # The plot from the Ultralytics library conf = 0.5, save = False, ) detections = sv.Detections.from_ultralytics(results) print(f"ClassID_Supervision_1: {detections.class_id}") # Between this and the next print, predictions are lost detections = tracker.update_with_detections(detections) # The detections get lost here labels = [ f"{results.names[class_id]} {confidence:0.2f}" for confidence, class_id in zip(detections.confidence, detections.class_id) ] print(f"ClassID_Supervision_2: {detections.class_id}") # Here two predictions from the Ultralytics model are lost annotated_frame = frame.copy() annotated_frame = box_annotator.annotate( annotated_frame, detections ) labeled_frame = label_annotator.annotate( annotated_frame, detections, labels ) print(f"ClassID_Supervision_3: {detections.class_id}") print(f"{len(detections)} detections, Labels: {labels}", ) cv2.imshow('Predictions', labeled_frame) # The with Roboflow generated frame cap.release() cv2.destroyAllWindows() ``` ### Prints in console: CLS_YOLO-model: tensor([1., 1., 1., 1.], device='cuda:0') **--> Class ID's from the predicted bounding boxes** ClassID_Supervision_1: [1 1 1 1] **--> Converted into Supervision** ClassID_Supervision_2: [1 1] **--> After the tracker method class ID's are lost** ClassID_Supervision_3: [1 1] 2 detections, Labels: ['Spot 0.87', 'Spot 0.86'] ### Additional _No response_ ### Are you willing to submit a PR? - [ ] Yes I'd like to help by submitting a PR!

Basically for small fast objects there is no overlap, so tracker fails to track them continuously, so i either need a slowmo video, or slow birds, or fat birds =)

system · October 30, 2024, 8:49pm

This topic was automatically closed 21 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Roboflow model complete-- Next Steps? Community Help	1	68	June 21, 2024
Python api not labeling object by name Community Help	2	7	January 14, 2025
How to imbed the video frames from the inference pipeline in the web Community Help	26	317	August 27, 2024
Tutorial on how to use SUPERVISION API Feedback	4	1448	March 27, 2024
Video inference using roboflow.js Community Help	6	372	May 17, 2024

Using Slicer with tracking only annotates first frame

Related topics