I am working on a model to detect defective gelatin capsules. I use YOLOv8 and annotate capsules in two classes: “normal” and “defective”. My dataset currently has 2,000 images and 10,000 annotations (5,000 per class). However, I have some doubts about this approach.
Defects can be very different: cracks, dents, chips, open or damaged halves. Because of this, the shape of the capsules changes, which may make classification more difficult.
Would it be better to annotate all capsules as one class and train the model separately to detect defects? Or maybe there are other approaches that work better for this task?
I would appreciate any advice!