Discrepancy in CreateML annotations JSON file

Nick_Arner · August 5, 2023, 5:31pm

I’m trying to export a dataset I have created on Roboflow for local training with Apple’s CreateML application to create a CoreML model.

The dataset is quite large; it’s based off this Coco dataset from Universe. The only difference is I’m only using the following classes in the pre-processing step:

backpack
book
cell phone
chair
clock
cup
dining table
handbag
keyboard
laptop
microwave
mouse
person
remote
refrigerator
potted plant
scissors
sofa
suitcase
tv monitor

When I add the data to CreateML, I see that there are 3 classes in the Training Data section, and 20 in the Validation Data section. These should be the same number of classes.

When I look at the respective _annotations.createml.json files, they do reflect this class difference.

Nick_Arner · August 5, 2023, 5:55pm

There are many under represented classes as shown here, but, I’m only using a few compared to the total number. This is again from the original Microsoft coco dataset I linked above.

leo · August 5, 2023, 9:39pm

Hi Nick,

Are you filtering null images as well? Can you confirm that there’s definitely more than three classes tagged in the training dataset images?

I’m sure it’s highly unlikely, but if you didn’t filter out null images, I can imagine a scenario where only the three classes that you filtered (and the other ones that aren’t) made it into your training dataset.

Nick_Arner · August 7, 2023, 3:12pm

@leo It looks like I’m not filtering out null images.

I can confirm, weirdly, that classes that are present in the Valid set are not present in the Train set - see the difference in the two screenshots below:

leo · August 7, 2023, 11:45pm

Hi Nick,

Could you try filtering out null images and rebalancing your dataset to see if that solves your issues?

Nick_Arner · August 7, 2023, 11:57pm

@leo dumb question, but, how do I filter out the null images?

leo · August 8, 2023, 9:58am

Hi @Nick_Arner

No worries at all. Null filtering is a preprocessing feature. Here’s how to enable it:

When generating a new version, during step 3 “Preprocessing”, add a new preprocessing step

Screenshot 2023-08-08 at 6.55.20 PM1018×820 44.6 KB
Select Filter Null and select the percentage of null images you’d like to remove from that version of your dataset.

Screenshot 2023-08-08 at 6.55.37 PM1078×930 152 KB

Nick_Arner · August 8, 2023, 10:16pm

Thank you @leo

I tried both those things, but still getting the same result as before

leo · August 9, 2023, 11:24pm

Hi Nick,

Could you share the project Universe link (if public) or the workspace and project ID? Is it the COCO dataset Universe link you shared? (I can’t seem to click on it)

Was it the same number of classes that were getting added to a specific split, or a different number?

Nick_Arner · August 9, 2023, 11:37pm

Hey @leo; workspace is “Stitch”; project is “stitch-coco”

Was it the same number of classes that were getting added to a specific split, or a different number?
I think the same number

leo · August 10, 2023, 9:26am

Hey @Nick_Arner, let me look into this for you and I’ll let you know when I have an update.

Nick_Arner · August 10, 2023, 2:04pm

Awesome; thank you kindly

Nick_Arner · August 16, 2023, 2:57pm

Hey @leo just wanted to see if you had any news on your end - thank you!

leo · August 20, 2023, 9:03am

Hi @Nick_Arner

We’re still working on it. Can you confirm that you already tried to rebalance your dataset?

Nick_Arner · August 22, 2023, 9:52pm

@leo thank you, Leo
Yes, I did try filtering out null images and rebalancing the dataset

leo · September 6, 2023, 10:11am

Hi @Nick_Arner

Still working on your issue and haven’t found a clear cause or solution yet. In the meantime, I’ve filed a bug report so the team can take a deeper look.

Topic		Replies	Views
Problems with coco annotations Feedback feature-request	2	298	July 4, 2024
Does omitting a class eliminate the images from the dataset? Community Help	5	745	June 29, 2023
Yolo negative annotated examples do not export and not used for training Community Help	3	448	August 22, 2022
How to completely remove an annotation class and their corresponding annotations and images containing those annotations? Community Help	2	837	March 28, 2024
False annotation Community Help	1	260	June 14, 2022

Discrepancy in CreateML annotations JSON file

Related topics