Train/Test Rebalancing Doesn't Randomly Select from All Classes

nickcmorgan · March 27, 2024, 1:14am

The train/test rebalancing feature still seems to not be randomly allocating images across classes into each set. This bug was described years ago here: Need Ability to randomly shuffle "merged dataset" when allocating Train/Val/Test Sets - #4 by Fredrik_T

Is there any update on this behavior? This is serious limitation to being able to work with a dataset of any decent size.

Edit: Another thread detailing this same bug: After Merging Datasets, Re-balancing (Train/Val/Test) excludes multiple classes in VAL split - #7 by 25benjaminli

nickcmorgan · March 27, 2024, 4:34pm

@brad tagging you directly since you’ve been involved on previous threads.

brad · March 27, 2024, 4:41pm

What’s the underlying problem you’re trying to solve?

nickcmorgan · March 27, 2024, 4:55pm

I am trying to get my train/valid/test splits to have equal representations of each of the classes in my dataset. If I generate a new version and change the splits, the new distribution does not have a random selection of classes.

The only way to get a new random distribution at that point is to download the imagery and re-upload it using the web interface. This is time consuming when dealing with nearly 10k images.

system · April 17, 2024, 4:55pm

This topic was automatically closed 21 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
After Merging Datasets, Re-balancing (Train/Val/Test) excludes multiple classes in VAL split Feedback split-after-upload , bugs	7	2642	March 11, 2024
Need Ability to randomly shuffle "merged dataset" when allocating Train/Val/Test Sets Feedback split-after-upload , bugs	3	837	October 9, 2023
Splitting Multi-Class Dataset & Rebalancing create severly unbalanced Val/Test sets at Export Community Help split-after-upload , bugs	2	415	April 27, 2022
Problem face when using Train/Test Split Community Help split-after-upload , bugs	5	1666	March 10, 2023
Validation/Training set split is not random Feedback bugs	0	224	May 16, 2023

Train/Test Rebalancing Doesn't Randomly Select from All Classes

Related topics