Hi everyone, I am posting with a very fundamental question. I am looking through the sort of naive lens of a biologist, and I am hoping for feedback on the feasibility of my proposed project through roboflow and on the specific approach I should be taking. I don’t know if this is the exact proper forum for this kind of question but seemed like the obvious place to start! I understand that resources exist for answering this, but a bunch of it is over my head at the moment… without a proper background in computer vision I am a bit overwhelmed by the breadth of information here. Plus, it’s been difficult for me to tell what pre-trained options already exist and would be applicable for my purpose.
Here’s my goal: to be able to input an image that contains flowers (e.g. here: https://inaturalist-open-data.s3.amazonaws.com/photos/221500346/original.jpg) and for a pipeline to return the pixels correspond specifically to the flowers. For now, at least, all of the flowers are from closely related plants and so there wouldn’t be much diversity in shape/size/color. I would like for this to be able to scale up to tens of thousands of images. I have plenty of sample images to choose from, I could pre-screen them for flower presence (e.g. with an LLM), and I can handle upstream/downstream work in python, etc.
Am I right that this is a segmentation-specific problem? Is manually training a small dataset the most sensible path forward? Should I be annotating my dataset with polygons specifically, rather than bounding boxes? Are there customization options I should have on my radar or is there a specific out-of-the-box model I should be training?
Anything advice helps – thanks so much!!!
Patrick
- Project Type: Segmentation (?)
- Operating System & Browser: macOS, Google Chrome