I’m currently training a keypoint detection model using Roboflow, and I’d like to better understand how the evaluation metrics (precision, recall, OKS, mAP) are computed.
-
Does Roboflow evaluate each keypoint ( joint-by-joint)?
-
Or does it evaluate per detected person/image, aggregating all keypoints together?
And also I would like to know that, I purchased the paid plan but I can’t upload the video for prediction. Doesn’t it support for Keypoint detection model or is it limited to image uploads only?