I’ve quantitatively evaluated binary segmentation models using IOU as my metric.
The reason why I used IoU is because IoU penalizes a single instance of classification error more compared to other metrics such as f1 scores, therefore lowering the value. Here are the steps I took.
- Randomly sample 30 images from the whole dataset
- Calculate All IoU for each images
- Average the Iou for each model