What is the mean average precision in object detection?

What is object detection?

Object detection is a method of recognizing and locating instances of an object from an image or video. Mean average precision (mAP) is a metric used to measure the accuracy of object detection models.

It is a value between $0–1$ , with higher scores representing a more accurate model.

The following formula describes it:

In the formula above, $N$ is the total number of classes, and $AP_i$ is the average precision of class $i$ . In simple terms, mAP is the average of average precisions across all classes.

To understand the calculation of mAP for object detection, we must first explore intersection over union, precision-recall curve, and average precision.

Intersection over union (IOU)

Intersection over union (IOU) is a metric used to measure the accuracy of a bounding boxAn imaginary rectangle that contains an object. predicted by an object detector. The actual bounding box is the desired box that we set ourselves. The following image gives examples of actual and predicted bounding boxes.

The training data is assumed to have some objects present in each image. As a result, true negatives (TN) are not considered.

Precision-recall curve

A precision-recall curve is a graph that shows the tradeoff between precision and recall.

A precision-recall curve can be plotted by following these steps:

Passing dataset of images to the object detection model.
Sorting results based on the received confidence scoresA value between 0–1 indicating the models certainty in its prediction..
Determining if the prediction was TP, FP, or FN.
Calculating ranked precisionPrecision for top k sorted results and ranked recallRecall for top k sorted results.

To understand the calculations in each step, let's consider an object detection model that can classify dogs. Let's suppose that the dataset contains only four images of dogs.

For each image, the model returns a bounding box, a predicted class, and a confidence scoreThe model's certainty in predicting the class for the predicted class. In the case of multiple predictions, we arbitrarily choose the prediction with the highest confidence. The bounding box and predicted class have been omitted for clarity.

True positve (TP)	False positive (FP)	False negative (FN)
The predicted class is correct, and IOU is greater than or equal to the threshold.	The IOU score is less than the threshold (less than 0.5) or multiple bounding boxes have been predicted.	The IOU score was greater than or equal to 0.5 but the prediction was made for the wrong class.

What is the mean average precision in object detection?

What is object detection?

Intersection over union (IOU)

Evaluating Metrics

Precision-recall curve

Average precision

Mean average precision