Contents

Measuring Overlap: Intersection over Union (IoU)

Visualizing Predictions and NMS

NMS Result

Effect of IoU Thresholds

Summary

NMS Explained: Step-by-Step Guide with Python

Understanding how Non-Maximum Suppression refines object detection predictions

Written by Brian Hulela

03 Sep 2025 • 19:27

4 min read

Non Maximum Suppression of the Golden Retriever: Image from the **Stanford Dogs Dataset**

In modern object detection, models often predict multiple overlapping bounding boxes for the same object.

These redundant boxes, if left unchecked, can create confusion for applications like counting objects or tracking.

Non-Maximum Suppression (NMS) is the technique that resolves this by selecting the most confident predictions and discarding overlapping ones.

Generating Bounding Box Predictions

Consider an example where the objective is to detect a Golden Retriever given an image. That is, we want to find a bounding box, $B = [x_{\min}, y_{\min}, x_{\max}, y_{\max}]$ , that best localizes the Golden Retriever.

Your object detection model might predict multiple bounding boxes around the same Dog.

B_i = [x_{\min}, y_{\min}, x_{\max}, y_{\max}]

s_i \in [0, 1]

Where $s_i$ is the confidence score of the predicted bounding box $B_i$ .

Multiple overlapping predictions around the Golden Retriever

These multiple predictions create noise, and make it difficult to use the predictions in any meaningful way.

The goal is to filter these bounding boxes in such a way that the bounding box with the highest confidence score is retained.

Any other bounding box that has an $IoU$ greater that a chosen threshold ( $T$ ) is dicarded.

Even though these boxes are close, only one should ideally represent the object. This is where NMS comes in.

Measuring Overlap: Intersection over Union (IoU)

The core idea is to measure how much two boxes overlap. This is quantified by the Intersection over Union (IoU):

\text{IoU}(B_i, B_j) = \frac{\text{Area of }(B_i \cap B_j)}{\text{Area of }(B_i \cup B_j)}

$IoU = 1$ → boxes perfectly overlap
$IoU = 0$ → no overlap

All code used in this guide can be accessed on this GitHub Repository.

Python

def iou(box1, box2):
    x1 = max(box1[0], box2[0])
    y1 = max(box1[1], box2[1])
    x2 = min(box1[2], box2[2])
    y2 = min(box1[3], box2[3])
    
    inter_area = max(0, x2-x1) * max(0, y2-y1)
    box1_area = (box1[2]-box1[0]) * (box1[3]-box1[1])
    box2_area = (box2[2]-box2[0]) * (box2[3]-box2[1])
    
    return inter_area / (box1_area + box2_area - inter_area)

How NMS Works

Non-Maximum Suppression follows a simple, intuitive logic:

Sort all predicted boxes by confidence scores $s_i$ in descending order.
Select the highest-scoring box $B_{\text{max}}$ and add it to the final detections.
Remove all remaining boxes that have IoU with $B_{\text{max}}$ greater than a threshold $T$ .
Repeat until no boxes remain.

Mathematically, for each box $B_i$ :

\text{keep } B_i \;\Longleftrightarrow\; \forall\, B_j \in \mathcal{K},

\; \operatorname{IoU}(B_i,B_j) \le T

where $\mathcal{K}$ is all the boxes in the image.

Python

def non_max_suppression(boxes, scores, iou_threshold=0.5):
    boxes = np.array(boxes)
    scores = np.array(scores)
    indices = np.argsort(scores)[::-1]
    keep = []
    
    while len(indices) > 0:
        current = indices[0]
        keep.append(current)
        rest = indices[1:]
        ious = np.array([iou(boxes[current], boxes[i]) for i in rest])
        indices = rest[ious <= iou_threshold]
    
    return boxes[keep], scores[keep], [i for i in keep]

Visualizing Predictions and NMS

We can represent predictions visually to see the effect of NMS.

Each box is drawn with a unique color, and confidence scores are annotated:

Python

def plot_boxes(image, boxes, scores=None, title="", filename=None, colors=None):
    fig, ax = plt.subplots(figsize=(8,6))
    ax.imshow(image)
    
    for i, box in enumerate(boxes):
        x1, y1, x2, y2 = box
        rect = patches.Rectangle(
            (x1, y1), x2-x1, y2-y1,
            linewidth=2,
            edgecolor=colors[i] if colors else 'w',
            facecolor='none',
        )
        ax.add_patch(rect)
        if scores is not None:
            ax.text(
                x1, y1-5, f"{scores[i]:.2f}",
                color=colors[i] if colors else 'w',
                fontsize=12
            )
    
    plt.axis('off')
    if filename:
        plt.savefig(filename, bbox_inches="tight", pad_inches=0)
    plt.show()

NMS Result

The result of NMS is the final bounding box that best represents the object as illustrated by the following. A Threshold of 0.5 is chosen for this example.

Python

threshold = 0.5

nms_boxes, nms_scores, keep_indices = non_max_suppression(boxes, scores, iou_threshold=threshold)
kept_colors = [colors[i] for i in keep_indices]
    
plot_boxes(
    image,
    nms_boxes,
    scores=nms_scores,
    title=f"NMS Filtered Boxes (IoU={threshold})",
    filename=f"nms_{int(threshold*100)}.png",
    colors=kept_colors
)

Effect of IoU Thresholds

NMS ensures that each object is represented by a single bounding box, prioritizing the most confident predictions. The IoU threshold $T$ determines how aggressive the suppression is. Choosing the right IoU threshold is crucial:

Low threshold (0.3): even slightly overlapping boxes are removed
High threshold (0.7): only highly overlapping boxes are removed

An understanding of how your detection model works is crucial in determining what the threshold should be.

Summary

Non-Maximum Suppression is a simple yet essential step in object detection pipelines. By combining confidence scores and IoU-based suppression, it reduces redundancy and ensures clearer, more precise predictions.

Confidence score guides which box is preferred.
IoU threshold controls overlap tolerance.
Visualization helps understand the algorithm’s behavior.

FAQs

NMS addresses the issue of multiple overlapping bounding boxes predicted for the same object. Without it, object detectors would output many redundant boxes, making it unclear which one represents the object best. NMS keeps the most confident box while discarding highly overlapping duplicates.

The algorithm sorts boxes by their confidence scores, keeps the highest one, and removes all other boxes that have an Intersection-over-Union (IoU) above a chosen threshold with it. This process repeats until no boxes remain to compare.

The IoU threshold controls how much overlap is tolerated between boxes. A low threshold (e.g., 0.3) aggressively removes boxes, potentially discarding valid detections. A high threshold (e.g., 0.7) is more forgiving, allowing more boxes to remain. Choosing the right threshold depends on the dataset and application.

Yes, if the IoU threshold is too strict, NMS may suppress valid boxes that happen to overlap significantly. This is why threshold tuning is important in practice.

While most common in object detection, NMS is also used in tasks like text detection, keypoint detection, and even some natural language processing problems where overlapping candidates need to be pruned.

Soft-NMS is a variation where overlapping boxes are not completely removed but instead have their confidence scores reduced based on their IoU. This often improves detection performance, especially when objects are close together.

NMS Explained: Step-by-Step Guide with Python

Understanding how Non-Maximum Suppression refines object detection predictions

Generating Bounding Box Predictions

Measuring Overlap: Intersection over Union (IoU)

How NMS Works

Visualizing Predictions and NMS

NMS Result

Effect of IoU Thresholds

Summary

FAQs

What problem does Non-Maximum Suppression solve?

How does NMS decide which boxes to keep?

What is the role of the IoU threshold in NMS?

Can NMS remove true positives?

Is NMS only used in object detection?

What is Soft-NMS and how is it different?

Read More

Intersection over Union and Its Applications in Computer Vision

Basic Computer Vision Operations

Foundations of Computer Vision

How to Get Started with Data Science Using Free Resources