CNN for Object Detection

📖

istilah

YOLO (You Only Look Once)

Real-time object detection algorithm that processes the entire image in a single pass, dividing the image into a grid and simultaneously predicting bounding boxes and probability classes for each cell.

📖

istilah

R-CNN (Region-based CNN)

Pioneering object detection architecture that uses selective region proposals followed by a CNN to extract features, then SVMs to classify each proposed region.

📖

istilah

Fast R-CNN

Improvement of R-CNN that shares computations between region proposals using ROI pooling to efficiently extract features and combines classification and regression in a single network.

📖

istilah

Faster R-CNN

Advanced architecture integrating a Region Proposal Network (RPN) that shares convolutional features with the detection network, eliminating the need for external selective search.

📖

istilah

Anchor Box

Predefined boxes of different dimensions and ratios used as references to predict bounding boxes, serving as anchor points to improve object localization accuracy.

📖

istilah

Bounding Box

Rectangle defined by coordinates (x, y, width, height) that delimits the position of a detected object in an image, used for precise spatial localization of elements.

📖

istilah

Non-Maximum Suppression (NMS)

Post-processing algorithm that eliminates redundant detections by keeping only the boxes with the highest scores and removing those that overlap beyond a defined IoU threshold.

📖

istilah

Region Proposal Network (RPN)

Convolutional neural network that directly generates candidate region proposals using anchor boxes and predicting object probabilities and box adjustments for each location.

📖

istilah

Intersection over Union (IoU)

Evaluation metric measuring the overlap between the predicted box and the ground truth box, calculated as the ratio of the intersection over the union of the two boxes.

📖

istilah

Feature Pyramid Network (FPN)

Architecture combining multi-scale features through top-down and lateral connections, improving the detection of objects at different sizes in the same image.

📖

istilah

Single Shot Detector (SSD)

Unified object detector eliminating region proposals by directly predicting boxes and classes from feature maps at different scales for efficient multi-scale detection.

📖

istilah

Mask R-CNN

Extension of Faster R-CNN adding a segmentation branch predicting binary masks for each object, simultaneously performing detection, classification, and instance segmentation.

📖

istilah

Object Detection

Computer vision task combining localization and classification to identify and delimit multiple objects in an image with bounding boxes and category labels.

📖

istilah

mAP (Mean Average Precision)

Standard evaluation metric in object detection calculating the mean of average precisions across all classes, integrating precision, recall, and IoU thresholds for overall performance.

📖

istilah

Backbone Network

Fundamental CNN network (e.g., ResNet, VGG) extracting hierarchical features from the image, serving as the base for detection heads in modern architectures.

📖

istilah

Strided Convolution

Convolutional operation with stride greater than 1, reducing the spatial dimensions of the feature map while increasing the receptive field to capture wider contexts.

Glosarium AI

YOLO (You Only Look Once)

R-CNN (Region-based CNN)

Fast R-CNN

Faster R-CNN

Anchor Box

Bounding Box

Non-Maximum Suppression (NMS)

Region Proposal Network (RPN)

Intersection over Union (IoU)

Feature Pyramid Network (FPN)

Single Shot Detector (SSD)

Mask R-CNN

Object Detection

mAP (Mean Average Precision)

Backbone Network

Strided Convolution

Tidak ada hasil ditemukan