🏠 Home
Benchmark Hub
📊 All Benchmarks 🦖 Dinosaur v1 🦖 Dinosaur v2 ✅ To-Do List Applications 🎨 Creative Free Pages 🎯 FSACB - Ultimate Showcase 🌍 Translation Benchmark
Models
🏆 Top 10 Models 🆓 Free Models 📋 All Models ⚙️ Kilo Code
Resources
💬 Prompts Library 📖 AI Glossary 🔗 Useful Links

AI Glossary

The complete dictionary of Artificial Intelligence

162
categories
2,032
subcategories
23,060
terms
📖
terms

CF Tree (Clustering Feature Tree)

Tree data structure at the core of BIRCH, storing statistical summaries (Clustering Features) in its nodes to compactly represent subclusters.

📖
terms

Clustering Feature (CF)

A triplet (N, LS, SS) that statistically summarizes a subcluster, where N is the number of points, LS the linear sum of the points, and SS the sum of the squares of the points.

📖
terms

Threshold

BIRCH parameter defining the maximum diameter of a subcluster in a leaf of the CF tree, controlling the granularity of the clustering summary.

📖
terms

Branching Factor

Parameter limiting the number of entries (children) per node in the CF tree, influencing the size and shape of the tree to optimize performance.

📖
terms

Micro-clustering

Initial phase of BIRCH where data points are organized into micro-clusters, represented by the entries in the leaf nodes of the CF tree.

📖
terms

Macro-clustering

Final phase of BIRCH applying a clustering algorithm (like K-Means) on the micro-clusters (leaf nodes of the CF tree) to generate the final clusters.

📖
terms

Incremental summarization

Ability of BIRCH to update the CF tree with new data points without needing a complete recalculation from the beginning, ideal for data streams.

📖
terms

CF Additive Distance

Distance metric used in BIRCH to measure the proximity between two Clustering Features, directly calculable from their statistical summaries without accessing the original points.

📖
terms

Leaf Entry

Element of a leaf of the CF tree representing a micro-cluster, containing a Clustering Feature and a pointer to the next node in the leaf linked list.

📖
terms

Leaf Linked List

Structure in the CF tree linking all leaves for efficient sequential scanning during the macro-clustering phase.

📖
terms

Point Absorption

Process in BIRCH where a new data point is integrated into the nearest micro-cluster if the addition does not exceed the diameter threshold.

📖
terms

Node Splitting

Mechanism triggered in BIRCH when the insertion of a point would exceed the diameter threshold or branching factor, dividing the node to maintain constraints.

📖
terms

Rebuilding Phase

Optional step in BIRCH where the CF tree is rebuilt with a lower diameter threshold to increase clustering precision before the final phase.

📖
terms

Incremental computational cost

Key advantage of BIRCH, where the cost to insert a data point is logarithmic with respect to the number of points, making the algorithm very scalable.

📖
terms

Cluster Summary

Fundamental concept of BIRCH where a group of points is represented by a statistical summary (the CF) rather than by individual points, reducing memory space.

🔍

No results found