K-means clustering - Glossariusz AI

📖

pojęcia

K-means

Unsupervised learning algorithm that partitions a dataset into K clusters by minimizing the sum of intra-cluster distances, where each cluster is represented by its centroid.

📖

pojęcia

Centroid

Geometric point representing the center of a cluster, calculated as the average of all data points belonging to that cluster and serving as a reference for assigning new observations.

📖

pojęcia

Within-cluster inertia

Measure quantifying the dispersion of points within the same cluster, calculated as the sum of squared distances between each point and its respective centroid.

📖

pojęcia

Elbow method

Heuristic technique to determine the optimal number of clusters K by plotting within-cluster inertia against K and identifying the point where the inertia reduction starts to decrease significantly.

📖

pojęcia

K-means++

Initialization variant of K-means that intelligently selects initial centroids using a probability distribution based on distances, improving convergence and clustering quality.

📖

pojęcia

State reached when centroids no longer move significantly between successive iterations or when the assignment of points to clusters becomes stable, indicating that the algorithm has found a solution.

📖

pojęcia

Silhouette score

Clustering evaluation metric measuring intra-cluster cohesion and inter-cluster separation, ranging from -1 (poor clustering) to +1 (optimal clustering).

📖

pojęcia

Local optimum

Clustering solution where no improvement is possible through simple point reassignments, but which does not necessarily represent the best possible global configuration.

📖

pojęcia

Mini-batch K-means

A variant of K-means that uses random subsets of data (mini-batch) to update centroids, offering faster convergence on large datasets.

📖

pojęcia

Inter-cluster inertia

Measure of separation between clusters, calculated as the weighted sum of squared distances between different cluster centroids and the global centroid.

📖

pojęcia

Vector quantization

Application of K-means clustering to compress vector data by representing groups of similar vectors by their common centroid, thus reducing information complexity.

📖

pojęcia

K-means iteration

A complete cycle of the algorithm including the assignment phase of points to the nearest centroid and the update phase of centroids based on the average of assigned points.

📖

pojęcia

Partitioning

Fundamental approach of K-means clustering aimed at creating a disjoint division of data where each observation belongs to exactly one cluster.

📖

pojęcia

Clustering stability

Measure of the consistency of clustering results obtained by K-means across multiple runs with different initializations, evaluating the robustness of the found solution.

Słownik AI

K-means

Centroid

Within-cluster inertia

Elbow method

K-means++

Convergence

Silhouette score

Local optimum

Mini-batch K-means

Inter-cluster inertia

Vector quantization

K-means iteration

Partitioning

Clustering stability

Nie znaleziono wyników