🏠 Home
Benchmark Hub
📊 All Benchmarks 🦖 Dinosaur v1 🦖 Dinosaur v2 ✅ To-Do List Applications 🎨 Creative Free Pages 🎯 FSACB - Ultimate Showcase 🌍 Translation Benchmark
Models
🏆 Top 10 Models 🆓 Free Models 📋 All Models ⚙️ Kilo Code
Resources
💬 Prompts Library 📖 AI Glossary 🔗 Useful Links

AI Glossary

The complete dictionary of Artificial Intelligence

162
categories
2,032
subcategories
23,060
terms
📖
terms

Contamination

Hyperparameter of the Isolation Forest defining the expected proportion of anomalies in the dataset, used to calibrate the decision threshold on anomaly scores.

📖
terms

iTree (Isolation Tree)

Binary decision tree randomly built in an Isolation Forest, where each partition is performed on a randomly selected feature and split value until each point is isolated.

📖
terms

Isolation

Principle stating that anomalies are 'few and different' data points, and are therefore easier to isolate than normal points, requiring fewer partitions in a tree.

📖
terms

Extension Level

Optional hyperparameter in some Isolation Forest implementations that controls the number of dimensions to split simultaneously at each node, allowing handling of very high-dimensional data.

📖
terms

Scoring Function

Mathematical function that transforms the average path length into a normalized anomaly score, typically based on the distribution of path length in a random tree.

📖
terms

Forest Size

Number of iTrees built in the Isolation Forest, a hyperparameter that influences the stability and accuracy of the final anomaly scores.

📖
terms

Node Depth

Depth of a specific node in an iTree, used to calculate the path length and thus the isolation score of a data point.

📖
terms

External Anomalies

Type of anomalies that lie outside regions of high data density, effectively detected by the Isolation Forest due to their rapid isolation.

📖
terms

Computational Complexity

Measure of the efficiency of the Isolation Forest, which is linear with respect to the number of observations and sublinear with respect to the number of features, making it suitable for large datasets.

🔍

No results found