🏠 Home
Benchmark Hub
📊 All Benchmarks 🦖 Dinosaur v1 🦖 Dinosaur v2 ✅ To-Do List Applications 🎨 Creative Free Pages 🎯 FSACB - Ultimate Showcase 🌍 Translation Benchmark
Models
🏆 Top 10 Models 🆓 Free Models 📋 All Models ⚙️ Kilo Code
Resources
💬 Prompts Library 📖 AI Glossary 🔗 Useful Links

AI Glossary

The complete dictionary of Artificial Intelligence

162
categories
2,032
subcategories
23,060
terms
📖
terms

Text classification

NLP task consisting of automatically assigning a textual document to one or more predefined categories based on its semantic content.

📖
terms

Binary classification

Type of classification where the model must choose between two mutually exclusive classes, usually represented as positive/negative or 0/1.

📖
terms

Multi-class classification

Classification problem where each instance must be assigned to exactly one class among three or more, with mutually exclusive classes.

📖
terms

Multi-label classification

Variant of classification where a document can be simultaneously associated with multiple non-exclusive labels or categories.

📖
terms

Naive Bayes

Probabilistic classification algorithm based on Bayes' theorem with a conditional independence assumption between features.

📖
terms

SVM (Support Vector Machine)

Supervised learning algorithm that finds the optimal hyperplane separating classes in high-dimensional space by maximizing the margin.

📖
terms

Bag-of-Words

Text representation that counts word occurrences without considering their order or grammatical context.

📖
terms

TF-IDF

Statistical metric evaluating the importance of a word in a document relative to a corpus, combining term frequency and inverse document frequency.

📖
terms

Word Embeddings

Dense vector representations of words in a continuous space where semantic distances between words are preserved.

📖
terms

Transformers

Neural network architecture based on attention mechanisms that allows capturing long-range dependencies in sequences.

📖
terms

Confusion Matrix

A table for visualizing classifier performance by comparing predictions to true labels by class.

📖
terms

Cross-validation

Robust evaluation technique dividing data into subsets to train and test the model multiple times on different partitions.

📖
terms

Precision

Metric measuring the proportion of correct positive predictions among all positive predictions made by the model.

📖
terms

Recall

Metric evaluating the model's ability to correctly identify all actual positive instances in the dataset.

📖
terms

F1 Score

Harmonic mean of precision and recall, providing a single balanced measure of classification performance.

📖
terms

Overfitting

Phenomenon where the model learns training data too specifically and poorly generalizes to new unseen data.

📖
terms

Tokenization

Process of segmenting text into elementary units (tokens) such as words, subwords, or characters for analysis.

📖
terms

Stemming

Text normalization technique that reduces words to their morphological root by removing suffixes and prefixes.

📖
terms

Lemmatization

Linguistic process that reduces words to their canonical form (lemma) using morphological analysis and a dictionary.

🔍

No results found