Feature Selection

📖

termer

Process of automatically selecting the most relevant variables to build an optimal predictive model, reducing dimensionality and improving generalization.

📖

termer

Filter Methods

Feature selection techniques independent of the model, evaluating each variable individually according to statistical criteria before training.

📖

termer

Wrapper Methods

Selection approaches using the predictive model to evaluate feature subsets, often more accurate but computationally intensive.

📖

termer

Embedded Methods

Strategies combining selection and learning, where the selection process is directly integrated into the model training algorithm.

📖

termer

Recursive Feature Elimination

Iterative algorithm progressively removing the least important features by retraining the model at each step until reaching the optimal number of variables.

📖

termer

Mutual Information

Measure quantifying the statistical dependence between two variables, used to evaluate feature relevance relative to the target variable.

📖

termer

Variance Threshold

Basic filtering technique eliminating features with variance below a predefined threshold, considered uninformative.

📖

termer

Chi-square Test

Statistical test evaluating independence between categorical variables, used to measure the relevance of qualitative features relative to the target.

📖

termer

ANOVA F-Test

Statistical test comparing variances between groups to evaluate the relationship between numerical features and categorical target variables.

📖

termer

Correlation Coefficient

Statistical measure quantifying the strength and direction of the linear relationship between two variables, used to detect multicollinearity.

📖

termer

Sequential Selection

Greedy method sequentially adding (forward) or removing (backward) features to optimize a model performance metric.

📖

termer

Boruta Algorithm

Wrapper method based on random forests identifying all relevant features by comparing their importance to random shadow variables.

📖

termer

Permutation Importance

Model-agnostic technique evaluating feature importance by measuring performance degradation after random permutation of their values.

📖

termer

Relief Algorithm

Filter method assessing feature relevance by measuring their ability to distinguish neighboring instances of different classes.

AI-ordlista