Glossario IA

Il dizionario completo dell'Intelligenza Artificiale

162

categorie

2.032

sottocategorie

23.060

termini

📖

termini

Distributional Correction

Technique correcting the mismatch between the distribution of offline visited state-actions and that generated by the learned policy during online transfer.

📖

termini

Fitted Q-Iteration

Iterative offline learning algorithm approximating the optimal Q-function using regressors on batches of experimental data.

📖

termini

Safe Policy Transfer

Strategy ensuring that policies transferred from offline to online maintain minimal performance during the initial adaptation phase.

📖

termini

Dataset Aggregation

Iterative method collecting and aggregating successive offline data to progressively improve policy performance before online deployment.

📖

termini

Offline Policy Evaluation

Evaluation of policy performance without direct interaction with the environment, crucial for selecting the best policies to transfer online.

📖

termini

Transfer Learning Gap

Quantitative measure of the performance difference between an offline-trained policy and its initial performance in an online environment.

🔍

Glossario IA

Distributional Correction

Fitted Q-Iteration

Safe Policy Transfer

Dataset Aggregation

Offline Policy Evaluation

Transfer Learning Gap

Nessun risultato trovato