Słownik AI
Kompletny słownik sztucznej inteligencji
Learning without Forgetting (LwF)
Approach that uses knowledge distillation to preserve the model's responses on old data while learning a new task. The original model serves as a teacher to guide the updated model, thus avoiding performance degradation on previous tasks.
Orthogonal Gradient Descent (OGD)
Method that projects the gradient of the new task onto the space orthogonal to the gradient subspaces of previous tasks. This projection guarantees that learning new tasks does not interfere with directions important for past performance.
Dynamical Expandable Networks (DEN)
Framework that dynamically expands the network by adding new units and connections when necessary, while selectively reactivating or deactivating existing connections. DEN adapts model capacity to new requirements without degrading previous performance.
PackNet
Regularization technique that assigns specific neural subnetworks to each task via fixed binary masks and sparsity constraints. PackNet allows stacking multiple tasks in the same network without interference by compartmentalizing resources.
HAT (Hard Attention to the Task)
Method that learns binary attention masks per task to select active network weights, thus creating dedicated paths for each task. HAT uses regularization to encourage the use of different weight subsets for different tasks.
CWR (Copy Weight with Reinit)
Strategy that duplicates model weights after learning each task and selectively reinitializes certain weights for learning the new task. CWR maintains a copy of important weights while allowing adaptation for new knowledge.
PathNet
Evolutionary architecture where neuron paths are selected and optimized for each specific task, using genetic algorithms to find the best combinations. PathNet allows module reuse while isolating parameters by task.
Sup-Sup (Superposition of Superpositions)
Technique that combines weight superposition with task superposition to maximize network parameter utilization. Sup-Sup allows a compact network to store and execute multiple tasks simultaneously without forgetting.