Parameter Regularization - Glossariusz AI

📖

pojęcia

Learning without Forgetting (LwF)

Approach that uses knowledge distillation to preserve the model's responses on old data while learning a new task. The original model serves as a teacher to guide the updated model, thus avoiding performance degradation on previous tasks.

📖

pojęcia

Orthogonal Gradient Descent (OGD)

Method that projects the gradient of the new task onto the space orthogonal to the gradient subspaces of previous tasks. This projection guarantees that learning new tasks does not interfere with directions important for past performance.

📖

pojęcia

Dynamical Expandable Networks (DEN)

Framework that dynamically expands the network by adding new units and connections when necessary, while selectively reactivating or deactivating existing connections. DEN adapts model capacity to new requirements without degrading previous performance.

📖

pojęcia

PackNet

Regularization technique that assigns specific neural subnetworks to each task via fixed binary masks and sparsity constraints. PackNet allows stacking multiple tasks in the same network without interference by compartmentalizing resources.

📖

pojęcia

HAT (Hard Attention to the Task)

Method that learns binary attention masks per task to select active network weights, thus creating dedicated paths for each task. HAT uses regularization to encourage the use of different weight subsets for different tasks.

📖

pojęcia

CWR (Copy Weight with Reinit)

Strategy that duplicates model weights after learning each task and selectively reinitializes certain weights for learning the new task. CWR maintains a copy of important weights while allowing adaptation for new knowledge.

📖

pojęcia

PathNet

Evolutionary architecture where neuron paths are selected and optimized for each specific task, using genetic algorithms to find the best combinations. PathNet allows module reuse while isolating parameters by task.

📖

pojęcia

Sup-Sup (Superposition of Superpositions)

Technique that combines weight superposition with task superposition to maximize network parameter utilization. Sup-Sup allows a compact network to store and execute multiple tasks simultaneously without forgetting.

Słownik AI

Learning without Forgetting (LwF)

Orthogonal Gradient Descent (OGD)

Dynamical Expandable Networks (DEN)

PackNet

HAT (Hard Attention to the Task)

CWR (Copy Weight with Reinit)

PathNet

Sup-Sup (Superposition of Superpositions)

Nie znaleziono wyników