Temporal Abstraction - Glossariusz AI

📖

pojęcia

Options

Temporal abstraction units consisting of an internal policy, an initiation condition, and a termination condition, enabling variable-duration action sequences in hierarchical reinforcement learning.

📖

pojęcia

Semi-Markov Decision Process (SMDP)

Extension of the Markov decision process where actions can have variable and stochastically distributed durations, mathematically formalizing temporal abstraction in option hierarchies.

📖

pojęcia

Primitive Actions

Basic atomic actions at the lowest level of the hierarchy, executed in a single time step without internal abstraction, serving as foundations for higher-level options.

📖

pojęcia

Intra-Option Policies

Policies defining behavior within an option, specifying how to choose primitive actions or sub-options during option execution until its termination condition.

📖

pojęcia

Option Termination Conditions

Probabilistic functions determining when an option should cease execution, allowing flexible control over the duration of temporal abstractions based on the current state.

📖

pojęcia

HAM (Hierarchy of Abstract Machines)

Formal framework using hierarchical finite state machines to structure policies, where each machine defines subtasks and decision points at different temporal scales.

📖

pojęcia

MAXQ Decomposition

Hierarchical task decomposition method where policies are represented as directed acyclic graphs of reusable subtasks, sharing temporal abstractions between different parts of the problem.

📖

pojęcia

Goal-Oriented Reinforcement Learning

Hierarchical approach where subtasks are defined as achieving specific sub-goals, naturally creating temporal abstractions aligned with the semantic structure of the problem.

📖

pojęcia

Feudal Reinforcement Learning (FRL)

Hierarchical architecture inspired by feudalism where high-level managers set goals for low-level workers, creating multi-scale temporal abstraction through command delegation.

📖

pojęcia

Abstract States

Simplified representations of the environment that group multiple concrete states into unique abstractions, enabling decisions at larger temporal scales without considering each individual state.

📖

pojęcia

Multi-scale Temporal Learning

Paradigm where different system components operate and learn at distinct temporal scales simultaneously, optimizing short-term and long-term decisions in a coordinated manner.

📖

pojęcia

Option Discovery

Automated process of identifying and constructing useful options from experience, detecting temporal regularities and recurring subtasks in the environment.

📖

pojęcia

Subgoal-based Learning

Methodology where hierarchical learning is structured around achieving intermediate subgoals, creating natural temporal abstractions aligned with progress toward the final goal.

📖

pojęcia

Hierarchical Temporal Memory

Information storage and retrieval system organized across multiple temporal levels, enabling agents to maintain representations at different time scales for informed decisions.

📖

pojęcia

Termination Function

Mathematical component of an option defining the probability of terminating the option in each state, explicitly controlling the duration and pace of temporal abstractions.

📖

pojęcia

Initiation Set

Set of states in which an option can be initiated, defining the temporal and contextual validity domain of each abstraction in the decision hierarchy.

📖

pojęcia

Option Policy

High-level policy that selects from available options rather than primitive actions, operating at a coarser temporal scale to plan over longer horizons.

📖

pojęcia

Option-Critic Architecture

Algorithmic framework combining hierarchical reinforcement learning with actor-critic methods, where the actor selects options and the critic evaluates their value at different temporal scales.

📖

pojęcia

Nested Options

Hierarchical structure where high-level options can call other lower-level options, creating recursive temporal abstractions to handle problems of arbitrary complexity.

Słownik AI

Options

Semi-Markov Decision Process (SMDP)

Primitive Actions

Intra-Option Policies

Option Termination Conditions

HAM (Hierarchy of Abstract Machines)

MAXQ Decomposition

Goal-Oriented Reinforcement Learning

Feudal Reinforcement Learning (FRL)

Abstract States

Multi-scale Temporal Learning

Option Discovery

Subgoal-based Learning

Hierarchical Temporal Memory

Termination Function

Initiation Set

Option Policy

Option-Critic Architecture

Nested Options

Nie znaleziono wyników