AI 용어집

인공지능 완전 사전

162

카테고리

2,032

하위 카테고리

23,060

용어

📖

용어

Sequence Modeling

Approach that formalizes reinforcement learning as a sequence modeling problem, where states, actions, and rewards are treated as tokens in a temporal sequence.

📖

용어

Temporal Difference Transformer

Transformer variant that incorporates temporal difference principles into the attention architecture, combining sequential learning and bootstrap updating of value estimates.

📖

용어

Trajectory Conditioning

Technique where the trajectory generator is conditioned on partial trajectory segments or specific goals, enabling precise control of the generated behavior.

📖

용어

Multi-step Prediction

Capability of transformer models to predict multiple future steps of a trajectory simultaneously, improving long-term consistency of generated state-action-reward sequences.

📖

용어

Distributional RL

Extension of reinforcement learning that models the complete distribution of returns rather than just their expectation, capturing uncertainty in trajectory predictions.

📖

용어

Attention-based Trajectory Embedding

Vector representation of trajectories obtained through attention mechanisms, capturing complex temporal dependencies between successive states, actions, and rewards.

🔍