🏠 홈
벤치마크
📊 모든 벤치마크 🦖 공룡 v1 🦖 공룡 v2 ✅ 할 일 목록 앱 🎨 창의적인 자유 페이지 🎯 FSACB - 궁극의 쇼케이스 🌍 번역 벤치마크
모델
🏆 톱 10 모델 🆓 무료 모델 📋 모든 모델 ⚙️ 킬로 코드 모드
리소스
💬 프롬프트 라이브러리 📖 AI 용어 사전 🔗 유용한 링크

AI 용어집

인공지능 완전 사전

162
카테고리
2,032
하위 카테고리
23,060
용어
📖
용어

Transformer for Time Series

Deep neural network architecture, initially designed for NLP, adapted to model complex and long-term dependencies in temporal sequential data through its attention mechanisms.

📖
용어

Self-Attention

Process where each element in a time sequence interacts with all other elements in the same sequence to compute a contextual representation, essential for understanding internal dependencies.

📖
용어

PatchTST

Transformer model that segments the time series into subsequences (patches) before processing them, reducing computational complexity and improving the ability to model local and global dependencies.

📖
용어

Informer

Enhanced Transformer architecture introducing a sparse attention mechanism and distillation to effectively reduce complexity and mitigate the problem of forecast degradation over long horizons.

📖
용어

Sparse Attention

Variant of the attention mechanism where each token only focuses on a selected subset of other tokens, drastically reducing computational cost from O(n²) to O(n log n) or O(n).

📖
용어

Long-Horizon Forecasting

Prediction task involving estimating time series values over an extended time horizon, a major challenge where Transformers excel due to their handling of long-term dependencies.

📖
용어

Long-Term Temporal Dependency

Statistical relationship between an observation and values far in the past, which traditional models like RNNs struggle to capture but which Transformers model effectively.

📖
용어

Multi-Head Attention

Layer composed of multiple attention heads in parallel, concatenating their outputs to allow the model to focus on different positions and extract richer features.

📖
용어

Bottleneck Transformer

Architecture variant that compresses the input sequence into a lower-dimensional latent space before applying attention mechanisms, to efficiently handle very long time series.

📖
용어

Time Series Tokenization

Process of discretizing or segmenting a continuous time series into a sequence of discrete 'tokens', which serve as input to the Transformer's processing layers.

📖
용어

Wash-out Effect

Phenomenon where relevant information from old time steps is lost or 'washed out' during propagation through multiple layers of a model, a problem that attention mechanisms aim to solve.

📖
용어

Quadratic Complexity

Computational cost of O(n²) for standard attention, where n is the sequence length, which constitutes the main limitation of Transformers for very long time series.

📖
용어

Contextual Representation

Embedding vector for a given time step that is computed based on all other time steps in the sequence, thus capturing its meaning and importance in the global context.

📖
용어

Encoder-Decoder Layers

Transformer structure where the encoder processes the input sequence (history) to create a representation, and the decoder uses this representation to generate the output sequence (forecasts) step by step.

🔍

결과를 찾을 수 없습니다