Multi-Head Attention
Linear Projection
Linear transformation applied to input embeddings to generate Query, Key and Value spaces in each multi-head attention head.
← 뒤로Linear transformation applied to input embeddings to generate Query, Key and Value spaces in each multi-head attention head.
← 뒤로