🏠 Trang chủ
Benchmark
📊 Tất cả benchmark 🦖 Khủng long v1 🦖 Khủng long v2 ✅ Ứng dụng To-Do List 🎨 Trang tự do sáng tạo 🎯 FSACB - Trình diễn cuối cùng 🌍 Benchmark dịch thuật
Mô hình
🏆 Top 10 mô hình 🆓 Mô hình miễn phí 📋 Tất cả mô hình ⚙️ Kilo Code
Tài nguyên
💬 Thư viện prompt 📖 Thuật ngữ AI 🔗 Liên kết hữu ích

Thuật ngữ AI

Từ điển đầy đủ về Trí tuệ nhân tạo

162
danh mục
2.032
danh mục con
23.060
thuật ngữ
📖
thuật ngữ

Multimodal Model

Artificial intelligence architecture capable of simultaneously processing and integrating multiple types of data such as text, images, audio, and video within a unified framework.

📖
thuật ngữ

Early Fusion

Multimodal integration strategy where different modalities are combined at the raw feature level before processing by the main model.

📖
thuật ngữ

Late Fusion

Multimodal approach where each modality is processed independently until the final layers of the model, before merging the representations for the final decision.

📖
thuật ngữ

Cross-modal Alignment

Learning process aimed at establishing semantic correspondences between different modalities in a common representation space.

📖
thuật ngữ

Vision-Language Encoding

Mechanism that simultaneously transforms visual and textual inputs into compatible vector representations for joint processing.

📖
thuật ngữ

Cross-modal Attention

Attention mechanism allowing the model to dynamically weight the importance of information from one modality relative to another.

📖
thuật ngữ

Multimodal Embeddings

Dense vector representations that encode information from multiple modalities in a shared semantic space.

📖
thuật ngữ

Multimodal Zero-shot Learning

Ability of a multimodal model to generalize to new tasks or modality combinations without specific training examples.

📖
thuật ngữ

Multimodal Tokenization

Process of converting different modalities (image, audio, video) into token sequences compatible with Transformer architecture.

📖
thuật ngữ

Multimodal Contrastive Pre-training

Self-supervised method that maximizes similarity between positive multimodal pairs while minimizing that of negative pairs.

📖
thuật ngữ

Common Latent Space Projection

Linear or non-linear transformation aligning representation spaces of different modalities into a unified vector space.

📖
thuật ngữ

Hybrid Encoder-Decoder Architecture

Structure combining specialized encoders per modality with a unified decoder for generating multimodal outputs.

📖
thuật ngữ

Multimodal Fine-tuning

Process of adapting a pre-trained multimodal model to specific tasks while preserving its cross-modal processing capabilities.

📖
thuật ngữ

Multimodal Prompt Engineering

Technique for optimizing inputs combining text and other modalities to effectively guide multimodal models toward desired outputs.

📖
thuật ngữ

Multimodal Chain-of-Thought Reasoning

Model's ability to generate explicit reasoning steps by integrating evidence from multiple modalities.

📖
thuật ngữ

Multimodal Conditioned Generation

Process of creating content in a target modality based on conditions or constraints provided in other modalities.

📖
thuật ngữ

Intermediate Fusion

Multimodal integration strategy where modalities are merged at multiple intermediate levels of the neural network.

📖
thuật ngữ

Multimodal Transformers

Extension of the Transformer architecture capable of simultaneously processing sequences from different modalities with adapted attention mechanisms.

🔍

Không tìm thấy kết quả