🏠 Trang chủ
Benchmark
📊 Tất cả benchmark 🦖 Khủng long v1 🦖 Khủng long v2 ✅ Ứng dụng To-Do List 🎨 Trang tự do sáng tạo 🎯 FSACB - Trình diễn cuối cùng 🌍 Benchmark dịch thuật
Mô hình
🏆 Top 10 mô hình 🆓 Mô hình miễn phí 📋 Tất cả mô hình ⚙️ Kilo Code
Tài nguyên
💬 Thư viện prompt 📖 Thuật ngữ AI 🔗 Liên kết hữu ích

Thuật ngữ AI

Từ điển đầy đủ về Trí tuệ nhân tạo

162
danh mục
2.032
danh mục con
23.060
thuật ngữ
📖
thuật ngữ

Vision-Language Pre-training

Self-supervised learning approach where models are pre-trained on large corpora of images and associated texts. Establishes fundamental mappings between visual concepts and linguistic descriptions before fine-tuning.

📖
thuật ngữ

Joint Representation Learning

Process of simultaneously learning shared features between multiple modalities to create a unified representation. Captures inter-modal correlations and complementarities in a single vector.

📖
thuật ngữ

Modal Fusion

Strategic integration of information from different modalities to create an enriched and coherent representation. Effectively combines the respective strengths of each modality in a unified output.

📖
thuật ngữ

Grounding

Process of associating abstract concepts (often textual) with concrete elements in another modality (typically visual). Establishes direct links between words and specific regions or objects in images.

📖
thuật ngữ

Alignment Loss

Loss function specifically designed to optimize semantic matching between elements of different modalities. Guides learning toward optimal alignment in the shared representation space.

📖
thuật ngữ

Semantic Consistency

Principle ensuring that multimodal representations preserve consistent meaning across different modalities. Ensures that semantically equivalent elements share similar characteristics.

📖
thuật ngữ

Multimodal Pre-training

Initialization phase of a multimodal model's weights on massive unannotated data. Develops fundamental alignment capabilities before adaptation to specific tasks.

📖
thuật ngữ

Modal Alignment Metrics

Quantitative indicators evaluating the quality of correspondence between representations of different modalities. Measure the accuracy and semantic consistency of learned alignments.

📖
thuật ngữ

Weakly Supervised Alignment

Learning approach using partial or noisy annotations to align modalities. Reduces dependency on labeled data while maintaining reasonable alignment performance.

📖
thuật ngữ

Self-supervised Multimodal Learning

Paradigm where the model automatically learns alignments by exploiting natural correlations between unannotated modalities. Generates intrinsic learning signals from the multimodal structure of the data.

🔍

Không tìm thấy kết quả