🏠 Trang chủ
Benchmark
📊 Tất cả benchmark 🦖 Khủng long v1 🦖 Khủng long v2 ✅ Ứng dụng To-Do List 🎨 Trang tự do sáng tạo 🎯 FSACB - Trình diễn cuối cùng 🌍 Benchmark dịch thuật
Mô hình
🏆 Top 10 mô hình 🆓 Mô hình miễn phí 📋 Tất cả mô hình ⚙️ Kilo Code
Tài nguyên
💬 Thư viện prompt 📖 Thuật ngữ AI 🔗 Liên kết hữu ích

Thuật ngữ AI

Từ điển đầy đủ về Trí tuệ nhân tạo

162
danh mục
2.032
danh mục con
23.060
thuật ngữ
📖
thuật ngữ

Precision@K

Metric measuring the proportion of relevant items among the top K recommendations, essential for evaluating the quality of top-ranked results.

📖
thuật ngữ

Recall@K

Indicator calculating the ratio of relevant items actually present in the top K recommendations compared to the total available relevant items.

📖
thuật ngữ

Mean Average Precision (MAP)

Aggregated metric calculating the average of precisions at each relevant position, weighted by the rank of each relevant item in the recommendation list.

📖
thuật ngữ

NDCG (Normalized Discounted Cumulative Gain)

Normalized score evaluating ranking quality by penalizing relevant items placed far from the top of the list, ideal for recommendations with graded relevance.

📖
thuật ngữ

RMSE (Root Mean Square Error)

Root mean square error used to evaluate rating prediction accuracy by measuring the difference between predicted and actual values.

📖
thuật ngữ

Hit Rate (HR)

Percentage of sessions where at least one relevant item appears in the top N recommendations, measuring the overall effectiveness of the system.

📖
thuật ngữ

Catalog Coverage

Percentage of unique catalog items that can be recommended by the system, crucial to avoid concentration on a limited subset of items.

📖
thuật ngữ

Intra-List Diversity

Measure of average dissimilarity between items in the same recommendation list, essential to avoid redundancy and enhance user experience.

📖
thuật ngữ

Novelty

Degree of unknown of recommended items for the user, calculated as the inverse of their global popularity in the catalog.

📖
thuật ngữ

Serendipity

Ability of the system to recommend relevant but unexpected items that positively surprise the user beyond simple predictions.

📖
thuật ngữ

A/B Testing

Experimental methodology comparing the performance of two versions of the system on real user segments to measure business impact.

📖
thuật ngữ

Leave-One-Out Cross-Validation

Robust evaluation technique where each user interaction is alternately used as test data while others serve for training.

📖
thuật ngữ

Offline vs Online Evaluation

Dual approach evaluating performance on historical data (offline) and with real interactions (online) to validate the complete effectiveness of the system.

📖
thuật ngữ

Temporal Generalization

Ability of the system to maintain its performance on future data, evaluated sequentially on temporal splits rather than random ones.

📖
thuật ngữ

Business Metrics Correlation

Analysis of the relationship between algorithmic metrics (NDCG, Precision) and business indicators (conversion, retention) to validate business relevance.

📖
thuật ngữ

Cataract Metric

Composite score balancing precision, diversity, novelty, and coverage to holistically evaluate the overall quality of recommendations.

📖
thuật ngữ

Expected Reciprocal Rank (ERR)

Probabilistic model based on user behavior assuming cessation of examination after the first click, heavily weighting the first positions.

📖
thuật ngữ

User Coverage

Percentage of users for whom the system can generate recommendations, critical for measuring the universal applicability of the system.

📖
thuật ngữ

Fairness Metrics

Indicators evaluating the equity of recommendation distribution among different demographic groups to avoid algorithmic biases.

📖
thuật ngữ

Exposure Bias Measurement

Quantification of the exposure disparity between popular and long-tail items, essential for evaluating recommendation balance.

🔍

Không tìm thấy kết quả