🏠 Ana Sayfa
Benchmarklar
📊 Tüm Benchmarklar 🦖 Dinozor v1 🦖 Dinozor v2 ✅ To-Do List Uygulamaları 🎨 Yaratıcı Serbest Sayfalar 🎯 FSACB - Nihai Gösteri 🌍 Çeviri Benchmarkı
Modeller
🏆 En İyi 10 Model 🆓 Ücretsiz Modeller 📋 Tüm Modeller ⚙️ Kilo Code
Kaynaklar
💬 Prompt Kütüphanesi 📖 YZ Sözlüğü 🔗 Faydalı Bağlantılar

YZ Sözlüğü

Yapay Zekanın tam sözlüğü

162
kategoriler
2.032
alt kategoriler
23.060
terimler
📖
terimler

LinUCB

Contextual bandit algorithm using linear regression with an Upper Confidence Bound to balance exploration and exploitation in continuous context spaces.

📖
terimler

Regret

Performance measure quantifying the difference between the optimal cumulative reward and that obtained by the algorithm, essential for evaluating the effectiveness of contextual bandit strategies.

📖
terimler

Context

Set of observable features that influence the optimal decision at a given time, serving as the basis for personalized action selection in contextual bandits.

📖
terimler

Off-policy Evaluation

Evaluation technique that estimates the performance of a new policy using data collected by an existing policy, without requiring direct deployment.

📖
terimler

Hyperparameters

Configuration parameters of contextual bandit algorithms (such as the exploration coefficient or minibatch size) that influence convergence and performance.

📖
terimler

Binary Reward

Type of feedback in contextual bandits where the outcome is limited to success (1) or failure (0), common in recommendation and advertising applications.

📖
terimler

Logistic Bandit

Contextual bandit variant using logistic regression to model the probability of binary reward based on context, particularly suited to classification problems.

📖
terimler

Neural Bandit

Contextual bandit approach using neural networks to model the complex relationship between context and reward, capable of capturing nonlinearities in the data.

📖
terimler

Policy Gradient

Direct policy optimization method in contextual bandits that adjusts parameters to directly maximize expected reward rather than first estimating values.

📖
terimler

Contextual UCB

Family of algorithms combining UCB principles with contextual models to guarantee an upper bound on theoretical regret with performance guarantees.

🔍

Sonuç bulunamadı