🏠 Trang chủ
Benchmark
📊 Tất cả benchmark 🦖 Khủng long v1 🦖 Khủng long v2 ✅ Ứng dụng To-Do List 🎨 Trang tự do sáng tạo 🎯 FSACB - Trình diễn cuối cùng 🌍 Benchmark dịch thuật
Mô hình
🏆 Top 10 mô hình 🆓 Mô hình miễn phí 📋 Tất cả mô hình ⚙️ Kilo Code
Tài nguyên
💬 Thư viện prompt 📖 Thuật ngữ AI 🔗 Liên kết hữu ích

Thuật ngữ AI

Từ điển đầy đủ về Trí tuệ nhân tạo

162
danh mục
2.032
danh mục con
23.060
thuật ngữ
📖
thuật ngữ

Bandit Algorithm

Family of online learning algorithms where the agent must sequentially select actions with uncertain rewards to maximize cumulative gain.

📖
thuật ngữ

Follow the Leader (FTL)

Online optimization strategy where the algorithm chooses at each step the action that would have been optimal on the observed past data up to that point.

📖
thuật ngữ

Follow the Regularized Leader (FTRL)

Variant of FTL incorporating regularization to stabilize sequential decisions and guarantee better regret bounds in adversarial environments.

📖
thuật ngữ

Online Gradient Descent

Optimization algorithm that updates model parameters in the direction opposite to the gradient of the loss function computed on each new observation.

📖
thuật ngữ

Multiplicative Weights Update

Online optimization method that exponentially adjusts weights assigned to experts based on their past performance to combine their predictions.

📖
thuật ngữ

Expert Advice

Online learning framework where the algorithm must aggregate recommendations from multiple experts to minimize regret relative to the best expert.

📖
thuật ngữ

Online Convex Optimization

Mathematical theory studying sequential optimization of convex functions where loss functions are gradually revealed over time.

📖
thuật ngữ

Adversarial Online Learning

Online learning scenario where data is generated by a potentially malicious adversary seeking to maximize the algorithm's regret.

📖
thuật ngữ

Exploration-Exploitation Trade-off

Fundamental dilemma in online learning between exploring new actions to discover their rewards and exploiting actions known to be high-performing.

📖
thuật ngữ

Online Mirror Descent

Generalization of gradient descent using a Bregman function to project updates into a constrained space, offering superior flexibility in optimization.

📖
thuật ngữ

Learning with Partial Information

Paradigm where the algorithm only receives information about the chosen action (bandit) rather than all possible actions (full information).

📖
thuật ngữ

Adaptive Learning Rate

Mechanism dynamically adjusting the learning step based on local properties of the loss landscape to optimize convergence in non-stationary environments.

📖
thuật ngữ

Hedge Algorithm

Expert aggregation algorithm using multiplicative weight updates to guarantee a logarithmic regret bound relative to the best expert.

📖
thuật ngữ

Regret Bound

Theoretical upper limit on the cumulative regret an algorithm may suffer, allowing comparison and performance guarantees for online optimization methods.

📖
thuật ngữ

Stochastic Online Learning

Learning framework where data follows a fixed but unknown probability distribution, enabling performance guarantees in expectation rather than worst-case.

🔍

Không tìm thấy kết quả