🏠 Ana Sayfa
Benchmarklar
📊 Tüm Benchmarklar 🦖 Dinozor v1 🦖 Dinozor v2 ✅ To-Do List Uygulamaları 🎨 Yaratıcı Serbest Sayfalar 🎯 FSACB - Nihai Gösteri 🌍 Çeviri Benchmarkı
Modeller
🏆 En İyi 10 Model 🆓 Ücretsiz Modeller 📋 Tüm Modeller ⚙️ Kilo Code
Kaynaklar
💬 Prompt Kütüphanesi 📖 YZ Sözlüğü 🔗 Faydalı Bağlantılar

YZ Sözlüğü

Yapay Zekanın tam sözlüğü

162
kategoriler
2.032
alt kategoriler
23.060
terimler
📂
alt kategoriler

Deep Q-Networks (DQN)

Pioneering algorithm combining Q-learning with deep neural networks to approximate the Q-value function in complex state spaces.

18 terimler
📂
alt kategoriler

Policy Gradient Methods

Reinforcement learning approaches that directly optimize the policy by following the gradient of expected rewards.

18 terimler
📂
alt kategoriler

Actor-Critic Methods

Hybrid architecture combining an actor that learns the policy and a critic that evaluates the value of states or actions.

8 terimler
📂
alt kategoriler

Deep Deterministic Policy Gradient (DDPG)

Off-policy actor-critic algorithm for environments with continuous action spaces using deep neural networks.

9 terimler
📂
alt kategoriler

Proximal Policy Optimization (PPO)

Policy optimization method that maintains updates in a trust region to ensure learning stability.

11 terimler
📂
alt kategoriler

Trust Region Policy Optimization (TRPO)

Constrained optimization algorithm that ensures new policies do not deviate too much from old policies.

8 terimler
📂
alt kategoriler

Multi-Agent Deep RL

Extension of deep RL where multiple agents learn simultaneously, in cooperation or competition in a shared environment.

20 terimler
📂
alt kategoriler

Hierarchical Reinforcement Learning

Approach structuring learning in hierarchical levels with meta-policies controlling specialized sub-policies.

20 terimler
📂
alt kategoriler

Model-Based Deep RL

Technique where the agent learns a model of the environment to plan and make more efficient decisions.

19 terimler
📂
alt kategoriler

Distributional RL

Paradigm learning the complete distribution of returns rather than just their expectation for better robustness.

18 terimler
📂
alt kategoriler

Curiosity-Driven RL

Approach where the agent receives intrinsic rewards based on its curiosity to efficiently explore the environment.

16 terimler
📂
alt kategoriler

Meta-Learning in RL

Technique that allows agents to learn to learn quickly on new tasks with few experiences.

18 terimler
🔍

Sonuç bulunamadı