BenchVibe AI Ecosystem

VIP 👤

🏠 Hem

Benchmarkar

📊 Alla benchmarkar 🦖 Dinosaur v1 🦖 Dinosaur v2 ✅ To-Do List-applikationer 🎨 Kreativa fria sidor 🎯 FSACB - Ultimata uppvisningen 🌍 Översättningsbenchmark

Modeller

🏆 Topp 10 modeller 🆓 Gratis modeller 📋 Alla modeller ⚙️ Kilo Code

Resurser

💬 Promptbibliotek 📖 AI-ordlista 🔗 Användbara länkar

📖

Batch Constrained Q-learning (BCQ)

Distribution Shift

Phenomenon where the distribution of state-actions visited by the learned policy significantly differs from the distribution of the offline dataset. This shift can lead to biased value estimates and degraded performance during deployment.

← Tillbaka