🏠 홈
벤치마크
📊 모든 벤치마크 🦖 공룡 v1 🦖 공룡 v2 ✅ 할 일 목록 앱 🎨 창의적인 자유 페이지 🎯 FSACB - 궁극의 쇼케이스 🌍 번역 벤치마크
모델
🏆 톱 10 모델 🆓 무료 모델 📋 모든 모델 ⚙️ 킬로 코드 모드
리소스
💬 프롬프트 라이브러리 📖 AI 용어 사전 🔗 유용한 링크

AI 용어집

인공지능 완전 사전

162
카테고리
2,032
하위 카테고리
23,060
용어
📖
용어

Pareto Optimum

Optimal solution in a multi-objective context that cannot be improved on any objective without degrading performance on at least one other objective.

📖
용어

Multi-Objective Reinforcement Learning

Extension of reinforcement learning where the agent simultaneously optimizes multiple often conflicting objectives with vector reward functions.

📖
용어

Vector Reward Function

Function that returns a vector of rewards instead of a scalar value, allowing simultaneous consideration of multiple performance criteria.

📖
용어

Objective Weighting

Scalarization technique where each objective receives a weight to combine multiple rewards into a single scalarized value to optimize.

📖
용어

Linear Scalarization

Method that transforms a multi-objective problem into a scalar problem through linear weighted combination of objectives to generate different Pareto-optimal solutions.

📖
용어

Pareto Elitism

Strategy that preserves non-dominated solutions between generations to ensure convergence to the Pareto front in evolutionary algorithms.

📖
용어

Expected Vector Return

Generalization of expected return in reinforcement learning, calculating the expectation of future cumulative reward vectors for each policy.

📖
용어

Pareto-optimal Policy

Action policy whose vector return belongs to the Pareto front, representing an optimal trade-off between different objectives.

📖
용어

Convergence Pareto

Propriété d'un algorithme garantissant que les solutions générées tendent asymptotiquement vers le véritable front de Pareto du problème.

📖
용어

Agrégation Tchebychev

Méthode de scalarisation utilisant la norme Tchebychev pour combiner les objectifs, capable de générer toutes les solutions Pareto-optimales convexes et non-convexes.

🔍

결과를 찾을 수 없습니다