🏠 홈
벤치마크
📊 모든 벤치마크 🦖 공룡 v1 🦖 공룡 v2 ✅ 할 일 목록 앱 🎨 창의적인 자유 페이지 🎯 FSACB - 궁극의 쇼케이스 🌍 번역 벤치마크
모델
🏆 톱 10 모델 🆓 무료 모델 📋 모든 모델 ⚙️ 킬로 코드 모드
리소스
💬 프롬프트 라이브러리 📖 AI 용어 사전 🔗 유용한 링크

AI 용어집

인공지능 완전 사전

162
카테고리
2,032
하위 카테고리
23,060
용어
📖
용어

Active Reinforcement Learning

Hybrid methodology combining active learning and reinforcement learning principles to optimize sample selection for annotation.

📖
용어

Sample Selection Policy

Deterministic or stochastic strategy defining which data to request for annotation to maximize model improvement under budget constraints.

📖
용어

Reinforcement Learning Agent

Algorithmic entity that learns to make optimal sample selection decisions through interaction with the annotation environment.

📖
용어

Reward Function

Signal quantifying the utility of each sample selection action, typically based on model performance improvement.

📖
용어

State-Action-Value

Q(s,a) function estimating the expected cumulative reward when selecting action a from state s and following the optimal policy.

📖
용어

Deep Reinforcement Learning

Extension of reinforcement learning using deep neural networks to approximate value functions or policies.

📖
용어

Uncertainty-Based Active Learning

Strategy where the agent preferentially selects samples for which the model exhibits the highest predictive uncertainty.

📖
용어

Strategic Sample Selection

Optimized decision-making process aiming to identify data subsets maximizing information gain per annotation cost.

📖
용어

Off-Policy Reinforcement Learning

Method enabling the learning of an optimal policy while following a different behavior policy, useful for flexible exploration.

📖
용어

Online Reinforcement Learning

Paradigm where the agent learns and selects samples simultaneously during annotation, dynamically adapting its strategy.

📖
용어

Learning-Annotation Trade-off

Optimization of the balance between time spent on intelligent selection and potential gains in model performance.

📖
용어

Data Acquisition Strategy

Systematic action plan for identifying and collecting the most relevant data to annotate according to predefined criteria.

📖
용어

Multi-Agent Reinforcement Learning

Extension where multiple agents collaborate or compete to jointly optimize the sample selection strategy.

📖
용어

Active Q-Learning Algorithm

Variant of Q-learning adapted to active learning, where actions correspond to selecting samples to annotate.

📖
용어

Guided Exploration Policy

Exploration strategy oriented towards regions of the data space potentially most informative for the model.

📖
용어

Bayesian Reinforcement Learning

Method integrating uncertainty into value function estimation for more robust decision-making in sample selection.

🔍

결과를 찾을 수 없습니다