Zero-Sum Games - 인공지능 용어집

📖

용어

Zero-sum game

Competitive situation where the total gain of participants is constant, with one agent's gains exactly corresponding to the losses of other agents.

📖

용어

Nash equilibrium

Game state where no agent can improve their strategy by unilaterally changing their actions, given that other agents' strategies remain fixed.

📖

용어

Minimax theorem

Fundamental principle establishing that in zero-sum games with perfect information, there exists an optimal strategy that maximizes the minimum possible gain.

📖

용어

Game tree

Tree structure representing all possible sequences of moves and resulting states of a game, used for decision analysis.

📖

용어

Alpha-beta pruning

Optimization technique eliminating unnecessary branches of the game tree by maintaining alpha and beta bounds on possible values.

📖

용어

Pure strategy

Deterministic strategy where an agent systematically chooses the same action in a given situation without any randomization.

📖

용어

Mixed strategy

Probabilistic approach where an agent randomly selects among several actions according to a predefined probability distribution.

📖

용어

Payoff matrix

Two-dimensional table representing each agent's payoffs for all possible combinations of players' actions.

📖

용어

Saddle point

Equilibrium position in the payoff matrix where the value is simultaneously the maximum of its column and the minimum of its row.

📖

용어

Duality in game theory

Mathematical relationship between a player's maximization problem and their opponent's minimization problem in zero-sum games.

📖

용어

Backpropagation evaluation

Method of propagating values from the leaves to the root of the game tree to assess the quality of each position.

📖

용어

Iterative deepening search

Algorithm that progressively explores the game tree by increasing the search depth at each iteration to guarantee a limited response time.

📖

용어

Heuristic evaluation function

Function that estimates the value of a game position without completely exploring the tree, used when exhaustive search is impossible.

📖

용어

Value of a state

Quantitative measure representing the optimal expected payoff from a given state assuming perfect play by all agents.

📖

용어

Adversarial learning

Process where agents mutually improve by competing against each other, with each agent developing strategies to counter the others.

📖

용어

Extensive form of the game

Detailed representation of a game including the temporal sequence of decisions, the information available at each stage, and the final payoffs.

📖

용어

Expected Utility Theory

Theoretical framework evaluating decisions based on outcomes weighted by their probabilities in a competitive context.

📖

용어

Perfect Equilibrium Strategy

Strategy constituting a Nash equilibrium for every subgame of the extensive form, ensuring temporal consistency of decisions.

📖

용어

Perfect Information

Condition where each agent knows the complete history of previous actions and the current state of the game before making their decision.

📖

용어

Backward Induction Principle

Method for solving sequential games by analyzing decisions from the end of the game towards the beginning to determine optimal strategies.

AI 용어집