AI 용어집
인공지능 완전 사전
Missing Completely At Random (MCAR)
Mechanism where the probability that an observation is missing depends neither on the observed data nor on the missing data. The missing data represent a simple random subset of the complete dataset.
Missing At Random (MAR)
Mechanism where the probability of missing data depends only on the observed values but not on the missing values themselves. This pattern allows for correction by imputation conditional on the observed variables.
Missing Not At Random (MNAR)
Mechanism where the probability of missing data depends directly on the missing values themselves. Standard imputation methods can introduce significant bias in this case.
Little's MCAR Test
Statistical hypothesis test with null hypothesis that the data are missing completely at random (MCAR). Based on comparing means and covariances between complete and incomplete cases.
Pattern Matrix
Binary matrix indicating the presence (1) or absence (0) of data for each observation and variable. Allows for visual identification of complex missing data structures.
Missing Data Mechanism
Underlying process that generates missing data in a dataset. Includes three main mechanisms: MCAR, MAR and MNAR, each requiring different treatment approaches.
Complete Case Analysis
Analysis method using only observations without any missing values. Simple to implement but can lead to significant data loss and biases if MCAR is not verified.
Available Case Analysis
Approach using all available data for each statistical calculation, allowing different sample sizes according to variables. Includes pairwise and listwise deletion methods.
Monotone Missing Pattern
Structure where if a variable is missing for an observation, all subsequent variables in a predefined order are also missing. Greatly simplifies multiple imputation methods.
Missingness Correlation
Measure of the association between missing data patterns of different variables. A strong correlation may indicate an MAR mechanism or suggest structural relationships in the data.
Missing Data Visualization
Set of graphical techniques (heatmaps, barplots, pattern plots) to explore and communicate the structure and extent of missing data. Essential for preliminary diagnosis.
Response Rate Analysis
Systematic evaluation of response rates by variable, by group, or over time. Allows identification of factors associated with non-response and detection of potential biases.
Missing Data Diagnostics
Comprehensive process of evaluating the characteristics, patterns, and mechanisms of missing data before imputation. Combines statistical analyses and visualizations to guide appropriate treatment.
Arbitrary Missing Pattern
Missing data structure without particular organization where absences can occur anywhere in the dataset. Requires more sophisticated imputation methods like MICE.
Missing Data Profile
Synthetic report describing the distribution, patterns, and characteristics of missing data. Includes descriptive statistics and visualizations for overall assessment.