Prediction of a Function of Misclassified Binary Data
Prediction of a Function of Misclassified Binary Data
Author(s): Noriah M. Al-Kandari, Partha LahiriSubject(s): Economy
Published by: Główny Urząd Statystyczny
Keywords: binary classification;double sampling;finite population sampling;misclassification;linkage error;sampling design
Summary/Abstract: We consider the problem of predicting a function of misclassified binary variables.We make an interesting observation that the naive predictor, which ignores the misclassificationerrors, is unbiased even if the total misclassification error is high aslong as the probabilities of false positives and false negatives are identical. Otherthan this case, the bias of the naive predictor depends on the misclassification distributionand the magnitude of the bias can be high in certain cases. We correct thebias of the naive predictor using a double sampling idea where both inaccurate andaccurate measurements are taken on the binary variable for all the units of a sampledrawn from the original data using a probability sampling scheme. Using thisadditional information and design-based sample survey theory, we derive a biascorrectedpredictor. We examine the cases where the new bias-corrected predictorscan also improve over the naive predictor in terms of mean square error (MSE).
Journal: Statistics in Transition. New Series
- Issue Year: 17/2016
- Issue No: 3
- Page Range: 429-447
- Page Count: 20
- Language: English