Events

Maison des Sciences Humaines (MSH), Belval/Esch-sur-Alzette
13 July 2017

Models for Imputing Missing Data, including methods for assessing sensitivity of conclusions to them

LISER and the Luxembourg Statistical Society have the pleasure to host a Keynote by Prof. Donald Rubin, Professor of Statistics at the Harvard University.

Prof. Rubin’s research interests lie on causal inference in experiments and observational studies, inference in sample surveys with nonresponse and in missing data problems as well as in application of Bayesian techniques.

Abstract:
There are two relatively standard approaches for dealing with missing data in statistics, one based on “selection models” and one based on “pattern-mixture" models.  The former is focused on formulating a model for complete data and then effectively imputing missing data so that the combined observed and missing data fit the assumed model for the complete data.  In contrast, the latter effectively fits a different model for each pattern of observed and missing data, thereby directly revealing sensitivity of conclusions to assumptions about distributions for which there are no actual observed data available for estimation.  A third class of models, which have remained mostly recondite, is  based on “Gibbs” factorizations; although these may not imply a valid joint distribution, they  have enjoyed success in applications because of their ease of use when implemented by MCMC computer software for multiple imputation, such as in SAS, STATA, and MICE.  The consideration of sensitivity of conclusions to assumptions unassailable by observed data, whether implicit, as with selection models, or explicit, as with pattern-mixture models, is a critical ingredient of satisfactory analyses of data sets with missing values.  Graphical displays, such as “enhanced tipping point analyses” implemented using modern computing, are critical ingredients for this enterprise.