Statistical Inference in the Presence of Imputed Survey Data Through Regression Trees and Random Forests

Add to Calendar

When and Where

Thursday, September 28, 2023 3:30 pm to 4:30 pm

9014

Ontario Power Generation

700 University Ave, Toronto, ON M5G 1Z5

Speakers

David Haziza, University of Ottawa

Description

In recent years, machine learning procedures have attracted much attention in National Statistical Offices. In particular, random forests are currently being scrutinized as an alternative to traditional imputation procedures. Item nonresponse in surveys is usually handled through some form of single imputation. Random forests provide flexible tools for obtaining a set of imputed values. Belonging to the class of non-parametric methods, random forests have the ability to capture nonlinear trends in the data and tend to be robust to the non-inclusion of interactions or predictors accounting for curvature. In this presentation, we will discuss the properties of imputed estimators based on random forests. Also, to the best of our knowledge, how to estimate the variance while accounting for sampling and nonresponse, has not been addressed in the literature. We propose a novel variance estimator based on the so-called reverse approach for variance estimation. We will present the results from a simulation study to assess the proposed methods in terms of bias and efficiency. Finally, the choice of hyper-parameters will also be discussed.

Co-authors: Mehdi Dagdoug (McGill University) and Camelia Goga (Université de Bourgogne Franche Comté).

Please join the event.

About David Haziza

David Haziza is Professor in the department of mathematics and statistics at the University of Ottawa. His research interests include the statistical inference in the presence of missing data and influential units, resampling methods and machine learning methods.

Map

700 University Ave, Toronto, ON M5G 1Z5

Universal Navigation

Universal Navigation2

Main menu

Statistical Inference in the Presence of Imputed Survey Data Through Regression Trees and Random Forests

When and Where

Speakers

Description

Map

Categories

Audiences

Footer Main-Menu

Footer Secondary Menu

Contact Us

Footer Accessibility Menu

Universal Navigation

Universal Navigation2

Main menu

Search form

Statistical Inference in the Presence of Imputed Survey Data Through Regression Trees and Random Forests

When and Where

Speakers

Description

Map

Categories

Audiences