Calibration and XGBoost reweighting to reduce coverage and non‑response biases in overlapping panel surveys: application to the Healthcare and Social Survey
Metadatos
Mostrar el registro completo del ítemAutor
Castro Martín, Luis; Rueda García, María Del Mar; Sánchez-Cantalejo, Carmen; Ferri García, Ramón; Cabrera León, AndrésEditorial
BioMed Central Ltd
Materia
Public health COVID-19 Panel surveys
Fecha
2024-02-15Referencia bibliográfica
Castro, L., Rueda, M., Sánchez-Cantalejo, C. et al. Calibration and XGBoost reweighting to reduce coverage and non-response biases in overlapping panel surveys: application to the Healthcare and Social Survey. BMC Med Res Methodol 24, 36 (2024). https://doi.org/10.1186/s12874-024-02171-z
Patrocinador
Spanish Ministerio de Ciencia and Innovation MCIN/AEI/10.13039/501100011033; Spanish Ministerio de Ciencia and Innovation PI20/00855; Ministerio de Economía y Competitividad [grant PID2019-106861RB-I00]; ESSA funding from the Andalusian Institute of Statistics and Cartography (IECA), the Andalusian School of Public Health (EASP), and the competitive tenders of the Santander Universities SUPERA COVID-19 Fund (SAUN), the Conference of Rectors of Spanish Universities (CRUE), and the Higher Council for Scientific Research (CSIC), in addition to the COVID Competitive Aid Program -19 from Pfizer Global Medical GrantsResumen
Background Surveys have been used worldwide to provide information on the COVID-19 pandemic impact so as to
prepare and deliver an effective Public Health response. Overlapping panel surveys allow longitudinal estimates
and more accurate cross-sectional estimates to be obtained thanks to the larger sample size. However, the problem
of non-response is particularly aggravated in the case of panel surveys due to population fatigue with repeated
surveys.
Objective To develop a new reweighting method for overlapping panel surveys affected by non-response.
Methods We chose the Healthcare and Social Survey which has an overlapping panel survey design with measurements
throughout 2020 and 2021, and random samplings stratified by province and degree of urbanization. Each
measurement comprises two samples: a longitudinal sample taken from previous measurements and a new sample
taken at each measurement.
Results Our reweighting methodological approach is the result of a two-step process: the original sampling design
weights are corrected by modelling non-response with respect to the longitudinal sample obtained in a previous
measurement using machine learning techniques, followed by calibration using the auxiliary information available
at the population level. It is applied to the estimation of totals, proportions, ratios, and differences between measurements,
and to gender gaps in the variable of self-perceived general health.
Conclusion The proposed method produces suitable estimators for both cross-sectional and longitudinal samples.
For addressing future health crises such as COVID-19, it is therefore necessary to reduce potential coverage and nonresponse
biases in surveys by means of utilizing reweighting techniques as proposed in this study.