On the Use of Gradient Boosting Methods to Improve the Estimation with Data Obtained with Self-Selection Procedures Castro Martín, Luis Rueda García, María Del Mar Ferri García, Ramón Hernando Tamayo, César Nonprobability surveys Machine learning techniques Propensity score adjustment Survey sampling In the last years, web surveys have established themselves as one of the main methods in empirical research. However, the effect of coverage and selection bias in such surveys has undercut their utility for statistical inference in finite populations. To compensate for these biases, researchers have employed a variety of statistical techniques to adjust nonprobability samples so that they more closely match the population. In this study, we test the potential of the XGBoost algorithm in the most important methods for estimation that integrate data from a probability survey and a nonprobability survey. At the same time, a comparison is made of the effectiveness of these methods for the elimination of biases. The results show that the four proposed estimators based on gradient boosting frameworks can improve survey representativity with respect to other classic prediction methods. The proposed methodology is also used to analyze a real nonprobability survey sample on the social effects of COVID-19. 2021-11-23T10:11:54Z 2021-11-23T10:11:54Z 2021-11-23 journal article Castro-Martín, L.; Rueda, M.d.M.; Ferri-García, R.; Hernando-Tamayo, C. On the Use of Gradient Boosting Methods to Improve the Estimation with Data Obtained with Self-Selection Procedures. Mathematics 2021, 9, 2991. https://doi.org/10.3390/math9232991 http://hdl.handle.net/10481/71686 10.3390/math9232991 eng http://creativecommons.org/licenses/by-nc-nd/3.0/es/ open access Atribución-NoComercial-SinDerivadas 3.0 España MDPI