Mostrar el registro sencillo del ítem

dc.contributor.authorSáez Muñoz, José Antonio 
dc.contributor.authorRomero Béjar, José Luis 
dc.date.accessioned2022-09-12T10:40:45Z
dc.date.available2022-09-12T10:40:45Z
dc.date.issued2022-07-21
dc.identifier.citationSáez, J.A.; Romero-Béjar, J.L. Impact of Regressand Stratification in Dataset Shift Caused by Cross-Validation. Mathematics 2022, 10, 2538. [https://doi.org/10.3390/math10142538]es_ES
dc.identifier.urihttp://hdl.handle.net/10481/76646
dc.description.abstractData that have not been modeled cannot be correctly predicted. Under this assumption, this research studies how k-fold cross-validation can introduce dataset shift in regression problems. This fact implies data distributions in the training and test sets to be different and, therefore, a deterioration of the model performance estimation. Even though the stratification of the output variable is widely used in the field of classification to reduce the impacts of dataset shift induced by cross-validation, its use in regression is not widespread in the literature. This paper analyzes the consequences for dataset shift of including different regressand stratification schemes in cross-validation with regression data. The results obtained show that these allow for creating more similar training and test sets, reducing the presence of dataset shift related to cross-validation. The bias and deviation of the performance estimation results obtained by regression algorithms are improved using the highest amounts of strata, as are the number of cross-validation repetitions necessary to obtain these better results.es_ES
dc.description.sponsorshipMCIU/AEI/ERDF, UE PGC2018098860-B-I00es_ES
dc.description.sponsorshipERDF Operational Programme 2014-2020es_ES
dc.description.sponsorshipEconomy and Knowledge Council of the Regional Government of Andalusia, Spain MCIN/AEI CEX2020-001105-M A-FQM-345-UGR18es_ES
dc.language.isoenges_ES
dc.publisherMDPIes_ES
dc.rightsAtribución 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.subjectCross-validationes_ES
dc.subjectDataset shiftes_ES
dc.subjectTarget shiftes_ES
dc.subjectStratificationes_ES
dc.subjectRegressiones_ES
dc.titleImpact of Regressand Stratification in Dataset Shift Caused by Cross-Validationes_ES
dc.typeinfo:eu-repo/semantics/articlees_ES
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses_ES
dc.identifier.doi10.3390/math10142538
dc.type.hasVersioninfo:eu-repo/semantics/publishedVersiones_ES


Ficheros en el ítem

[PDF]

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem

Atribución 4.0 Internacional
Excepto si se señala otra cosa, la licencia del ítem se describe como Atribución 4.0 Internacional