Mostrar el registro sencillo del ítem

dc.contributor.authorTriguero, Isaac
dc.contributor.authorPeralta, Daniel
dc.contributor.authorBacardit, Jaume
dc.contributor.authorGarcía López, Salvador 
dc.contributor.authorHerrera Triguero, Francisco 
dc.date.accessioned2021-01-21T07:59:12Z
dc.date.available2021-01-21T07:59:12Z
dc.date.issued2014-03-03
dc.identifier.citationPublished version: Triguero, I., Peralta, D., Bacardit, J., García, S., & Herrera, F. (2015). MRPR: a MapReduce solution for prototype reduction in big data classification. neurocomputing, 150, 331-345. [https://doi.org/10.1016/j.neucom.2014.04.078]es_ES
dc.identifier.urihttp://hdl.handle.net/10481/65872
dc.descriptionSupported by the Research Projects TIN2011-28488, P10-TIC-6858 and P11-TIC-7765. D. Peralta holds an FPU scholarship from the Spanish Ministry of Education and Science (FPU12/04902).es_ES
dc.description.abstractIn the era of big data, analyzing and extracting knowledge from large-scale data sets is a very interesting and challenging task. The application of standard data mining tools in such data sets is not straightforward. Hence, a new class of scalable mining method that embraces the huge storage and processing capacity of cloud platforms is required. In this work, we propose a novel distributed partitioning methodology for prototype reduction techniques in nearest neighbor classification. These methods aim at representing original training data sets as a reduced number of instances. Their main purposes are to speed up the classification process and reduce the storage requirements and sensitivity to noise of the nearest neighbor rule. However, the standard prototype reduction methods cannot cope with very large data sets. To overcome this limitation, we develop a MapReduce-based framework to distribute the functioning of these algorithms through a cluster of computing elements, proposing several algorithmic strategies to integrate multiple partial solutions (reduced sets of prototypes) into a single one. The proposed model enables prototype reduction algorithms to be applied over big data classification problems without significant accuracy loss. We test the speeding up capabilities of our model with data sets up to 5.7 millions of instances. The results show that this model is a suitable tool to enhance the performance of the nearest neighbor classifier with big data.es_ES
dc.description.sponsorshipGerman Research Foundation (DFG) FPU12/04902es_ES
dc.description.sponsorshipTIN2011-28488es_ES
dc.description.sponsorshipP10-TIC-6858es_ES
dc.description.sponsorshipP11-TIC-7765es_ES
dc.language.isoenges_ES
dc.publisherElsevieres_ES
dc.rightsAtribución-NoComercial-SinDerivadas 3.0 España*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/*
dc.subjectBig Dataes_ES
dc.subjectMahoutes_ES
dc.subjectHadoopes_ES
dc.subjectPrototype reductiones_ES
dc.subjectPrototype generationes_ES
dc.subjectNearest neighbor classificationes_ES
dc.titleMRPR: A MapReduce Solution for Prototype Reduction in Big Data Classificationes_ES
dc.typeinfo:eu-repo/semantics/articlees_ES
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses_ES
dc.identifier.doi10.1016/j.neucom.2014.04.078
dc.type.hasVersioninfo:eu-repo/semantics/submittedVersiones_ES


Ficheros en el ítem

[PDF]

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem

Atribución-NoComercial-SinDerivadas 3.0 España
Excepto si se señala otra cosa, la licencia del ítem se describe como Atribución-NoComercial-SinDerivadas 3.0 España