Mostrar el registro sencillo del ítem

dc.contributor.authorGarcía Gil, Diego Jesús 
dc.contributor.authorAlcalde Barros, Alejandro
dc.contributor.authorLuengo Martín, Julián 
dc.contributor.authorGarcía López, Salvador 
dc.contributor.authorHerrera Triguero, Francisco 
dc.date.accessioned2020-12-02T11:07:28Z
dc.date.available2020-12-02T11:07:28Z
dc.date.issued2019
dc.identifier.citationGarcía-Gil, D., Alcalde-Barros, A., Luengo, J., García, S., & Herrera, F. (2019). Big Data Preprocessing as the Bridge between Big Data and Smart Data: BigDaPSpark and BigDaPFlink Libraries. In IoTBDS (pp. 324-331). [DOI: 10.5220/0007738503240331]es_ES
dc.identifier.urihttp://hdl.handle.net/10481/64590
dc.description.abstractWith the advent of Big Data, terabytes of data are generated and stored every second. This raw data is far from being perfect, it contains many imperfections (noise, missing values, etc.) and is not suitable for analysis, as it will led to wrong conclusions. Data preprocessing is the set of techniques devoted to polish, clean, fix, and improve that raw data. With this preprocessed data, we would be able to find more patterns in it, and to better explain the underlaying distribution of the data. This is what is called Smart Data, raw data that has been preprocessed and is ready for being analyzed, data that contains valuable information that will led to knowledge. In this work, we present two Big Data libraries for achieving Smart Data from Big Data, BigDaPSpark and BigDaPFlink. They are built on top of two Big Data frameworks, Apache Spark and Apache Flink. Both libraries contain a series of algorithms for Big Data preprocessing, ranging from noise cleaning, to discretization, or data reduction, among many others. Additionally, we ilustrate the usage of the libraries with two cases of use.es_ES
dc.description.sponsorshipSpanish National Research Project TIN2017-89517-Pes_ES
dc.language.isoenges_ES
dc.publisherScitePresses_ES
dc.rightsAtribución-NoComercial-SinDerivadas 3.0 España*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/*
dc.subjectBig Dataes_ES
dc.subjectApache sparkes_ES
dc.subjectData Preprocessinges_ES
dc.subjectSmart Dataes_ES
dc.subjectImbalancedes_ES
dc.subjectClassification es_ES
dc.titleBig Data Preprocessing as the Bridge between Big Data and Smart Data: BigDaPSpark and BigDaPFlink Librarieses_ES
dc.typeinfo:eu-repo/semantics/articlees_ES
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses_ES
dc.identifier.doi10.5220/0007738503240331
dc.type.hasVersioninfo:eu-repo/semantics/publishedVersiones_ES


Ficheros en el ítem

[PDF]

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem

Atribución-NoComercial-SinDerivadas 3.0 España
Excepto si se señala otra cosa, la licencia del ítem se describe como Atribución-NoComercial-SinDerivadas 3.0 España