TY - GEN AU - García Gil, Diego Jesús AU - Alcalde Barros, Alejandro AU - Luengo Martín, Julián AU - García López, Salvador AU - Herrera Triguero, Francisco PY - 2019 UR - http://hdl.handle.net/10481/64590 AB - With the advent of Big Data, terabytes of data are generated and stored every second. This raw data is far from being perfect, it contains many imperfections (noise, missing values, etc.) and is not suitable for analysis, as it will led to wrong... LA - eng PB - ScitePress KW - Big Data KW - Apache spark KW - Data Preprocessing KW - Smart Data KW - Imbalanced KW - Classification TI - Big Data Preprocessing as the Bridge between Big Data and Smart Data: BigDaPSpark and BigDaPFlink Libraries DO - 10.5220/0007738503240331 ER -