TY - JOUR AU - Maillo Hidalgo, Jesús AU - Triguero, Isaac AU - Herrera Triguero, Francisco PY - 2020 UR - http://hdl.handle.net/10481/62787 AB - It is recognized the importance of knowing the descriptive properties of a dataset when tackling a data science problem. Having information about the redundancy, complexity and density of a problem allows us to make decisions as to which data... LA - eng PB - Institute of Electrical and Electronics Engineers (IEEE) KW - Big data KW - Smart Data KW - Classification KW - Redundancy KW - Complexity KW - Apache spark TI - Redundancy and Complexity Metrics for Big Data Classification: Towards Smart Data DO - 10.1109/ACCESS.2020.2991800 ER -