A View on Fuzzy Systems for Big Data: Progress and Opportunities

Fernández Hilario, Alberto Luis; Carmona, Cristóbal José; Jesús Díaz, María José del; Herrera Triguero, Francisco

doi:10.1080/18756891.2016.1180820

FernandezHilario_FuzzySystems.pdf (1.129Mb)

Identificadores

URI: http://hdl.handle.net/10481/49267

DOI: 10.1080/18756891.2016.1180820

ISSN: 1875-6883

Exportar

Editorial

Atlantis Press

Materia

Big data

Fuzzy rule based classification systems

Clustering

MapReduce

Hadoop

Spark

Flink

Fecha

2016

Referencia bibliográfica

Fernández Hilario, A.; et al. A View on Fuzzy Systems for Big Data: Progress and Opportunities. International Journal of Computational Intelligence Systems, 9(1): 69-80 (2016). [http://hdl.handle.net/10481/49267]

Patrocinador

This work have been partially supported by the Spanish Ministry of Science and Technology under project TIN2014-57251-P; the Andalusian Research Plan P11-TIC-7765; and both the University of Jaén and Caja Rural Provincial de Jaén under project UJA2014/06/15.

Resumen

Currently, we are witnessing a growing trend in the study and application of problems in the framework of Big Data. This is mainly due to the great advantages which come from the knowledge extraction from a high volume of information. For this reason, we observe a migration of the standard Data Mining systems towards a new functional paradigm that allows at working with Big Data. By means of the MapReduce model and its different extensions, scalability can be successfully addressed, while maintaining a good fault tolerance during the execution of the algorithms. Among the different approaches used in Data Mining, those models based on fuzzy systems stand out for many applications. Among their advantages, we must stress the use of a representation close to the natural language. Additionally, they use an inference model that allows a good adaptation to different scenarios, especially those with a given degree of uncertainty. Despite the success of this type of systems, their migration to the Big Data environment in the different learning areas is at a preliminary stage yet. In this paper, we will carry out an overview of the main existing proposals on the topic, analyzing the design of these models. Additionally, we will discuss those problems related to the data distribution and parallelization of the current algorithms, and also its relationship with the fuzzy representation of the information. Finally, we will provide our view on the expectations for the future in this framework according to the design of those methods based on fuzzy sets, as well as the open challenges on the topic.

Colecciones

DCCIA - Artículos

Excepto si se señala otra cosa, la licencia del ítem se describe como Creative Commons Attribution-NonCommercial-NoDerivs 3.0 License