Mostrar el registro sencillo del ítem

dc.contributor.authorFernández Basso, Carlos Jesús 
dc.contributor.authorRuiz Jiménez, María Dolores 
dc.contributor.authorMartín Bautista, María José 
dc.date.accessioned2023-05-31T07:21:48Z
dc.date.available2023-05-31T07:21:48Z
dc.date.issued2023-04-30
dc.identifier.citationFernandez-Basso, C. et al. New Spark solutions for distributed frequent itemset and association rule mining algorithms. Cluster Computing. [https://doi.org/10.1007/s10586-023-04014-w]es_ES
dc.identifier.urihttps://hdl.handle.net/10481/82042
dc.descriptionFunding for open access publishing: Universidad de Gran- ada/CBUA. The research reported in this paper was partially sup- ported by the BIGDATAMED project, which has received funding from the Andalusian Government (Junta de Andalucı ́a) under grant agreement No P18-RT-1765, by Grants PID2021-123960OB-I00 and Grant TED2021-129402B-C21 funded by Ministerio de Ciencia e Innovacio ́n and, by ERDF A way of making Europe and by the European Union NextGenerationEU. In addition, this work has been partially supported by the Ministry of Universities through the EU- funded Margarita Salas programme NextGenerationEU. Funding for open access charge: Universidad de Granada/CBUAes_ES
dc.description.abstractThe large amount of data generated every day makes necessary the re-implementation of new methods capable of handle with massive data efficiently. This is the case of Association Rules, an unsupervised data mining tool capable of extracting information in the form of IF-THEN patterns. Although several methods have been proposed for the extraction of frequent itemsets (previous phase before mining association rules) in very large databases, the high computational cost and lack of memory remains a major problem to be solved when processing large data. Therefore, the aim of this paper is three fold: (1) to review existent algorithms for frequent itemset and association rule mining, (2)to develop new efficient frequent itemset Big Data algorithms using distributive computation, as well as a new association rule mining algorithm in Spark, and (3) to compare the proposed algorithms with the existent proposals varying the number of transactions and the number of items. To this purpose, we have used the Spark platform which has been demonstrated to outperform existing distributive algorithmic implementations.es_ES
dc.description.sponsorshipUniversidad de Granada/CBUAes_ES
dc.description.sponsorshipJunta de Andalucia P18-RT-1765es_ES
dc.description.sponsorshipMinistry of Science and Innovation, Spain (MICINN) Instituto de Salud Carlos III Spanish Government PID2021-123960OB-I00, TED2021-129402B-C21es_ES
dc.description.sponsorshipERDF A way of making Europees_ES
dc.description.sponsorshipEuropean Union NextGenerationEUes_ES
dc.description.sponsorshipMinistry of Universities through the EUes_ES
dc.language.isoenges_ES
dc.publisherSpringeres_ES
dc.rightsAtribución 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.subjectBig Dataes_ES
dc.subjectData Mininges_ES
dc.subjectAssociation Rulees_ES
dc.subjectFrequent Itemsetes_ES
dc.subjectDistributed computinges_ES
dc.subjectSparkes_ES
dc.titleNew Spark solutions for distributed frequent itemset and association rule mining algorithmses_ES
dc.typejournal articlees_ES
dc.rights.accessRightsopen accesses_ES
dc.identifier.doi10.1007/s10586-023-04014-w
dc.type.hasVersionVoRes_ES


Ficheros en el ítem

[PDF]

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem

Atribución 4.0 Internacional
Excepto si se señala otra cosa, la licencia del ítem se describe como Atribución 4.0 Internacional