An evolutionary algorithm for mining rare association rules: A Big Data approach
Created by W.Langdon from
gp-bibliography.bib Revision:1.8010
- @InProceedings{PadilloLV17,
-
author = "Francisco Padillo and Jose Maria Luna and
Sebastian Ventura",
-
booktitle = "Proceedings of the 2017 {IEEE} Congress on
Evolutionary Computation",
-
title = "An evolutionary algorithm for mining rare association
rules: {A} Big Data approach",
-
year = "2017",
-
editor = "Jose A. Lozano",
-
pages = "2007--2014",
-
address = "Donostia, San Sebastian, Spain",
-
publisher = "IEEE",
-
isbn13 = "978-1-5090-4601-0",
-
abstract = "Association rule mining is one of the most well known
techniques to discover interesting relations between
items in data. To date, this task has been mainly
focused on the discovery of frequent relationships.
However, it is often interesting to focus on those that
do not occur frequently. Rare association rule mining
is an alluring field aiming at describing rare cases or
unexpected behaviour. This field is really useful over
Big Data where abnormal endeavour are more curious than
common behaviour. In this sense, our aim is to propose
a new evolutionary algorithm based on grammars to
obtain rare association rules on Big Data. The novelty
of our work is that it is eminently designed to be
parallel, enabling its use over emerging technologies
as Spark and Flink. Furthermore, while other algorithms
focus on maximizing a couple of quality measure
ignoring the rest, our fitness function has been
precisely designed to obtain a trade-off while
maximizing a set of well-known quality measures. The
experimental study includes more than 70 datasets
revealing alluring results in efficiency when more than
300 million of instances and file sizes up to 250
GBytes are considered, and proving that it is able to
run efficiently in huge volumes of data.",
-
keywords = "genetic algorithms, genetic programming, data mining,
Big Data, Flink, Spark, association rules mining,
evolutionary algorithm, grammars, quality measure,
Algorithm design and analysis, Grammar, Proposals,
Sparks, Syntactics",
-
isbn13 = "978-1-5090-4601-0",
-
DOI = "doi:10.1109/CEC.2017.7969547",
-
month = "5-8 " # jun,
-
notes = "IEEE Catalog Number: CFP17ICE-ART Also known as
\cite{7969547} \cite{padillo:2017:CEC}",
- }
Genetic Programming entries for
Francisco Padillo
Jose Maria Luna
Sebastian Ventura
Citations