TY - GEN
T1 - Beyond homemade artificial data sets
AU - MacIà, Núria
AU - Orriols-Puig, Albert
AU - Bernadó-Mansilla, Ester
PY - 2009
Y1 - 2009
N2 - One of the most important challenges in supervised learning is how to evaluate the quality of the models evolved by different machine learning techniques. Up to now, we have relied on measures obtained by running the methods on a wide test bed composed of real-world problems. Nevertheless, the unknown inherent characteristics of these problems and the bias of learners may lead to inconclusive results. This paper discusses the need to work under a controlled scenario and bets on artificial data set generation. A list of ingredients and some ideas about how to guide such generation are provided, and promising results of an evolutionary multi-objective approach which incorporates the use of data complexity estimates are presented.
AB - One of the most important challenges in supervised learning is how to evaluate the quality of the models evolved by different machine learning techniques. Up to now, we have relied on measures obtained by running the methods on a wide test bed composed of real-world problems. Nevertheless, the unknown inherent characteristics of these problems and the bias of learners may lead to inconclusive results. This paper discusses the need to work under a controlled scenario and bets on artificial data set generation. A list of ingredients and some ideas about how to guide such generation are provided, and promising results of an evolutionary multi-objective approach which incorporates the use of data complexity estimates are presented.
KW - Artificial data sets
KW - Data complexity
KW - Machine learning
UR - http://www.scopus.com/inward/record.url?scp=70350632465&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-02319-4_73
DO - 10.1007/978-3-642-02319-4_73
M3 - Conference contribution
AN - SCOPUS:70350632465
SN - 3642023185
SN - 9783642023187
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 605
EP - 612
BT - Hybrid Artificial Intelligence Systems - 4th International Conference, HAIS 2009, Proceedings
T2 - 4th International Conference on Hybrid Artificial Intelligence Systems, HAIS 2009
Y2 - 10 June 2009 through 12 June 2009
ER -