TY - GEN
T1 - In search of targeted-complexity problems
AU - Macià, Núria
AU - Orriols-Puig, Albert
AU - Bernadó-Mansilla, Ester
PY - 2010
Y1 - 2010
N2 - Currently available real-world problems do not cover the whole complexity space and, therefore, do not allow us to thoroughly test learner behavior on the border of its domain of competence. Thus, the necessity of developing a more suitable testing scenario arises. With this in mind, data complexity analysis has shown promise in characterizing difficulty of classification problems through a set of complexity descriptors which used in artificial data sets generation could supply the required framework to refine and design learners. This paper, then, proposes the use of instance selection based on an evolutionary multiobjective technique to generate data sets that meet specific characteristics established by such complexity descriptors. These artificial targeted-complexity problems, which capture the essence of real-world structures, may help to define a set of benchmarks that contributes to test the properties of learners and to improve them.
AB - Currently available real-world problems do not cover the whole complexity space and, therefore, do not allow us to thoroughly test learner behavior on the border of its domain of competence. Thus, the necessity of developing a more suitable testing scenario arises. With this in mind, data complexity analysis has shown promise in characterizing difficulty of classification problems through a set of complexity descriptors which used in artificial data sets generation could supply the required framework to refine and design learners. This paper, then, proposes the use of instance selection based on an evolutionary multiobjective technique to generate data sets that meet specific characteristics established by such complexity descriptors. These artificial targeted-complexity problems, which capture the essence of real-world structures, may help to define a set of benchmarks that contributes to test the properties of learners and to improve them.
KW - Artificial data sets
KW - Data complexity
KW - Evolutionary multiobjective optimization
UR - http://www.scopus.com/inward/record.url?scp=77955860550&partnerID=8YFLogxK
U2 - 10.1145/1830483.1830674
DO - 10.1145/1830483.1830674
M3 - Conference contribution
AN - SCOPUS:77955860550
SN - 9781450300728
T3 - Proceedings of the 12th Annual Genetic and Evolutionary Computation Conference, GECCO '10
SP - 1055
EP - 1062
BT - Proceedings of the 12th Annual Genetic and Evolutionary Computation Conference, GECCO '10
T2 - 12th Annual Genetic and Evolutionary Computation Conference, GECCO-2010
Y2 - 7 July 2010 through 11 July 2010
ER -