TY - GEN
T1 - Modeling problem transformations based on data complexity
AU - Bernadó-Mansilla, Ester
AU - MacIà-Antolínez, Núria
PY - 2007
Y1 - 2007
N2 - This paper presents a methodology to transform a problem to make it suitable for classification methods, while reducing its complexity so that the classification models extracted are more accurate. The problem is represented by a dataset, where each instance consists of a variable number of descriptors and a class label. We study dataset transformations in order to describe each instance by a single descriptor with its corresponding features and a class label. To analyze the suitability of each transformation, we rely on measures that approximate the geometrical complexity of the dataset. We search for the best transformation minimizing the geometrical complexity. By using complexity measures, we are able to estimate the intrinsic complexity of the dataset without being tied to any particular classifier.
AB - This paper presents a methodology to transform a problem to make it suitable for classification methods, while reducing its complexity so that the classification models extracted are more accurate. The problem is represented by a dataset, where each instance consists of a variable number of descriptors and a class label. We study dataset transformations in order to describe each instance by a single descriptor with its corresponding features and a class label. To analyze the suitability of each transformation, we rely on measures that approximate the geometrical complexity of the dataset. We search for the best transformation minimizing the geometrical complexity. By using complexity measures, we are able to estimate the intrinsic complexity of the dataset without being tied to any particular classifier.
KW - Breast cancer diagnosis
KW - Classification
KW - Data complexity
UR - http://www.scopus.com/inward/record.url?scp=84878029411&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84878029411
SN - 9781586037987
T3 - Frontiers in Artificial Intelligence and Applications
SP - 133
EP - 140
BT - Artificial Intelligence Research and Development
PB - IOS Press BV
T2 - 10th International Conference of the Catalan Association for Artificial Intelligence, CCIA 2007
Y2 - 25 October 2007 through 26 October 2007
ER -