Learner excellence biased by data set selection: A case for data characterisation and artificial data sets

Núria MacIà*, Ester Bernadó-Mansilla, Albert Orriols-Puig, Tin Kam Ho

*Autor corresponent d’aquest treball

    Producció científica: Article en revista indexadaArticleAvaluat per experts

    32 Cites (Scopus)

    Resum

    The excellence of a given learner is usually claimed through a performance comparison with other learners over a collection of data sets. Too often, researchers are not aware of the impact of their data selection on the results. Their test beds are small, and the selection of the data sets is not supported by any previous data analysis. Conclusions drawn on such test beds cannot be generalised, because particular data characteristics may favour certain learners unnoticeably. This work raises these issues and proposes the characterisation of data sets using complexity measures, which can be helpful for both guiding experimental design and explaining the behaviour of learners.

    Idioma originalAnglès
    Pàgines (de-a)1054-1066
    Nombre de pàgines13
    RevistaPattern Recognition
    Volum46
    Número3
    DOIs
    Estat de la publicacióPublicada - de març 2013

    Fingerprint

    Navegar pels temes de recerca de 'Learner excellence biased by data set selection: A case for data characterisation and artificial data sets'. Junts formen un fingerprint únic.

    Com citar-ho