Improving data partition schemes in Smart Grids via clustering data streams

Andreu Sancho-Asensio, Joan Navarro, Itziar Arrieta-Salinas, José Enrique Armendáriz-Íñigo, Virginia Jiménez-Ruano, Agustín Zaballos, Elisabet Golobardes

Producción científica: Artículo en revista indizadaArtículorevisión exhaustiva

28 Citas (Scopus)

Resumen

Data mining techniques are traditionally divided into two distinct disciplines depending on the task to be performed by the algorithm: supervised learning and unsupervised learning. While the former aims at making accurate predictions after deeming an underlying structure in data - which requires the presence of a teacher during the learning phase - the latter aims at discovering regular-occurring patterns beneath the data without making any a priori assumptions concerning their underlying structure. The pure supervised model can construct a very accurate predictive model from data streams. However, in many real-world problems this paradigm may be ill-suited due to (1) the dearth of training examples and (2) the costs of labeling the required information to train the system. A sound use case of this concern is found when defining data replication and partitioning policies to store data emerged in the Smart Grids domain in order to adapt electric networks to current application demands (e.g.; real time consumption, network self adapting). As opposed to classic electrical architectures, Smart Grids encompass a fully distributed scheme with several diverse data generation sources. Current data storage and replication systems fail at both coping with such overwhelming amount of heterogeneous data and at satisfying the stringent requirements posed by this technology (i.e.; dynamic nature of the physical resources, continuous flow of information and autonomous behavior demands). The purpose of this paper is to apply unsupervised learning techniques to enhance the performance of data storage in Smart Grids. More specifically we have improved the eXtended Classifier System for Clustering (XCSc) algorithm to present a hybrid system that mixes data replication and partitioning policies by means of an online clustering approach. Conducted experiments show that the proposed system outperforms previous proposals and truly fits with the Smart Grid premises.

Idioma originalInglés
Páginas (desde-hasta)5832-5842
Número de páginas11
PublicaciónExpert Systems with Applications
Volumen41
N.º13
DOI
EstadoPublicada - 1 oct 2014

Huella

Profundice en los temas de investigación de 'Improving data partition schemes in Smart Grids via clustering data streams'. En conjunto forman una huella única.

Citar esto