TY - GEN
T1 - Improving microaggregation for complex record anonymization
AU - Pont-Tuset, Jordi
AU - Nin, J.
AU - Medrano-Gracia, Pau
AU - Larriba-Pey, Josep Ll
AU - Muntés-Mulero, Victor
PY - 2008
Y1 - 2008
N2 - Microaggregation is one of the most commonly employed microdata protection methods. This method builds clusters of at least k original records and replaces the records in each cluster with the centroid of the cluster. Usually, when records are complex, i.e., the number of attributes of the data set is large, this data set is split into smaller blocks of attributes and microaggregation is applied to each block, successively and independently. In this way, the information loss when collapsing several values to the centroid of their group is reduced, at the cost of losing the k-anonymity property when at least two attributes of different blocks are known by the intruder. In this work, we present a new microaggregation method called One dimension microaggregation (Mic1D - k). This method gathers all the values of the data set into a single sorted vector, independently of the attribute they belong to. Then, it microaggregates all the mixed values together. Our experiments show that, using real data, our proposal obtains lower disclosure risk than previous approaches whereas the information loss is preserved.
AB - Microaggregation is one of the most commonly employed microdata protection methods. This method builds clusters of at least k original records and replaces the records in each cluster with the centroid of the cluster. Usually, when records are complex, i.e., the number of attributes of the data set is large, this data set is split into smaller blocks of attributes and microaggregation is applied to each block, successively and independently. In this way, the information loss when collapsing several values to the centroid of their group is reduced, at the cost of losing the k-anonymity property when at least two attributes of different blocks are known by the intruder. In this work, we present a new microaggregation method called One dimension microaggregation (Mic1D - k). This method gathers all the values of the data set into a single sorted vector, independently of the attribute they belong to. Then, it microaggregates all the mixed values together. Our experiments show that, using real data, our proposal obtains lower disclosure risk than previous approaches whereas the information loss is preserved.
KW - K-anonymity
KW - Microaggregation
KW - Privacy in Statistical Databases
UR - http://www.scopus.com/inward/record.url?scp=58049097034&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-88269-5_20
DO - 10.1007/978-3-540-88269-5_20
M3 - Conference contribution
AN - SCOPUS:58049097034
SN - 3540882685
SN - 9783540882688
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 215
EP - 226
BT - Modeling Decisions for Artificial Intelligence - 5th International Conference, MDAI 2008, Proceedings
PB - Springer Verlag
T2 - 5th International Conference on Modeling Decisions for Artificial Intelligence, MDAI 2008
Y2 - 30 October 2008 through 31 October 2008
ER -