Beyond multivariate microaggregation for large record anonymization

J. Nin*

*Corresponding author for this work

Research output: Book chapterConference contributionpeer-review

Abstract

Microaggregation is one of the most commonly employed microdata protection methods. The basic idea of microaggregation is to anonymize data by aggregating original records into small groups of at least k elements and, therefore, preserving k-anonymity. Usually, in order to avoid information loss, when records are large, i.e., the number of attributes of the data set is large, this data set is split into smaller blocks of attributes and microaggregation is applied to each block, successively and independently. This is called multivariate microaggregation. By using this technique, the information loss after collapsing several values to the centroid of their group is reduced. Unfortunately, with multivariate microaggregation, the k-anonymity property is lost when at least two attributes of different blocks are known by the intruder, which might be the usual case. In this work, we present a new microaggregation method called one dimension microaggregation (Mic1D − k). With Mic1D − k, the problem of k-anonymity loss is mitigated by mixing all the values in the original microdata file into a single non-attributed data set using a set of simple pre-processing steps and then, microaggregating all the mixed values together. Our experiments show that, using real data, our proposal obtains lower disclosure risk than previous approaches whereas the information loss is preserved.

Original languageEnglish
Title of host publicationCitizen in Sensor Networks - 2nd International Workshop, CitiSens 2013, Revised Selected Papers
EditorsJordi Nin, Daniel Villatoro
PublisherSpringer Verlag
Pages87-107
Number of pages21
ISBN (Electronic)9783319041773
DOIs
Publication statusPublished - 2014
Externally publishedYes
Event2nd International Workshop on Citizen in Sensor Networks, CitiSens 2013 - Barcelona, Spain
Duration: 19 Sept 201319 Sept 2013

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8313
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference2nd International Workshop on Citizen in Sensor Networks, CitiSens 2013
Country/TerritorySpain
CityBarcelona
Period19/09/1319/09/13

Keywords

  • K-anonymity
  • Microaggregation
  • Privacy in statistical databases

Fingerprint

Dive into the research topics of 'Beyond multivariate microaggregation for large record anonymization'. Together they form a unique fingerprint.

Cite this