Cloning for privacy protection in multiple independent data publications
Date
2011
Authors
Baig, M.M.
Li, J.
Liu, J.
Wang, H.
Editors
Advisors
Journal Title
Journal ISSN
Volume Title
Type:
Conference paper
Citation
International Conference on Information and Knowledge Management Proceedings, 2011, pp.885-894
Statement of Responsibility
Conference Name
CIKM '11: International Conference on Information and Knowledge Management (24 Oct 2011 - 28 Oct 2011 : Glasgow, UK)
Abstract
Data anonymization has become a major technique in privacy preserving data publishing. Many methods have been proposed to anonymize one dataset and a series of datasets of a data owner. However, no method has been proposed for the anonymization of data of multiple independent data publications. A data owner publishes a dataset, which contains overlapping population with other datasets published by other independent data owners. In this paper we analyze the privacy risk in the such scenario and vulnerability of partitioned based anonymization methods. We show that no partitioned based anonymization methods can protect privacy in arbitrary data distributions, and identify a case that the privacy can be protected in the scenario. We propose a new generalization principle -cloning to protect privacy for multiple independent data publications. We also develop an effective algorithm to achieve the -cloning. We experimentally show that the proposed algorithm anonymizes data to satisfy the privacy requirement and preserves good data utility.
School/Discipline
Dissertation Note
Provenance
Description
Access Status
Rights
Copyright 2011 Association for Computing Machinery