Cloning for privacy protection in multiple independent data publications

Date

2011

Authors

Baig, M.M.
Li, J.
Liu, J.
Wang, H.

Editors

Advisors

Journal Title

Journal ISSN

Volume Title

Type:

Conference paper

Citation

International Conference on Information and Knowledge Management Proceedings, 2011, pp.885-894

Statement of Responsibility

Conference Name

CIKM '11: International Conference on Information and Knowledge Management (24 Oct 2011 - 28 Oct 2011 : Glasgow, UK)

Abstract

Data anonymization has become a major technique in privacy preserving data publishing. Many methods have been proposed to anonymize one dataset and a series of datasets of a data owner. However, no method has been proposed for the anonymization of data of multiple independent data publications. A data owner publishes a dataset, which contains overlapping population with other datasets published by other independent data owners. In this paper we analyze the privacy risk in the such scenario and vulnerability of partitioned based anonymization methods. We show that no partitioned based anonymization methods can protect privacy in arbitrary data distributions, and identify a case that the privacy can be protected in the scenario. We propose a new generalization principle -cloning to protect privacy for multiple independent data publications. We also develop an effective algorithm to achieve the -cloning. We experimentally show that the proposed algorithm anonymizes data to satisfy the privacy requirement and preserves good data utility.

School/Discipline

Dissertation Note

Provenance

Description

Access Status

Rights

Copyright 2011 Association for Computing Machinery

License

Grant ID

Call number

Persistent link to this record