Please use this identifier to cite or link to this item: http://hdl.handle.net/2440/59019
Citations
Scopus Web of Science® Altmetric
?
?
Type: Conference paper
Title: Equi-width data swapping for private data publication
Author: Li, Y.
Shen, H.
Citation: Proceedings: 2009 International Conference on Parallel and Distributed Computing, Applications and Technologies, PDCAT 2009, 8-11 December 2009, Higashi Hiroshima, Japan: pp.231-238
Publisher: IEEE
Publisher Place: USA
Issue Date: 2009
ISBN: 9780769539140
Conference Name: International Conference on Parallel and Distributed Computing, Applications and Technologies (10th : 2009 : Hiroshima, Japan)
Statement of
Responsibility: 
Yidong Li and Hong Shen
Abstract: Data Swapping is a popular value-invariant data perturbation technique. The quality of a data swapping method is measured by how well it preserves data privacy and data utility. As swapping data globally is computationally impractical, to guarantee its performance in these metrics appropriate, localization schemes are often conducted in advance. Equi-depth partitioning is preferred by most of the existing data perturbation techniques as it provides uniform privacy protection for each data tuple. However, this method performs ineffectively for two types of applications: one is to maintain statistics based on equi-width partitioning, such as the multivariate histogram with equal bin width, and the other is to preserve parametric statistics, such as covariance, in the context of sparse data with non-uniform distribution. As a natural solution for the above application, this paper explores the possibility of using data swapping with equi-width partitioning for private data publication, which has been little used in data perturbation due to the difficulty of preserving data privacy. With extensive theoretical analysis and experimental results, we show that, Equi-Width Swapping (EWS)can achieve a similar performance in privacy preservation to that of Equi-Depth Swapping (EDS) if the number of partitions is sufficiently large (e. g. à ¿ = à ¿N, where N is the size of dataset). Our experimental results in both synthetic and real-world data validate our theoretical analysis.
Keywords: Privacy preserving data mining; data publication; data swapping; equi-width partitioning
Rights: Copyright © 2009 by The Institute of Electrical and Electronics Engineers
RMID: 0020097594
DOI: 10.1109/PDCAT.2009.69
Appears in Collections:Computer Science publications

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.