Utility aware clustering for publishing transactional data

dc.contributor.authorBewong, M.
dc.contributor.authorLiu, J.
dc.contributor.authorLiu, L.
dc.contributor.authorLi, J.
dc.contributor.conference21st Pacific-Asia conference on knowledge discovery and data mining (23 May 2017 - 26 May 2017 : Jeju, South Korea)
dc.contributor.editorKim, J.
dc.contributor.editorShim, K.
dc.contributor.editorCao, L.
dc.contributor.editorLee, J.G.
dc.contributor.editorLin, X.
dc.contributor.editorMoon, Y.S.
dc.date.issued2017
dc.description.abstractThis work aims to maximise the utility of published data for the partition-based anonymisation of transactional data. We make an observation that, by optimising the clustering i.e. horizontal partitioning, the utility of published data can significantly be improved without affecting the privacy guarantees. We present a new clustering method with a specially designed distance function that considers the effect of sensitive terms in the privacy goal as part of the clustering process. In this way, when the clustering minimises the total intra-cluster distances of the partition, the utility loss is also minimised. We present two algorithms DocClust and DetK for clustering transactions and determining the best number of clusters respectively.
dc.identifier.citationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2017 / Kim, J., Shim, K., Cao, L., Lee, J.G., Lin, X., Moon, Y.S. (ed./s), vol.10235 LNAI, pp.481-494
dc.identifier.doi10.1007/978-3-319-57529-2_38
dc.identifier.isbn978-3-319-57528-5
dc.identifier.issn0302-9743
dc.identifier.issn1611-3349
dc.identifier.orcidLiu, J. [0000-0002-0794-0404]
dc.identifier.urihttps://hdl.handle.net/11541.2/127028
dc.language.isoen
dc.publisherSPRINGER INTERNATIONAL PUBLISHING AG
dc.publisher.placeSwitzerland
dc.relation.ispartofseriesLecture Notes in Artificial Intelligence
dc.rightsCopyright 2017 Springer International Publishing
dc.source.urihttps://doi.org/10.1007/978-3-319-57529-2_38
dc.subjectutility aware clustering
dc.subjectpublishing
dc.subjecttransactional data
dc.titleUtility aware clustering for publishing transactional data
dc.typeConference paper
pubs.publication-statusPublished
ror.mmsid9916130870401831

Files

Collections