Please use this identifier to cite or link to this item: https://hdl.handle.net/2440/115616
Citations
Scopus Web of Science® Altmetric
?
?
Full metadata record
DC FieldValueLanguage
dc.contributor.authorJiang, H.-
dc.contributor.authorZhou, R.-
dc.contributor.authorZhang, L.-
dc.contributor.authorWang, H.-
dc.contributor.authorZhang, Y.-
dc.date.issued2017-
dc.identifier.citationProceedings of the ACM Conference on Information and Knowledge Management (CIKM 2017), 2017, vol.Part F131841, pp.1489-1498-
dc.identifier.isbn9781450349185-
dc.identifier.urihttp://hdl.handle.net/2440/115616-
dc.descriptionSession 8B: Text Analysis-
dc.description.abstractDetermining appropriate statistical distributions for modeling text corpora is important for accurate estimation of numerical charac- teristics. Based on the validity of the test on a claim that the data conforms to Poisson distribution we propose Poisson decomposi- tion model (PDM), a statistical model for modeling count data of text corpora, which can straightly capture each document’s mul- tidimensional numerical characteristics on topics. In PDM, each topic is represented as a parameter vector with multidimensional Poisson distribution, which can be easily normalized to multino- mial term probabilities and each document is represented as mea- surements on topics and thereby reduced to a measurement vec- tor on topics. We use gradient descent methods and sampling al- gorithm for parameter estimation. We carry out extensive experi- ments on the topics produced by our models. The results demon- strate our approach can extract more coherent topics and is com- petitive in document clustering by using the PDM-based features, compared to PLSI and LDA.-
dc.description.statementofresponsibilityHaixin Jiang, Rui Zhou, Limeng Zhang, Hua Wang, Yanchun Zhang-
dc.language.isoen-
dc.publisherAssociation for Computing Machinery-
dc.rights© 2017 Association for Computing Machinery.-
dc.source.urihttp://dx.doi.org/10.1145/3132847.3132942-
dc.subjectTopic model; Poisson decomposition; statistical testing; text classi- fication; topic coherence-
dc.titleA topic model based on poisson decomposition-
dc.typeConference paper-
dc.contributor.conferenceACM Conference on Information and Knowledge Management (CIKM 2017) (6 Nov 2017 - 10 Nov 2017 : Singapore, SINGAPORE)-
dc.identifier.doi10.1145/3132847.3132942-
dc.publisher.placeNew York, NY, USA-
dc.relation.granthttp://purl.org/au-research/grants/arc/DP170104747-
pubs.publication-statusPublished-
dc.identifier.orcidZhou, R. [0000-0001-6807-4362]-
Appears in Collections:Aurora harvest 3
Computer Science publications

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.