Top-k keyword search over probabilistic XML data

Li, J.; Liu, C.; Zhou, R.; Wang, W.

doi:10.1109/ICDE.2011.5767875

Top-k keyword search over probabilistic XML data

Files

RA_hdl_110473.pdf (245.12 KB)

(Restricted Access)

Date

2011

Authors

Li, J.

Liu, C.

Zhou, R.

Wang, W.

Type:

Conference paper

Citation

Proceedings of the International Conference on Data Engineering, 2011, pp.673-684

Statement of Responsibility

Jianxin Li, Chengfei Liu, Rui Zhou, Wei Wang

Conference Name

2011 IEEE 27th International Conference on Data Engineering (ICDE 2011) (11 Apr 2011 - 16 Apr 2011 : Hannover)

DOI

10.1109/ICDE.2011.5767875

Abstract

Despite the proliferation of work on XML keyword query, it remains open to support keyword query over probabilistic XML data. Compared with traditional keyword search, it is far more expensive to answer a keyword query over probabilistic XML data due to the consideration of possible world semantics. In this paper, we firstly define the new problem of studying top-k keyword search over probabilistic XML data, which is to retrieve k SLCA results with the k highest probabilities of existence. And then we propose two efficient algorithms. The first algorithm PrStack can find k SLCA results with the k highest probabilities by scanning the relevant keyword nodes only once. To further improve the efficiency, we propose a second algorithm EagerTopK based on a set of pruning properties which can quickly prune unsatisfied SLCA candidates. Finally, we implement the two algorithms and compare their performance with analysis of extensive experimental results.

Rights

Grant ID

http://purl.org/au-research/grants/arc/DP110102407
http://purl.org/au-research/grants/arc/DP0878405
http://purl.org/au-research/grants/arc/DP0987273
http://purl.org/au-research/grants/arc/DP0881779
http://purl.org/au-research/grants/arc/DP0878405

Published Version

https://doi.org/10.1109/icde.2011.5767875

Persistent link to this record

http://hdl.handle.net/2440/110473

Full item page

Top-k keyword search over probabilistic XML data

Files

Date

Authors

Editors

Advisors

Journal Title

Journal ISSN

Volume Title

Type:

Citation

Statement of Responsibility

Conference Name

DOI

Abstract

School/Discipline

Dissertation Note

Provenance

Description

Access Status

Rights

License

Grant ID

Published Version

Call number

Persistent link to this record