Computing structural similarity of source XML schemas against domain XML schema

LI, J.; Liu, J.; Liu, C.; Wang, G.; Yu, J.; Yang, C.

Computing structural similarity of source XML schemas against domain XML schema

Date

2008

Authors

LI, J.

Liu, J.

Liu, C.

Wang, G.

Yu, J.

Yang, C.

Editors

Fekete, F.
Alan, A.

Type:

Conference paper

Citation

Conferences in Research and Practice in Information Technology Series, 2008 / Fekete, F., Alan, A. (ed./s), vol.75, pp.155-164

Conference Name

ADC '08 Proceedings of the nineteenth conference on Australasian database (22 Jan 2008 - 25 Jan 2008 : Wollongong, Australia)

Abstract

In this paper, we study the problem of measuring structural similarities of large number of source schemas against a single domain schema, which is useful for enhancing the quality of searching and ranking big volume of source documents on the Web with the help of structural information. After analyzing the improperness of adopting existing edit-distance based methods, we propose a new similarity measure model that caters for the requirements of the problem. Given the asymmetric nature of the similarity comparisons of source schemas with a domain schema, similarity preserving rules and algorithm are designed to filter out uninteresting elements in source schemas for the purpose of optimizing the similarity computation. Based on the model, a basic algorithm and an improved algorithm are developed for structural similarity computation. The improved algorithm makes full use of a new coding scheme that is devised to reduce the number of comparisons. Complexities of both algorithms are analyzed and extensive experiments are conducted showing the significant performance gain achieved by the improved algorithm. © 2008, Australian Computer Society, Inc.

Rights

Persistent link to this record

https://hdl.handle.net/1959.8/66136

Full item page

Computing structural similarity of source XML schemas against domain XML schema

Date

Authors

Editors

Advisors

Journal Title

Journal ISSN

Volume Title

Type:

Citation

Statement of Responsibility

Conference Name

Abstract

School/Discipline

Dissertation Note

Provenance

Description

Access Status

Rights

License

Grant ID

Published Version

Call number

Persistent link to this record