X-NIndex: A High Performance Stable and Large XML Document Query Approach and Experience in TOP500 List Data

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Volume 3032, Issue null, 2004, Pages 1043-1046.

Cited by: 0|Bibtex|Views12|Links
WOS SCOPUS EI
Keywords:
xml document
Weibo:
We proposed a new coordinate-index approach called X-NIndex, the features of our work include the pre-analysis of stable XML document and the coordinate installation which helps to increase the efficiency a lot in the query afterward

Abstract:

This article describes X-NIndex, a novel approach for large XML documents with stable structure. The definition for the large XML document with stable structure is given while the concept of XML document tree coordi- nate(X-DTC) is introduced. The significant advantage of X-NIndex to other XML query schemas is shown and the experimental r...More

Code:

Data:

0
Introduction
  • Extensive Markup Language (XML) [1] is emerged as the dominant standard for representing and exchanging data over Internet.
  • As an organization of data with various semistructures, it is more difficult to store and query various XML documents, especially those consisted with a large amount of data.
  • Several models have been advanced include XML-QL [8], XML-GL [3], Quilt [7], XPath [4], X-Rel [9], XQuery [6], and XML Indexed Structure with RRC [5]
  • They process XML data by changing XML document into different data schemas and do query on the schemas.
  • The authors found that much of these approaches are not very efficient in operation of large XML documents, and to solve such problem, the authors propose a new query structure named X-NIndex (XML Node Index Structure), which is proved to be able to improve query performance greatly
Highlights
  • Extensive Markup Language (XML) [1] is emerged as the dominant standard for representing and exchanging data over Internet
  • We found that much of these approaches are not very efficient in operation of large XML documents, and to solve such problem, we propose a new query structure named X-NIndex (XML Node Index Structure), which is proved to be able to improve query performance greatly
  • To get the location of each node of XML document quickly, we introduce a new concept called XML Document Tree Coordinate (XML Document Tree Coordinate)
  • We proposed a new coordinate-index approach called X-NIndex, the features of our work include the pre-analysis of stable XML document and the coordinate installation which helps to increase the efficiency a lot in the query afterward
Conclusion
  • Conclusion and Future Work

    In this paper, the authors proposed a new coordinate-index approach called X-NIndex, the features of the work include the pre-analysis of stable XML document and the coordinate installation which helps to increase the efficiency a lot in the query afterward.

    As some problems still exist with X-NIndex, for instance, the instability of it with increase of result set, in the future work, the authors will investigate the improvement with XNIndex in such cases and conduct some new methods or concepts into this problem.
Summary
  • Introduction:

    Extensive Markup Language (XML) [1] is emerged as the dominant standard for representing and exchanging data over Internet.
  • As an organization of data with various semistructures, it is more difficult to store and query various XML documents, especially those consisted with a large amount of data.
  • Several models have been advanced include XML-QL [8], XML-GL [3], Quilt [7], XPath [4], X-Rel [9], XQuery [6], and XML Indexed Structure with RRC [5]
  • They process XML data by changing XML document into different data schemas and do query on the schemas.
  • The authors found that much of these approaches are not very efficient in operation of large XML documents, and to solve such problem, the authors propose a new query structure named X-NIndex (XML Node Index Structure), which is proved to be able to improve query performance greatly
  • Conclusion:

    Conclusion and Future Work

    In this paper, the authors proposed a new coordinate-index approach called X-NIndex, the features of the work include the pre-analysis of stable XML document and the coordinate installation which helps to increase the efficiency a lot in the query afterward.

    As some problems still exist with X-NIndex, for instance, the instability of it with increase of result set, in the future work, the authors will investigate the improvement with XNIndex in such cases and conduct some new methods or concepts into this problem.
Reference
  • W3C Extensible Markup Language XML1.1 in www.w3.org/TR/xml11/. 2. W3C XML Schema 1.1, in www.w3c.org/xml/schema.
    Findings
  • 3. Ceri, S., Comai, S., Damiani, E., and Fraternali, P. (1999). XML-GL: a graphical language querying and restructuring XML documents 1. in WWW8.
    Google ScholarFindings
  • 4. Clark, J. and DeRose, S., 1999, XML Path Language (XPath).
    Google ScholarFindings
  • 5. Dao Dinh Kha, Masatoshi Yoshikawa, Shunsuke Uemura, An XML Indexing Structure with Relative Region Coordinate in ICDE’01.
    Google ScholarFindings
  • 6. D. Chamberlin, XQuery: An XML query language in IBM System Journal, 2002.
    Google ScholarLocate open access versionFindings
  • 7. Don Chamberlin, Jonathan Robie, Daniela Florescu, Quilt: An XML Query Language for Heterogeneous Data Source.
    Google ScholarFindings
  • 8. Deutsch, A., Fernandez, M., Suciu, D.(1999b).Storing Semistructured Data in Relations.
    Google ScholarFindings
  • 9. M. Yoshikawa, T. Amagasa, T. Shimura, S. Uemura,. XRel: A Path-Based Approach to Storage and Retrieval of XML Documents Using Relational Databases, 2001.
    Google ScholarFindings
Your rating :
0

 

Tags
Comments