Query optimization in information integration

Chen, Dongfeng; Chirkova, Rada; Sadri, Fereidoon; Salo, Tiia
June 2013
Acta Informatica;Jun2013, Vol. 50 Issue 4, p257
Academic Journal
The problem of decentralized data sharing, which is relevant to a wide range of applications, is still a source of major theoretical and practical challenges, in spite of many years of sustained research. In this paper we focus on the challenge of efficiency of query evaluation in information integration systems that use the global-as-view approach, with the objective of developing query-processing strategies that would be widely applicable and easy to implement in real-life applications. Our algorithms take into account important features of today's data sharing applications: XML as likely interface or representation for data sources; the potential for information overlap across data sources; and the need for inter-source processing, as in joins of data across sources. The focus of this paper is on performance-related characteristics of several alternative approaches that we propose for efficient query processing in information integration, including an approach that uses materialized restructured views. We use synthetic and real-life datasets in our implementation of an information integration system shell to provide experimental results that demonstrate that our algorithms are efficient and competitive in the information integration setting. In addition, our experimental results allow us to make context-specific recommendations on selecting query-processing approaches from our proposed alternatives. As such, our approaches could form a basis for scalable query processing in information integration and interoperability in many practical settings.


Related Articles

  • Query optimization in information integration. Chen, Dongfeng; Chirkova, Rada; Sadri, Fereidoon; Salo, Tiia // Acta Informatica;Jun2013, Vol. 50 Issue 4, p257 

    The problem of decentralized data sharing, which is relevant to a wide range of applications, is still a source of major theoretical and practical challenges, in spite of many years of sustained research. In this paper we focus on the challenge of efficiency of query evaluation in information...

  • An Enhanced Way of Labelling Nodes in Dynamic XML. Paramasivam, Jayanthi; Angamuthu, Tamilarasi // European Journal of Scientific Research;7/1/2011, Vol. 55 Issue 3, p348 

    In this era, XML is used as a standard in various businesses, researches, etc. It is necessary to manipulate data and evaluating the queries over the data in the XML document. Number of schemes is used for this purpose. The labelling is one such process in which the nodes of the XML documents...

  • Structural Query Optimization in Native XML Databases: A Hybrid Approach. Su-Cheng Haw; Chien-Sing Lee // Journal of Applied Sciences;2007, Vol. 7 Issue 20, p2934 

    As XML (eXtensible Mark-up Language) is gaining its popularity in data exchange over the Web, querying XML data has become an important issue to be addressed. In native XML databases (NXD), XML documents are usually modeled as trees and XML queries are typically specified in path expression. The...

  • Theory and Practices in Xml Query Optimization. Asre, Asmita P.; Ali, M. S. // International Journal of Advanced Research in Computer Science;May/Jun2012, Vol. 3 Issue 3, p715 

    As computers and technology continue to become more commonplace and essential to everyday life, more data is captured, stored, and analyzed by a variety of institutions. As this amount of data grows, so does the need for efficient methodologies and tools used to store, retrieve, and transform...

  • A survey on XML streaming evaluation techniques. Wu, Xiaoying; Theodoratos, Dimitri // VLDB Journal International Journal on Very Large Data Bases;Apr2013, Vol. 22 Issue 2, p177 

    XML is currently the most popular format for exchanging and representing data on the web. It is used in various applications and for different types of data including structured, semistructured, and unstructured heterogeneous data types. During the period, XML was establishing itself, data...

  • XML in the Publishing World. Poe, Stephen // AIIM E-DOC;Sep/Oct2003, Vol. 17 Issue 5, p14 

    Reports on the importance of Extensible Markup Language (XML) in solving problems of creating electronic documents in the U.S. Features of XML that benefits the end users; Provision of instructions for creating documents; Benefits of XML for electronic printing and publishing.

  • XML Five Years On: Simplicity Gives Way to Complexity.  // Seybold Report: Analyzing Publishing Technologies;3/11/2002, Vol. 1 Issue 23, p3 

    Focuses on issues relating to XML, a document markup language. Adoption of XML for data-processing; Analysis on the differences among the XML schema; Investigation on the technological innovation in XML editing tools.

  • Embedded XML for Device Management. Gordon, Charles // ECN: Electronic Component News;Mar2002 Part 1 of 2, Vol. 46 Issue 6, p78 

    Focuses on the importance of the Extensible Markup Language in accomplishing the level of interoperability of the network appliance. Specifications of the format of data records; Elimination of the error of data interpretation; Simplification of encoding/decoding data records.

  • Some Notes on Declarative Specification of Semantic Data Integration Task in a Peer-to-peer Agent System. Brzykcy, Gra┼╝yna // New Generation Computing;Jan2010, Vol. 28 Issue 1, p95 

    In the paper, a problem of semantic integration of XML data in a peer-to-peer agent system is analysed upon a channel theory. In the system, each agent manages its local data, and communication and cooperation actions, executed by the agents, consist of asking and answering queries. The abstract...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics