Extracting partition statistics from semistructured data

Tools

Wilson, John N. and Gourlay, Richard and Japp, Robert and Neumüller, Mathias (2006) Extracting partition statistics from semistructured data. In: 17th International Workshop on Database and Expert Systems Applications (DEXA 2006), 2006-09-04 - 2006-09-08.

Preview

Text. Filename: strathprints002387.pdf
Accepted Author Manuscript
Download (163kB)| Preview

Abstract

The effective grouping, or partitioning, of semistructured data is of fundamental importance when providing support for queries. Partitions allow items within the data set that share common structural properties to be identified efficiently. This allows queries that make use of these properties, such as branching path expressions, to be accelerated. Here, we evaluate the effectiveness of several partitioning techniques by establishing the number of partitions that each scheme can identify over a given data set. In particular, we explore the use of parameterised indexes, based upon the notion of forward and backward bisimilarity, as a means of partitioning semistructured data; demonstrating that even restricted instances of such indexes can be used to identify the majority of relevant partitions in the data.

ORCID iDs

Wilson, John N.

, Gourlay, Richard

, Japp, Robert and Neumüller, Mathias;

Share and Export

Item metadata

Item type:	Conference or Workshop Item(Paper)
ID code:	2387
Dates:	Date Event 4 September 2006 Published
Subjects:	Science > Mathematics > Electronic computers. Computer science
Department:	Faculty of Science > Computer and Information Sciences
Depositing user:	Strathprints Administrator
Date deposited:	22 Jan 2007
Last modified:	09 Apr 2025 20:31
URI:	https://strathprints.strath.ac.uk/id/eprint/2387

CORE (COnnecting REpositories)