Dongwon Lee, Murali Mani, Makoto Murata.

Technical Report, IBM Almaden Research Center, RJ# 10197, Log# 95071*, November 16, 2000

Abstract

A mathematical framework using formal language theory to describe and compare XML schema languages is presented. Our framework uses the work in two related areas - regular tree languages [CDG+97] and ambiguity in regular expressions [BEGO71, BKW98]. Using these work as well as the content in two classical references [HU79, AU79], we present the following results: (1) a normal form representation for regular tree grammars, (2) a framework of marked regular expressions and model groups, and their ambiguities, (3) five subclasses of regular tree grammars and their corresponding languages to describe XML content models: regular tree languages, TD(1) (top-down input scan with 1-vertical lookahead), single-type constraint languages, TDLL(1) (top-down and left-right input scan with 1-vertical and 1-horizontal lookaheads), and local tree languages, (4) the closure properties of the five language classes under boolean set operations, (5) a classification and comparison of a few XML schema proposals and type systems: DTD, XML-Schema, DSD, XDuce, RELAX, and (6) properties of the grammar classes under two common operations: XML document validity checking and type resolution (i.e., XML document interpretation).

Download

http://citeseer.nj.nec.com/lee00reasoning.html

-- MartinBravenboer - 30 May 2002

Revision: r1.1 - 30 May 2002 - 20:35 - MartinBravenboer
Transform > XDuce > SchemaLanguageForXML > TaxonomyOfXMLSchemaLanguagesUsingFormalLanguageTheory > ReasoningAboutXMLSchemaLanguagesUsingFormalLanguageTheory
Copyright © 1999-2020 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback