DTD-Miner: A tool for mining DTDs from XML documents

XML documents are semistructured and the structure of the documents is embedded in the tags. Although XML documents can be accompanied by a DTD that defines the structure of the documents, the presence of a DTD is not mandatory. The difficulty in deriving the DTD for XML documents lies in the fact t...

Full description

Saved in:
Bibliographic Details
Main Authors: HUE, Moh Chuang, LIM, Ee Peng, NG, Wee-Keong
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2000
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/990
https://ink.library.smu.edu.sg/context/sis_research/article/1989/viewcontent/d24f4dbbc34b23c537a57962a89c91183a15.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-1989
record_format dspace
spelling sg-smu-ink.sis_research-19892018-06-18T04:28:07Z DTD-Miner: A tool for mining DTDs from XML documents HUE, Moh Chuang LIM, Ee Peng NG, Wee-Keong XML documents are semistructured and the structure of the documents is embedded in the tags. Although XML documents can be accompanied by a DTD that defines the structure of the documents, the presence of a DTD is not mandatory. The difficulty in deriving the DTD for XML documents lies in the fact that DTDs are of different syntax as XML and that prior knowledge of the structure of the documents is required. In this paper, we introduce DTD-Miner, an automatic structure-mining tool for XML documents. Using a Web-based interface, the user will be able to submit a set of similarly structured XML documents and the system will automatically suggest a DTD. The user is also able to further refine the DTD generated to reduce the complexity by relaxing some the rules used in the system. 2000-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/990 https://ink.library.smu.edu.sg/context/sis_research/article/1989/viewcontent/d24f4dbbc34b23c537a57962a89c91183a15.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle Databases and Information Systems
Numerical Analysis and Scientific Computing
HUE, Moh Chuang
LIM, Ee Peng
NG, Wee-Keong
DTD-Miner: A tool for mining DTDs from XML documents
description XML documents are semistructured and the structure of the documents is embedded in the tags. Although XML documents can be accompanied by a DTD that defines the structure of the documents, the presence of a DTD is not mandatory. The difficulty in deriving the DTD for XML documents lies in the fact that DTDs are of different syntax as XML and that prior knowledge of the structure of the documents is required. In this paper, we introduce DTD-Miner, an automatic structure-mining tool for XML documents. Using a Web-based interface, the user will be able to submit a set of similarly structured XML documents and the system will automatically suggest a DTD. The user is also able to further refine the DTD generated to reduce the complexity by relaxing some the rules used in the system.
format text
author HUE, Moh Chuang
LIM, Ee Peng
NG, Wee-Keong
author_facet HUE, Moh Chuang
LIM, Ee Peng
NG, Wee-Keong
author_sort HUE, Moh Chuang
title DTD-Miner: A tool for mining DTDs from XML documents
title_short DTD-Miner: A tool for mining DTDs from XML documents
title_full DTD-Miner: A tool for mining DTDs from XML documents
title_fullStr DTD-Miner: A tool for mining DTDs from XML documents
title_full_unstemmed DTD-Miner: A tool for mining DTDs from XML documents
title_sort dtd-miner: a tool for mining dtds from xml documents
publisher Institutional Knowledge at Singapore Management University
publishDate 2000
url https://ink.library.smu.edu.sg/sis_research/990
https://ink.library.smu.edu.sg/context/sis_research/article/1989/viewcontent/d24f4dbbc34b23c537a57962a89c91183a15.pdf
_version_ 1770570816091586560