Structured storage and retrieval of SGML documents using Grove

Hak Gyoon Kim, Sung-Bae Cho

Research output: Contribution to journalArticle

5 Citations (Scopus)

Abstract

SGML standardized in ISO 8879 [International Organization for Standardization (1986)] has been proliferated because it can provide various styles and transform documents on different platforms. The SGML document has logical structure information in addition to the contents. As SGML documents are widely used, there is an increasing demand for a storage and retrieval system to use the logical structure of documents efficiently. However, traditional retrieval systems based on document indexes cannot exploit the logical structure appropriately. In this paper, we have developed a document storage and retrieval system based on structure information, where the SGML document is transformed into Grove, which is the document model for DSSSL and HyTime, and stored at an element level by an object-oriented DBMS, Object Store. It supports structured documents and provides a query interface to retrieve information contained in the structures.

Original languageEnglish
Pages (from-to)643-657
Number of pages15
JournalInformation Processing and Management
Volume36
Issue number4
DOIs
Publication statusPublished - 2000 Jul 1

Fingerprint

SGML
Standardization
Logic
international organization
Information structure

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Media Technology
  • Computer Science Applications
  • Management Science and Operations Research
  • Library and Information Sciences

Cite this

@article{279cfe94c1d142e1a8352673e9910dce,
title = "Structured storage and retrieval of SGML documents using Grove",
abstract = "SGML standardized in ISO 8879 [International Organization for Standardization (1986)] has been proliferated because it can provide various styles and transform documents on different platforms. The SGML document has logical structure information in addition to the contents. As SGML documents are widely used, there is an increasing demand for a storage and retrieval system to use the logical structure of documents efficiently. However, traditional retrieval systems based on document indexes cannot exploit the logical structure appropriately. In this paper, we have developed a document storage and retrieval system based on structure information, where the SGML document is transformed into Grove, which is the document model for DSSSL and HyTime, and stored at an element level by an object-oriented DBMS, Object Store. It supports structured documents and provides a query interface to retrieve information contained in the structures.",
author = "Kim, {Hak Gyoon} and Sung-Bae Cho",
year = "2000",
month = "7",
day = "1",
doi = "10.1016/S0306-4573(99)00075-8",
language = "English",
volume = "36",
pages = "643--657",
journal = "Information Processing and Management",
issn = "0306-4573",
publisher = "Elsevier Limited",
number = "4",

}

Structured storage and retrieval of SGML documents using Grove. / Kim, Hak Gyoon; Cho, Sung-Bae.

In: Information Processing and Management, Vol. 36, No. 4, 01.07.2000, p. 643-657.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Structured storage and retrieval of SGML documents using Grove

AU - Kim, Hak Gyoon

AU - Cho, Sung-Bae

PY - 2000/7/1

Y1 - 2000/7/1

N2 - SGML standardized in ISO 8879 [International Organization for Standardization (1986)] has been proliferated because it can provide various styles and transform documents on different platforms. The SGML document has logical structure information in addition to the contents. As SGML documents are widely used, there is an increasing demand for a storage and retrieval system to use the logical structure of documents efficiently. However, traditional retrieval systems based on document indexes cannot exploit the logical structure appropriately. In this paper, we have developed a document storage and retrieval system based on structure information, where the SGML document is transformed into Grove, which is the document model for DSSSL and HyTime, and stored at an element level by an object-oriented DBMS, Object Store. It supports structured documents and provides a query interface to retrieve information contained in the structures.

AB - SGML standardized in ISO 8879 [International Organization for Standardization (1986)] has been proliferated because it can provide various styles and transform documents on different platforms. The SGML document has logical structure information in addition to the contents. As SGML documents are widely used, there is an increasing demand for a storage and retrieval system to use the logical structure of documents efficiently. However, traditional retrieval systems based on document indexes cannot exploit the logical structure appropriately. In this paper, we have developed a document storage and retrieval system based on structure information, where the SGML document is transformed into Grove, which is the document model for DSSSL and HyTime, and stored at an element level by an object-oriented DBMS, Object Store. It supports structured documents and provides a query interface to retrieve information contained in the structures.

UR - http://www.scopus.com/inward/record.url?scp=0033872485&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0033872485&partnerID=8YFLogxK

U2 - 10.1016/S0306-4573(99)00075-8

DO - 10.1016/S0306-4573(99)00075-8

M3 - Article

AN - SCOPUS:0033872485

VL - 36

SP - 643

EP - 657

JO - Information Processing and Management

JF - Information Processing and Management

SN - 0306-4573

IS - 4

ER -