TY - JOUR
T1 - Automatic extraction of apparent semantic structure from text contents of a structural calculation document
AU - Kim, Bong Geun
AU - Park, Sang Il
AU - Kim, Hyo Jin
AU - Lee, Sang Ho
PY - 2010
Y1 - 2010
N2 - A generic method for the automatic extraction of apparent semantic document structure from a structural calculation document was proposed in this paper. The method consists of two processes: extracting subtitles and classifying depth levels of the subtitles. The subtitles become tree nodes of the apparent semantic structure. A context model of technical documents was built for the subtitle extraction from plain text information. In addition, a formal classification method for the determination of depth levels of the subtitles was developed and used to build a document tree with sequentially ordered subtitles. An application module of the proposed method, which transforms a plain text document into a semistructured XML document, was implemented. Performance of the developed application module was also evaluated with 40 test documents including structural calculation documents, technical reports, and theses.
AB - A generic method for the automatic extraction of apparent semantic document structure from a structural calculation document was proposed in this paper. The method consists of two processes: extracting subtitles and classifying depth levels of the subtitles. The subtitles become tree nodes of the apparent semantic structure. A context model of technical documents was built for the subtitle extraction from plain text information. In addition, a formal classification method for the determination of depth levels of the subtitles was developed and used to build a document tree with sequentially ordered subtitles. An application module of the proposed method, which transforms a plain text document into a semistructured XML document, was implemented. Performance of the developed application module was also evaluated with 40 test documents including structural calculation documents, technical reports, and theses.
UR - http://www.scopus.com/inward/record.url?scp=77951212598&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77951212598&partnerID=8YFLogxK
U2 - 10.1061/(ASCE)CP.1943-5487.0000047
DO - 10.1061/(ASCE)CP.1943-5487.0000047
M3 - Article
AN - SCOPUS:77951212598
SN - 0887-3801
VL - 24
SP - 313
EP - 324
JO - Journal of Computing in Civil Engineering
JF - Journal of Computing in Civil Engineering
IS - 3
ER -