Automatic Extraction of Apparent Semantic Structure from Text Contents of a Structural Calculation Document

Bong-Geun Kim; Sang II Park; Hyo-Jin Kim; Sang-Ho Lee

首页> 外文期刊>Journal of Computing in Civil Engineering >Automatic Extraction of Apparent Semantic Structure from Text Contents of a Structural Calculation Document

【24h】

Automatic Extraction of Apparent Semantic Structure from Text Contents of a Structural Calculation Document

机译：从结构计算文档的文本内容中自动提取表观语义结构

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A generic method for the automatic extraction of apparent semantic document structure from a structural calculation document was proposed in this paper. The method consists of two processes: extracting subtitles and classifying depth levels of the subtitles. The subtitles become tree nodes of the apparent semantic structure. A context model of technical documents was built for the subtitle extraction from plain text information. In addition, a formal classification method for the determination of depth levels of the subtitles was developed and used to build a document tree with sequentially ordered subtitles. An application module of the proposed method, which transforms a plain text document into a semi structured XML document, was implemented. Performance of the developed application module was also evaluated with 40 test documents including structural calculation documents, technical reports, and theses.

机译：提出了一种从结构计算文档中自动提取表观语义文档结构的通用方法。该方法包括两个过程：提取字幕和对字幕的深度级别进行分类。字幕成为表面语义结构的树节点。建立了技术文档上下文模型，用于从纯文本信息中提取字幕。另外，开发了用于确定字幕的深度级别的正式分类方法，并将其用于构建具有顺序排序的字幕的文档树。实现了该方法的应用模块，该模块将纯文本文档转换为半结构化XML文档。还使用40个测试文件（包括结构计算文件，技术报告和这些内容）对开发的应用程序模块的性能进行了评估。

著录项

来源
《Journal of Computing in Civil Engineering》 |2010年第3期|313-324|共12页
作者
Bong-Geun Kim; Sang II Park; Hyo-Jin Kim; Sang-Ho Lee;
展开▼
作者单位

Dept. of Civil and Environmental Engineering,Yonsei Univ., Seoul 120-749, Korea;

Dept. of Civil and Environmental Engineering,Yonsei Univ., Seoul 120-749, Korea;

Dept. of Civil and Environmental Engineering,Yonsei Univ., Seoul 120-749, Korea;

Dept. of Civil and Environmental Engineering, YonseiUniv., Seoul 120-749, Korea;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
data collection; documentation; structures; computer applications;

机译：数据采集;文件;结构;电脑应用;

相似文献

外文文献
中文文献
专利

1. Automatic extraction of corollaries from semantic structure of text [J] . Abyz T. Nurtazin, Zarif G. Khisamiev Open Engineering . 2016,第1期

机译：从文本的语义结构中自动提取推论
2. Deep Text Mining for Automatic Keyphrase Extraction from Text Documents [J] . Muhammad Abulaish, Jahiruddin, Lipika Dey Journal of Intelligent Systems . 2011,第4期

机译：深度文本挖掘，用于从文本文档中自动提取关键词
3. Multi-documents Automatic Abstracting based on text clustering and semantic analysis [J] . Qinglin Guo, Ming Zhang Knowledge-Based Systems . 2009,第6期

机译：基于文本聚类和语义分析的多文档自动摘要
4. Indexing Documents by Discourse and Semantic Contents from Automatic Annotations of Texts [C] . Brahim Djioua, Jean-Pierre Descles International Florida Artificial Intelligence Research Society Conference(FLAIRS 2007); 20070507-09; Key West,FL(US) . 2007

机译：从文本自动注释中按话语和语义内容索引文档
5. Automatic term extraction and document similarity in special text corpora. [D] . Dong, Li. 2002

机译：特殊文本语料库中的自动术语提取和文档相似性。
6. Towards Answering Biological Questions with Experimental Evidence: Automatically Identifying Text that Summarize Image Content in Full-Text Articles [O] . Hong Yu 2006

机译：尝试用实验证据回答生物学问题：自动识别全文文章中包含图像内容的文本
7. Automatic Annotation of Content-Rich HTML Documents: Structural and Semantic Analysis [O] . Saikat Mukherjee, Guizhen Yang, I. V. Ramakrishnan 2003

机译：内容丰富的HTmL文档的自动注释：结构和语义分析
8. Almost Automatic Semantic Feature Extraction from Technical Text. [R] . Agarwal, R. 1994

机译：从技术文本中提取几乎自动语义特征。

Automatic Extraction of Apparent Semantic Structure from Text Contents of a Structural Calculation Document

摘要

著录项

相似文献

相关主题

期刊订阅