Ontology-based sequence labelling for automated information extraction for supporting bridge data analytics

机译：基于本体的序列标记，用于支持桥梁数据分析的自动信息提取

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The massive amount of data/information buried in textual bridge inspection reports open opportunities to leverage big data analytics for advanced information-rich bridge deterioration prediction. However, utilizing textual data for bridge deterioration prediction is challenging because of its inherently unstructured nature. To this end, this paper proposes an ontology-based information extraction (IE) framework that automatically recognizes and extracts key data/information from unstructured textual reports, and represents the extracted data/information in a structured way that is ready for data analytics. The proposed IE framework is composed of two primary components: (1) ontology-based sequence labelling for term identification, and (2) ontology-based dependency grammar for relationship association. This paper focuses on presenting the proposed sequence labelling methodology. The methodology utilizes ontology-based begin, inside, and outside (BIO) encoding for phrase-level segmentation and Conditional Random Field (CRF) for ontology-based labelling in both token and phrase levels. The experimental results showed that the proposed methodology has a precision of 97% and a recall of 91 %.

机译：数据/信息埋在文本桥梁检测的巨量报告公开的机会，以先进的信息丰富的桥恶化预测利用大数据分析。然而，对于桥梁恶化预测利用文本数据，因为其固有的非结构化性质的挑战。为此，提出了一种基于本体的信息提取（IE）的框架，可以自动识别和从非结构化文本报告中提取密钥数据/信息，和表示在该准备用于数据分析以结构化方式所提取的数据/信息。所提出的IE框架由两个主要部分组成：（1）项识别基于本体的序列标签，和（2）本体的基于依赖性的语法为关系关联。本文着重介绍拟议序列标注方法。该方法利用基于本体开始，里面，和外侧（BIO）编码词组级分割和条件随机场（CRF），用于在两个令牌和短语级别基于本体的标记。实验结果表明，该方法具有97％的精确度和91％的召回。

著录项

来源
《International Conference on Sustainable Design, Engineering and Construction》|2016年|2 v.|共7页
会议地点
作者
Kaijian Liu; Nora El-Gohary;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类建筑科学;
关键词
Ontology; Sequence labelling; Information extraction; Infrastructure system data analytics; Bridge deterioration prediction;

机译：本体;序列标签;信息提取;基础设施系统数据分析;桥梁劣化预测;

相似文献

外文文献
中文文献
专利

1. Ontology-based semi-supervised conditional random fields for automated information extraction from bridge inspection reports [J] . Liu Kaijian, El-Gohary Nora Automation in construction . 2017,第sepa期

机译：基于本体的半监督条件随机字段，用于从桥梁检查报告中自动提取信息
2. A REVIEW ON ONTOLOGY-BASED LABEL EXTRACTION FROM IMAGE DATA [J] . AHMAD ADEL ABU-SHAREHA, MANDAVA RAJESWARI Journal of Theoretical and Applied Information Technology . 2015,第2期

机译：图像数据中基于本体的标签提取综述
3. Context-aware sequence labeling for condition information extraction from historical bridge inspection reports [J] . Tianshu Li, Mohamad Alipour, Devin K. Harris Advanced engineering informatics . 2021,第Auga期

机译：来自历史桥式检查报告的条件信息提取的上下文感知序列标记
4. Ontology-based sequence labelling for automated information extraction for supporting bridge data analytics [C] . Kaijian Liu, Nora El-Gohary International Conference on Sustainable Design, Engineering and Construction . 2016

机译：基于本体的序列标记，用于支持桥梁数据分析的自动信息提取
5. Web support for automated analysis of DNA sequences. [D] . Hassan, Wael A. 2000

机译：Web支持，可自动分析DNA序列。
6. Multimodal Teaching Analytics: Automated Extraction of Orchestration Graphs from Wearable Sensor Data [O] . Luis P. Prieto, Kshitij Sharma, Łukasz Kidzinski, -1

机译：多模式教学分析：从可穿戴式传感器数据中自动提取业务流程图
7. Ontology-based Sequence Labelling for Automated Information Extraction for Supporting Bridge Data Analytics [O] . Liu Kaijian, El-Gohary Nora 2016

机译：基于本体的序列标记，可自动提取信息以支持桥梁数据分析
8. Automated Extraction and Characterisation of Social Network Data from Unstructured Sources -- An Ontology-Based Approach. [R] . Martineau, E., Lecocq, R. 2013

机译：非结构化源社交网络数据的自动提取与表征 - 基于本体论的方法。

Ontology-based sequence labelling for automated information extraction for supporting bridge data analytics

摘要

著录项

相似文献

相关主题

期刊订阅