首页> 外文期刊>Journal of Computing in Civil Engineering >Semantic Text Classification for Supporting Automated Compliance Checking in Construction
【24h】

Semantic Text Classification for Supporting Automated Compliance Checking in Construction

机译:语义文本分类以支持施工中的自动合规性检查

获取原文
获取原文并翻译 | 示例
       

摘要

Automated regulatory and contractual compliance checking requires automated rule extraction from regulatory and contractual textual documents (e.g., contract specifications). Automated rule extraction is a challenging task that requires complex processing of text. In the proposed automated compliance checking (ACC) approach, the first step in automating the rule extraction process is automatically classifying the different documents and parts of documents (e.g., contract clauses) into predefined categories (environmental, safety, health, etc.) for preparing it for further text analysis and rule extraction. These categories are defined in a semantic model for normative reasoning. This paper presents a semantic, machine learning-based text classification algorithm for classifying clauses and subclauses of general conditions for supporting ACC in construction. The multilabel classification problem was transformed into a set of binary classification problems. Different machine learning algorithms, text preprocessing techniques, methods of text feature scoring, methods of feature weighting, and feature sizes were implemented and evaluated at different thresholds. The developed classifier achieved 100 and 96% recall and precision, respectively, on the testing data. (C) 2014 American Society of Civil Engineers.
机译:自动化的法规和合同合规性检查要求从法规和合同文本文档(例如,合同规范)中自动提取规则。规则自动提取是一项艰巨的任务,需要复杂的文本处理。在提议的自动合规性检查(ACC)方法中,自动化规则提取过程的第一步是将不同的文档和文档的某些部分(例如合同条款)自动分类为预定义的类别(环境,安全,健康等),以用于为进一步的文本分析和规则提取做准备。这些类别是在语义模型中为规范推理定义的。本文提出了一种基于语义,基于机器学习的文本分类算法,用于对支持ACC的一般条件条款和子句进行分类。多标签分类问题转化为一组二进制分类问题。在不同的阈值下实现并评估了不同的机器学习算法,文本预处理技术,文本特征评分方法,特征加权方法以及特征大小。开发的分类器在测试数据上分别实现了100%和96%的查全率和精确度。 (C)2014年美国土木工程师学会。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号