首页> 外文会议>International conference on information engineering and applications >Chinese Standard Comparative Sentence Recognition and Extraction Research
【24h】

Chinese Standard Comparative Sentence Recognition and Extraction Research

机译:中国标准比较句子识别与提取研究

获取原文

摘要

Information extraction is the first and foremost important task of Standard Knowledge Mining. The paper focuses on comparative sentence recognition and extraction. There are three steps, respectively Comparative Sentence Recognition, Technical Index Parameter Recognition and Technical Index Extraction. At first, we search the standard by keywords from a feature set lists in order to categorize the documents into its specific class. In addition, we build the regular expression to spot technical index parameter. Lastly, we treat the technical index extraction as a sequence labelling problem and treat the keyword, noun phrases, and theirs position as features training by CRF model. The final experiments show that the result performs very well in standard document comparative sentence recognition and extraction.
机译:信息提取是标准知识挖掘的第一个和最重要的任务。本文侧重于比较句子识别和提取。有三个步骤,分别比较句子识别,技术指标参数识别和技术指标提取。首先,我们通过来自功能集列表的标准通过关键字搜索标准,以便将文档分类为其特定类。此外,我们将正则表达式构建到现货技术索引参数。最后,我们将技术索引提取视为序列标记问题,并将关键字,名词短语及其位置视为CRF模型的特征培训。最后的实验表明,结果在标准文件比较句子识别和提取中表现得非常好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号