首页> 外文期刊>Information Processing & Management >Mining knowledge from natural language texts using fuzzy associated concept mapping
【24h】

Mining knowledge from natural language texts using fuzzy associated concept mapping

机译:使用模糊关联概念映射从自然语言文本中挖掘知识

获取原文
获取原文并翻译 | 示例
           

摘要

Natural Language Processing (NLP) techniques have been successfully used to automatically extract information from unstructured text through a detailed analysis of their content, often to satisfy particular information needs. In this paper, an automatic concept map construction technique, Fuzzy Association Concept Mapping (FACM), is proposed for the conversion of abstracted short texts into concept maps. The approach consists of a linguistic module and a recommendation module. The linguistic module is a text mining method that does not require the use to have any prior knowledge about using NLP techniques. It incorporates rule-based reasoning (RBR) and case based reasoning (CBR) for anaphoric resolution. It aims at extracting the propositions in text so as to construct a concept map automatically. The recommendation module is arrived at by adopting fuzzy set theories. It is an interactive process which provides suggestions of propositions for further human refinement of the automatically generated concept maps. The suggested propositions are relationships among the concepts which are not explicitly found in the paragraphs. This technique helps to stimulate individual reflection and generate new knowledge. Evaluation was carried out by using the Science Citation Index (SCI) abstract database and CNET News as test data, which are well known databases and the quality of the text is assured. Experimental results show that the automatically generated concept maps conform to the outputs generated manually by domain experts, since the degree of difference between them is proportionally small. The method provides users with the ability to convert scientific and short texts into a structured format which can be easily processed by computer. Moreover, it provides knowledge workers with extra time to rethink their written text and to view their knowledge from another angle.
机译:自然语言处理(NLP)技术已成功用于通过对非结构化文本的内容进行详细分析来自动从非结构化文本中提取信息,从而经常满足特定的信息需求。本文提出了一种自动概念图构建技术,即模糊关联概念图(FACM),用于将抽象的短文本转换为概念图。该方法包括语言模块和推荐模块。语言模块是一种文本挖掘方法,不需要使用它就具有使用NLP技术的任何先验知识。它结合了基于规则的推理(RBR)和基于案例的推理(CBR)来进行隐喻解析。它旨在提取文本中的命题,以便自动构建概念图。推荐模块是采用模糊集理论得出的。它是一个交互过程,为进一步人工完善自动生成的概念图提供建议。建议的建议是各段落中未明确找到的概念之间的关系。这种技术有助于激发个人反思并产生新知识。通过使用科学引文索引(SCI)抽象数据库和CNET新闻作为测试数据进行评估,这些数据是众所周知的数据库,并且可以确保文本的质量。实验结果表明,自动生成的概念图与领域专家手动生成的输出相符,因为它们之间的差异程度成比例地小。该方法为用户提供了将科学文本和短文本转换为可以由计算机轻松处理的结构化格式的能力。此外,它为知识工作者提供了更多的时间来重新思考他们的书面文字并从另一个角度查看他们的知识。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号