首页> 外文会议>IEEE International Conference on Tools with Artificial Intelligence >An Unsupervised Approach for Precise Context Identification from Unstructured Text Documents
【24h】

An Unsupervised Approach for Precise Context Identification from Unstructured Text Documents

机译:从非结构化文本文件中精确上下文识别的无监督方法

获取原文

摘要

The majority of the documents produced and exchanged through medias and social networks are unstructured. Due to the amount of these unstructured documents on the Web, their exploitation represents a tedious or even impossible task for human beings without assistance by dedicated algorithms and specialized computer systems in document classification or information extraction. To be efficient and relevant, such systems have to understand the content of these unstructured documents. The context (or topic) of a document is one of the basic information essential for the understanding of its content, and the more precise the context of a document, the more relevant its understanding will be. This paper presents a precise context identification approach that is evaluated quantitatively and qualitatively on several reference corpora and compared to other context identification systems. The contexts identified by our model are much more precise than those identified by these others systems.
机译:通过媒体和社交网络制作和交换的大部分文件都是非结构化的。由于网络上的这些非结构化文件的数量,他们的开发代表了人类的乏味甚至不可能的任务,没有专用算法和文档分类或信息提取的专业计算机系统的帮助。为了高效和相关,这些系统必须了解这些非结构化文件的内容。文档的上下文(或主题)是对其内容的理解所必需的基本信息之一,并且文档的上下文更准确,其理解越多。本文提出了一种精确的上下文识别方法,其在几个参考数集上定量和定性地评估并与其他上下文识别系统进行比较。我们模型所识别的上下文比这些其他系统识别的更精确。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号