...
首页> 外文期刊>Expert Systems >Conceptual feature generation for textual information using a conceptual network constructed from Wikipedia
【24h】

Conceptual feature generation for textual information using a conceptual network constructed from Wikipedia

机译:使用从Wikipedia构建的概念网络生成文本信息的概念特征

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

A proper semantic representation of textual information underlies many natural language processing tasks. In this paper, a novel semantic annotator is presented to generate conceptual features for text documents. A comprehensive conceptual network is automatically constructed with the aid of Wikipedia that has been represented as a Markov chain. Furthermore, semantic annotator gets a fragment of natural language text and initiates a random walk to generate conceptual features that represent topical semantic of the input text. The generated conceptual features are applicable to many natural language processing tasks where the input is textual information and the output is a decision based on its context. Consequently, the effectiveness of the generated features is evaluated in the task of document clustering and classification. Empirical results demonstrate that representing text using conceptual features and considering the relations between concepts can significantly improve not only the bag of words representation but also other state-of-the-art approaches.
机译:文本信息的正确语义表示法是许多自然语言处理任务的基础。本文提出了一种新颖的语义注释器来生成文本文档的概念特征。借助于表示为马尔可夫链的Wikipedia,可以自动构建一个全面的概念网络。此外,语义注释器获取自然语言文本的一部分,并发起随机游走以生成表示输入文本主题语义的概念特征。生成的概念特征适用于许多自然语言处理任务,其中输入是文本信息,而输出是基于上下文的决策。因此,在文档聚类和分类任务中评估了生成特征的有效性。实证结果表明,使用概念特征表示文本并考虑概念之间的关系不仅可以显着改善单词表示的方式,而且可以显着改善其他最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号