首页> 外文会议>Recent advances in signal processing, robotics and automation >A new approach for better document retrieval and classification performance using supervised WSD and Concept Graph
【24h】

A new approach for better document retrieval and classification performance using supervised WSD and Concept Graph

机译:使用监督的WSD和Concept Graph改善文档检索和分类性能的新方法

获取原文
获取原文并翻译 | 示例

摘要

Word Sense Disambiguation (WSD) is main task in the area of natural language processing (NLP). Supervised WSD methods are shown to be more effective than other WSD methods with the limitation of the size of manual annotated learning set. On the other hand, Concept graph is a weighted graph with each of its edges representing the relationships between concepts (relevancy of each pair of concepts). In this paper, we propose a method to improve the retrieval and classification performance of documents from different sources by means of concept graph. In our method, some features are initially selected from a training set by applying a well-known feature selection algorithm. Then, by injecting suggested relevant words for each class from the concept graph, a more enriched feature set is produced to apply to the test set. Our experimental results exhibit an improvement of 14.6% and 18.4% (few and more term injection evaluations, respectfully) in classification and also some improvements in retrieval performance.
机译:词义消歧(WSD)是自然语言处理(NLP)领域的主要任务。有监督的WSD方法显示出比其他WSD方法更有效的方法,但该方法受到手动注释学习集大小的限制。另一方面,概念图是一个加权图,其每个边缘表示概念之间的关系(每对概念的相关性)。本文提出了一种利用概念图的方法来提高不同来源文档的检索和分类性能。在我们的方法中,首先通过应用众所周知的特征选择算法从训练集中选择一些特征。然后,通过从概念图中为每个类别注入建议的相关单词,可以生成更加丰富的功能集以应用于测试集。我们的实验结果在分类方面显示了14.6%和18.4%的改进(分别进行了少量和更长期的注射评估),并且在检索性能方面也有一些改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号