首页> 外文会议>Proceedings of the Twenty-Third international joint conference on artificial intelligence >Computing Text Semantic Relatedness using the Contents and Links of a Hypertext Encyclopedia:Extended Abstract

Computing Text Semantic Relatedness using the Contents and Links of a Hypertext Encyclopedia:Extended Abstract


获取原文并翻译 | 示例


We propose methods for computing semantic relatedness between words or texts by using knowledge from hypertext encyclopedias such as Wikipedia.A network of concepts is built by filtering the encyclopedia’s articles,each concept corresponding to an article.A random walk model based on the notion of Visiting Probability (VP) is employed to compute the distance between nodes,and then between sets of nodes.To transfer learning from the network of concepts to text analysis tasks,we develop two common representation approaches.In the first approach,the shared representation space is the set of concepts in the network and every text is represented in this space.In the second approach,a latent space is used as the shared representation,and a transformation from words to the latent space is trained over VP scores.We applied our methods to four important tasks in natural language processing: word similarity,document similarity,document clustering and classification,and ranking in information retrieval.The performance is state-ofthe- art or close to it for each task,thus demonstrating the generality of the proposed knowledge resource and the associated methods.



  • 外文文献
  • 中文文献
  • 专利


京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号