首页> 美国政府科技报告 >BRAT: A Random Walk through the Semantic Spaces of the Blogosphere
【24h】

BRAT: A Random Walk through the Semantic Spaces of the Blogosphere

机译:BRaT:随机浏览Blogosphere的语义空间

获取原文

摘要

Semantic spaces, such as the Latent Semantic Analysis (LSA), Hyperspace Analog to Language (HAL) or Random Indexing (RI), offer convenient methods to represent semantic relations between words and concepts, abstracted from a distribution of documents. The distribution of documents determines the local co-occurrence pattern between words all over the corpus and, then, determines the semantic abstracted from the local distribution. Such methods are sensitive to the statistical properties on the distribution of words over documents. For instance, the semantic on the word table abstracted from a scientific corpus or a general corpus may be different. In the first case, since table may occur in the context of table of correlation or table of results, it would be considered to be associated to the word correlation whereas in the second case, because it may co-occur with kitchen or living-room, it would rather be considered as similar to chair. Nevertheless, the formal relation bearing the properties of the distribution of word's co-occurrence and the final semantic produced by Semantic space methods have not been described until now. In the case of a mixed 'scientific and general' corpus, what makes that the semantic of table became more similar to chair than Speerman and vice- versa.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号