首页> 外文期刊>International journal on digital libraries >Named entity evolution recognition on the Blogosphere
【24h】

Named entity evolution recognition on the Blogosphere

机译:在Blogosphere上命名为实体进化识别

获取原文
获取原文并翻译 | 示例
       

摘要

Advancements in technology and culture lead to changes in our language. These changes create a gap between the language known by users and the language stored in digital archives. It affects user's possibility to firstly find content and secondly interpret that content. In a previous work, we introduced our approach for named entity evolution recognition (NEER) in newspaper collections. Lately, increasing efforts in Web preservation have led to increased availability of Web archives covering longer time spans. However, language on the Web is more dynamic than in traditional media and many of the basic assumptions from the newspaper domain do not hold for Web data. In this paper we discuss the limitations of existing methodology for NEER. We approach these by adapting an existing NEER method to work on noisy data like the Web and the Blogosphere in particular. We develop novel filters that reduce the noise and make use of Semantic Web resources to obtain more information about terms. Our evaluation shows the potentials of the proposed approach.
机译:科技和文化的进步导致我们语言的变化。这些更改在用户已知的语言和数字档案中存储的语言之间造成了鸿沟。它影响了用户首先找到内容然后解释该内容的可能性。在先前的工作中,我们介绍了在报纸收藏中使用命名实体演化识别(NEER)的方法。近来,在Web保存方面的更多努力导致覆盖更长时间范围的Web档案的可用性增加。但是,与传统媒体相比,网络上的语言更具动态性,报纸领域的许多基本假设都不适用于网络数据。在本文中,我们讨论了NEER现有方法的局限性。我们通过调整现有的NEER方法来处理嘈杂的数据(例如Web和Blogosphere),从而解决这些问题。我们开发了减少噪音的新颖过滤器,并利用语义Web资源来获取有关术语的更多信息。我们的评估表明了该方法的潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号