首页> 外文会议>International conference on current trends in theory and practice of computer science >Improving Relevance of Keyword Extraction from the Web Utilizing Visual Style Information
【24h】

Improving Relevance of Keyword Extraction from the Web Utilizing Visual Style Information

机译:利用视觉样式信息从Web上提高关键字提取的相关性

获取原文

摘要

Information growth is faster than ever before. We need to provide advanced services facilitating information "consumption" (e.g., recommendation, personalized navigation). At least a lightweight semantics is necessary for such services. Nowadays keyword paradigm is widely used and seems to achieve satisfactory results in fields such as social bookmarking or ontology learning. In this paper we explore impact of web site visual style on relevant keywords extraction. We propose a method for relevant keywords extraction from web pages combining traditional automatic term recognition algorithms with web site's visual style processing. We particularly focus on cascade style sheets. The evaluation conducted on 200 "wild" Web documents from 12 different web sites showed that our method increases the relevance of extracted keywords.
机译:信息增长比以往任何时候都更快。我们需要提供促进信息“消费”的先进服务(例如,推荐,个性化导航)。这些服务至少需要轻量级语义。如今关键字范例被广泛使用,似乎在社交书签或本体学习等领域实现了令人满意的结果。在本文中,我们探讨了网站视觉风格对相关关键字提取的影响。我们提出了一种与网页提取的相关关键字提取方法,将传统的自动术语识别算法与网站的视觉风格处理相结合。我们特别关注级联样式床单。从12个不同网站的200“野生”Web文档中进行的评估表明我们的方法增加了提取的关键字的相关性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号