【24h】

Sentiment-Focused Web Crawling

机译:以情感为中心的网络爬网

获取原文

摘要

The sentiments and opinions that are expressed in web pages towards objects, entities, and products constitute an important portion of the textual content available in the Web. Despite the vast interest in sentiment analysis and opinion mining, somewhat surprisingly, the discovery of the sentimental or opinionated web content is mostly ignored. This work aims to fill this gap and address the problem of quickly discovering and fetching the sentimental content present in the Web. To this end, we design a sentiment-focused web crawling framework for faster discovery and retrieval of such content. In particular, we propose different sentiment-focused web crawling strategies that prioritize discovered URLs based on their predicted sentiment; scores. Through simulations, these strategies are shown to achieve considerable performance improvement over general-purpose web crawling strategies in discovering sentimental content.
机译:网页中表达的有关对象,实体和产品的情感和观点构成了Web上可用文本内容的重要部分。尽管人们对情感分析和观点挖掘有着浓厚的兴趣,但令人惊讶的是,人们通常忽略了对情感或固执的Web内容的发现。这项工作旨在填补这一空白,并解决快速发现和获取Web中存在的情感内容的问题。为此,我们设计了以情感为中心的网络爬网框架,以更快地发现和检索此类内容。特别是,我们提出了不同的针对情感的Web爬网策略,这些策略根据发现的URL的预测情感来对其进行优先级排序。分数。通过仿真,这些策略在发现情感内容方面比通用Web爬网策略显示出了显着的性能提升。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号