首页> 外文期刊>ACM transactions on the web >Sentiment-Focused Web Crawling
【24h】

Sentiment-Focused Web Crawling

机译:以情感为中心的网络爬网

获取原文
获取原文并翻译 | 示例
           

摘要

Sentiments and opinions expressed in Web pages towards objects, entities, and products constitute an important portion of the textual content available in the Web. In the last decade, the analysis of such content has gained importance due to its high potential for monetization. Despite the vast interest in sentiment analysis, somewhat surprisingly, the discovery of sentimental or opinionated Web content is mostly ignored. This work aims to fill this gap and addresses the problem of quickly discovering and fetching the sentimental content present in the Web. To this end, we design a sentiment-focused Web crawling framework. In particular, we propose different sentiment-focused Web crawling strategies that prioritize discovered URLs based on their predicted sentiment scores. Through simulations, these strategies are shown to achieve considerable performance improvement over general-purpose Web crawling strategies in discovery of sentimental Web content.
机译:网页中表达的针对对象,实体和产品的情感和观点构成了网页中文本内容的重要组成部分。在过去的十年中,由于其具有很高的获利潜力,因此对此类内容的分析变得越来越重要。尽管人们对情感分析有着极大的兴趣,但令人惊讶的是,人们通常忽略了对情感或固执的Web内容的发现。这项工作旨在填补这一空白,并解决快速发现和获取Web中存在的情感内容的问题。为此,我们设计了一个以情感为中心的Web爬网框架。特别是,我们提出了不同的针对情感的Web爬网策略,这些策略根据发现的URL的预测情感分数来对发现的URL进行优先排序。通过仿真,在发现多愁善感的Web内容时,这些策略显示出与通用Web爬网策略相比可实现显着的性能改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号