首页> 美国政府科技报告 >FUB, IASI-CNR and University of Tor Vergata at TREC 2008 Blog Track
【24h】

FUB, IASI-CNR and University of Tor Vergata at TREC 2008 Blog Track

机译:FUB,IasI-CNR和Tor Vergata大学参加TREC 2008博客大赛

获取原文

摘要

We take part in the opinion and polarity retrieval tasks of the blog track. A test collection, called Blog06, was created for the blog track in 2006 with three main different components: feeds, permalinks and home-pages. The collection contains spam as well as possibly no blogs and no english pages. For our experimentation only permalinks have been taken into consideration, consisting of 3.2 million of Web pages for a total of 88.8GB, each one containing a post and its related comments. The evaluation metrics are precision/recall based, the Mean Average Precision (MAP) and R-Precision (RPrec), but we also focused on Precision at 10 (P10), due to its relevance in evaluating the effectiveness of Web search engines. As in 2007, we based our approach on the construction of ad-hoc weighted dictionaries, containing terms assumed to be used to express a sentiment. The weight is a measure of how much sentiment the term expresses. To automatically construct our dictionaries, we assumed that opinion-bearing words distribute more randomly in the set of opinionated documents than semantic-bearing terms, but less randomly than not- informative terms.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号