首页> 外文会议>IEEE International Conference on Data Mining Workshops >Towards Summarizing Popular Information from Massive Tourism Blogs
【24h】

Towards Summarizing Popular Information from Massive Tourism Blogs

机译:总结来自大量旅游博客的热门信息

获取原文

摘要

In this work, we propose a research method to summarize popular information from massive tourism blog data. First, we crawl blog contents from website and segment each of them into a semantic word vector separately. Then, we select the geographical terms in each word vector into a corresponding geographical term vector and present a new method to explore the hot tourism locations and, especially, their frequent sequential relations from a set of geographical term vectors. Third, we propose a novel word vector subdividing method to collect the local features for each hot location, and introduce the metric of max-confidence to identify the Things of Interest (ToI) associated to the location from the collected data. We illustrate the benefits of this approach by applying it to a Chinese online tourism blog data set. The experiment results show that the proposed method can be used to explore the hot locations, as well as their sequential relations and corresponding ToI, efficiently.
机译:在这项工作中,我们提出了一种从大量旅游博客数据中总结流行信息的研究方法。首先,我们从网站上抓取博客内容,并将它们分别分割成一个语义词向量。然后,我们将每个单词向量中的地理术语选择为对应的地理术语向量,并提出了一种新的方法来探索旅游热点,特别是从一组地理术语向量中探索它们频繁的顺序关系。第三,我们提出了一种新颖的词向量细分方法,以收集每个热点位置的局部特征,并引入最大置信度的度量,以从收集到的数据中识别与该位置相关的兴趣事物(ToI)。我们通过将这种方法应用于中国在线旅游博客数据集来说明这种方法的好处。实验结果表明,该方法可有效地探索热点位置及其顺序关系和对应的ToI。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号