首页> 外文期刊>Journal of software >A Method of Hot Topic Detection in Blogs Using TV-gram Model
【24h】

A Method of Hot Topic Detection in Blogs Using TV-gram Model

机译:电视文法模型的博客热点话题检测方法

获取原文
获取原文并翻译 | 示例
           

摘要

Over the last few years, blogs (web logs) have gained massive popularity and have become one of the most important web social media, through which people can get and release information. Hot topic detection in blogs is most commonly used in analyzing network public opinion. A method of hot topic detection using n-gram model and hotness of topic evaluation is proposed in this paper. Our approach consists of three steps. First of all, keywords during a given time period are obtained by means of calculating word's weight, and hot keywords are collected by combining keywords. Secondly, based on hot keywords, hot keyword groups are extracted using n-gram model. In the third step, hot keyword groups are extracted and hot topics are detected. The hotness of hot topic is evaluated by the value of keywords' weight, which is got in the second step. Evaluations on Chinese corpus show that when the size of n for n-gram is five, the proposed method is most effective.
机译:在过去的几年中,博客(网络日志)获得了广泛的普及,并已成为最重要的网络社交媒体之一,人们可以通过它获取和发布信息。博客中的热门话题检测最常用于分析网络舆情。提出了一种基于n-gram模型的热点话题检测和话题评价热点的方法。我们的方法包括三个步骤。首先,通过计算单词的权重获得给定时间段内的关键词,并通过组合关键词来收集热门关键词。其次,基于热关键词,使用n-gram模型提取热关键词组。第三步,提取热门关键词组并检测热门话题。热门话题的热门程度是通过第二步获得的关键字权重值来评估的。对中文语料库的评估表明,当n语法的n大小为5时,该方法最为有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号