首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Hot Topic Extraction Based on Timeline Analysis and Multidimensional Sentence Modeling
【24h】

Hot Topic Extraction Based on Timeline Analysis and Multidimensional Sentence Modeling

机译:基于时间线分析和多维句子建模的热点话题提取

获取原文
获取原文并翻译 | 示例
       

摘要

With the vast amount of digitized textual materials now available on the Internet, it is almost impossible for people to absorb all pertinent information in a timely manner. To alleviate the problem, we present a novel approach for extracting hot topics from disparate sets of textual documents published in a given time period. Our technique consists of two steps. First, hot terms are extracted by mapping their distribution over time. Second, based on the extracted hot terms, key sentences are identified and then grouped into clusters that represent hot topics by using multidimensional sentence vectors. The results of our empirical tests show that this approach is more effective in identifying hot topics than existing methods.
机译:互联网上现在有大量的数字化文本材料,人们几乎不可能及时吸收所有相关信息。为了缓解该问题,我们提出了一种新颖的方法,可以从给定时间段内发布的不同文本文档集中提取热门话题。我们的技术包括两个步骤。首先,通过映射热项随时间的分布来提取热项。其次,基于提取的热门术语,识别关键句子,然后通过使用多维句子向量将其分组为代表热门话题的聚类。我们的经验测试结果表明,这种方法比现有方法更有效地识别热门话题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号