首页> 外文期刊>Pattern recognition letters >Online Biterm Topic Model based short text stream classification using short text expansion and concept drifting detection
【24h】

Online Biterm Topic Model based short text stream classification using short text expansion and concept drifting detection

机译:使用短文本扩展和概念漂移检测的基于在线Biterm主题模型的短文本流分类

获取原文
获取原文并翻译 | 示例
       

摘要

Short text stream classification suffers from enormous challenges, due to the sparsity, high dimension and rapid variability of the short text stream. In this paper, we present a short text stream classification approach refined from online Biterm Topic Model (BTM) using short text expansion and concept drifting detection. Specifically, in our method, we firstly extend short text streams from an external resource to make up for the sparsity of data, and use online BTM to select representative topics instead of the word vector to represent the feature of short texts. Secondly, we propose a concept drift detection method based on the topic model to detect the hidden concept drifts in short text streams. Thirdly, we build an ensemble model using several data chunks and update with the newest data chunk and results of the concept drift detection. Finally, extensive experimental results demonstrate that compared to well-known baselines, our approach achieves a better performance in the classification and concept drifting detection. (C) 2018 Elsevier B.V. All rights reserved.
机译:由于短文本流的稀疏性,高维度和快速变化性,短文本流分类遭受巨大挑战。在本文中,我们提出了一种使用短文本扩展和概念漂移检测从在线Biterm主题模型(BTM)中提炼的短文本流分类方法。具体而言,在我们的方法中,我们首先从外部资源扩展了短文本流以弥补数据的稀疏性,然后使用在线BTM选择代表主题,而不是用词向量来代表短文本的特征。其次,提出一种基于主题模型的概念漂移检测方法,以检测短文本流中隐藏的概念漂移。第三,我们使用几个数据块构建一个集成模型,并使用最新的数据块和概念漂移检测的结果进行更新。最后,大量的实验结果表明,与众所周知的基准相比,我们的方法在分类和概念漂移检测中取得了更好的性能。 (C)2018 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号