【24h】

CEW-DTW: A new time series model for text mining

机译:CEW-DTW:用于文本挖掘的新时间序列模型

获取原文

摘要

The keyword information is usually applied to describe answers. In most of the previous studies, researchers usually rank answers according to keyword retrieval, which fails to consider the importance of the time sequence of keywords in answers. In this paper, we propose CEW-DTW, a new time series model for answer ranking. This model considers the importance of the time sequence of keywords as well as the amount of keywords. CEW-DTW is developed from a carefully designed model, Dynamic Time Warping-Delta (DTW-D). We choose Amazon question/answer data as our evaluation dataset. We apply Entropy to remove noise in answer vectors. In experiments, we apply normalized discounted cumulative gain (nDCG) as the assess rule to test models. CEW-DTW is proven to have a better performance than Dynamic Time Warping (DTW) and Dynamic Time Warping-Delta (DTW-D) in answer ranking. An extensive set of evaluation results demonstrates the effectiveness of the CEW-DTW model for answer ranking.
机译:关键字信息通常用于描述答案。在大多数以前的研究中,研究人员通常根据关键字检索对答案进行排名,而这并未考虑关键字在答案中的时间顺序的重要性。在本文中,我们提出了CEW-DTW,一种用于答案排名的新时间序列模型。该模型考虑了关键字时间序列的重要性以及关键字的数量。 CEW-DTW是从精心设计的模型动态时间扭曲增量(DTW-D)中开发出来的。我们选择Amazon问题/答案数据作为评估数据集。我们应用熵来去除答案向量中的噪声。在实验中,我们将标准化的折现累计收益(nDCG)作为评估规则应用于测试模型。事实证明,CEW-DTW在答案排名方面比动态时间规整(DTW)和动态时间规整-Delta(DTW-D)更好。大量的评估结果证明了CEW-DTW模型对答案排名的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号