...
首页> 外文期刊>Programming and Computer Software >Applying Time Series for Background User Identification Based on Their Text Data Analysis
【24h】

Applying Time Series for Background User Identification Based on Their Text Data Analysis

机译:基于时间序列的文本数据分析在背景用户识别中的应用

获取原文
获取原文并翻译 | 示例
           

摘要

An approach to user identification based on deviations of their topic trends in operation with text information is presented. An approach is proposed to solve this problem; the approach implies topic analysis of the user's past trends (behavior) in operation with text content of various (including confidential) categories and forecast of their future behavior. The topic analysis of user's operation implies determining the principal topics of their text content and calculating their respective weights at the given instants. Deviations in the behavior in the user's operation with the content from the forecast are used to identify this user. In the framework of this approach, our own original time series forecasting method is proposed based on orthogonal non-negative matrix factorization (ONMF). Note that ONMF has not been used to solve time series forecasting problems before. The experimental research held on the example of real-world corporate emailing formed out of the Enron data set showed the proposed user identification approach to be applicable.
机译:提出了一种基于文本信息在操作中基于其主题趋势的偏差进行用户识别的方法。提出了解决该问题的方法。该方法意味着使用各种(包括机密)类别的文本内容对用户过去的操作趋势(行为)进行主题分析,并预测其未来行为。用户操作的主题分析意味着确定其文本内容的主要主题,并在给定的瞬间计算其各自的权重。用户操作中行为与预测内容之间的偏差将用于识别该用户。在这种方法的框架内,基于正交非负矩阵分解(ONMF),提出了我们自己的原始时间序列预测方法。请注意,之前从未使用ONMF解决时间序列预测问题。对以Enron数据集构成的真实公司电子邮件示例进行的实验研究表明,所提出的用户识别方法是适用的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号