首页> 外文会议>ACM conference on information and knowledge management >Topic Detection and Organization of Mobile Text Messages
【24h】

Topic Detection and Organization of Mobile Text Messages

机译:主题检测和移动文本消息的组织

获取原文

摘要

How to organize and visualize big amount of text messages stored on one's mobile phone is a challenging problem, since they can hardly be organized by threads as we do for emails due to lack of necessary metadata such as "subject" and "reply-to". In this paper, we propose an innovative approach based on clustering algorithms and natural language processing methods. We first cluster the text messages into candidate conversations based on their temporal attributes, and then do further analysis using a semantic model based on Latent Dirichlet Allocation (LDA). Considering that the text messages are usually short and sparse, we trained the model using a large scale external data collected from twitter-like web sites, and applied the model to text messages. In the end, the text messages are organized as conversations based on their topics. We evaluated our approach based on 122,359 text messages collected from 50 university students during 6 months.
机译:如何组织和可视化存储在一个人的手机上的大量文本消息是一个具有挑战性的问题,因为由于缺乏必要的元数据,如“主题”和“回复”,我们几乎无法通过线程组织。 。本文提出了一种基于聚类算法和自然语言处理方法的创新方法。我们首先将文本消息群体基于它们的时间属性将文本消息集聚到候选对话中,然后使用基于潜在Dirichlet分配(LDA)的语义模型进行进一步的分析。考虑到文本消息通常短而稀疏,我们使用从像推特式网站收集的大规模外部数据训练模型,并将模型应用于短信。最后,文本消息被组织为基于主题的对话。我们根据6个月内从50名大学生收集的122,359名短信评估了我们的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号