首页> 外文会议>International Conference on Data Management Technologies and Applications >A Novel Method for Unsupervised and Supervised Conversational Message Thread Detection

【24h】

A Novel Method for Unsupervised and Supervised Conversational Message Thread Detection

机译：一种无监督和监督的对话消息线程检测的新方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Efficiently detecting conversation threads from a pool of messages, such as social network chats, emails, comments to posts, news etc., is relevant for various applications, including Web Marketing, Information Retrieval and Digital Forensics. Existing approaches focus on text similarity using keywords as features that are strongly dependent on the dataset. Therefore, dealing with new corpora requires further costly analyses conducted by experts to find out new relevant features. This paper introduces a novel method to detect threads from any type of conversational texts overcoming the issue of previously determining specific features for each dataset. To automatically determine the relevant features of messages we map each message into a three dimensional representation based on its semantic content, the social interactions in terms of sender/recipients and its timestamp; then clustering is used to detect conversation threads. In addition, we propose a supervised approach to detect conversation threads that builds a classification model which combines the above extracted features for predicting whether a pair of messages belongs to the same thread or not. Our model harnesses the distance measure of a message to a cluster representing a thread to capture the probability that a message is part of that same thread. We present our experimental results on seven datasets, pertaining to different types of messages, and demonstrate the effectiveness of our method in the detection of conversation threads, clearly outperforming the state of the art and yielding an improvement of up to a 19%.

机译：有效地检测来自一条消息池的对话线程，例如社交网络聊天，电子邮件，发布，新闻等，与各种应用相关，包括网络营销，信息检索和数字取证。现有方法专注于使用关键字作为强烈依赖于数据集的功能的文本相似性。因此，处理新的Corpora需要通过专家进行进一步的昂贵分析，以找到新的相关功能。本文介绍了一种从任何类型的会话文本中检测线程的新方法，克服了先前确定每个数据集的特定功能的问题。自动确定消息的相关特征，我们将每个消息映射到三维表示基于其语义内容，在发件人/收件人及其时间戳方面的社交交互;然后群集用于检测对话线程。此外，我们提出了一种监督方法来检测构建的对话线程，该对话线程构建组合上述提取特征的分类模型，以预测一对消息是否属于相同的线程。我们的模型利用表示表示线程的群集的消息的距离测量，以捕获消息是消息是该相同线程的一部分的概率。我们在七个数据集中介绍了我们的实验结果，与不同类型的信息有关，并展示了我们在检测对话线程中的方法的有效性，显然优于现有技术并产生高达19％的提高。

著录项

来源
《International Conference on Data Management Technologies and Applications 》|2016年|1(CD-ROM)|共12页
会议地点
作者
Giacomo Domeniconi; Konstantinos Semertzidis; Vanessa Lopez; Elizabeth M. Daly; Spyros Kotoulas; Gianluca Moro;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP274-53;
关键词
Clustering Algorithms; Conversation Threads; Topic Detection;

机译：聚类算法;会话线程;主题检测;

相似文献

外文文献
中文文献
专利

1. Unsupervised and supervised methods for the detection of hurriedly created profiles in recommender systems [J] . Panagiotakis Costas, Papadakis Harris, Fragopoulou Paraskevi International journal of machine learning and cybernetics . 2020 ,第9期

机译：在推荐系统中检测匆忙创建的配置文件的无监督和监督方法
2. An Overview of Deep Learning Based Methods for Unsupervised and Semi-Supervised Anomaly Detection in Videos [J] . B. Ravi Kiran, Dilip Mathew Thomas, Ranjith Parakkal Journal of Imaging . 2018 ,第2期

机译：基于深度学习的视频中无监督和半监督异常检测方法的概述
3. Automated lesion detection on MRI scans using combined unsupervised and supervised methods [J] . Dazhou Guo, Julius Fridriksson, Paul Fillmore, BMC Medical Imaging . 2015 ,第1期

机译：结合无监督和无监督方法对MRI扫描进行自动病变检测
4. A Novel Method for Unsupervised and Supervised Conversational Message Thread Detection [C] . Giacomo Domeniconi, Konstantinos Semertzidis, Vanessa Lopez, International Conference on Data Management Technologies and Applications . 2016

机译：一种无监督和监督的对话消息线程检测的新方法
5. Classifying Land Use/Land Cover Change Over Time within the Watershed Boundary of Keenjhar Lake Using Supervised, Unsupervised, and Hybrid Classification Methods [D] . Henry, Katherine Rae. 2021

机译：分类土地使用/陆地覆盖随着时间的推移在Keenjhar Lake的流域边界内使用监督，无监督和混合分类方法而变化
6. Automated lesion detection on MRI scans using combined unsupervised and supervised methods [O] . Dazhou Guo, Julius Fridriksson, Paul Fillmore, 2015

机译：使用无监督和监督方法相结合的MRI扫描自动病变检测
7. Supervised and Unsupervised Intrusion Detection Based on CAN Message Frequencies for In-vehicle Network [O] . Takuya Kuwahara, Yukino Baba, Hisashi Kashima, 2018

机译：基于CAN信息网络频率的监督和无监督的入侵检测

A Novel Method for Unsupervised and Supervised Conversational Message Thread Detection

摘要

著录项

相似文献

相关主题

期刊订阅