Simultaneously modeling semantics and structure of threaded discussions

机译：同时建模主题讨论的语义和结构

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The huge amount of knowledge in web communities has motivated the research interests in threaded discussions. The dynamic nature of threaded discussions poses lots of challenging problems for computer scientists. Although techniques such as semantic models and structural models have been shown to be useful in a number of areas, they are inefficient in understanding threaded discussions due to three reasons: (I) as most of users read existing messages before posting, posts in a discussion thread are temporally dependent on the previous ones; It causes the semantics and structure to be coupled with each other in threaded discussions; (II) in online discussion threads, there are a lot of junk posts which are useless and may disturb content analysis; and (III) it is very hard to judge the quality of a post. In this paper, we propose a sparse coding-based model named SMSS to Simultaneously Model Semantics and Structure of threaded discussions. The model projects each post into a topic space, and approximates each post by a linear combination of previous posts in the same discussion thread. Meanwhile, the model also imposes two sparse constraints to force a sparse post reconstruction in the topic space and a sparse post approximation from previous posts. The sparse properties effectively take into account the characteristics of threaded discussions. Towards the above three problems, we demonstrate the competency of our model in three applications: reconstructing reply structure of threaded discussions, identifying junk posts, and finding experts in a given board/sub-board in web communities. Experimental results show encouraging performance of the proposed SMSS model in all these applications.

机译：网络社区中的大量知识激发了螺纹讨论中的研究兴趣。讨论的动态本质给计算机科学家带来了许多具有挑战性的问题。尽管已显示诸如语义模型和结构模型之类的技术在许多领域都非常有用，但是由于以下三个原因，它们在理解主题讨论方面效率低下：（I）由于大多数用户在发布之前阅读了现有消息，因此在讨论中发表线程在时间上取决于先前的线程;它使语义和结构在多线程讨论中相互结合; （II）在在线讨论线程中，有很多垃圾帖子是无用的，可能会干扰内容分析; （三）很难判断一个职位的质量。在本文中，我们提出了一种基于稀疏编码的名为SMSS的模型，以同时对线程讨论的语义和结构进行建模。该模型将每个帖子投影到主题空间中，并通过同一讨论线程中以前的帖子的线性组合来近似每个帖子。同时，该模型还施加了两个稀疏约束，以强制在主题空间中进行稀疏的帖子重建，以及对先前帖子进行稀疏的帖子近似。稀疏属性有效地考虑了主题讨论的特征。针对上述三个问题，我们在三个应用程序中证明了我们模型的能力：重构线程讨论的回复结构，识别垃圾帖子以及在网络社区的给定董事会/子董事会中寻找专家。实验结果表明，所提出的SMSS模型在所有这些应用中均具有令人鼓舞的性能。

著录项

来源
《Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval》|2009年|P.131 - 138|共8页
会议地点
作者
Chen Lin; Jiang-Ming Yang; Rui Cai; Xin-Jing Wang; Wei Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类情报检索;各种专用数据库;
关键词
expert finding; junk identification; reply reconstruction; sparse coding; threaded discussion;

机译：专家发现;垃圾识别;回复重建;稀疏编码;线程讨论;

相似文献

外文文献
中文文献
专利

1. Modelling structure and predicting dynamics of discussion threads in online boards [J] . Alexey N Medvedev, Jean-Charles Delvenne, Renaud Lambiotte, . 2018,第1期

机译：在线板上建模结构及预测讨论线程的动态
2. Sentence Embedding Based Semantic Clustering Approach for Discussion Thread Summarization [J] . Atif Khan, Qaiser Shah, M. Irfan Uddin, Complexity . 2020,第1期

机译：基于语句嵌入的语义聚类方法讨论线程汇总
3. A Model for Using Threaded Discussions in On-line Agricultural Education Courses [J] . Awoke Dollisso, Vikram Koundinya NACTA Journal . 2011,第1a4期

机译：在线农业教育课程中使用讨论式讨论的模型
4. Simultaneously modeling semantics and structure of threaded discussions [C] . Chen Lin, Jiang-Ming Yang, Rui Cai, International ACM SIGIR conference on Research and development in information retrieval . 2009

机译：同时建模语义和结构的线程讨论
5. Computer modeling of protein tertiary structure and DNA binding energetics. I. Empirical free energy analysis of the engrailed Q50K variant-DNA complex and its mutants. II. The predicted structure of the adenovirus E4 orf6 protein by threading and comparative protein modeling. [D] . Brown, Lawrence Milton, III. 2001

机译：蛋白质三级结构和DNA结合能学的计算机建模。 I.陷入困境的Q50K变异体-DNA复合体及其突变体的经验自由能分析。二。通过穿线和比较蛋白建模预测腺病毒E4 orf6蛋白的结构。
6. 3D Structure Prediction of Human β1-Adrenergic Receptor via Threading-Based Homology Modeling for Implications in Structure-Based Drug Designing [O] . Zaheer Ul-Haq, Maria Saeed, Sobia Ahsan Halim, -1

机译：通过基于线程的同源性建模对人β1-肾上腺素受体的3D结构预测对基于结构的药物设计具有重要意义
7. Modelling structure and predicting dynamics of discussion threads in online boards [O] . Alexey N Medvedev, Jean-Charles Delvenne, Renaud Lambiotte 2018

机译：在线板上建模结构及预测讨论线程的动态

Simultaneously modeling semantics and structure of threaded discussions

摘要

著录项

相似文献

相关主题

期刊订阅