首页> 外文会议>International Conference on Cyber and IT Service Management >The application of Centroid linkage hierarchical method and Hill climbing method in comments clustering online discussion forum
【24h】

The application of Centroid linkage hierarchical method and Hill climbing method in comments clustering online discussion forum

机译:质心联动层次方法和山攀爬方法在评论集群中的应用在线讨论论坛

获取原文
获取外文期刊封面目录资料

摘要

Several problems are rised in order to enhance the effectiveness of communication in online discussion. The similarity and repetition of comments in terms of questions in the sentences or text meanings as well as triggers the emerging of miscommunication amongst participants in a forum discussion are investigated. Moreover, some comments seems are ignored or not been touched by other participants and in advance the effective used of forum discussion as knowledge acquisition and sharing can not be achieved. This paper studies the application of Centroid Linkage Hierarchical Method (CLHM) Algorithm and Hill Climbing methods in findings the similarity value of participants comments and clustering based on it. The analysis follows the text mining process including text processing, text transformation, attribute selection and pattern discovery. In order to test the validity and accuracy of both application methods, confusion matrix in euclidean and consine similarity were calculated. As the results, from variety numbers of comments groups, including Bersosial.com in 17 comments, Indowebster.com in 27 comments and Teknojurnal.com in 51 data comments provided the value of well-separated clusters performed. This testing also defined that the alteration of threshold and altitude did not affect the clustering process. From the calculation of F-measure values in confusion matrix explained that consine similarity provided better result that euclidean distance where teknojurnal 0.89, indowebster,com 0.71 and bersosial.com 0.57. This showed that CLHM algorithm and Hill climbing methods are effective approaches and have been successfully applied in comments clustering of online discussion.
机译:提升了几个问题,以提高在线讨论中的沟通的有效性。在句子或文本含义中的问题和触发论坛讨论中,调查了在句子或文本含义中的疑问以及触发了在论坛讨论中的误解的情况下的相似性和重复。此外,其他参与者似乎忽略了一些评论,并提前涉及论坛讨论的有效使用,因为知识获取和共享无法实现。本文研究了质心联动等级方法(CLHM)算法和山坡攀爬方法在研究结果中的应用基于IT的参与者评论和聚类的相似性值。分析遵循文本挖掘过程,包括文本处理,文本转换,属性选择和模式发现。为了测试应用方法的有效性和准确性,计算欧几里德和欧几里德的混乱矩阵。作为结果,从各种评论组中,包括Bersosial.com在17评论中,Indowebster.com在27条评论中和Teknojurnal.com中的51个数据评论提供了分离良好的群集的值。该测试还定义了阈值和高度的改变不影响聚类过程。从F-量度值的混淆矩阵计算解释consine相似提供了更好的结果是欧几里德距离,其中teknojurnal 0.89,indowebster,玉米0.71和0.57 bersosial.com。这表明CLHM算法和山坡攀爬方法是有效的方法,并已成功应用于在线讨论的评论集群中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号