首页> 外文期刊>Expert Systems with Application >A graph based keyword extraction model using collective node weight
【24h】

A graph based keyword extraction model using collective node weight

机译:使用集合节点权重的基于图的关键词提取模型

获取原文
获取原文并翻译 | 示例
       

摘要

In the recent times, a huge amount of text is being generated for social purposes on twitter social networking site. Summarizing and analysing of twitter content is an important task as it benefits many applications such as information retrieval, automatic indexing, automatic classification, automatic clustering, automatic filtering etc. One of the most important tasks in analyzing tweets is automatic keyword extraction. There are some graph based approaches for keyword extraction which determine keywords only based on centrality measure. However, the importance of a keyword in twitter depends on various parameters such as frequency, centrality, position and strength of neighbors of the keyword. Therefore, this paper proposes a novel unsupervised graph based keyword extraction method called Keyword Extraction using Collective Node Weight (KECNW) which determines the importance of a keyword by collectively taking various influencing parameters. The KECNW is based on Node Edge rank centrality with node weight depending on various parameters. The model is validated with five datasets: Uri Attack, American Election, Harry Potter, IPL and Donald Trump. The result of KECMW is compared with three existing models. It is observed from the experimental results that the proposed method is far better than the others. The performances are shown in terms of precision, recall and F-measure. (C) 2017 Published by Elsevier Ltd.
机译:最近,在Twitter社交网站上出于社交目的正在生成大量文本。对Twitter内容进行汇总和分析是一项重要的任务,因为它有益于许多应用程序,例如信息检索,自动索引,自动分类,自动聚类,自动过滤等。在分析推文中最重要的任务之一是自动关键词提取。存在一些基于图的关键字提取方法,这些方法仅基于中心度度量来确定关键字。但是,Twitter中关键字的重要性取决于各种参数,例如频率,中心性,关键字邻居的位置和强度。因此,本文提出了一种新的基于无监督图的关键字提取方法,即使用集体节点权重(KECNW)的关键字提取,该方法通过综合考虑各种影响参数来确定关键字的重要性。 KECNW基于节点边缘等级中心,节点权重取决于各种参数。该模型通过五个数据集进行了验证:Uri Attack,美国大选,哈利·波特,IPL和唐纳德·特朗普。将KECMW的结果与三个现有模型进行比较。从实验结果可以看出,该方法远优于其他方法。性能以精度,召回率和F量度表示。 (C)2017由Elsevier Ltd.发布

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号