首页> 外文期刊>The international arab journal of information technology >Enhanced Clustering-Based Topic Identification of Transcribed Arabic troadcast News
【24h】

Enhanced Clustering-Based Topic Identification of Transcribed Arabic troadcast News

机译:基于增强聚类的转录阿拉伯广播新闻主题识别

获取原文
获取原文并翻译 | 示例
           

摘要

This research presents an enhanced topic identification of transcribed Arabic broadcast news using clustering techniques. The enhancement includes applying new stemming technique rule-based light stemming to balance the negative effects of the stemming errors associated with light stemming and root-based stemming. New possibilistic-based clustering technique is also applied to evaluate the degree of membership that every transcribed document has in regard to every predefined topic, hence detecting documents causing topic confusions that negatively affect the accuracy of the topic-clustering process. The evaluation has showed that using rule-based light stemming in combination of spectral clustering technique achieved the highest accuracy, and this accuracy is further increased after excluding confusing documents.
机译:这项研究提出了使用聚类技术对转录的阿拉伯广播新闻进行增强的主题识别。增强功能包括应用基于规则的新茎技术,以平衡与茎和基于根的茎相关的茎错误的负面影响。还使用了新的基于可能性的聚类技术来评估每个转录文档对每个预定义主题的隶属程度,从而检测导致主题混乱的文档,从而对主题聚类过程的准确性产生负面影响。评估表明,将基于规则的光干与光谱聚类技术结合使用可获得最高的准确度,并且在排除了混乱的文档之后,该准确度进一步提高了。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号