首页> 外文会议>International Conference on Document Analysis and Recognition >Arabic handwritten texts clusterization based on Feature Relation Graph (FRG)
【24h】

Arabic handwritten texts clusterization based on Feature Relation Graph (FRG)

机译:基于特征关系图(FRG)的阿拉伯语手写文本聚类

获取原文

摘要

Text clustering is known as the problem of grouping texts so that all texts within a group share a similar measure (similar author, similar genre, etc.) This task became very important for the last two decades because of increased number of text documents in digital form which needs to be organized and processed. We investigate a new metric based on the Feature Relation Graph (FRG) for Arabic handwritten texts clustering. This metric has proved to be effective for the text independent Persian writer identification. We have used it to solve more general problem of texts clustering. Pattern based features are extracted from handwritten texts using Gabor and XGabor filters. The extracted features are represented for each cluster by using the FRG. We apply several clustering algorithms in a space of FRGs. Numerical experiments to demonstrate effectiveness of proposed metric and compare effectiveness of different algorithms are provided.
机译:文本聚类被称为对文本进行分组的问题,以使组中的所有文本共享相似的度量(相似的作者,相似的流派等)。由于数字文档中数字文本数量的增加,该任务在过去的二十年中变得非常重要需要组织和处理的表格。我们研究了基于特征关系图(FRG)的阿拉伯手写文本聚类的新指标。事实证明,该度量标准对于独立于文本的波斯作家识别是有效的。我们已经使用它来解决文本聚类的更一般的问题。使用Gabor和XGabor过滤器从手写文本中提取基于模式的功能。通过使用FRG为每个聚类表示提取的特征。我们在FRG的空间中应用了几种聚类算法。数值实验证明了所提出的度量标准的有效性,并比较了不同算法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号