首页> 外文会议>Information Reuse and Integration, 2007 IEEE International Conference on >Comparative Analysis of Different Text Segmentation Algorithms on Arabic News Stories
【24h】

Comparative Analysis of Different Text Segmentation Algorithms on Arabic News Stories

机译:阿拉伯新闻故事中不同文本分割算法的比较分析

获取原文
获取原文并翻译 | 示例

摘要

The task of text segmentation represents an important step in many applications and while much work has been carried out to address this task for the English language, work on text segmentation for other languages is still lagging behind. In this paper a comparative analysis of three different text segmentation algorithms on Arabic news stories is presented. To assess how well each algorithm works on Arabic news stories, each was applied on an Arabic Reuters news story dataset and the results were compared. The work in this paper also describes a combination of two of these algorithms that was found to produce better results than any of the presented individual algorithms. It also presents a set of error reduction filters that were found to significantly reduce segmentation errors in the detection of borders in Arabic based news stories.
机译:文本分割的任务代表了许多应用程序中的重要一步,尽管为解决英语语言的这一任务已经进行了许多工作,但其他语言的文本分割工作仍然落后。本文对阿拉伯新闻报导的三种不同的文本分割算法进行了比较分析。为了评估每种算法在阿拉伯新闻报道上的效果,将每种算法应用于阿拉伯路透社新闻报道数据集,并对结果进行比较。本文中的工作还描述了这些算法中的两种的组合,发现这些组合产生的效果比所提出的任何单个算法都要好。它还提供了一组减少错误的过滤器,这些过滤器被发现可以显着减少基于阿拉伯语的新闻报道中的边界检测中的分段错误。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号