首页> 外文会议>IEEE International Conference on Natural Language Processing and Knowledge Engineering >Improvement of the Dotplotting Method for Linear Text Segmentation
【24h】

Improvement of the Dotplotting Method for Linear Text Segmentation

机译:线性文本分割的Dotplotting方法的改进

获取原文

摘要

The dotplotting method, employed by Reynar[1], is a state-of-the-art algorithm for automatic linear text segmentation. However, several problems are found in its measure for assessing density that represents topical coherence: the density function is asymmetric, leading to the apparent false conclusion that forward scan may result in different segmentation with backward scan; besides, while determining next boundary, the assessing strategy doesn't adequately take the previously located boundaries into account. In this paper we propose modified models that remedy these problems. We also make use of segment length to improve segmentation performance. Experimental results show that the modified models achieve considerable improvement in P{sub}k value and precision and recall over the original dotplotting method.
机译:Reynar [1]采用的dotplotting方法是用于自动线性文本分段的最先进的算法。然而,在评估代表局部相干性的密度的措施中发现了几个问题:密度函数是不对称的,导致前进扫描可能导致反向扫描的不同分割的表观错误结论;此外,在确定下一个边界的同时,评估策略没有充分考虑先前定位的边界。在本文中,我们提出了修改的模型来解决这些问题。我们还利用段长度来提高分割性能。实验结果表明,修改模型在原始配件方法上达到了P {Sub} K值和精度并召回的相当大的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号