首页> 外文会议>Advances in Natural Language Processing >Finding Text Boundaries and Finding Topic Boundaries: Two Different Tasks?
【24h】

Finding Text Boundaries and Finding Topic Boundaries: Two Different Tasks?

机译:查找文本边界和查找主题边界:两个不同的任务?

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

The goal of this paper is to demonstrate that usual evaluation methods for text segmentation are not adapted for every task linked to text segmentation. To do so we differentiated the task of finding text boundaries in a corpus of concatenated texts from the task of finding transitions between topics inside the same text. We worked on a corpus of twenty two French political discourses trying to find boundaries between them when they are concatenated, and to find topic boundaries inside them when they are not. We compared the results of our distance based method to the well known c99 algorithm.
机译:本文的目的是证明文本分割的常规评估方法并不适用于链接到文本分割的每个任务。为此,我们将查找连续文本语料库中文本边界的任务与查找同一文本内主题之间的过渡的任务区分开来。我们研究了22种法国政治话语的语料库,试图在它们串联时找到它们之间的界限,而在它们不串联时找到它们内部的主题界限。我们将基于距离的方法的结果与众所周知的c99算法进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号