Automatic Annotation of Corpora for Text Summarisation: A Comparative Study

机译：用于文本摘要的语料库自动注释：一项比较研究

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents two methods which automatically produce annotated corpora for text summarisation on the basis of human produced abstracts. Both methods identify a set of sentences from the document which conveys the information in the human produced abstract best. The first method relies on a greedy algorithm, whilst the second one uses a genetic algorithm. The methods allow to specify the number of sentences to be annotated, which constitutes an advantage over the existing methods. Comparison between the two approaches investigated here revealed that the genetic algorithm is appropriate in cases where the number of sentences to be annotated is less than the number of sentences in an ideal gold standard with no length restrictions, whereas the greedy algorithm should be used in other cases.

机译：本文提出了两种方法，它们可以在人工产生的摘要的基础上自动生成带注释的语料库，用于文本摘要。两种方法都从文档中识别出一组句子，以最佳方式传达信息。第一种方法依靠贪婪算法，而第二种方法则使用遗传算法。该方法允许指定要注释的句子的数量，这构成了优于现有方法的优点。此处研究的两种方法的比较表明，在需要注释的句子数量少于没有长度限制的理想黄金标准中的句子数量的情况下，遗传算法是合适的，而其他算法则应使用贪婪算法。案件。

著录项

来源
《International Conference on Computational Linguistics and Intelligent Text Processing(CICLing 2005); 20050213-19; Mexico City(MX)》|2005年|P.670-681|共12页
会议地点 Mexico City(MX)
作者
Constantin Orasan;
展开▼
作者单位

Research Group in Computational Linguistics, School of Humanities, Languages and Social Sciences, University of Wolverhampton, Stafford St., Wolverhampton, WV1 1SB, UK;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;程序语言、算法语言;
关键词
入库时间 2022-08-26 14:29:47

相似文献

外文文献
中文文献
专利

1. Interlingual annotation of parallel text corpora: a new framework for annotation and evaluation [J] . BONNIE J. DORR, REBECCA J. PASSONNEAU, DAVID FARWELL, Natural language engineering . 2010,第3期

机译：并行文本语料库的语际注释：注释和评估的新框架
2. Automatic summarisation and annotation of microarray data [J] . Pietro H. Guzzi, Maria Teresa Di Martino, Giuseppe Tradigo, Soft Computing - A Fusion of Foundations, Methodologies and Applications . 2011,第8期

机译：自动汇总和注释微阵列数据
3. Automatic summarisation and annotation of microarray data [J] . Guzzi P.H., Di Martino M.T., Tradigo G., Soft computing: A fusion of foundations, methodologies and applications . 2011,第8期

机译：自动汇总和注释微阵列数据
4. Automatic Annotation of Corpora for Text Summarisation: A Comparative Study [C] . Constantin Orasan International Conference on Computational Linguistics and Intelligent Text Processing . 2005

机译：关于文本汇总的自动注释：比较研究
5. The effects of multimedia annotations on L2 vocabulary immediate recall and reading comprehension: A comparative study of text-picture and audio-picture annotations under incidental and intentional learning conditions. [D] . Chen, Zhaohui. 2006

机译：多媒体注释对二语词汇即时回忆和阅读理解的影响：在偶然和有意学习条件下对文本图片和音频图片注释的比较研究。
6. Combining MEDLINE and publisher data to create parallel corpora for the automatic translation of biomedical text [O] . Antonio Jimeno Yepes, Élise Prieur-Gaston, Aurélie Névéol 2013

机译：结合MEDLINE和发布者数据以创建并行语料库以自动翻译生物医学文本
7. Automatic annotation of corpora for text summarisation: a comparative study [O] . 2015

机译：文本摘要语料库的自动注释：比较研究

Automatic Annotation of Corpora for Text Summarisation: A Comparative Study

摘要

著录项

相似文献

相关主题

期刊订阅