首页> 外文会议>International Conference on Computational Linguistics >A Framework for Identifying Textual Redundancy

【24h】

A Framework for Identifying Textual Redundancy

机译：识别文本冗余的框架

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The task of identifying redundant information in documents that are generated from multiple sources provides a significant challenge for summarization and QA systems. Traditional clustering techniques detect redundancy at the sentential level and do not guarantee the preservation of all information within the document. We discuss an algorithm that generates a novel graph-based representation for a document and then utilizes a set cover approximation algorithm to remove redundant text from it. Our experiments show that this approach offers a significant performance advantage over clustering when evaluated over an annotated dataset.

机译：在多个源生成的文档中识别冗余信息的任务为摘要和QA系统提供了重大挑战。传统聚类技术在句子级别检测冗余，并保证保存文档中的所有信息。我们讨论一种为文档生成基于图形的基于图形的算法，然后利用SET封面近似算法从中删除冗余文本。我们的实验表明，在通过注释的数据集进行评估时，这种方法在聚类时提供了显着的性能优势。

著录项

来源
《International Conference on Computational Linguistics 》|2008年||共8页
会议地点
作者

展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程 ;
关键词

相似文献

外文文献
中文文献
专利

1. Richness, redundancy or relational salience? A comparison of the effect of textual and aural feedback modes on knowledge elaboration in higher education students' work [J] . Alan Gleaves, Caroline Walker Computers & education . 2013 ,第mara期

机译：丰富性，冗余性或关系显着性？文本和听觉反馈方式对高校学生工作中知识阐述的影响比较
2. Identifying duplicate functionality in textual use cases by aligning semantic actions [J] . Rago Alejandro, Marcos Claudia, Diaz-Pace J. Andres Software and systems modeling . 2016 ,第2期

机译：通过对齐语义动作来识别文本用例中的重复功能
3. A Linguistic Approach to Identify the Affective Dimension Expressed in Textual Messages [J] . Sandro Jose Rigo, Isa Mara da Rosa Alves, Jorge Luis Victoria Barbosa International Journal of Information and Communication Technology Education: An Official Pubblication of the Information Resources Management Association . 2015 ,第1期

机译：识别短信中情感维度的语言学方法
4. A Framework for Identifying Textual Redundancy [C] . Kapil Thadani, Kathleen McKeown 22nd International Conference on Computational Linguistics . 2008

机译：识别文本冗余的框架
5. Index compression and redundancy elimination in large textual collections. [D] . Yan, Hao. 2010

机译：大型文本集合中的索引压缩和冗余消除。
6. Development and Validation of Online Textual Pediatrician-Parent Communication Instrument Based on the SEGUE Framework [O] . Yuqi Xiong, Dan Wang, Haihong Chen, 2006

机译：基于SEGUE框架的在线文本儿科医生—父母沟通工具的开发与验证
7. A Framework for Identifying Textual Redundancy [O] . Kapil Thadani, Kathleen Mckeown 2011

机译：识别文本冗余的框架

A Framework for Identifying Textual Redundancy

摘要

著录项

相似文献

相关主题

期刊订阅