Exploiting Discourse Relations between Sentences for Text Clustering

机译：利用文本聚类句子的话语关系

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Over the years, the usage of discourse relations has been proven to enhance many applications such as text summarization, question answering and natural language generation. This paper proposes an approach that expands the benefit of discourse relations for natural language processing from a different aspect. We exploit the discourse relations existing between sentences to generate clusters of similar sentences from document sets. We first examined and defined the type of discourse relations that useful to retrieve sentences with identical content. We then assigned these relations to each sentence pair using a machine learning method. Finally we performed discourse relation-based clustering algorithm to generate clusters of similar sentences. We evaluated our method by measuring the cohesion and separation of the clusters and compared to a well recognized clustering method. The experimental result shows that our method performed significantly well, which demonstrated that discourse relation between sentences can be exploited for text clustering.

机译：多年来，已证明话语关系的使用是为了提高许多诸如文本摘要，问题应答和自然语言生成等申请。本文提出了一种拓展了不同方面的自然语言处理的话语关系的益处的方法。我们利用句子之间存在的话语关系来生成文档集的类似句子的集群。我们首先检查并定义了用于检索具有相同内容的句子的话语关系类型。然后，我们使用机器学习方法将这些关系分配给每个句子对。最后，我们执行了基于话语关系的聚类算法来生成类似句子的集群。我们通过测量簇的凝聚力和分离并与良好认可的聚类方法进行评估。实验结果表明，我们的方法效果显着良好，这表明可以利用句子之间的话语关系进行文本聚类。

著录项

来源
《Workshop on Advances in Discourse Analysis and its Computational Aspects》|2012年||共15页
会议地点
作者
Nik Adilah Hanin Zahri; Fumiyo Fukumoto; Suguru Matsuyoshi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词
discourse relation; rhetorical relation; text clustering; SVMs; cluster validation;

机译：话语关系;修辞关系;文本聚类;SVM;群集验证;
入库时间 2022-08-20 19:56:47

相似文献

外文文献
中文文献
专利

1. Application of Rhetorical Relations Between Sentences to Cluster-Based Text Summarization [J] . N. Adilah Hanin Zahri, Fumiyo Fukumoto, Matsyoshi Suguru and Ong Bi Lynn Computer Science & Information Technology . 2015,第2期

机译：句间修辞关系在基于聚类的文本摘要中的应用
2. Text Cohesion in English Scientific Texts Written by Saudi Undergraduate Dentistry Students: A Multimodal Discourse Analysis of Textual and Logical Relations in Oral Biology Texts [J] . Hesham Suleiman Alyousef SAGE Open . 2021,第3期

机译：沙特本科牙科学生撰写的英语科学文本中的文本凝聚力：口腔生物学文本中文本与逻辑关系的多模式语篇论证分析
3. Clustering Sentence-Level Text Using a Novel Fuzzy Relational Clustering Algorithm [J] . Skabar Andrew, Abdalgader Khaled Knowledge and Data Engineering, IEEE Transactions on . 2013,第1期

机译：使用新型模糊关系聚类算法的句子级文本聚类
4. Exploiting Discourse Relations between Sentences for Text Clustering [C] . Nik Adilah Hanin Zahri, Fumiyo Fukumoto, Suguru Matsuyoshi Workshop on Advances in Discourse Analysis and its Computational Aspects . 2012

机译：利用句子之间的语篇关系进行文本聚类
5. Learning Representations of Text through Language and Discourse Modeling: From Characters to Sentences. [D] . Jernite, Yacine. 2018

机译：通过语言和话语建模学习文本表示形式：从字符到句子。
6. Exploiting Unlabeled Texts with Clustering-based Instance Selection for Medical Relation Classification [O] . Youngjun Kim, Ellen Riloff, Stéphane M. Meystre 2017

机译：通过基于聚类的实例选择来利用未标记的文本进行医疗关系分类
7. APPLICATION OF RHETORICAL RELATIONS BETWEEN SENTENCES TO CLUSTER-BASED TEXT SUMMARIZATION [O] . N. Adilah, Hanin Zahri, Fumiyo Fukumoto, 2015

机译：作者：张莹莹，襄樊学院学报JOURNaL OF XIaNGFaN UNIVERsITY句子之间的修辞关系在基于聚类的文本概述中的应用

Exploiting Discourse Relations between Sentences for Text Clustering

摘要

著录项

相似文献

相关主题

期刊订阅