首页> 外文会议>Australasian Joint Conference on Artificial Intelligence >Improving Sentence Similarity Measurement by Incorporating Sentential Word Importance

【24h】

Improving Sentence Similarity Measurement by Incorporating Sentential Word Importance

机译：通过结合句子词重要性来提高句子相似度测量

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Measuring similarity between sentences plays an important role in textual applications such as document summarization and question answering. While various sentence similarity measures have recently been proposed, these measures typically only take into account word importance by virtue of inverse document frequency (IDF) weighting. IDF values are based on global information compiled over a large corpus of documents, and we hypothesise that at the sentence level better performance can be achieved by using a measure of the importance of a word within the sentence that it appears. In this paper we show how the PageRank graph-centrality algorithm can be used to assign a numerical measure of importance to each word in a sentence, and how these values can be incorporated within various sentence similarity measures. Results from applying the measures to a difficult sentence clustering task demonstrates that incorporation of sentential word importance leads to statistically significant improvement in clustering performance as evaluated using a range of external clustering criteria.

机译：测量句子之间的相似性在文本应用中起重要作用，例如文件摘要和问题应答。虽然最近提出了各种句子相似度措施，但这些措施通常仅考虑凭借逆文档频率（IDF）加权来描述重要性。 IDF值基于由大型文档编译的全局信息，我们假设在句子水平上，通过使用它出现的句子中的句子中的单词的重要性来实现更好的性能。在本文中，我们展示了PageRank图表中心算法如何用于为句子中的每个单词分配数字测量值，以及如何在各种句子相似度量中结合这些值。将措施应用于困难的句子聚类任务的结果表明，并入句子词重要性导致使用一系列外部聚类标准评估的聚类性能的统计上显着改进。

著录项

来源
《Australasian Joint Conference on Artificial Intelligence 》|2010年||共10页
会议地点
作者
Andrew Skabar; Khaled Abdalgader;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Unsupervised similarity-based word sense disambiguation using context vectors and sentential word importance [J] . Franz Kurfess Computing reviews . 2013 ,第2期

机译：使用上下文向量和句子重要性的无监督基于相似度的词义消歧
2. Incorporating Prior Knowledge into Word Embedding for Chinese Word Similarity Measurement [J] . Huang Degen, Pei Jiahuan, Zhang Cong, ACM transactions on Asian language information processing . 2018 ,第3期

机译：将先验知识融合到词嵌入中以进行中文词相似度测量
3. Assessing Sentence Similarity Using WordNet based Word Similarity [J] . Hongzhe Liu, Pengfei Wang Journal of software . 2013 ,第6期

机译：使用基于WordNet的单词相似度评估句子相似度
4. Improving Sentence Similarity Measurement by Incorporating Sentential Word Importance [C] . Andrew Skabar, Khaled Abdalgader AI 2010: Advances in artificial intelligence . 2010

机译：通过结合句子的重要性来提高句子相似度的度量
5. Improved GloVe Word Embedding Using Linear Weighting Scheme for Word Similarity Tasks [D] . Lu, Qinglan. 2021

机译：使用线性加权方案进行改进的手套单词嵌入单词相似性任务
6. Toddlers encode similarities among novel words from meaningful sentences [O] . Erica H. Wojcik, Jenny R. Saffran -1

机译：幼儿编码有意义句子中新颖单词之间的相似性
7. Words and sentences: the effects of sentential-semantic context on spoken-word processing [O] . Zwitserlood C.M.E. 1989

机译：单词和句子：句子语义环境对口语单词处理的影响

Improving Sentence Similarity Measurement by Incorporating Sentential Word Importance

摘要

著录项

相似文献

相关主题

期刊订阅