首页> 外文会议>Text, speech and dialogue >Finding the Optimal Number of Clusters for Word Sense Disambiguation

【24h】

Finding the Optimal Number of Clusters for Word Sense Disambiguation

机译：为词义消歧找到最佳的簇数

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Ambiguity is an inherent problem for many tasks in Natural Language Processing. Unsupervised and semi-supervised approaches to ambiguity resolution are appealing as they lower the cost of manual labour. Typically, those methods struggle with estimation of number of senses without supervision. This paper shows research on using stopping functions applied to clustering algorithms for estimation of number of senses. The experiments were performed for Polish and English. We found that estimation based on PK2 stopping functions is encouraging, but only when using coarse-grained distinctions between senses.

机译：对于自然语言处理中的许多任务，歧义是一个固有的问题。无监督和半监督解决歧义的方法很有吸引力，因为它们降低了人工成本。通常，这些方法在没有监督的情况下就难以估计感觉。本文展示了将停止函数应用于聚类算法以估计感官数量的研究。实验是针对波兰语和英语进行的。我们发现基于PK2停止功能的估计令人鼓舞，但仅当使用感官之间的粗粒度区分时才如此。

著录项

来源
《Text, speech and dialogue》|2011年|p.388-394|共7页
会议地点 Pilsen(CZ);Pilsen(CZ)
作者
Bartosz Broda; Pawel Kgdzia;
展开▼
作者单位

Institute of Informatics, Wroclaw University of Technology, Poland;

Institute of Informatics, Wroclaw University of Technology, Poland;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. A Sense Annotated Corpus for All-Words Urdu Word Sense Disambiguation [J] . Saeed Ali, Nawab Rao Muhammad Adeel, Stevenson Mark, ACM transactions on Asian language information processing . 2019,第4期

机译：用于全词乌尔都语的词义注释语料库
2. Semi-supervised Learning with Induced Word Senses for State of the Art Word Sense Disambiguation [J] . Ba#351, kaya Osman, Jurgens David The Journal of Artificial Intelligence Research . 2016,第10期

机译：半监督学习与诱导词义相结合，可实现最先进的词义歧义消除
3. Semi-supervised Learning with Induced Word Senses for State of the Art Word Sense Disambiguation [J] . Baskaya Osman, Jurgens David The Journal of Artificial Intelligence Research . 2016,第Null期

机译：半监督学习与诱导词义相结合，可实现最先进的词义歧义消除
4. Finding the Optimal Number of Clusters for Word Sense Disambiguation [C] . Bartosz Broda, Pawel Kedzia International Conference on Text, Speech and Dialogue . 2011

机译：寻找最佳群集字母歧义
5. Finding meaning: Sense inventories for improved word sense disambiguation. [D] . Brown, Susan Windisch. 2010

机译：查找含义：感官清单可改善词义歧义。
6. Fast max-margin clustering for unsupervised word sense disambiguation in biomedical texts [O] . Weisi Duan, Min Song, Alexander Yates 2009

机译：快速最大边距聚类用于生物医学文本中无监督的词义消歧
7. Fine-Grained Word Sense Disambiguation Based on Parallel Corpora, Word Alignment, Word Clustering and Aligned Wordnets [O] . Tufis, Dan, Ion, Radu, Ide, Nancy 2005

机译：基于平行语料库，Word的细粒度词义消歧对齐，Word聚类和对齐的Wordnets
8. Word Domain Disambiguation via Word Sense Disambiguation [R] . Sanfilippo, A. 2006

机译：Word Word消歧通过Word sense消歧

Finding the Optimal Number of Clusters for Word Sense Disambiguation

摘要

著录项

相似文献

相关主题

期刊订阅