Graph-based lemmatization of Turkish words by using morphological similarity

机译：基于形态相似度的土耳其词基于图的词形化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Lemmatization of the words is an important preprocess for Natural Language Processing (NLP) studies. Especially in language applications (such as part of speech tagging, spell-checking, and document clustering), selection of the right lemma with morphological features can provide better results. In this study, we present a new hybrid approach for Turkish inflected words by using morphological similarity based graph models which is recently getting popular in lemmatization. For this aim, a novel similarity function for Turkish is developed to connect the similar word forms. The proposed model is trained and tested by a double-checked Turkish lemmatization dataset. Then, empirical results are compared with ones of Zemberek which is the most used Turkish lemmatization tool.

机译：单词的合法化是自然语言处理（NLP）研究的重要预处理。尤其是在语言应用程序中（例如语音标记，拼写检查和文档聚类的一部分），选择具有形态特征的正确引理可以提供更好的结果。在这项研究中，我们通过使用基于形态相似性的图模型，提出了一种新的土耳其语变形词混合方法，该方法最近在词形化中很受欢迎。为此，开发了一种新颖的土耳其语相似功能，以连接相似的单词形式。所提出的模型是通过双重检查的土耳其词条化数据集进行训练和测试的。然后，将实证结果与最常用的土耳其语词化工具Zemberek进行比较。

著录项

来源
《International Symposium on INnovations in Intelligent SysTems and Applications》|2016年|1-5|共5页
会议地点 Sinaia(RO)
作者
Enis Arslan; Umut Orhan;
展开▼
作者单位

Computer Engineering Department Cukurova University Adana Turkey;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Natural language processing; Computers; Speech; Tagging; Dictionaries; Transforms; Mathematical model;

机译：自然语言处理；电脑;言语;标记；字典；转换；数学模型;

相似文献

外文文献
中文文献
专利

1. Performance of children on the Turkish Nonword Repetition Test: Effect of word similarity, word length, and scoring [J] . Topba?S., Ka?ar-Kütük?üDi., Kopkalli-YavuzH. Clinical linguistics & phonetics . 2014,第7a8期

机译：儿童在土耳其语非单词重复测试中的表现：单词相似度，单词长度和得分的影响
2. How Dutch and Turkish-Dutch readers process morphologically complex words: An ERP study [J] . Prins Tineke, Dijkstra Ton, Koeneman Olaf Journal of neurolinguistics . 2019,第期

机译：荷兰语和土耳其读者如何进程形态学复杂的词语：一个ERP研究
3. Semantic similarity influences early morphological priming in Serbian: A challenge to form-then-meaning accounts of word recognition [J] . Feldman L.B., Kosti? A., Gvozdenovi? V., Psychonomic bulletin & review . 2012,第4期

机译：语义相似性影响塞尔维亚语的早期形态学启动：对单词识别的“先形成后意义”帐户的挑战
4. Graph-based lemmatization of Turkish words by using morphological similarity [C] . Enis Arslan, Umut Orhan International Symposium on Innovations in Intelligent Systems and Applications . 2016

机译：使用形态相似性的土耳其词的基于图谱的lemmatization
5. Unsupervised Graph-Based Similarity Learning Using Heterogeneous Features. [D] . Muthukrishnan, Pradeep. 2011

机译：使用异构特征的无监督基于图的相似性学习。
6. Semantic similarity influences early morphological priming in Serbian: A challenge to form-then-meaning accounts of word recognition [O] . Laurie Beth Feldman, Aleksandar Kostić, Vasilije Gvozdenović, -1

机译：语义相似性影响塞尔维亚语的早期形态学意义：形成单词识别的挑战
7. Unsupervised Graph-based Word Sense Disambiguation Using Measures of Word Semantic Similarity [O] . 2008

机译：基于Word语义相似度量的无监督图形词义消歧

Graph-based lemmatization of Turkish words by using morphological similarity

摘要

著录项

相似文献

相关主题

期刊订阅