A comparative study of stemming algorithms for use with the Uzbek language

机译：与乌兹别克语语言一起使用的词干算法的比较研究

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Stemming is one of the pipeline feature of Information Retrieval and commonly used in natural language processing and text mining. The main purpose of a stemming process is to reduce the inflectional or derivational word into its root form. The difficulties on developing stemming algorithm is to identify and remove affixes since each language in the world has unique characteristics and grammatical rules. This paper compares related study on existing stemmers to be used in Uzbek language. We discuss the type of stemming algorithms, an overview of available popular English stemmers and comparison between discussed stemmers as well as their evaluation and analysis of available stemmers on Uzbek language experiment. Based on the comparative study and experiment, we proposal our model of the Uzbek stemmer that enhances some of the features in Lovins stemmer to suit the requirements for the Uzbek language.

机译：提取是信息检索的管道功能之一，通常用于自然语言处理和文本挖掘。词干处理的主要目的是将变形词或派生词简化为词根形式。由于世界上每种语言都有独特的特征和语法规则，因此开发词干算法的困难在于识别和删除词缀。本文比较了有关将在乌兹别克语中使用的现有词干的相关研究。我们讨论了词干算法的类型，可用的流行英语词干概述，讨论的词干之间的比较以及它们在乌兹别克语语言实验中对可用词干的评估和分析。根据比较研究和实验，我们提出了乌兹别克语词干分析器模型，该模型增强了Lovins词干分析器的某些功能，以适应乌兹别克语的要求。

著录项

来源
《International Conference on Computer and Information Sciences;World Engineering, Science Technology Congress》|2016年|7-12|共6页
会议地点 Kuala Lumpur(MY)
作者
A. Ismailov; M.M. Abdul Jalil; Z. Abdullah; N.H. Abd Rahim;
展开▼
作者单位

School of Informatics and Applied Mathematics Universiti Malaysia Terengganu 21030 Kuala Malaysia;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Algorithm design and analysis; Computers; Dictionaries; Databases; Informatics;

机译：算法设计与分析；电脑;字典；数据库；信息学;

相似文献

外文文献
中文文献
专利

1. A comparative study of unification algorithms for OR-parallel execution of logic languages [J] . Crammond Jim Computers, IEEE Transactions on . 1985,第10期

机译：逻辑语言“或”并行执行的统一算法的比较研究
2. COMPARATIVE STUDY OF TOPIC SEGMENTATION ALGORITHMS BASED ON LEXICAL COHESION: EXPERIMENTAL RESULTS ON ARABIC LANGUAGE [J] . Fouzi Harrag, Aboubekeur Hamdi-Cherif, Abdulmalik Salman Al-Salman The Arabian journal for science and engineering . 2010,第2C期

机译：基于词干凝聚的主题词分割算法的比较研究：阿拉伯语言的实验结果
3. A comparative study of programming languages for next-generation astrodynamics systems [J] . Helge Eichhorn, Juan Luis Cano, Frazer McLean, CEAS Space Journal . 2018,第1期

机译：下一代Astrocums系统编程语言的比较研究
4. A comparative study of stemming algorithms for use with the Uzbek language [C] . A. Ismailov, M.M. Abdul Jalil, Z. Abdullah, International Conference on Computer and Information Sciences . 2016

机译：乌兹别克语用作算法算法的比较研究
5. Understanding violent conflict: A comparative study of Tajikistan and Uzbekistan [D] . Tuncer Kilavuz, Idil 2007

机译：了解暴力冲突：塔吉克斯坦和乌兹别克斯坦的比较研究
6. Natural language processing algorithms for mapping clinical text fragments onto ontology concepts: a systematic review and recommendations for future studies [O] . Martijn G. Kersloot, Florentien J. P. van Putten, Ameen Abu-Hanna, 2020

机译：用于将临床文本碎片映射到本体概念的自然语言处理算法：未来研究的系统审查和建议
7. Comparative analysis of adjectives in English and Uzbek languages [O] . 2020

机译：英语和乌兹别克语语言形容词的比较分析

A comparative study of stemming algorithms for use with the Uzbek language

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅