Automatic Stemming for Indexing of an Agglutinative Language

机译：自动词干标注聚结语言

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Stemming is an essential process in information retrieval. Though there are extremely simple stemming algorithms for inflectional languages, the story goes totally different for agglutinative languages. It is even more difficult if significant portion of the vocabulary is new or unknown. This paper explores the possibility of stemming of an agglutinative language, in particular, Korean language, by unsupervised morphology learning. We use only raw corpus and make use of no dictionary. Unlike heuristic algorithms that are theoretically ungrounded, this method is based on statistical methods, which are widely accepted. Although the method is currently applied only to Korean language, the method can be adapted to other agglutinative languages with similar characteristics, since language-specific knowledge is not used.

机译：提取是信息检索中必不可少的过程。尽管有非常简单的词干变化算法，但对于胶合语言来说，情况却截然不同。如果词汇表的重要部分是新的或未知的，则更加困难。本文探讨了通过无监督形态学来阻止凝集性语言（尤其是朝鲜语）的可能性。我们仅使用原始语料库，不使用字典。与理论上没有根据的启发式算法不同，此方法基于统计方法，已被广泛接受。尽管该方法当前仅适用于朝鲜语，但是由于不使用特定于语言的知识，因此该方法可以适用于具有类似特征的其他凝集性语言。

著录项

来源
《Advances in Information Systems》|2002年|p.154-165|共12页
会议地点
作者
Sehyeong Cho; Seung-Soo Han;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Unsupervised Joint PoS Tagging and Stemming for Agglutinative Languages [J] . Bolucu Necva, Can Burcu ACM transactions on Asian language information processing . 2019,第3期

机译：胶合语言的无监督联合PoS标记和词干
2. Agglutinative Language Speech Recognition Using Automatic Allophone Deriving [J] . Ji Xu, Jielin Pan, Yonghong Yan Chinese Journal of Electronics . 2016,第2期

机译：使用自动变音位派生的凝集语言语音识别
3. Agglutinative Language Speech Recognition Using Automatic Allophone Deriving [J] . XU Ji, PAN Jielin, YAN Yonghong 电子学报（英文版） . 2016,第002期

机译：使用自动变音位派生的凝集语言语音识别
4. Automatic Stemming for Indexing of an Agglutinative Language [C] . Sehyeong Cho, Seung-Soo Han International conference on advances in information systems . 2002

机译：自动置入凝集语言的索引
5. A comparison of manual indexing and automatic indexing in the Humanities [D] . Sensuse, Dana Indra 2004

机译：人文领域中手动索引和自动索引的比较
6. Controlled Vocabularies Indexing and Medical Language Processing. Expert Indexing Systems: Research on Interactive Knowledge-Based Indexing: The MedIndEx Prototype [O] . Susanne M. Humphrey 1989

机译：受控词汇表索引编制和医学语言处理。专家索引系统：基于交互式知识的索引的研究：MedIndEx原型
7. DISCRIMINATIVE APPROACH TO LEXICAL ENTRY SELECTION FOR AUTOMATIC SPEECH RECOGNITION OF AGGLUTINATIVE LANGUAGE [O] . Mijit Ablimit, Tatsuya Kawahara, Askar Hamdulla 2012

机译：农业语言自动语音识别的词条选择方法

Automatic Stemming for Indexing of an Agglutinative Language

摘要

著录项

相似文献

相关主题

期刊订阅