Syllabification Model of Indonesian Language Named-Entity Using Syntactic n-Gram

Ahmad Muammar Fanani; Suyanto Suyanto

首页> 外文期刊>Procedia Computer Science >Syllabification Model of Indonesian Language Named-Entity Using Syntactic n-Gram

【24h】

Syllabification Model of Indonesian Language Named-Entity Using Syntactic n-Gram

机译：使用句法n-gram的印度尼西亚语言名称实体的音节模型

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Syllabication or syllabification is an activity to detect syllable boundaries in a word. There are two main ways for automatic syllabification, namely rule-based and data-driven. The rule-based approach is based on the general principle of syllabification, while the data-driven method uses a set of syllabified words to create a syllabification of unknown words. Research on syllabification of words has been done a lot. However, most of these studies only deal with the formal words but still a few studies for named entities. Besides, named entities tend to be more complicated than the regular words. In this research, a syntactic n-Gram is proposed and investigated to syllabify the named entities since it is developed based on the n-gram that has an excellent accuracy and tends to be consistent with various languages. Evaluation on 20 k named-entities based on 4-fold cross-validation show that the proposed model gives a competitive syllable error rate (SER) compare to another similar n-gram-based model.

机译：音节或音节是一个在一个单词中检测音节边界的活动。自动音节有两种主要方法，即基于规则和数据驱动。基于规则的方法是基于音节的一般原则，而数据驱动方法使用一组音节单词来创建未知单词的音节。关于单词的音节研究已经完成了很多。然而，这些研究中的大多数只处理了正式的话语，但仍然是一些针对命名实体的研究。此外，命名实体往往比常规词更复杂。在这项研究中，提出了一种句法n-gram并调查了Syllabify命名实体，因为它是基于具有优异精度的N-GR族的开发，并且往往与各种语言一致。基于4倍交叉验证的20 k命名实体的评估表明，该模型给出了与另一个类似的基于N-GRAM的模型相比的竞争音节错误率（SER）。

著录项

来源
《Procedia Computer Science》 |2021年第1期|共7页
作者
Ahmad Muammar Fanani; Suyanto Suyanto;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词
syllabificationnamed-entitiessyntactic n-gram;

机译：syllabificationnamed-oritiessyntactic n-gram;

相似文献

外文文献
中文文献
专利

1. An empirical study of statistical language models: n-gram language models vs. neural network language models [J] . Freha Mezzoudj, Abdelkader Benyettou International Journal of Innovative Computing and Applications . 2018,第4期

机译：统计语言模型的实证研究：n-gram语言模型与神经网络语言模型
2. Syntactic N-grams as machine learning features for natural language processing [J] . Grigori Sidorov, Francisco Velasquez, Efstathios Stamatatos, Expert Systems with Application . 2014,第3期

机译：语法N-gram作为自然语言处理的机器学习功能
3. Syntactic N-grams as machine learning features for natural language processing [J] . Jun Ping Ng Computing reviews . 2014,第3期

机译：语法N-gram作为自然语言处理的机器学习功能
4. Indonesian Graphemic Syllabification Using n-Gram Tagger with State-Elimination [C] . Rezza Nafi Ismail, Suyanto Suyanto International Conference on Information and Communication Technology . 2020

机译：使用带有状态消除功能的n-Gram Tagger进行印度尼西亚音素化
5. Language-independent text learning with statistical n-gram language models. [D] . Peng, Fuchun. 2003

机译：统计n-gram语言模型的独立于语言的文本学习。
6. Semi-Supervised Bidirectional Long Short-Term Memory and Conditional Random Fields Model for Named-Entity Recognition Using Embeddings from Language Models Representations [O] . Min Zhang, Guohua Geng, Jing Chen 2020

机译：使用语言模型表示的嵌入式识别命名实体识别的半监控双向短期内存和条件随机字段模型
7. Expanding n-gram training data for language models based on morpho-syntactic transformations [O] . Verwimp Lyan, Pelemans Joris, Van hamme Hugo, 2015

机译：基于形态-句法转换的语言模型扩展n-gram训练数据
8. Investigation of Back-off Based Interpolation Between Recurrent Neural Network and N-gram Language Models (Author's Manuscript). [R] . Chen, X., Liu, X., Gales, M. J. F., 2016

机译：基于回退的递归神经网络与N-gram语言模型的插值研究（作者手稿）。

Syllabification Model of Indonesian Language Named-Entity Using Syntactic n-Gram

摘要

著录项

相似文献

相关主题

期刊订阅