Automatic Syllabification for Manipuri language

机译：Manipuri语言的自动音节化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Development of hand crafted rule for syllabifying words of a language is an expensive task. This paper proposes several data-driven methods for automatic syllabification of words written in Manipuri language. Manipuri is one of the scheduled Indian languages. First, we propose a language-independent rule-based approach formulated using entropy based phonotactic segmentation. Second, we project the syllabification problem as a sequence labeling problem and investigate its effect using various sequence labeling approaches. Third, we combine the effect of sequence labeling and rule-based method and investigate the performance of the hybrid approach. From various experimental observations, it is evident that the proposed methods outperform the baseline rule-based method. The entropy based phonotactic segmentation provides a word accuracy of 96%, CRF (sequence labeling approach) provides 97% and hybrid approach provides 98% word accuracy.

机译：开发用于将语言的单词音节化的手工规则是一项昂贵的任务。本文提出了几种数据驱动的方法，用于对以Manipuri语言编写的单词进行自动音节化。 Manipuri是预定的印度语言之一。首先，我们提出了一种基于语言的基于规则的方法，该方法使用了基于熵的音位分割方法。其次，我们将音节化问题投影为序列标记问题，并使用各种序列标记方法研究其影响。第三，我们结合了序列标记和基于规则的方法的效果，并研究了混合方法的性能。从各种实验观察中，很明显，所提出的方法优于基于基线规则的方法。基于熵的音符分割提供了96％的单词准确度，CRF（序列标记方法）提供了97％的单词准确度，而混合方法提供了98％的单词准确度。

著录项

来源
《International conference on computational linguistics》|2016年|349-357|共9页
会议地点
作者
Loitongbam Gyanendro Singh; Lenin Laitonjam; Sanasam Ranbir Singh;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Training Cum Workshop On Compilation Of Bibliographies In North East Indian Languages With Special Reference To Manipuri Language [J] . Herald of Library Science . 2007,第3a4期

机译：东北印度语言书目编纂培训暨专题讨论会，特别提到Manipuri语言
2. Comparison of rule-based and data-driven approaches for syllabification of simple syllable languages and the effect of orthography [J] . Franklin Oladiipo Asahiah Computer speech and language . 2021,第Nova期

机译：基于规则的和数据驱动方法的比较简单音节语言的音节和拼写效果
3. Syllabification Model of Indonesian Language Named-Entity Using Syntactic n-Gram [J] . Ahmad Muammar Fanani, Suyanto Suyanto Procedia Computer Science . 2021,第1期

机译：使用句法n-gram的印度尼西亚语言名称实体的音节模型
4. Automatic Syllabification for Manipuri language [C] . Loitongbam Gyanendro Singh, Lenin Laitonjam, Sanasam Ranbir Singh International conference on computational linguistics . 2016

机译：曼尼迪语言的自动音节
5. Automatic Speech Recognition for Low-Resource and Morphologically Complex Languages [D] . Morris, Ethan. 2021

机译：用于低资源和形态复杂语言的自动语音识别
6. Dataset of Pakistan Sign Language and Automatic Recognition of Hand Configuration of Urdu Alphabet through Machine Learning [O] . Ali Imran, Abdul Razzaq, Irfan Ahmad Baig, 2021

机译：通过机器学习巴基斯坦的数据集和自动识别URDU字母的手机配置
7. Automatic Syllabification Rules for ASSAMESE Language [O] . Laba Kr. Thakuria, Prof. P.H. Talukdar 2014

机译：assamEsE语言的自动音节化规则

Automatic Syllabification for Manipuri language

摘要

著录项

相似文献

相关主题

期刊订阅