A hybrid approach to Pali Sandhi segmentation using BiLSTM and rule-based analysis

Klangjai Tammanam; Nuttachot Promrit; Sajjaporn Waijanya

首页> 外文期刊>Engineering and Applied Science Research >A hybrid approach to Pali Sandhi segmentation using BiLSTM and rule-based analysis

【24h】

A hybrid approach to Pali Sandhi segmentation using BiLSTM and rule-based analysis

机译：使用Bilstm和基于规则分析的Pali Sandhi分割的混合方法

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Pali Sandhi is a phonetic transformation from two words into a new word. The phonemes of the neighbouring words are changed and merged. Pali Sandhi word segmentation is more challenging than Thai word segmentation because Pali is a highly inflected language. This study proposes a novel approach that predicts splitting locations by classifying the sample Sandhi words into five classes with a bidirectional long short-term memory model. We applied the classified rules to rectify the words from the splitting locations. We identified 6,345 Pali Sandhi words from Dhammapada Atthakatha. We evaluated the performance of our proposed model on the basis of the accuracy of the splitting locations and compared the results with the dataset. Results showed that 92.20% of the splitting locations were correct, 1.10% of the Pali Sandhi words were predicted as non-splitting location words and 5.83% were not matched with the answers (incomplete segmentation).

机译：Pali Sandhi是从两个单词进入新词的语音转变。更改并合并了邻近单词的音素。 Pali Sandhi Word分割比泰国字分割更具挑战性，因为Pali是一种高度变性的语言。本研究提出了一种新的方法，其通过将样品Sandhi单词分为五类，通过双向短期内记忆模型将样本Sandhi单词分类为五类来预测分裂位置。我们应用了分类规则来纠正拆分位置的单词。我们确定了来自Dhammapada Atthakatha的6,345个Pali Sandhi单词。我们根据分裂位置的准确性评估我们提出的模型的性能，并将结果与数据集进行比较。结果表明，92.20％的分裂位置是正确的，1.10％的Pali Sandhi单词被预测为非分裂定位词，5.83％与答案（不完整的分割）不匹配。

著录项

来源
《Engineering and Applied Science Research》 |2021年第5期|共13页
作者
Klangjai Tammanam; Nuttachot Promrit; Sajjaporn Waijanya;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类一般工业技术;
关键词
BiLSTMPali SandhiThai PaliRule basePali Sandhi splitting;

机译：Bilstmpali Sandhithai Palirule Basepali Sandhi Sandhi Sandhi;
入库时间 2022-08-19 03:15:29

相似文献

外文文献
中文文献
专利

1. Hybrid Rule-Based/Neural Approach for Segmentation of MPEG Compressed Video [J] . IRENA KOPRINSKA, SERGIO CARRATO Multimedia Tools and Applications . 2002,第3期

机译：基于混合规则/神经网络的MPEG压缩视频分割
2. Sentiment Analysis of Amazon Product Reviews Using Hybrid Rule-based Approach [J] . Anjali Dadhich, Blessy Thankachan International Journal of Engineering and Manufacturing(IJEM) . 2021,第2期

机译：利用混合规则的方法的亚马逊产品评论的情感分析
3. Hybrid soft computing approach based on clustering, rule mining, and decision tree analysis for customer segmentation problem: Real case of customer-centric industries [J] . Khalili-Damghani Kaveh, Abdi Farshid, Abolmakarem Shaghayegh Applied Soft Computing . 2018,第期

机译：基于聚类，规则挖掘和客户分割问题的混合软计算方法问题：客户以客户为中心的实际情况
4. Signal Integrity analysis for high speed channels in pcb/package co-design interface: 3D full wave vs. 2d/hybrid approach full model vs. segmentation approach [C] . Scogna Antonio Ciccomancini, Chiang Chun Tong, Krohne Klaus, IEEE Electronics Packaging Technology Conference . 2013

机译：pcb /封装协同设计界面中高速通道的信号完整性分析：3D全波与2d /混合方法以及全模型与分割方法
5. A machine-aided approach to generating grammar rules from Japanese source text for use in hybrid and rule-based machine translation systems. [D] . Jones, Sean. 2015

机译：一种从日语源文本生成语法规则的机器辅助方法，用于混合和基于规则的机器翻译系统。
6. A Double-Channel Hybrid Deep Neural Network Based on CNN and BiLSTM for Remaining Useful Life Prediction [O] . Chengying Zhao, Xianzhen Huang, Yuxiong Li, 2020

机译：基于CNN和Bilstm的双通道混合深神经网络剩余使用寿命预测
7. Art. V.—On Sandhi in Pali [O] . R. C. Childers 1879

机译：艺术这些条约

A hybrid approach to Pali Sandhi segmentation using BiLSTM and rule-based analysis

摘要

著录项

相似文献

相关主题

期刊订阅