Improving Rule-Based Method for Arabic POS Tagging Using HMM Technique

Meryeme Hadni; Said Alaoui Ouatik; Abdelmonaime Lachkar; Mohammed Meknassi

首页> 外文期刊>Computer Science & Information Technology >Improving Rule-Based Method for Arabic POS Tagging Using HMM Technique

【24h】

Improving Rule-Based Method for Arabic POS Tagging Using HMM Technique

机译：使用HMM技术改进基于规则的阿拉伯语POS标记方法

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Part-of-speech (POS) tagger plays an important role in Natural Language Applications like Speech Recognition, Natural Language Parsing, Information Retrieval and Multi Words Term Extraction. This study proposes a building of an efficient and accurate POS Tagging technique for A rabic language using statistical approach. Arabic Rule-Based method suffers from misclassified and unanalyzed words due to the ambiguity issue. To overcome these two problems, we propose a Hidden Markov Model (HMM) integrated with Arabic Rule-Based method. Our POS tagger generates a set of 4 POS tags: Noun, Verb, Particle, and Quranic Initial (INL). The proposed technique uses the different contextual information of the words with a variety of the features which are helpful to predict the various POS classes. To evaluate its accuracy, the proposed method has been trained and tested with the Holy Quran Corpus containing 77 430 terms for undiacritized Classical Arabic language. The experiment results demonstrate the efficiency of our method for Arabic POS Tagging. The obtained accuracies are 97.6% and 94.4% for respectively our method and for the Rule based tagger method

机译：词性（POS）标记器在自然语言应用（例如语音识别，自然语言解析，信息检索和多词术语提取）中发挥重要作用。这项研究提出了一种使用统计方法针对阿拉伯语的高效，准确的POS标记技术的构建。由于含糊不清的问题，基于阿拉伯规则的方法存在单词分类错误和未分析的问题。为了克服这两个问题，我们提出了一种与阿拉伯基于规则的方法相集成的隐马尔可夫模型（HMM）。我们的POS标记器会生成4个POS标记集：名词，动词，质点和古兰经首字母（INL）。所提出的技术使用具有各种特征的单词的不同上下文信息，这些特征有助于预测各种POS类。为了评估其准确性，该方法已经过圣古兰经语料库的培训和测试，该古兰经语料库包含77 430个不偏音的古典阿拉伯语术语。实验结果证明了我们的阿拉伯POS标记方法的有效性。对于我们的方法和基于规则的标记器方法，获得的准确性分别为97.6％和94.4％

著录项

来源
《Computer Science & Information Technology》 |2013年第8期|共13页
作者
Meryeme Hadni; Said Alaoui Ouatik; Abdelmonaime Lachkar; Mohammed Meknassi;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. span xmlns="http://www.wiley.com/namespaces/wiley" cssStyle="font-family:monospace">HMMoce/span>HMMoce : An span xmlns="http://www.wiley.com/namespaces/wiley" cssStyle="font-family:monospace">R/span>R package for improved geolocation of archival‐tagged fishes using a hidden Markov method [J] . Braun Camrin D., Galuardi Benjamin, Thorrold Simon R., Methods in Ecology and Evolution . 2018,第5期

机译：＆ span xmlns =“http://www.wiley.com/namespaces/wiley”cssstyle =“font-family：monospace”> hmmoce＆ / span> hmmoce：一个＆ span xmlns =“http：// www。 wiley.com/namespaces/wiley“cssstyle =”font-family：monospace“> r＆ / span> R包，用于使用隐藏的马尔可夫方法改进档案标记的鱼类的地理位置
2. A novel Arabic OCR post-processing using rule-based and word context techniques [J] . Iyad Abu Doush, Faisal Alkhateeb, Anwaar Hamdi Gharaibeh International Journal on Document Analysis and Recognition . 2018,第1a2期

机译：使用基于规则和单词上下文技术的新颖阿拉伯语OCR后处理
3. Bidirectional HMM-based Arabic POS tagging [J] . Ayoub Kadim, Azzeddine Lazrek International journal of speech technology . 2016,第2期

机译：基于双向HMM的阿拉伯语POS标记
4. The second-order derivatives of MFCC for improving spoken Arabic digits recognition using Tree distributions approximation model and HMMs [C] . Hammami Nacereddine, Bedda Mouldi, Nadir Farah 2012 2nd International Conference on Communications and Information Technology . 2012

机译：MFCC的二阶导数，用于使用树分布近似模型和HMM改善语音阿拉伯数字识别
5. Methods and techniques for improving figure of merit for wide tuning range quadrature voltage controlled oscillators. [D] . El Gouhary, Amany. 2014

机译：用于改善宽调谐范围正交压控振荡器的品质因数的方法和技术。
6. Improving prokaryotic transposable elements identification using a combination of de novo and profile HMM methods [O] . Choumouss Kamoun, Thibaut Payen, Aurélie Hua-Van, 2013

机译：结合使用de novo和profile HMM方法改进原核转座因子鉴定
7. IMPROVING RULE-BASED METHOD FOR ARABIC POS TAGGING USING HMM TECHNIQUE [O] . Meryeme Hadni, Said Alaoui Ouatik, Abdelmonaime Lachkar, 2014

机译：利用Hmm技术改进基于规则的阿拉伯pOs标记方法
8. Improved Active Tagging Non-Destructive Evaluation Techniques for Full- Scale Structural Composite Elements [R] . Chen, Z., Giurgiutiu, V., Rogers, C. A., 1997

机译：改进的全尺寸结构复合材料主动标记无损评估技术

Improving Rule-Based Method for Arabic POS Tagging Using HMM Technique

摘要

著录项

相似文献

相关主题

期刊订阅