NLTK tagger for Albanian using iterative approach

机译：使用迭代方法的阿尔巴尼亚的NLTK标记器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a research done about a model of tagging for Albanian texts, using the NLTK toolkit. The model uses cascading of three taggers with backoff. We use a dictionary of around 32000 words, together their correspondent POS tags and a set of regular expressions rules too. A lemmatize module is implemented in order to convert nouns and verbs to their lemma. The text is tagged initially with a unigram tagger based on the dictionary. This is used as a baseline tagger for a regular expressions tagger. A correction is made for not correct lemmatized words, creating a third lookup tagger. This tagger will be used with the first and second tagger as backoff.

机译：本文使用NLTK Toolkit提出了关于阿尔巴尼亚语文本标记模型的研究。该模型使用带有退避的三个标记器的级联。我们使用大约32000个单词的字典，它们的对应POS标签以及一组正则表达式规则。实现了lemmatize模块，以便将名词和动词转换为其引导。最初使用基于字典的Unigram标记标记文本。这用作正则表达式标记器的基线标记器。为不正确的lemmatized字而进行校正，创建第三查找标记器。此标记器将与第一和第二标记器一起使用，作为退避。

著录项

来源
《International Conference on Information Technology Interfaces》|2013年||共6页
会议地点
作者
Kadriu Arbana;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 G202-53;
关键词
Albanian language; NLTK; POS tagging;

机译：阿尔巴尼亚语;nltk;pos标记;

相似文献

外文文献
中文文献
专利

1. A two-stage multi-damage detection approach for composite structures using MKECR-Tikhonov regularization iterative method and model updating procedure [J] . D. Dinh-Cong, T. Nguyen-Thoi, Due T. Nguyen Applied Mathematical Modelling . 2021,第Feba期

机译：使用MKECR-Tikhonov正规迭代方法和模型更新程序的复合结构的两级多损伤检测方法
2. A Two-stage Iterative Approach to Improve Crowdsourcing-Based Relevance Assessment [J] . Wang Yongzhen, Lin Yan, Gao Zheng, Arabian Journal for Science and Engineering . 2019,第4期

机译：改进基于众包的相关性评估的两步迭代方法
3. Iterative two-stage approach for identifying structural damage by combining the modal strain energy decomposition method with the multiobjective particle swarm optimization algorithm [J] . Xu Mingqiang, Wang Shuqing, Jiang Yufeng Structural Control and Health Monitoring . 2019,第2期

机译：模态应变能分解方法与多目标粒子群算法相结合的结构损伤识别的迭代两阶段方法
4. NLTK tagger for Albanian using iterative approach [C] . Kadriu Arbana 35th International Conference on Information Technology Interfaces : Research and Education using Mobile and Social Networking: When, Where, and How . 2013

机译：使用迭代方法的阿尔巴尼亚语NLTK标记器
5. A new mortgage remedies regime for Albania based upon the approach of the Canadian system [D] . Plaku, Ledia. 2002

机译：基于加拿大制度的做法的阿尔巴尼亚新抵押贷款救济制度
6. Effective Three-Stage Demosaicking Method for RGBW CFA Images Using The Iterative Error-Compensation Based Approach [O] . Kuo-Liang Chung, Tzu-Hsien Chan, Szu-Ni Chen 2020

机译：利用基于迭代误差补偿的方法对RGBW CFA图像的有效三阶段去序方法
7. The Difference between Albanian and Italian Tax Systems and the Challenges of Albanian Tax System Against the Advantages of Italian Tax System. The Investment Climate in Albania for Italian Businesses [O] . Jonada Mamo, Ina Shehu 2013

机译：阿尔巴尼亚人与意大利税收制度与阿尔巴尼亚税制危害意大利税制优势的差异。意大利企业阿尔巴尼亚的投资气候
8. Invariant Imbedding,Iterative Linearization,and Multistage Countercurrent Processes. IV. A New Approach to Distillation Column Calculation. [R] . lee, e. stanley lee,paul 1974

机译：不变嵌入，迭代线性化和多级逆流过程。 IV。一种蒸馏塔计算的新方法。

NLTK tagger for Albanian using iterative approach

摘要

著录项

相似文献

相关主题

期刊订阅