Text chunker for Malayalam using Memory-Based Learning

机译：使用基于内存的学习的Malayalam文本块

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Text chunking consists of dividing a text into syntactically correlated parts of words. Given the words and their morphosyntactic class, a chunker will decide which words can be grouped as chunks. Malayalam is a free word order language and has relatively unrestricted phrase structures that make the problem of chunking quite challenging. This paper aims to develop a text chunker for Malayalam using Memory-Based Learning (MBL) approach. Memory-Based Learning is a machine learning methodology based on the idea that the direct reuse of examples using analogical reasoning is more suited for solving language processing problems than the application of rules extracted from those examples. The chunker was trained using the tool Memory-Based Tagger (MBT) with words and their POS tags as features. The chunker demonstrated an accuracy of 97.14%.

机译：文本块包括将文本划分为语法相关的单词。鉴于单词和它们的语气职业类，散货员将决定可以将哪些单词分组为块。 Malayalam是一种免费的单词秩序语言，并且具有相对不受限制的短语结构，使得大小是充满挑战性的问题。本文旨在使用基于内存的学习（MBL）方法为Malayalam开发一个文本块。基于内存的学习是一种机器学习方法，基于使用模拟推理的示例直接重复使用的想法更适合于求解语言处理问题，而不是从这些示例中提取的规则的应用。块训练使用基于工具内存的标签（MBT）用单词及其POS标记作为功能。块状物证明了97.14％的准确性。

著录项

来源
《International Conference on Control, Communication Computing India》|2015年||共5页
会议地点
作者
Rekha Raj C. T.; Reghu Raj P. C.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术及设备;
关键词
Machine Learning; Malayalam Chunking; Memory Based Learning; Memory Based Natural Language Processing; Natural Language Processing; POS Tagging; Shallow parsing;

机译：机器学习;Malayalam Chunking;基于记忆的学习;基于记忆的自然语言处理;自然语言处理;POS标记;浅析;

相似文献

外文文献
中文文献
专利

1. Statistically Induced Chunking Recall: A Memory-Based Approach to Statistical Learning [J] . Isbilen Erin S., McCauley Stewart M., Kidd Evan, Cognitive science . 2020,第7期

机译：统计上诱发的划分召回：基于记忆的统计学习方法
2. Memory-Based Hypothesis Formation: Heuristic Learning of Commonsense Causal Relations from Text [J] . H. Cem Bozsahin, Nicholas V. Findler Cognitive science . 1992,第4期

机译：基于记忆的假设形成：基于文本的常识因果关系的启发式学习
3. Unicode-based method for text steganography with Malayalam text [J] . Vidhya P. M., Paul Varghese Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2015,第4期

机译：基于Unicode的带有Malayalam文本的文本隐写方法
4. Text chunker for Malayalam using Memory-Based Learning [C] . Rekha Raj C. T., Reghu Raj P. C. 2015 International Conference on Control, Communication amp; Computing India . 2015

机译：马拉雅拉姆语的文本分块器，使用基于内存的学习
5. Memory-based approach to learning commonsense causal relations from text [D] . Bozsahin, Huseyin Cem. 1990

机译：基于记忆的文本常识因果关系学习方法
6. Chunking or not chunking? How do we find words in artificial languagelearning? [O] . Ana Franco, Arnaud Destrebecqz 2012

机译：分块还是不分块？我们如何找到人造语言的单词学习？
7. Chunking and Extracting Text Content for Mobile Learning: A Query-focused Summarizer Based on Relevance Language Model [O] . Yang Guangbing, Kinshuk, Sutinen Erkki, 2013

机译：移动学习的分块和提取文本内容：基于关联语言模型的查询聚焦摘要

Text chunker for Malayalam using Memory-Based Learning

摘要

著录项

相似文献

相关主题

期刊订阅