A Computational Approach for Corpus Based Analysis of Reduplicated Words in Bengali

机译：基于语料库的孟加拉语重叠词分析计算方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Reduplication is an important phenomenon in language studies especially in Indian languages. The definition of reduplication is the repetition of the smallest linguistic unit partially or completely i.e. repetition of phoneme, morpheme, word, phrase, clause or the utterance as a whole and it gives different meaning in syntax as well as semantic level. The reduplicated words has important role in many natural language processing (NLP) applications, namely in machine translation (MT), text summarization, identification of multiword expressions, etc. This article focuses on an algorithm for identifying the reduplicated words from a text corpus and computing statistics (descriptive statistics) of reduplicated words frequently used in Bengali.

机译：在语言研究中，特别是在印度语言中，重复是一个重要现象。重复的定义是部分或全部重复最小的语言单元，即重复音素，词素，词，词组，从句或整体的话语，它在语法和语义层次上给出了不同的含义。重叠词在许多自然语言处理（NLP）应用程序中具有重要作用，即在机器翻译（MT），文本摘要，多词表达的标识等方面。本文着重于从文本语料库和文本语料库中识别重叠词的算法。计算孟加拉语中经常使用的重复单词的统计信息（描述性统计信息）。

著录项

来源
《International conference on intelligent text processing and computational linguistics》|2015年|456-466|共11页
会议地点
作者
Apurbalal Senapati; Utpal Garain;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Reduplication; Bengali; Corpus; Descriptive statistics; Evaluation;

机译：重复孟加拉;语料库;描述性统计;评价;
入库时间 2022-08-26 15:23:07

相似文献

外文文献
中文文献
专利

1. Deep Learning Based Sentiment Analysis in a Code-Mixed English-Hindi and English-Bengali Social Media Corpus [J] . Jamatia Anupam, Swamy Steve Durairaj, Gamback Bjorn, International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms . 2020,第5期

机译：基于码混合英语 - 印度和英语 - 孟加拉社交媒体语料库的深度学习情感分析
2. Detection of Opinion: Approach based on Corpus vs. Approach based on SentiWordNet [J] . Mohamed Amine Boudia, Reda Mohamed Hamou, Abdelmalek Amine International journal of organizational and collective intelligence . 2015,第2期

机译：意见检测：基于语料库的方法与基于SentiWordNet的方法
3. Statistical analysis of orthographic and phonemic language corpus for word-based and phoneme-based Polish language modelling [J] . Piotr K?osowski EURASIP journal on audio, speech, and music processing . 2017,第1期

机译：基于单词和音素的波兰语语言建模的正字法和音位语料库的统计分析
4. A Computational Approach for Corpus Based Analysis of Reduplicated Words in Bengali [C] . Apurbalal Senapati, Utpal Garain Annual International Conference on Computational Linguistics and Intelligent Text Processing . 2015

机译：基于语料库的计算方法分析孟加拉语重复词汇
5. A machine-aided approach to intelligent index generation: Using natural language processing and latent semantic analysis to determine the contexts and relationships among words in a corpus. [D] . Lukon, Shelly Candita. 2006

机译：一种机器辅助的智能索引生成方法：使用自然语言处理和潜在语义分析来确定语料库中单词之间的上下文和关系。
6. A computational approach to candidate gene prioritization for X-linked mental retardation using annotation-based binary filtering and motif-based linear discriminatory analysis [O] . Zané Lombard, Chungoo Park, Kateryna D Makova, 2011

机译：一种基于注释的二进制过滤和基于基元的线性判别分析的X连锁智力障碍候选基因优先级计算方法
7. Amharic Internal Reduplication and Foot Structure: A Word-Based Approach [O] . Kevin Schluter 2008

机译：Amharic内部重新删除和脚结构：基于词的方法

A Computational Approach for Corpus Based Analysis of Reduplicated Words in Bengali

摘要

著录项

相似文献

相关主题

期刊订阅