首页> 外文会议>International Conference on Computing for Sustainable Global Development >Implementation Approach of Indian Language Gujarati Grammar's Concept “sandhi” using the Concepts of Rule-based NLP
【24h】

Implementation Approach of Indian Language Gujarati Grammar's Concept “sandhi” using the Concepts of Rule-based NLP

机译:印度语言古吉拉蒂语法的概念“Sandhi”的实施方法,使用基于规则的NLP概念

获取原文

摘要

The term ‘language’ in NLP has to be understood as natural languages like Gujarati, Hindi, English etc., which we use in daily life to communicate. Most of the NLP research has been centered on English & other European Languages. NLP research concerning the Indian language like Gujarati is commenced in the last few years. The centre of attention of this paper is to demonstrate the road map of implementation of Gujarati grammar's concept “sandhi ”. In our words sandhi is a word segmentation process & it is present in most of the South Asian language, such as Devnagri, Sanskrit, Hindi, and Gujarati & even in Chinese & Thai languages.” Sandhi leads to phonetic transformation at word boundaries of a written chunk (small part), and the sounds at the end of word join together to form a single chunk of the character sequence.” Our main spotlight is on rule-based implementation of “sandhi”. Similar to every Indian scripting language Gujarati language (Grammar) also has its own specified rules of composition for combining the consonants, vowels and modifiers. We have identified certain rules by which we accomplish the practical implementation of “sandhi ”. There are many sandhi rules available, each denoting a unique combination of phonetic transformations, documented in the grammatical tradition of Gujarati. The Sandhi does not make any syntactic or semantic changes to the words implicated. Sandhi is an elective operation that depends only on the alertness of the writer.
机译:NLP中的“语言”一词必须被理解为古吉拉特,印地语,英语等的自然语言,我们在日常生活中沟通。大多数NLP研究都以英语和其他欧洲语言为中心。关于印度语言的NLP研究,如古吉拉蒂在过去几年开始。本文的关注中心是展示古吉拉蒂语法“Sandhi”实施的路线图。在我们的话语中,Sandhi是一个词分割过程,它存在于大多数南亚语中,例如Devnagri,Sanskrit,Hindi和Gujarati甚至是中文和泰语语言。“ Sandhi在书面块(小部分)的字边界上导致语音变换,以及单词末尾的声音连接在一起形成一个字符序列的单个块。“我们的主要聚光灯是基于规则的“Sandhi”实施。类似于每个印度脚本语言古吉拉蒂语言(语法)也有自己指定的组合规则,用于组合辅音,元音和修饰符。我们已经确定了我们完成“Sandhi”的实际实施规则。有许多Sandhi规则可用,每个Sandhi规则都表示语音变换的独特组合,记录在Gujarati的语法传统中。 Sandhi不会对牵连的单词作出任何句法或语义改变。 Sandhi是一项选修业务,只取决于作者的警觉性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号