首页> 外文会议>International Conference on Data Science, Machine Learning and Applications >A Tool for Statistical Analysis of Alphabets and Words of Hindi
【24h】

A Tool for Statistical Analysis of Alphabets and Words of Hindi

机译:印地语字母和单词的统计分析工具

获取原文

摘要

Natural language processing (NLP) is an approach to analyze and understand human language in a smart and useful way. Statistical analysis is an approach of mathematics to examine and perform analysis to map data in the form of numeric values. Frequency analysis of Hindi alphabets and words can help to examine occurrence frequency of words and alphabets in according to the provided word list. In this paper, an integrated model has been developed and implemented to perform statistical analysis of alphabets and words for the Hindi language. The complete model includes three modules for word-list preparation, frequency count and statistical analysis of alphabets and words. Hindi News articles of local and national paper have been considered as the input source. The tool evaluates the length, frequency of occurrence of practical and dictionary words.
机译:自然语言处理(NLP)是一种以智能和有用的方式分析和理解人类语言的方法。统计分析是一种数学方法,用于检查和执行分析以映射数值形式的数据。印地语字母和单词的频率分析可以帮助根据提供的单词列表检查单词和字母的出现频率。在本文中,已经开发并实施了一个集成模型来对印地语进行字母和单词的统计分析。完整的模型包括三个模块,用于单词列表的准备,频率计数以及字母和单词的统计分析。本地和国家报纸的《印地语新闻》文章已被视为输入来源。该工具可以评估实际单词和字典单词的长度,出现频率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号