首页> 外文会议>International Conference on Informatics, Electronics amp;amp;amp;amp;amp;amp; Vision >Bengali Stop Phrase Detection Mechanism using Corpus Based Method
【24h】

Bengali Stop Phrase Detection Mechanism using Corpus Based Method

机译:基于语料库的方法,孟加拉止术检测机制

获取原文
获取外文期刊封面目录资料

摘要

This paper discusses a corpus-based method for the detection of the stop phrase. These phrases must be detected and eliminated during NLP in the quest for attaining efficient indexing in modern Information Retrieval (IR) systems. A complete set of stop phrases for the Bengali language has not been developed yet. In this paper, a corpus-based approach is introduced for recognizing and extracting Bengali stop phrases. This proposed technique indicates that an input paragraph will be tokenized in several required manners and after that identification of stop phrases will be obtained by checking through the corpus. Accepted stop phrases will be sent for uniqueness. Outcomes of this proposed approach for stop phrases detection are notable where accuracy, precision, and recall results are observable. Eliminating these stop phrases will further reduce the time complexity of those algorithms, which were used in case of text summarizing and IR system.
机译:本文讨论了一种基于语料库的检测方法的方法。必须在NLP期间检测和消除这些短语,以便在现代信息检索(IR)系统中获得高效索引。孟加拉语的一整套停止短语尚未开发。本文介绍了一种基于语料库的方法,用于识别和提取孟加拉止舌。该提出的技术表明输入段落将以几种所需的方式令牌化,并且在通过检查语料库来获得停止短语的识别之后。被接受的停止短语将以唯一性发送。这种止动句子检测方法的结果是值得注意的,其中准确,精度和召回结果是可观察到的。消除这些停止短语将进一步降低这些算法的时间复杂性,这些算法在文本总结和IR系统的情况下使用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号