首页> 外文会议>International Conference on Computing, Communication and Networking Technologies >Bengali Question Answering System Using Seq2Seq Learning Based on General Knowledge Dataset
【24h】

Bengali Question Answering System Using Seq2Seq Learning Based on General Knowledge Dataset

机译:基于常识数据集的Seq2Seq学习孟加拉语问答系统

获取原文

摘要

Context-based QA system is a leading research area in NLP. An automatic QA system that can respond to the answer, and which is related to the given context. A deep learning-based model provides a more factual result for today's QA system. Here we introduce a deep learning-based Seq2Seq model for Bengali context-based QA system using general knowledge dataset. Where context and question are part of the encoder and related answer is part of the decoder. All automatic system can discern any language by machine translation. Sequence wise learning is a good solution for those types of automatic system learning. Input tokens are encoded by the encoder and output tokens are decoded by the decoder. Each sequence is stored in LSTM cell that maintains the sequence of input and output. Most of the AI system is developed in different languages. Compared with other languages the Bengali language needs to expand research this field. The major perspective of this research to develop an AI based QA system for the Bengali language. For experiment total, two thousand Bengali general knowledge data is used that also provides a dataset for Bengali QA system. In the dataset, the context contains the main feature for the question. After training our model it gives 99 % accuracy for this dataset and 89% accuracy for validation. The trained model gives a good response in answer prediction.
机译:基于上下文的质量保证系统是NLP领域的领先研究领域。一个自动的质量检查系统,可以响应答案,并且与给定的上下文有关。基于深度学习的模型为当今的质量保证体系提供了更加真实的结果。在这里,我们介绍了使用通用知识数据集的孟加拉语基于上下文的QA系统的基于深度学习的Seq2Seq模型。其中上下文和问题是编码器的一部分,而相关答案是解码器的一部分。全自动系统可以通过机器翻译识别任何语言。对于那些类型的自动系统学习,按顺序学习是一个很好的解决方案。输入令牌由编码器编码,输出令牌由解码器解码。每个序列都存储在LSTM单元中,该单元维护输入和输出的序列。大多数AI系统是用不同的语言开发的。与其他语言相比,孟加拉语需要扩展该领域的研究。本研究的主要观点是为孟加拉语言开发基于AI的质量检查系统。对于实验总数,使用了两千个孟加拉语常识数据,该数据还提供了孟加拉语QA系统的数据集。在数据集中,上下文包含问题的主要特征。训练我们的模型后,该数据集的准确性为99%,验证的准确性为89%。经过训练的模型在答案预测中给出了良好的响应。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号