首页> 外文OA文献 >Acquisition of context-based word recognition by reinforcement learning using a recurrent neural network
【2h】

Acquisition of context-based word recognition by reinforcement learning using a recurrent neural network

机译:通过使用递归神经网络进行强化学习来获取基于上下文的单词识别

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The eye movement and recognition in humans seem very flexible and intelligent. The flexible and intelligent recognition is not only depending on the informatiOn that belonged by target object, but also is supported by other information, including past knowledge and contextual information. For an example, when we read a book, we do not usually seem to read it -by recognizing each character one by one. We would predict a word from the first two or several characters, and also would utilize the story context to expect the next character or word. In order to make flexible recognition, we consider many things simultaneously, and move our eyes, recognize and understand the context flexibly. Such parallel consideration and flexibility must be achieved by our parallel and flexible brain, and learning plays an important role in, it. udIn our laboratory, the coupling of a neural network (NN) and reinforcement learning (RL) is considered useful because of its autonomous, parallel and flexible learning ability. In the previous research, it was verified through simulations and also using a real camera that the appropriate camera motion, recognition and recognition timing were successfully acquired. However, since a regular layered neural network was used, the recognition and movement functions were limited to the case where the whole pattern is in the visual field. Furthermore, only two patterns were used. udIn this thesis, context-based word recognition learning system was developed. 6 words that need context-based recognition function for the words to be recognized were chosen. The learning system was trained to recognize all the chosen words. As a learning method, the combination of Reinforcement Learning and a Recurrent Neural Network (RNN) was applied. The developed learning system has a 4-layered RNN and it was trained by BPTT method based on teaching signal that was generated by Q-Learning algorithm. The learning system was trained on several tasks using simulations or real time learning in order to verify whether flexible recognition could emerge through this context-based word recognition learning. There are two types of problem tasks in this thesis. The first type using ideal- images asudlearning data and the second one using real camera captured images. For the first type, two tasks were done through simulations. The one is a fixed initial position task, and the second one is a random initial position task. As results, for both simulations the system manages to recognize all the prepared words. Here, the relation between the parameter of discount factor teaching signal was also discussed and how to choose the appropriate settings was proposed. In the second type of task, both simulations were trained using real camera captured images that were prepared beforehand as samples. After the learning was successfully verified, finally, the system was trained for the same task in real time. After the real time learning, learning was successful and system manages to recognize the entire prepared patterns. All these results show that by applying a combination of Reinforcement Learning and a Recurrent Neural Network learning method, the context-based word recognition can be achieved. Flexible recognition function in the appropriate timing is mostly acquired.ud
机译:人类的眼睛运动和识别似乎非常灵活和智能。灵活而智能的识别不仅取决于目标对象所属的信息,而且还受其他信息(包括过去的知识和上下文信息)的支持。例如,当我们读一本书时,我们通常似乎并不阅读它,而是逐个识别每个字符。我们将根据前两个或几个字符来预测一个单词,并利用故事情境来期待下一个字符或单词。为了进行灵活的识别,我们会同时考虑很多事情,并移动眼睛,灵活地识别和理解上下文。这种并行考虑和灵活性必须由我们并行灵活的大脑来实现,而学习在其中发挥着重要作用。 ud在我们的实验室中,神经网络(NN)和强化学习(RL)的结合被认为是有用的,因为它具有自主,并行和灵活的学习能力。在先前的研究中,通过模拟以及使用真实相机进行了验证,可以成功获取适当的相机运动,识别和识别时间。但是,由于使用了规则的分层神经网络,因此识别和移动功能仅限于整个模式都在视野内的情况。此外,仅使用了两种模式。 ud本论文开发了基于上下文的单词识别学习系统。选择了需要基于上下文的识别功能的6个单词进行识别。学习系统经过训练可以识别所有选定的单词。作为一种学习方法,应用了强化学习和递归神经网络(RNN)的组合。所开发的学习系统具有4层RNN,并根据BPTT方法基于Q学习算法生成的教学信号对其进行了训练。使用模拟或实时学习对学习系统进行了多项任务训练,以验证是否可以通过基于上下文的单词识别学习来实现灵活的识别。本论文有两种类型的问题任务。第一种使用理想图像作为 udlearning数据,第二种使用真实相机捕获的图像。对于第一类,通过仿真完成了两项任务。一个是固定的初始位置任务,第二个是随机的初始位置任务。结果,对于这两种模拟,系统设法识别所有准备好的单词。在此,还讨论了折扣因子示教信号的参数之间的关系,并提出了如何选择合适的设置。在第二种类型的任务中,两种模拟都是使用预先准备好的真实相机捕获图像作为样本进行训练的。在成功地验证了学习之后,最终,该系统被实时训练以完成相同的任务。实时学习后,学习成功,系统设法识别出整个准备好的模式。所有这些结果表明,通过结合强化学习和递归神经网络学习方法,可以实现基于上下文的单词识别。在适当的时间获得了灵活的识别功能。 ud

著录项

  • 作者

    Ahmad Afif Mohd Faudzi;

  • 作者单位
  • 年度 2012
  • 总页数
  • 原文格式 PDF
  • 正文语种
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号