首页> 外文期刊>Systems and Computers in Japan >An Architecture of Self-Organizing Map for Temporal Signal Processing and Its Application to a Braille Recognition Task
【24h】

An Architecture of Self-Organizing Map for Temporal Signal Processing and Its Application to a Braille Recognition Task

机译:用于时间信号处理的自组织映射架构及其在盲文识别任务中的应用

获取原文
获取原文并翻译 | 示例
       

摘要

A self-organizing map (SOM) performs a mapping of an object preserving its topological relations between input and output spaces, and also can be seen as a coordinate transformer that preserves adjacency relations. Since the standard SOM cannot deal with temporal data intrinsically, in this paper we provide new feedback pathways around the competitive layer to refer to context information of the past history. An extra output layer is added next to the competitive layer to represent secondary candidates and a quantitative measure of confidence. From the viewpoint of its structural similarities, we refer to this architecture as an Elman-type feedback SOM. In order to clarify the effectiveness of the proposed model, we then adopt a temporal signal processing task of Braille recognition. Braille is a character set for visually impaired people made up of 6 dots on a 3 by 2 grid. When Braille is read by running the fingertip over the characters, the vertical axis can be seen as providing spatial information and the horizontal axis as providing temporal information. As a result of computer simulations with partially common four city names, we confirm that the proposed model can recognize them appropriately. In addition, we find that this approach is robust to both the temporal elasticity and spatial displacement. Moreover, when we conduct an analysis with the neuro-bar model, we find that the Braille recognition task is executed based on a series of state transition along a type of pathway that has been developed during training.
机译:自组织映射(SOM)执行对象的映射,以保留其在输入和输出空间之间的拓扑关系,并且还可以视为保留邻接关系的坐标转换器。由于标准SOM无法本质上处理时间数据,因此在本文中,我们提供了围绕竞争层的新反馈路径,以引用过去历史的上下文信息。在竞争层旁边增加了一个额外的输出层,以表示次要候选人和定量的置信度。从其结构相似性的角度来看,我们将此体系结构称为Elman型反馈SOM。为了阐明所提出模型的有效性,我们然后采用了盲文识别的时间信号处理任务。盲文是一种用于视障人士的字符集,由3点乘2的网格上的6个点组成。当通过在字符上移动指尖来读取盲文时,可以将垂直轴视为提供空间信息,将水平轴视为提供时间信息。通过对部分相同的四个城市名称进行计算机模拟的结果,我们确认所提出的模型可以正确识别它们。另外,我们发现这种方法对于时间弹性和空间位移都是鲁棒的。此外,当我们使用神经棒模型进行分析时,我们发现盲文识别任务是基于沿着训练过程中发展出的一种路径的一系列状态转变而执行的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号