Brain2Char: a deep architecture for decoding text from brain recordings

Pengfei Sun; Gopala K Anumanchipalli; Edward F Chang

首页> 外文期刊>Journal of neural engineering >Brain2Char: a deep architecture for decoding text from brain recordings

【24h】

Brain2Char: a deep architecture for decoding text from brain recordings

机译：Brain2char：用于从脑录制中解码文本的深度架构

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Decoding language representations directly from the brain can enable new brain-computer interfaces (BCIs) for high bandwidth human-human and human-machine communication. Clinically, such technologies can restore communication in people with neurological conditions affecting their ability to speak. Approach. In this study, we propose a novel deep network architecture Brain2Char, for directly decoding text (specifically character sequences) from direct brain recordings (called electrocorticography, ECoG). Brain2Char framework combines state-of-the-art deep learning modules-3D Inception layers for multiband spatiotemporal feature extraction from neural data and bidirectional recurrent layers, dilated convolution layers followed by language model weighted beam search to decode character sequences, and optimizing a connectionist temporal classification loss. Additionally, given the highly non-linear transformations that underlie the conversion of cortical function to character sequences, we perform regularizations on the network’s latent representations motivated by insights into cortical encoding of speech production and artifactual aspects specific to ECoG data acquisition. To do this, we impose auxiliary losses on latent representations for articulatory movements, speech acoustics and session specific non-linearities. Main results. In three (out of four) participants reported here, Brain2Char achieves 10.6%, 8.5%, and 7.0% word error rates respectively on vocabulary sizes ranging from 1200 to 1900 words. Significance. These results establish a new end-to-end approach on decoding text from brain signals and demonstrate the potential of Brain2Char as a high-performance communication BCI.

机译：直接从大脑解码语言表示可以为高带宽人员和人机通信启用新的脑电脑界面（BCI）。临床上，这种技术可以恢复有影响其讲话能力的神经系统的沟通。方法。在这项研究中，我们提出了一种新颖的深度网络架构Brain2char，用于直接解码来自直接脑录制的文本（特别是字符序列）（称为电加线，ECOG）。 Brain2Char框架结合了最先进的深度学习模块-3D inception层 - 从神经数据和双向复发层提取多频带时空特征，扩张卷积层，后跟语言模型加权光束搜索解码字符序列，并优化连接员时间分类损失。此外，考虑到提高皮质函数对字符序列的高度线性变换，我们对网络的潜在表示，在网络的潜在表示上进行了定期提明，进入语音编码的语音编码和特定于ECOG数据采集的艺术方面。为此，我们施加辅助损失对明晰的表现，言论动作，语音声学和会议特定的非线性。主要结果。在此报告的三个（四分之一）的参与者中，Brain2char分别在1200到1900字的词汇量范围内实现10.6％，8.5％和7.0％的错误速率。意义。这些结果在从脑信号解码文本上建立了新的端到端方法，并证明了Brain2Char的潜力作为高性能通信BCI。

著录项

来源
《Journal of neural engineering》 |2020年第6期|066015.1-066015.11|共11页
作者
Pengfei Sun; Gopala K Anumanchipalli; Edward F Chang;
展开▼
作者单位

Center of Integrative Neurosciences University of California San Francisco CA United States of America;

Center of Integrative Neurosciences University of California San Francisco CA United States of America;

Center of Integrative Neurosciences University of California San Francisco CA United States of America;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
ECoG; convolutional neural network; regularization; BCI;

机译：ECOG;卷积神经网络;正则化;BCI.;

相似文献

外文文献
中文文献
专利

1. Hidden semi-Markov models in the computerized decoding of microelectrode recording data for deep brain stimulator placement [J] . TaghvaA. World neurosurgery . 2011,第5a6期

机译：用于大脑深部刺激器放置的微电极记录数据的计算机解码中的隐式半马尔可夫模型
2. A high-performance 4 nV (√Hz)~(-1) analog front-end architecture for artefact suppression in local field potential recordings during deep brain stimulation [J] . Petkos Konstantinos, Guiho Thomas, Degenaar Patrick, Journal of neural engineering . 2019,第6期

机译：高性能4 nV（√Hz）〜（-1）模拟前端架构，可在深部脑刺激过程中抑制局部场电势记录中的伪影
3. Brain Recording, Mind-Reading, and Neurotechnology: Ethical Issues from Consumer Devices to Brain-Based Speech Decoding [J] . Rainey Stephen, Martin Stephanie, Christen Andy, Science and engineering ethics . 2020,第4期

机译：脑记录，思想阅读和神经技术：消费者设备到基于脑的语音解码的道德问题
4. A Deep Learning based Model for Decoding Motion Intent of Traumatic Brain Injured Patients' using HD-sEMG Recordings [C] . Mojisola Grace Asogbon, Oluwarotimi Williams Samuel, Ejay Nsugbe, IEEE International Workshop on Metrology for Industry 4.0 and IoT . 2021

机译：基于深入的学习模型，用于使用HD-SEMG记录进行创伤性脑受伤患者的运动意图
5. Electrode recording of field potentials along an axial trajectory: An approach for target localization improvement in deep brain stimulation implant surgery. [D] . Parsons, Mark Andrew. 2007

机译：沿轴向轨迹记录电场电势：一种在深部脑刺激植入物手术中改善目标定位的方法。
6. Brain-to-text: decoding spoken phrases from phone representations in the brain [O] . Christian Herff, Dominic Heger, Adriana de Pesters, 2015

机译：大脑到文本：从大脑中的电话表示中解码口头短语
7. Brain-to-text: Decoding spoken phrases from phone representations in the brain [O] . Christian eHerff, Dominic eHeger, Adriana ede Pesters, 2015

机译：脑与文本：解码大脑中电话表示的口语短语

Brain2Char: a deep architecture for decoding text from brain recordings

摘要

著录项

相似文献

相关主题

期刊订阅