首页> 美国卫生研究院文献>PLoS Clinical Trials >Quality prediction of synthesized speech based on tensor structured EEG signals

【2h】

Quality prediction of synthesized speech based on tensor structured EEG signals

机译：基于张量结构脑电信号的合成语音质量预测

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

This study investigates quality prediction methods for synthesized speech using EEG. Training a predictive model using EEG is challenging due to a small number of training trials, a low signal-to-noise ratio, and a high correlation among independent variables. When a predictive model is trained with a machine learning algorithm, the features extracted from multi-channel EEG signals are usually organized as a vector and their structures are ignored even though they are highly structured signals. This study predicts the subjective rating scores of synthesized speeches, including their overall impression, valence, and arousal, by creating tensor structured features instead of vectorized ones to exploit the structure of the features. We extracted various features to construct a tensor feature that maintained their structure. Vectorized and tensorial features were used to predict the rating scales, and the experimental result showed that prediction with tensorial features achieved the better predictive performance. Among the features, the alpha and beta bands are particularly more effective for predictions than other features, which agrees with previous neurophysiological studies.

机译：这项研究调查了使用脑电图的合成语音质量预测方法。使用脑电图训练预测模型具有挑战性，原因是训练试验数量少，信噪比低以及自变量之间的相关性高。当使用机器学习算法训练预测模型时，从多通道EEG信号中提取的特征通常被组织为矢量，并且即使它们是高度结构化的信号，其结构也将被忽略。这项研究通过创建张量结构特征而不是矢量化特征来利用特征的结构，从而预测了合成语音的主观评分得分，包括其总体印象，效价和唤醒程度。我们提取了各种特征以构造张量特征以维持其结构。使用矢量化和张量特征预测等级量表，实验结果表明，具有张量特征的预测具有较好的预测性能。在这些特征中，α和β谱带比其他特征在预测方面特别有效，这与以前的神经生理学研究一致。

著录项

期刊名称 PLoS Clinical Trials
作者
Hayato Maki; Sakriani Sakti; Hiroki Tanaka; Satoshi Nakamura;
展开▼
作者单位

展开▼
年(卷),期 2012(13),6
年度 2012
页码 e0193521
总页数 13
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. The prediction of EEG signals using a feedback-structured adaptive rational function filter [J] . Kim HS., Choi YH., Park SH., Biological Cybernetics: Communication and Control in Organisms and Automata: = Nachrichtenubertragung, Nachrichtenverarbeitung, Steuerung und Regelung in Organismen und in Automaten . 2000,第2期

机译：使用反馈结构自适应有理函数滤波器的脑电信号预测
2. Identification of vowels in consonant-vowel-consonant words from speech imagery based EEG signals [J] . Chengaiyan Sandhya, Retnapandian Anandha Sree, Anandan Kavitha Cognitive Neurodynamics . 2020,第1期

机译：基于语音图像的EEG信号辨识辅音元音辅音词元音
3. High Quality Speech Synthesis Based on the Reproduction of the Randomness in Speech Signals [J] . Naofumi aoki IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences . 2001,第9期

机译：基于语音信号随机性再现的高质量语音合成
4. Speech quality assessment using EEG signals [C] . Omri Bar, Ilan D. Shallom 2016 IEEE International Conference on the Science of Electrical Engineering . 2016

机译：使用EEG信号进行语音质量评估
5. Multi-Feature Analysis of Eeg Signal on Seizure Patterns and Deep Neural Structures for Prediction of Epileptic Seizures [D] . Ma, Xinyuan. 2020

机译：癫痫发作模式和深神经结构对癫痫癫痫发作预测的多种特征分析
6. Identification of vowels in consonant–vowel–consonant words from speech imagery based EEG signals [O] . Sandhya Chengaiyan, Anandha Sree Retnapandian, Kavitha Anandan 2020

机译：基于语音图像的EEG信号辨识辅音元音辅音词元音
7. EEG Signal Discrimination using Non-linear Dynamics in the EMD Domain S. M. Shafiul Alam,S. M. Shafiul Alam,Aurangozeb, and Syed TarekShahriar Abstract—An EMD-chaos based approach is proposed todiscriminate EEG signals corresponding to healthy persons,and epileptic patients during seizure-free intervals and seizureattacks. An electroencephalogram (EEG) is first empiricallydecomposed to intrinsic mode functions (IMFs). The nonlineardynamics of these IMFs are quantified in terms of the largestLyapunov exponent (LLE) and correlation dimension (CD).This chaotic analysis in EMD domain is applied to a large groupof EEG signals corresponding to healthy persons as well asepileptic patients (both with and without seizure attacks). It isshown that the values of the obtained LLE and CD exhibitfeatures by which EEG for seizure attacks can be clearlydistinguished from other EEG signals in the EMD domain.Thus, the proposed approach may aid researchers in developingeffective techniques to predict seizure activities. Index Terms—Electroencephalogram (EEG), empiricalmode decomposition (EMD), largest Lyapunov exponent (LLE),correlation dimension (CD), epileptic seizures. The Authors are with the Electrical and Electronic EngineeringDepartment, Bangladesh University of Engineering and Technology,Dhaka-1000, Bangladesh (e-mail: imamul@eee.buet.ac.bd) PDF Cite: S. M. Shafiul Alam,S. M. Shafiul Alam,Aurangozeb, and Syed Tarek Shahriar, "EEG Signal Discrimination using Non-linear Dynamics in the EMD Domain," International Journal of Computer and Electrical Engineering vol. 4, no. 3, pp. 326-330, 2012. PREVIOUS PAPER Perception of Emotions Using Constructive Learningthrough Speech NEXT PAPER Physical Layer Impairments Aware OVPN Connection Selection Mechanisms Copyright © 2008-2013. International Association of Computer Science and Information Technology Press (IACSIT Press) [O] . S. M. Shafiul Alam, Syed TarekShahriar 2012

机译：EEG信号在EMD域S. S. Shafiul Alam，S中的非线性动力学使用非线性动力学。 M. Shafiul Alam，Aurangozeb和Syed Tarekshahriar摘要 - 基于EMD Chaos的方法，提出了对应于健康人的EEG信号，癫痫发作期间的癫痫患者和Seizureattacks。脑电图（EEG）首先被凭经上分解为内在模式功能（IMF）。这些IMF的非线性动力学在最大范围的指数（LLE）和相关尺寸（CD）方面是量化的。本域中的混沌分析应用于与健康人相对应的大型脑电图（Asepileptic患者）（两者都有癫痫发作）。因此，所获得的LLE和CD表展的价值可以从EMD领域的其他EEG信号中清晰地区分脑电图的表达展示。本拟议的方法可以帮助研究人员以预测癫痫发作的癫痫发作技术。索引术语 - 脑电图（EEG），仿真态分解（EMD），最大的Lyapunov指数（LLE），相关维度（CD），癫痫发作。作者与电气电子和电子工程公司，孟加拉国工程和技术大学，孟加拉国达卡 - 1000（电子邮件：imamul@eee.buet.ac.bd）pdf cite：s. m. shafiul Alam，s。 M. Shafiul Alam，Aurangozeb和Syed Tarek Shahriar，“EEG信号歧视在EMD领域的非线性动态，”计算机电气工程卷国际杂志。 4，不。 3，pp。326-330,2012，上一篇论文对情绪的看法，使用建设性的学习言论下一篇论文物理层障碍意识到OVPN连接选择机制版权所有©2008-2013。国际计算机科学与信息技术协会出版社（IACSIT Press）

Quality prediction of synthesized speech based on tensor structured EEG signals

摘要

著录项

相似文献

相关主题

期刊订阅