首页> 美国卫生研究院文献>PLoS Clinical Trials >Quality prediction of synthesized speech based on tensor structured EEG signals
【2h】

Quality prediction of synthesized speech based on tensor structured EEG signals

机译:基于张量结构脑电信号的合成语音质量预测

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

This study investigates quality prediction methods for synthesized speech using EEG. Training a predictive model using EEG is challenging due to a small number of training trials, a low signal-to-noise ratio, and a high correlation among independent variables. When a predictive model is trained with a machine learning algorithm, the features extracted from multi-channel EEG signals are usually organized as a vector and their structures are ignored even though they are highly structured signals. This study predicts the subjective rating scores of synthesized speeches, including their overall impression, valence, and arousal, by creating tensor structured features instead of vectorized ones to exploit the structure of the features. We extracted various features to construct a tensor feature that maintained their structure. Vectorized and tensorial features were used to predict the rating scales, and the experimental result showed that prediction with tensorial features achieved the better predictive performance. Among the features, the alpha and beta bands are particularly more effective for predictions than other features, which agrees with previous neurophysiological studies.
机译:这项研究调查了使用脑电图的合成语音质量预测方法。使用脑电图训练预测模型具有挑战性,原因是训练试验数量少,信噪比低以及自变量之间的相关性高。当使用机器学习算法训练预测模型时,从多通道EEG信号中提取的特征通常被组织为矢量,并且即使它们是高度结构化的信号,其结构也将被忽略。这项研究通过创建张量结构特征而不是矢量化特征来利用特征的结构,从而预测了合成语音的主观评分得分,包括其总体印象,效价和唤醒程度。我们提取了各种特征以构造张量特征以维持其结构。使用矢量化和张量特征预测等级量表,实验结果表明,具有张量特征的预测具有较好的预测性能。在这些特征中,α和β谱带比其他特征在预测方面特别有效,这与以前的神经生理学研究一致。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号