The present disclosure provides a technical solution of highly empathetic TTS processing, which not only takes a semantic feature and a linguistic feature into consideration, but also assigns a sentence ID to each sentence in a training text to distinguish sentences in the training text. Such sentence IDs may be introduced as training features into a processing of training a machine learning model, so as to enable the machine learning model to learn a changing rule for the changing of acoustic codes of sentences with a context of sentence. A speech naturally changed in rhythm and tone may be output to make TTS more empathetic by performing TTS processing with the trained model. A highly empathetic audio book may be generated using the TTS processing provided herein, and an online system for generating a highly empathetic audio book may be established with the TTS processing as a core technology. v
展开▼
机译:本公开提供了一种高度同理心的TTS处理的技术方案,其不仅考虑语义特征和语言特征,而且还给训练文本中的每个句子分配一个句子ID,以区别训练文本中的句子。这样的句子ID可以作为训练特征被引入到训练机器学习模型的处理中,从而使得机器学习模型能够学习用于改变句子的声码的改变规则。通过对训练后的模型执行TTS处理,可以输出节奏和语调自然变化的语音,以使TTS更具同理心。可以使用本文提供的TTS处理来生成高度同理心的有声书,并且可以以TTS处理为核心技术来建立用于生成高度同理心的有声书的在线系统。 v
展开▼