首页> 外文会议>IEEE Region 10 Annual Conference >Mixed wideband speech and music coding using a speech/music discriminator

【24h】

Mixed wideband speech and music coding using a speech/music discriminator

机译：使用语音/音乐鉴别器的混合宽带语音和音乐编码

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In multimedia applications such as videoconferencing, users are demanding higher quality speech/audio transmission than the POTS can offer. 7kHz wideband speech/audio offers a good compromise between bandwidth and sound quality. It improves the intelligibility and naturalness of speech and adds a feeling of transparent communication. Currently the only existing international standard for coding such signals is the G.722 wideband speech/audio coder. While its coding quality is satisfactory it leaves much to be desired with its bit rate. The CELP-based approach has been very successful in telephone bandwidth speech coding, but is not suitable for coding non-speech signals because of the assumed signal production model. This paper proposes an alternative approach to mixed speech/music coding, which uses a discriminator to separate music signals from speech, and codes them with the G.722 coder and a G.723.1-based speech coder, respectively. Simulations shows very promising results.

机译：在视频会议等多媒体应用中，用户要求比盆可以提供的更高质量的语音/音频传输。 7KHz宽带语音/音频在带宽和音质之间提供良好的折衷。它提高了言语的可懂度和自然性，并增加了透明通信的感觉。目前，唯一用于此类信号的现有国际标准是G.722宽带语音/音频编码器。虽然其编码质量令人满意的虽然它的比特率留下了很多东西。基于CELP的方法在电话带宽语音编码方面非常成功，但由于假定的信号生产模型，不适合编码非语音信号。本文提出了一种混合语音/音乐编码的替代方法，它使用鉴别器将音乐信号与语音分开，并分别用基于G.722编码器和基于G.723.1的语音编码器来编码。仿真显示出现非常有前途的结果。

著录项

来源
《IEEE Region 10 Annual Conference》|1997年||共4页
会议地点
作者
Rong-Yu Iao; Institute of Electric and Electronic Engineer;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Discriminative Weight Training for Support Vector Machine-Based Speech/Music Classification in 3GPP2 SMV Codec [J] . Sang-Kyun KIM, Joon-Hyuk CHANG IEICE Transactions on fundamentals of electronics, communications & computer sciences . 2010,第1期

机译：3GPP2 SMV编解码器中基于支持向量机的语音/音乐分类的判别权重训练
2. When speech enhances Spatial Musical Association of Response Codes: Joint spatial associations of pitch and timbre in nonmusicians [J] . Weis Tina, Estner Barbara, Lachmann Thomas The quarterly journal of experimental psychology: QJEP . 2016,第9期

机译：当语音增强响应代码的空间音乐关联时：非音乐家中音高和音色的联合空间关联
3. Speech and music perception with the new fine structure speech coding strategy: preliminary results. [J] . Arnoldner C, Riss D, Brunner M, Acta Oto-Laryngologica . 2007,第12期

机译：新型精细结构语音编码策略的语音和音乐感知：初步结果。
4. Mixed wideband speech and music coding using a speech/music discriminator [C] . Rong-Yu Iao . 1997

机译：使用语音/音乐鉴别器的混合宽带语音和音乐编码
5. Investigating the Effect of Musical Training on Speech-in-Speech Perception: The Role of f0, Timing, and Spectral Cues [D] . Cohn, Michelle Dana. 2018

机译：调查音乐训练对语音讲话的影响：F0，时序和光谱线索的作用
6. Do Musicians and Non-musicians Differ in Speech-on-Speech Processing? [O] . Elif Canseza Kaplan, Anita E. Wagner, Paolo Toffanin, 2021

机译：音乐家和非音乐家在语音上有所不同吗？
7. A robust speech/music discriminator for switched audio coding [O] . Fuchs Guillaume 2015

机译：强大的语音/音乐鉴别器，用于切换音频编码

Mixed wideband speech and music coding using a speech/music discriminator

摘要

著录项

相似文献

相关主题

期刊订阅