New linear predictive methods for digital speech processing

机译：数字语音处理的新线性预测方法

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech processing is needed whenever speech is to be compressed, synthesised or recognised by the means of electrical equipment. Different types of phones, multimedia equipment and interfaces to various electronic devices, all require digital speech processing. As an example, a GSM phone applies speech processing in its RPE-LTP encoder/decoder (ETSI, 1997). In this coder, 20 ms of speech is first analysed in the short-term prediction (STP) part, and second in the long-term prediction (LTP) part. Finally, speech compression is achieved in the RPE encoding part, where only 1/3 of the encoded samples are selected to be transmitted.This thesis presents modifications for one of the most widely applied techniques in digital speech processing, namely linear prediction (LP). During recent decades linear prediction has played an important role in telecommunications and other areas related to speech compression and recognition. In linear prediction sample s(n) is predicted from its p previous samples by forming a linear combination of the p previous samples and by minimising the prediction error. This procedure in the time domain corresponds to modelling the spectral envelope of the speech spectrum in the frequency domain. The accuracy of the spectral envelope to the speech spectrum is strongly dependent on the order of the resulting all-pole filter. This, in turn, is usually related to the number of parameters required to define the model, and hence to be transmitted.Our study presents new predictive methods, which are modified from conventional linear prediction by taking the previous samples for linear combination differently. This algorithmic development aims at new all-pole techniques, which could present speech spectra with fewer parameters.

机译：每当要通过电气设备压缩，合成或识别语音时，都需要进行语音处理。不同类型的电话，多媒体设备以及与各种电子设备的接口都需要数字语音处理。例如，GSM电话在其RPE-LTP编码器/解码器中应用语音处理（ETSI，1997）。在此编码器中，首先在短期预测（STP）部分中分析20 ms语音，然后在长期预测（LTP）部分中分析20 ms。最后，在RPE编码部分实现了语音压缩，其中只有1/3的编码样本被选择进行传输。本文提出了对数字语音处理中应用最广泛的技术之一的修改，即线性预测（LP）。。在最近的几十年中，线性预测在电信及其他与语音压缩和识别有关的领域中发挥了重要作用。在线性预测中，通过形成p个先前样本的线性组合并最小化预测误差，从其p个先前样本中预测s（n）。时域中的此过程对应于在频域中对语音频谱的频谱包络建模。频谱包络对语音频谱的准确性在很大程度上取决于所得全极点滤波器的顺序。反过来，这通常与定义模型所需的参数数量有关，因此需要传输。我们的研究提出了新的预测方法，该方法是通过将先前的线性组合样本进行不同的处理而从常规线性预测中修改而来的。该算法的开发针对于新的全极点技术，该技术可以用较少的参数呈现语音频谱。

著录项

作者
Varho Susanna;
展开▼
作者单位

展开▼
年度 2001
总页数
原文格式 PDF
正文语种 en
中图分类

相似文献

外文文献
中文文献
专利

1. Predicting the Intelligibility of Noisy and Nonlinearly Processed Binaural Speech [J] . Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016 ,第11期

机译：预测嘈杂和非线性处理的双耳语音的清晰度
2. Steganography in vector quantization process of linear predictive coding for low-bit-rate speech codec [J] . Liu Peng, Li Songbin, Wang Haiqiang Multimedia Systems . 2017 ,第4期

机译：低比特率语音编解码器线性预测编码的矢量量化过程中的隐写术
3. Speech to Text Synthesis from Video Automated Subtitling using Levinson Durbin Method of Linear Predictive Coding [J] . Shuaian Wang, Lin Xiao, Zhiyuan Liu, International journal of instrumentation science and engineering . 2018 ,第1期

机译：使用Levinson Durbin方法线性预测编码的视频自动对字幕文本综合的演讲
4. Code-excited linear predictive speech processing for digital transmission and storage [C] . Langi, A., Kinsner, . 1991

机译：用于数字传输和存储的码激励线性预测语音处理
5. Strong field nonlinear optics in atoms and polyatomic molecules: Application of quantum mechanical methods to predict and control laser-induced processes. [D] . Tarazkar, Maryam. 2015

机译：原子和多原子分子中的强场非线性光学：量子力学方法在预测和控制激光诱导过程中的应用。
6. Computer aided tool for diagnosis of ENT pathologies using digital signal processing of speech and stroboscopic images [O] . Amaia Méndez Zorrilla, Begoña García Zapirain, Agustín Pérez Izquierdo -1

机译：使用语音和频闪图像的数字信号处理来诊断ENT病理的计算机辅助工具
7. ABSTRACT Various body parts or organs can be analysed to identify the different diseases in the human body. Fingernail analysis is one of the ways to identify disease in the human body. Nails are the body part which are farthest from the heart and therefore receive oxygen at last. As a result the nails are the first who show the symptoms of a disease in the human body. Fingernails can be easily captured for diagnosis and there are no heavy equipment or no specific conditions required to use nail image for disease diagnosis, like in other tests and scanning processes. Human nails deliver beneficial information about complaints or any nutritive imbalances in the human body depending upon their shape, texture and colour. In human beings, numerous systemic and skin diseases can be easily analyzed through careful examination of nails of both the limbs. A lot of nail illnesses have been found to be primary signs of numerous underlying systemic illnesses. The colour, texture or shape changes in nails are signs of many diseases mainly affecting nails. Considering all these properties of nails a system is proposed that uses digital image processing (DIP) methods for identifying such changes in the human nail to get more precise results and predict numerous diseases effortlessly. With the emerging Internet of Things (IOT) concept the generated report is made available remotely, this will help users to reduce transportation efforts. As the system has to deal with large and private data, the security of data must be ensured. To keep the data confidential, the Blockchain concept which is one of the most emerging concepts in the field of data management is used. The paper contains the implementation of the digital image processing for feature extraction of nail images, usage of IOT (ThingSpeak cloud) for data storage and implementation of Blockchain to keep the system secured and theft free. KEY WORDS: Int ernet of thin gs (IOT), Image proc essin g, Thin gSpeak, RG B vavalues, Mean pi xel vavalues, Bloc kchain , Hash key. Disease Diagnostic System: Abnormalities in Human Nail [O] . Pranav S. Wazarkar 2020

机译：摘要的各个身体部位或器官可被分析以识别在人体内的不同的疾病。指甲分析来识别人体疾病的方法之一。指甲是身体一部分是离心脏最远，因此在最后接受氧气。作为结果，指甲是第一谁表现出人体疾病的症状。指甲可以容易地捕获用于诊断和没有重装或需要使用指甲图像用于疾病诊断，比如在其他测试和扫描过程没有特定的条件。人的指甲提供有关投诉或取决于它们的形状，纹理和色彩在人体内的任何营养失衡有益的信息。在人类中，许多全身性皮肤疾病是可以很容易地通过两个四肢指甲的仔细检查分析。很多指甲病已发现众多潜在系统性疾病的主要症状。在指甲的颜色，质地和形状的变化是许多疾病主要影响指甲的迹象。考虑到所有的指甲的这些性能的系统被提出，用于识别人指甲这样的变化以获得更精确的结果，并毫不费力预测许多疾病用途的数字图像处理（DIP）方法。随着物联网（IOT）的概念，新兴的互联网将生成的报告提供远程，这将帮助用户降低运输工作。由于系统必须处理大量的私人数据，数据的安全性必须得到保证。为了保持数据的机密性，使用Blockchain的概念，它是在数据管理领域的大多数新兴的概念之一。本文包含了数字图像处理的指甲图像，IOT（ThingSpeak云）的使用为数据存储和执行Blockchain的特征提取的执行，以保持固定的系统和盗窃免费。关键词：诠释薄GS（IOT），图像的ERNET PROC essin克，薄型gSpeak，RG乙vavalues，平均数PI XEL vavalues，阵营kchain，哈希密钥。疾病诊断系统：在人类指甲异常
8. New linear Predictive Methods for Digital Speech Processing [R] . Varho, S. 2000

机译：数字语音处理的新线性预测方法

New linear predictive methods for digital speech processing

摘要

著录项

相似文献

相关主题

期刊订阅