首页> 外文学位 >Diagnosis of disease in newborn infants by analysis of cry signals.
【24h】

Diagnosis of disease in newborn infants by analysis of cry signals.

机译:通过分析哭声信号诊断新生儿疾病。

获取原文
获取原文并翻译 | 示例

摘要

Crying is the first sound the baby makes when he enters the world outside of his mother's stomach, which is a very positive sign of a new healthy life. Well, we elders can talk but the newborn infant isn't old enough to do that yet. Cry is all a baby can do to express any discomfort it feels. When initially reading it, the first thing that comes to mind is why the cry is such an important aspect of health care for newborn infants? Although studying on infant's cry was pioneered in the late 1960s, but it never crossed anybody's mind that sick infants might be identified from their cries. Statistical reports by World Health Organization state that the congenital anomalies or birth defects affect approximately 1 in 33 infants born every year and almost all of the world's infant deaths happen in developing countries. Therefore, it is imperative to provide an inexpensive health care system, with no need of complex and advanced technology for poor mothers with newborn babies in low-income countries to survive more babies beyond the first months of life. In spite of the fact that there are a lot of maternal issues that can raise the risks of complications and anomalies in newborn infants, we are curious to examine the ability of solely the concealed information inside infant's cry to clarify the infant's physiological anatomy and psychological condition. The creative idea behind of such a non-invasive diagnostic system is based on the evidence extracted from past research studies for potential ability of infant's cry to distinguish between healthy and sick infants. This innovative idea can tackle key global health and development problems.;The purpose of this study is to develop a newborn cry-based diagnostic system to classify healthy and sick infants with different pathological conditions. First, an informed choice of pathological states and collecting of the infant cry data base is necessary and still in progress to complete the infant cry data base. In many of today's application domains, it is often unavoidable to have data with high dimensionality and small sample size. Both small sample size problem and dimensionality reduction methods have been studied extensively but the combination of imbalanced data and small sample size presents a new challenge to the community. In this situation, learning algorithm often fail to generalize inductive rules over the sample space when presented with this form of imbalance. In fact, the combination of small sample size and high dimensionality hinders learning because of difficulty involved in forming conjugations over the high degree of features with limited samples. In the next part, data preprocessing, including selection and extraction of pathologically-informed features suitably with the best possible precision and then quantifying them for each pathological condition without any human intervention is considered in the system. In order to obtain the full benefit of the information embedded in the cry signal, Mel Frequency Cepstrum Coefficient (MFCC) analysis will be done on both expiratory and inspiratory cry vocalizations separately in this study. To avoid the need of human effort in labeling the boundaries of the corresponding corpus, automatic labeling of cry signals is required for an ideal cry-based diagnostic system. However, to alleviate the segmentation task in this study, it has been manually performed so far.;Finite mixtures are a flexible and powerful probabilistic tool for modeling univariate and multivariate data among all available approaches to do modeling and classification tasks. In this regard, we come up with Gaussian Mixture Models (GMMs) that is a special case of Hidden Markov Models (HMMs) with one state, as a new representation of cry signals according to extracted feature streams. The next part of this thesis is dedicated to enhancement of learning of GMMs that are usually trained using the iterative Expectation Maximization (EM) algorithm. However, considering the risk of overfitting due to small training sample size in some pathological conditions, and the fact that the number of mixtures is fixed in the traditional EM-based re-estimation algorithm, a new learning method based on boosting algorithm is introduced to learn growing mixture models in an incremental and recursive manner.;The idea of Universal Background Model (UBM) used in speaker recognition and verification systems is employed to represent general feature characteristics of infant cry signals. Then, a variant of boosted mixture learning (BML) method is employed in order to derive subclass models for each enrolled disease from the GMM-UBM by adaptation of GMM parameters. The crux of the design was to fuse two subsystems that are based on expiratory and inspiratory sounds in baby cry recordings into a single effective system. Such systems are expected to be more reliable due to the presence of multiple, (fairly) independent pieces of evidence. We present log-likelihood ratio score fusion to stop worrying on the feature compatibility and rigid fusion.;Apart from all of the above-mentioned modeling and learning methods, our work is different from previous works in that while other systems usually deal with binary classification tasks between healthy and sick infant with only one specific disorder. Our cry-based diagnostic system has a hierarchical scheme that focuses into multi-pathology classification problem via combination of individual classifiers. Moreover, it is worthwhile mentioning that the chosen diseases have not been previously studied.
机译:哭泣是婴儿进入母亲肚子以外的世界时发出的第一声声音,这是新的健康生活的非常积极的信号。好吧,我们的长者可以说话,但是刚出生的婴儿还不够大。哭泣是婴儿能够表达出任何不适感的全部方法。最初阅读它时,首先想到的是为什么哭声对新生儿的医疗保健如此重要?尽管对婴儿啼哭的研究是在1960年代后期开创的,但从未有人想到可以从他们的哭声中识别出患病的婴儿。世界卫生组织的统计报告指出,先天畸形或先天缺陷影响着每年出生的33名婴儿中的大约1名,世界上几乎所有婴儿死亡都发生在发展中国家。因此,当务之急是要提供一种廉价的医疗保健系统,而对于低收入国家中有新生婴儿的贫穷母亲来说,无需复杂而先进的技术就能在出生后的头几个月中存活更多的婴儿。尽管事实上有很多产妇问题会增加新生儿并发症和异常的风险,但我们很好奇地检查了婴儿哭声中隐藏信息的能力,以阐明婴儿的生理解剖结构和心理状况。这种无创诊断系统背后的创新思想是基于从以往研究中获得的证据,即婴儿的啼哭可能会区分健康婴儿和生病婴儿的潜在能力。这个创新的想法可以解决关键的全球健康和发展问题。本研究的目的是开发一种基于新生儿啼哭的诊断系统,以对具有不同病理状况的健康和患病婴儿进行分类。首先,病理状态的明智选择和婴儿啼哭数据库的收集是必要的,并且仍在完善婴儿啼哭数据库的过程中。在当今的许多应用领域中,通常都不可避免地需要具有高维数和小样本量的数据。小样本量问题和降维方法都得到了广泛的研究,但是数据不平衡和小样本量的结合对社区提出了新的挑战。在这种情况下,学习算法在出现这种形式的不平衡时,常常无法归纳出样本空间上的归纳规则。实际上,小样本量和高维数的组合阻碍了学习,因为在有限的样本上很难形成高度特征的共轭。在下一部分中,系统中将考虑数据预处理,包括适当地以最佳可能的精度选择和提取病理信息,然后针对每种病理条件对它们进行量化,而无需任何人工干预。为了获得嵌入在哭声信号中的信息的全部好处,在这项研究中,将分别对呼气和吸气哭声进行梅尔频率倒谱系数(MFCC)分析。为了避免在标记相应语料库的边界时需要人工,对于理想的基于哭声的诊断系统,需要自动标记哭声信号。但是,为减轻此研究中的分割任务,到目前为止,它是手动执行的。有限混合是一种灵活而强大的概率工具,用于在所有可用的建模和分类任务方法中对单变量和多变量数据进行建模。在这方面,我们提出了高斯混合模型(GMM),它是具有一种状态的隐马尔可夫模型(HMM)的特例,作为根据提取的特征流的哭声信号的新表示。本文的下一部分致力于增强GMM的学习,而GMM通常是使用迭代期望最大化(EM)算法进行训练的。但是,考虑到在某些病理条件下训练样本量较小导致过度拟合的风险,以及传统基于EM的重新估计算法中混合数量固定的事实,引入了一种基于Boosting算法的新学习方法以递增和递归的方式学习不断增长的混合模型。;说话人识别和验证系统中使用的通用背景模型(UBM)的思想被用来代表婴儿啼哭信号的一般特征。然后,采用增强混合学习(BML)方法的一种变体,以便通过修改GMM参数从GMM-UBM导出每种已入院疾病的子类模型。该设计的关键是将基于婴儿啼哭记录中的呼气和吸气声音的两个子系统融合到一个有效的系统中。由于存在多个,因此预期此类系统会更可靠,(相当)独立的证据。我们提出对数似然比分数融合以消除对特征兼容性和刚性融合的担忧。;除上述所有建模和学习方法外,我们的工作与以前的工作不同之处在于,其他系统通常处理二进制分类仅患有一种特定疾病的健康和患病婴儿之间的任务。我们基于哭泣的诊断系统具有分层方案,该方案通过组合各个分类器来关注多病理学分类问题。此外,值得一提的是,以前尚未研究过所选的疾病。

著录项

  • 作者

    Farsaie Alaie, Hesam.;

  • 作者单位

    Ecole de Technologie Superieure (Canada).;

  • 授予单位 Ecole de Technologie Superieure (Canada).;
  • 学科 Artificial intelligence.;Electrical engineering.;Applied mathematics.
  • 学位 D.Eng.
  • 年度 2015
  • 页码 197 p.
  • 总页数 197
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号