首页> 外文会议>Workshop on Automatic Speech Recognition and Understanding >FIXED-DIMENSIONAL ACOUSTIC EMBEDDINGS OF VARIABLE-LENGTH SEGMENTS IN LOW-RESOURCE SETTINGS

【24h】

FIXED-DIMENSIONAL ACOUSTIC EMBEDDINGS OF VARIABLE-LENGTH SEGMENTS IN LOW-RESOURCE SETTINGS

机译：低资源设置中可变长度段的固定尺寸声学嵌入

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Measures of acoustic similarity between words or other units are critical for segmental exemplar-based acoustic models, spoken term discovery, and query-by-example search. Dynamic time warping (DTW) alignment cost has been the most commonly used measure, but it has well-known inadequacies. Some recently proposed alternatives require large amounts of training data. In the interest of finding more efficient, accurate, and low-resource alternatives, we consider the problem of embedding speech segments of arbitrary length into fixed-dimensional spaces in which simple distances (such as cosine or Euclidean) serve as a proxy for linguistically meaningful (phonetic, lexical, etc.) dissimilarities. Such embeddings would enable efficient audio indexing and permit application of standard distance learning techniques to segmental acoustic modeling. In this paper, we explore several supervised and unsupervised approaches to this problem and evaluate them on an acoustic word discrimination task. We identify several embedding algorithms that match or improve upon the DTW baseline in low-resource settings.

机译：单词或其他单位之间的声学相似度对于基于分段示例的声学模型，说话术语发现和查询逐个搜索至关重要。动态时间翘曲（DTW）对齐成本是最常用的措施，但它具有众所周知的不足。一些最近提出的替代方案需要大量的培训数据。为了找到更高效，准确和低资源的替代方案，我们考虑将任意长度的语音段嵌入到固定尺寸空间中，其中简单的距离（如余弦或欧几里德）用作语言上有意义的代理（语音，词汇等）异化。这种嵌入式将实现高效的音频索引并允许将标准距离学习技术应用于分段声学建模。在本文中，我们探讨了这个问题的几种监督和无人监督的方法，并在声学词歧视任务上评估它们。我们识别几个嵌入算法，匹配或改进DTW基线在低资源设置中。

著录项

来源
《Workshop on Automatic Speech Recognition and Understanding 》|2013年||共6页
会议地点
作者
Keith Levin; Katharine Henry; Aren Jansen; Karen Livescu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912.3-532;
关键词
Fixed-dimensional embedding; segmental acoustic modeling; query-by-example search; speech indexing;

机译：固定维嵌入;分段声学建模;逐个示例搜索;语音索引;

相似文献

外文文献
中文文献
专利

1. Including ultrasound scans in antenatal care in low-resource settings: Considering the complementarity of obstetric ultrasound screening and maternity waiting homes in strengthening referral systems in low-resource, rural settings [J] . Swanson David L., Franklin Holly L., Swanson Jonathan O., Seminars in perinatology . 2019 ,第5期

机译：在低资源环境中的产前护理中包括超声波扫描：考虑到产科超声波筛查和产妇等待家庭在加强低资源，农村环境中的推荐系统中的互补性
2. The quest for a framework for sustainable and institutionalised priority-setting for health research in a low-resource setting: the case of Zambia [J] . Lydia Kapiriri, Pascalina Chanda-Kapata Health Research Policy and Systems . 2018 ,第1期

机译：在低资源环境中追求可持续和制度化优先级的框架，以便在低资源环境中进行健康研究：赞比亚的案例
3. Cardiovascular risk assessment in low-resource settings: A consensus document of the European Society of Hypertension Working Group on Hypertension and Cardiovascular Risk in Low Resource Settings [J] . ModestiP.A., AgostoniP., AgyemangC., Journal of hypertension . 2014 ,第5期

机译：资源贫乏地区的心血管风险评估：欧洲高血压学会高血压和资源贫乏地区心血管疾病风险工作组的共识文件
4. Fixed-dimensional acoustic embeddings of variable-length segments in low-resource settings [C] . Levin Keith, Henry Katharine, Jansen Aren, IEEE Workshop on Automatic Speech Recognition and Understanding . 2013

机译：低资源设置中可变长度段的固定维声学嵌入
5. Parallel Sentence Detection in Comparable Corpora with Bilingual Word Embeddings for Low-Resource Languages [D] . Cadigan, John. 2018

机译：与低资源语言的双语单词嵌入式的同类语料中的并行句子检测
6. Simulation and team training embedded nurse mentoring programme and improvement in intrapartum and newborn care in a low-resource setting in Bihar India [O] . Rakesh Ghosh, Hilary Spindler, Jessica Dyer, 2020

机译：嵌入式护士指导计划的模拟和团队培养媒体媒体指导计划及其在印度的低资源环境中携带的indramartum和新生儿的改进
7. Fixed-dimensional acoustic embeddings of variable-length segments in low-resource settings [O] . Keith Levin, Katharine Henry, Aren Jansen, 2013

机译：低资源环境中可变长度段的固定声学嵌入

FIXED-DIMENSIONAL ACOUSTIC EMBEDDINGS OF VARIABLE-LENGTH SEGMENTS IN LOW-RESOURCE SETTINGS

摘要

著录项

相似文献

相关主题

期刊订阅