A Free Synthetic Corpus for Speaker Diarization Research

机译：用于说话人差异化研究的免费合成语料库

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A synthetic corpus of dialogs was constructed from the Libri-Speech corpus, and is made freely available for diarization research. It includes over 90 h of training data, and over 9 h each of development and test data. Both 2-person and 3-person dialogs, with and without overlap, are included. Timing information is provided in several formats, and includes not only speaker segmentations, but also phoneme segmentations. As such, it is a useful starting point for general, particularly early-stage, diarization system development.

机译：从Libri-Speech语料库构建了一个对话的综合语料库，并免费提供给进行差异化研究。它包括90多个小时的培训数据，以及每个9个小时以上的开发和测试数据。包括2人对话和3人对话，有或没有重叠。定时信息以几种格式提供，不仅包括说话者细分，还包括音素细分。这样，它对于一般的，特别是早期的二值化系统开发是有用的起点。

著录项

来源
《International Conference on speech and computer》|2018年|113-122|共10页
会议地点
作者
Erik Edwards; Michael Brenndoerfer; Amanda Robinson; Najmeh Sadoughi; Greg P. Finley; Maxim Korenevsky; Nico Axtmann; Mark Miller; David Suendermann-Oeft;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Speaker diarization Speech activity detection; Open-source corpora;

机译：说话人区分语音活动检测;开源语料库;

相似文献

外文文献
中文文献
专利

1. Hybridization DE with K-means for speaker clustering in speaker diarization of broadcasts news [J] . Dabbabi Karim, Hajji Salah, Cherif Adnen International journal of speech technology . 2019,第4期

机译：与K-means的混合DE用于演讲者广播新闻的演讲者聚类
2. Probabilistic Speaker Diarization With Bag-of-Words Representations of Speaker Angle Information [J] . Ishiguro K., Yamada T., Araki S., Audio, Speech, and Language Processing, IEEE Transactions on . 2012,第2期

机译：说话者角度信息的词袋表示概率的说话人区分
3. Development of a Speaker Diarization System for Speaker Tracking in Audio Broadcast News: a Case Study [J] . Mihelic France, Vesnicer Bostjan, Zibert Janez Journal of computing and information technology . 2008,第3期

机译：音频广播新闻中演讲者跟踪的演讲者区分系统的开发：一个案例研究
4. A Free Synthetic Corpus for Speaker Diarization Research [C] . Erik Edwards, Michael Brenndoerfer, Amanda Robinson, International Conference on Speech and Computer . 2018

机译：一种免费的扬声器日复变化研究语料库
5. Automatic Speaker Recognition and Diarization in Co-Channel Speech [D] . Shokouhi, Navid. 2017

机译：同频道语音中的说话人自动识别和区分
6. Supervised Speaker Diarization Using Random Forests: A Tool for Psychotherapy Process Research [O] . Lukas Fürer, Nathalie Schenk, Volker Roth, 2020

机译：使用随机森林监督扬声器日期：一种心理治疗过程研究的工具
7. End-to-End Neural Speaker Diarization with Permutation-Free Objectives [O] . Yusuke Fujita, Naoyuki Kanda, Shota Horiguchi, 2019

机译：终端到底神经扬声器和无置换目标的日益衰退
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

A Free Synthetic Corpus for Speaker Diarization Research

摘要

著录项

相似文献

相关主题

期刊订阅