首页> 外文会议>International Speech Communication Association >Addressing database mismatch in forensic speaker recognition with Ahumada III: a public real-casework database in Spanish

【24h】

Addressing database mismatch in forensic speaker recognition with Ahumada III: a public real-casework database in Spanish

机译：通过Ahumada III解决法医扬声器识别中的数据库不匹配：西班牙语中的公共案例组织数据库

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents and describes Ahumada III, a speech database in Spanish collected from real forensic cases. In its current release, the database presents 61 male speakers recorded using the systems and procedures followed by Spanish Guardia Civil police force. The paper also explores the usefulness of such a corpus for facing the important problem of database mis-match in speaker recognition, understood as the difference be-tween the database used for tuning a speaker recognition sys-tem and the data which the system will handle in operational conditions. This problem is typical in forensics, where variabil-ity in speech conditions may be extreme and difficult to model. Therefore, this work also presents a study evaluating the im-pact of such problem, for which a corpus quoted as NIST4M MultiMic MisMatch) has been constructed from NIST SRE 2006 data. NIST4M presents microphone data both in the enrolled models and in the test segments, allowing the genera-tion of trials in a variety of strongly mismatching conditions. Database mismatch is simulated by eliminating some micro-phone channels of interest from the background data, and com-puting scores with speech from such microphones in unknown testing conditions as usually happens in forensic speaker recog-nition. Finally, we show how the incorporation of Ahumada III as background data is useful to face database mismatch in real-world forensic conditions.

机译：本文提出并描述了由真正的法医案例收集的西班牙语演讲数据库的Ahumada III。在其目前的发布中，数据库呈现了使用系统和程序记录的61名男性扬声器，然后录制了西班牙监护人民警部队。本文还探讨了这种语料库，用于面对扬声器识别的数据库错误匹配的重要问题，理解为差异是用于调整扬声器识别系统的数据库和系统将处理的数据在运营条件下。这个问题在法医中典型，其中语音条件中的VariaBil-Ity可能是极端的，难以模拟。因此，这项工作还提出了一种评估此类问题的IM-PACT的研究，该问题引用了作为NIST4M多米错配的语料库）已经由NIST 2006数据构成。 NIST4M在注册的型号和测试段中展示了麦克风数据，允许在各种强不匹配的条件下进行试验。通过消除来自背景数据的一些微电话频道的数据库不匹配，以及在法医扬声器Recog-nition中的未知测试条件下的这种麦克风中的语音与语音的分数。最后，我们展示了如何纳入Ahumada III作为背景数据是有用的，对于在现实世界的法医条件下面对数据库不匹配。

著录项

来源
《International Speech Communication Association》|2008年||共4页
会议地点
作者
Daniel Ramos; Joaquin Gonzalez-Rodriguez; Javier Gonzalez-Dominguez; Jose Juan Lucena-Molina;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912.3-532;
关键词
database; NIST; conditions;

机译：数据库;nist;条件;

相似文献

外文文献
中文文献
专利

1. Sensitivity of likelihood-ratio based forensic voice comparison under mismatched conditions of within-speaker sample sizes across databases [J] . Ishihara Shunichi The Australian journal of forensic sciences . 2018,第4期

机译：基于扬声器样本尺寸的错配条件下基于似然比基于票据的敏感性
2. Spanish public awareness regarding DNA profile databases in forensic genetics: what type of DNA profiles should be included? [J] . Gamero JJ, Romero JL, Peralta JL, Journal of medical ethics . 2007,第10期

机译：西班牙公众对法医遗传学中DNA谱数据库的认识：应包括哪种类型的DNA谱？
3. Public participation in genetic databases: crossing the boundaries between biobanks and forensic DNA databases through the principle of solidarity [J] . Machado Helena, Silva Susana Journal of medical ethics . 2015,第10期

机译：公众参与基因数据库：通过团结原则跨越生物库和法医DNA数据库之间的界限
4. Addressing database mismatch in forensic speaker recognition with Ahumada III: a public real-casework database in Spanish [C] . Daniel Ramos, Joaquin Gonzalez-Rodriguez, Javier Gonzalez-Dominguez, International Speech Communication Association . 2008

机译：通过Ahumada III解决法医扬声器识别中的数据库不匹配：西班牙语中的公共案例组织数据库
5. The "Denver Multimedia Database": A Forensic Database for Digital Audio, Video, and Image Media [D] . Janson, Craig Andrew 2019

机译：“丹佛多媒体数据库”：数字音频，视频和图像媒体的法医数据库
6. Spanish public awareness regarding DNA profile databases in forensic genetics: what type of DNA profiles should be included? [O] . Joaquín J Gamero, Jose‐Luis Romero, Juan‐Luis Peralta, 2007

机译：西班牙公众对法医遗传学中的DNA谱数据库的认识：应包括哪种类型的DNA谱？
7. Addressing database mismatch in forensic speaker recognition with Ahumada III: A public real-casework database in Spanish [O] . Ramos, Daniel, González-Rodríguez, Joaquín, González Domínguez, Javier, 2008

机译：使用ahumada III解决法医说话人识别中的数据库不匹配：西班牙语的公共实际案例数据库
8. Soil Properties Database of Spanish Soils. Volume VIII.- Castilla-La Mancha (a): Toledo and Ciudad Real. [R] . Trueba, C., Millan, R., Schmid, T., 1999

机译：西班牙土壤的土壤特性数据库。第八卷 - 卡斯蒂利亚 - 拉曼查（a）：托莱多和雷阿尔城。

Addressing database mismatch in forensic speaker recognition with Ahumada III: a public real-casework database in Spanish

摘要

著录项

相似文献

相关主题

期刊订阅