首页> 外国专利> END-TO-END SPEAKER RECOGNITION USING DEEP NEURAL NETWORK

END-TO-END SPEAKER RECOGNITION USING DEEP NEURAL NETWORK

机译：使用深神经网络的端到端扬声器识别

页面导航

摘要
著录项
相似文献

摘要

The present invention is directed to a deep neural network (DNN) having a triplet network architecture, which is suitable to perform speaker recognition. In particular, the DNN includes three feed-forward neural networks, which are trained according to a batch process utilizing a cohort set of negative training samples. After each batch of training samples is processed, the DNN may be trained according to a loss function, e.g., utilizing a cosine measure of similarity between respective samples, along with positive and negative margins, to provide a robust representation of voiceprints.

机译：本发明涉及一种具有三重态网络架构的深神经网络（DNN），其适合于执行扬声器识别。特别地，DNN包括三个前馈神经网络，其根据利用群组的负训练样本进行批量培训。在处理每批训练样本之后，可以根据损耗函数训练DNN，例如，利用各个样本之间的相似性的余弦测量，以及正和负边缘，以提供鲁棒的声明图。

著录项

公开/公告号EP3501025B1

专利类型
公开/公告日2021-08-11

原文格式PDF
申请/专利权人 PINDROP SECURITY INC.;
展开▼

申请/专利号EP20170772184
发明设计人 KHOURY ELIE;GARLAND MATTHEW;
展开▼

申请日2017-09-11
分类号G10L17/04;G10L17/18;
国家 EP
入库时间 2022-08-24 20:33:09

相似文献

专利
外文文献
中文文献