首页> 外文会议>International workshop on spoken dialogue systems technology >Enabling Spoken Dialogue Systems for Low-Resourced Languages—End-to-End Dialect Recognition for North Sami
【24h】

Enabling Spoken Dialogue Systems for Low-Resourced Languages—End-to-End Dialect Recognition for North Sami

机译:为低资源语言的口语对话系统 - 北萨米的端到端方言识别

获取原文

摘要

In this paper, we tackle the challenge of identifying dialects using deep learning for under-resourced languages. Recent advances in spoken dialogue technology have been strongly influenced by the availability of big corpora, while our goal is to work on the spoken interactive application for the North Sami language, which is classified as one of the less-resourced languages spoken in Northern Europe. North Sami has various variations and dialects which are influenced by the majority languages of the areas in which it is spoken: Finnish and Norwegian. To provide reliable and accurate speech components for an interactive system, it is important to recognize the speakers with their Finnish or Norwegian accent. Conventional approaches compute universal statistical models which require a large amount of data to form reliable statistics, and thus they are vulnerable to small data where there is only a limited number of utterances and speakers available. In this paper we will discuss dialect and accent recognition in under-resourced context, and focus on training an attentive network for leveraging unlabeled data in a semi-supervised scenario for robust feature learning. Validation of our approach is done via two DigiSami datasets: conversational and read corpus.
机译:在本文中,我们使用深度学习为资源不足的语言来解决识别方言的挑战。口头对话技术的最新进展受到大公司可用性的强烈影响,而我们的目标是致力于北萨米语的口头互动申请,该申请被归类为北欧中所说的较低资源的语言之一。北萨米有各种各样的变体和方言,受到它所说的主要语言的各种变化和方言:芬兰和挪威语。为了为交互式系统提供可靠和准确的语音组件,重要的是要用芬兰语或挪威口音识别扬声器。传统方法计算需要大量数据以形成可靠统计数据的通用统计模型,因此它们容易受到只有有限数量的话语和扬声器的小数据。在本文中,我们将讨论资源不足的上下文中的方言和重点识别,并专注于培训专注网络,以利用鲁棒特征学习的半监督场景中的未标记数据。我们的方法验证是通过两个Digisami数据集完成的:会话和读取语料库。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号