首页> 外文会议>International Conference on Mobile Data Management >Utterance Classification Using Linguistic and Non-Linguistic Information for Network-Based Speech-To-Speech Translation Systems
【24h】

Utterance Classification Using Linguistic and Non-Linguistic Information for Network-Based Speech-To-Speech Translation Systems

机译:用语言和非语言信息对基于网络的语音翻译系统的语言和非语言信息进行话语分类

获取原文

摘要

Network-based mobile services, such as speech-to-speech translation and voice search, enable the construction of large-scale log database including speech. We have developed a smartphone application called VoiceTra for speech-to-speech translation and have collected 10,000,000 utterances so far. This huge corpus is unique in size and spatio-temporal information; it contains information on anonymized user locations. This spatiotemporal corpus can be used for improving the accuracy of its speech recognition and machine translation, and it will open the door for the study of the location dependency of vocabulary and new applications for location-based services. This paper first analyzes the corpus and then presents a novel method for classifying utterances using linguistic and non-linguistic information. L2-regularized Logistic Regression is used for utterance classification. Our experiments performed on the VoiceTra log corpus revealed that our proposed method outperformed baseline methods in terms of F measure.
机译:基于网络的移动服务,例如语音到语音转换和语音搜索,可以构建包括语音的大规模日志数据库。我们开发了一个名为VoiceTra的智能手机应用程序,用于语音转换,并到目前为止收集了10,000,000个的话语。这种巨大的尸体在尺寸和时空信息中是独一无二的;它包含有关匿名用户位置的信息。这种时空语料库可用于提高其语音识别和机器翻译的准确性,它将为基于位置的服务的词汇和新应用的位置依赖,打开门。本文首先通过语言和非语言信息分析了语料库,然后提出了一种用于分类话语的新方法。 L2 - 正则逻辑回归用于话语分类。我们在VoiceTra Log Corpus上进行的实验表明,我们所提出的方法在F度量方面表现出基线方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号