首页> 外文会议>INTERSPEECH 2012 >Improved Prediction of Japanese Word Accent Sandhi Using CRF
【24h】

Improved Prediction of Japanese Word Accent Sandhi Using CRF

机译:利用CRF改进了日语口音Sandhi的预测

获取原文

摘要

In Japanese, every content word has its own mora-based H/L pitch pattern when it is uttered in isolation, called accent type. When reading out a written sentence, however, this lexical H/L pattern is often changed according to the context, known as word accent sandhi. In our previous work, an accent sandhi predictor was developed using CRF [1], and in this paper, the predictor is improved through feature engineering especially focusing on phrases including numerals and those including loanwords. This is because our previous work showed that the prediction performance was relatively low for those phrases. To optimize the features used for CRF, it is critical to take into account the mechanism of word accent sandhi. We review linguistic and technical literature that attempted to characterize accent sandhi in the phrases including numerals and loanwords and, by reflecting these characteristics the features are re-designed. Experiments show that the proposed predictor improved the performance relatively by 37% and 41%, respectively.
机译:在日语中,当它以隔离发出时,每个内容词都有自己的Mora基于的H / L俯仰模式,称为口音类型。然而,当读出书面句子时,这种词汇H / L模式通常根据上下文而变化,称为Word Accent Sandhi。在我们以前的工作中,使用CRF [1]开发了一个口音Sandhi预测器,并且在本文中,通过特征工程改善了预测器,特别是专注于包括数字和包括借词的短语。这是因为我们之前的工作表明,这些短语的预测性能相对较低。为了优化用于CRF的功能,考虑到Word Accent Sandhi的机制至关重要。我们审查了语言和技术文献,试图在包括数字和借词的短语中表征口音Sandhi,并且通过反映这些特征来重新设计这些特征。实验表明,所提出的预测指标分别提高了比例的性能,分别提高了37%和41%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号