首页> 外文会议>IEEE International Conference on Acoustics, Speech, and Signal Processing >AN EMPIRICAL INVESTIGATION OF SPARSE LOG-LINEAR MODELS FOR IMPROVED DIALOGUE ACT CLASSIFICATION
【24h】

AN EMPIRICAL INVESTIGATION OF SPARSE LOG-LINEAR MODELS FOR IMPROVED DIALOGUE ACT CLASSIFICATION

机译:改进对话法分类的稀疏对数线性模型的实证研究

获取原文

摘要

Previous work on dialogue act classification have primarily focused on dense generative and discriminative models. However, since the automatic speech recognition (ASR) outputs are often noisy, dense models might generate biased estimates and overfit to the training data. In this paper, we study sparse modeling approaches to improve dialogue act classification, since the sparse models maintain a compact feature space, which is robust to noise. To test this, we investigate various element-wise frequentist shrinkage models such as lasso, ridge, and elastic net, as well as structured sparsity models and a hierarchical sparsity model that embed the dependency structure and interaction among local features. In our experiments on a real-world dataset, when augmenting N-best word and phone level ASR hypotheses with confusion network features, our best sparse log-linear model obtains a relative improvement of 19.7% over a rule-based baseline, a 3.7% significant improvement over a traditional non-sparse log-linear model, and outperforms a state-of-the-art SVM model by 2.2%.
机译:以前的对话法案分类的工作主要集中在密集的生成和歧视模型上。然而,由于自动语音识别(ASR)输出通常嘈杂,因此密集模型可能会产生偏置估计和过度装备到训练数据。在本文中,我们研究了稀疏建模方法来改进对话法案分类,因为稀疏模型保持紧凑的特征空间,这是强大的噪声。为了测试这一点,我们调查各种元素 - 明智的频率收缩型号,如套索,脊和弹性网,以及结构化的稀疏模型和分层稀疏模型,嵌入了本地特征之间的依赖关系和交互。在我们对真实数据集的实验中,在增强N最佳的单词和电话级ASR假设时,我们最好的稀疏对数线性模型在基于规则的基线上获得了19.7%的相对提升,3.7%对传统的非稀疏对数线性模型的显着改进,并且优于最先进的SVM模型2.2%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号