首页> 外文会议>International Conference on Natural Language Processing and Chinese Computing >An Effective Approach for Chinese News Headline Classification Based on Multi-representation Mixed Model with Attention and Ensemble Learning
【24h】

An Effective Approach for Chinese News Headline Classification Based on Multi-representation Mixed Model with Attention and Ensemble Learning

机译:基于多标识混合模型的中国新闻标题分类的有效方法,关注和集合学习

获取原文

摘要

In NLPCC 2017 shared task two, we propose an efficient approach for Chinese news headline classification based on multi-representation mixed model with attention and ensemble learning. Firstly, we model the headline semantic both on character and word level via Bi-directional Long Short-Term Memory (BiLSTM), with the concatenation of output states from hidden layer as the semantic representation. Meanwhile, we adopt attention mechanism to highlight the key characters or words related to the classification decision, and we get a preliminary test result. Then, for samples with lower confidence level in the preliminary test result, we utilizing ensemble learning to determine the final category of the whole test samples by sub-models voting. Testing on the NLPCC 2017 official test set, the overall F1 score of our model eventually reached 0.8176, which can be ranked No. 3.
机译:在NLPCC 2017共享任务二方面,我们提出了一种基于多标识混合模型的中文新闻标题分类方法,注意力和集合学习。首先,我们通过双向长期短期内存(BILSTM)在字符和字级别上模拟标题语义,并将输出状态从隐藏层作为语义表示的串联连接。同时,我们采用注意机制来突出显示与分类决定相关的关键字或单词,我们得到了初步测试结果。然后,对于在初步测试结果中具有较低置信水平的样本,我们利用集合学习通过子模型投票确定整个测试样本的最终类别。在NLPCC 2017官方测试集上测试,我们模型的总体F1得分最终达到0.8176,可以排名第3。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号