首页> 外文会议>IEEE International Conference on Intelligence and Security Informatics >Pre-trained Contextualized Representation for Chinese Conversation Topic Classification
【24h】

Pre-trained Contextualized Representation for Chinese Conversation Topic Classification

机译:汉语会话话题分类的预训练情境表示

获取原文

摘要

Topic classification plays an important role in facilitating security-related applications, which can help people reduce data scope and acquire key information quickly. Conversation is one of the important ways of communication between people. The utterances in a conversation may contain vital clues, such as people's opinions, emotions and political slants. To explore more effective approaches for Chinese conversational topic classification, in this paper, we propose a neural network architecture with pre-trained contextualized representation. We firstly apply pretrained BERT model to fine-tune and generate the conversational embeddings, which are the inputs of our neural network models. Then we design several models based on neural networks to extract task-oriented advanced features for topic classification. Experimental results indicate that the models based on our neural network architecture all outperform the baseline only fine-tuned with the pre-trained BERT model. It demonstrates that the pretrained representations are effective to Chinese conversational topic classification, and the proposed architecture can further capture the salient features from the representations. And we release the code and dataset of this paper that can be obtained from https://github.comjoe9/pretrained_representation.
机译:主题分类在促进与安全相关的应用程序中起着重要作用,可以帮助人们缩小数据范围并快速获取关键信息。对话是人与人之间交流的重要方式之一。对话中的话语可能包含重要的线索,例如人们的意见,情感和政治倾向。为了探索汉语会话话题分类的更有效方法,本文提出了一种具有预训练的上下文表示的神经网络体系结构。我们首先应用预训练的BERT模型来微调并生成对话嵌入,这是我们神经网络模型的输入。然后,我们基于神经网络设计了几种模型,以提取面向任务的高级特征进行主题分类。实验结果表明,基于我们的神经网络架构的模型均优于仅使用预训练的BERT模型进行微调的基线。结果表明,经过预训练的表征对于汉语会话话题的分类是有效的,并且所提出的体系结构可以进一步从表征中捕捉到显着特征。然后,我们发布了可从https://github.com/njoe9/pretrained_representation获得的本文代码和数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号