Pre-trained Contextualized Representation for Chinese Conversation Topic Classification

机译：汉语会话话题分类的预训练情境表示

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Topic classification plays an important role in facilitating security-related applications, which can help people reduce data scope and acquire key information quickly. Conversation is one of the important ways of communication between people. The utterances in a conversation may contain vital clues, such as people's opinions, emotions and political slants. To explore more effective approaches for Chinese conversational topic classification, in this paper, we propose a neural network architecture with pre-trained contextualized representation. We firstly apply pretrained BERT model to fine-tune and generate the conversational embeddings, which are the inputs of our neural network models. Then we design several models based on neural networks to extract task-oriented advanced features for topic classification. Experimental results indicate that the models based on our neural network architecture all outperform the baseline only fine-tuned with the pre-trained BERT model. It demonstrates that the pretrained representations are effective to Chinese conversational topic classification, and the proposed architecture can further capture the salient features from the representations. And we release the code and dataset of this paper that can be obtained from https://github.comjoe9/pretrained_representation.

机译：主题分类在促进与安全相关的应用程序中起着重要作用，可以帮助人们缩小数据范围并快速获取关键信息。对话是人与人之间交流的重要方式之一。对话中的话语可能包含重要的线索，例如人们的意见，情感和政治倾向。为了探索汉语会话话题分类的更有效方法，本文提出了一种具有预训练的上下文表示的神经网络体系结构。我们首先应用预训练的BERT模型来微调并生成对话嵌入，这是我们神经网络模型的输入。然后，我们基于神经网络设计了几种模型，以提取面向任务的高级特征进行主题分类。实验结果表明，基于我们的神经网络架构的模型均优于仅使用预训练的BERT模型进行微调的基线。结果表明，经过预训练的表征对于汉语会话话题的分类是有效的，并且所提出的体系结构可以进一步从表征中捕捉到显着特征。然后，我们发布了可从https://github.com/njoe9/pretrained_representation获得的本文代码和数据集。

著录项

来源
《IEEE International Conference on Intelligence and Security Informatics》|2019年|122-127|共6页
会议地点
作者
Yujun Zhou; Changliang Li; Saike He; Xiaoqi Wang; Yiming Qiu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Bit error rate; Task analysis; Neural networks; Feature extraction; Context modeling; Adaptation models; Training;

机译：误码率;任务分析;神经网络;特征提取;上下文建模;适应模型;训练;

相似文献

外文文献
中文文献
专利

1. Contextual Sentiment Topic Model for Adaptive Social Emotion Classification [J] . Rao Yanghui Intelligent Systems, IEEE . 2016,第1期

机译：适应性社会情绪分类的情境主题模型
2. Chinese Auto-Clustering of Oral Conversation Corpus Based on Contextual Features [J] . Yue Chen, Qi Chen, Minghu Jiang Signal Processing Research . 2015,第期

机译：基于上下文特征的口语会话语料库中文自动聚类
3. Contextual Word Representation and Deep Neural Networks-based Method for Arabic Question Classification [J] . Alami Hamza, Noureddine En-Nahnahi, Said El Alaoui Ouatik Advances in Science, Technology and Engineering Systems . 2020,第5期

机译：基于词语表示和基于深度神经网络的阿拉伯语问题分类方法
4. Pre-trained Contextualized Representation for Chinese Conversation Topic Classification [C] . Yujun Zhou, Changliang Li, Saike He, IEEE International Conference on Intelligence and Security Informatics . 2019

机译：用于中文对话主题分类的预先训练的上下文化表示
5. Up from the roots: Contextualizing medicinal plant classifications of Tibetan doctors in Rgyalthang, PRC (China). [D] . Glover, Denise M. 2005

机译：扎根：在中华人民共和国（中国）Rgyalthang的藏族医生将药用植物分类进行语境化。
6. Single-cell conventional pap smear image classification using pre-trained deep neural network architectures [O] . Mohammed Aliy Mohammed, Fetulhak Abdurahman, Yodit Abebe Ayalew 2021

机译：使用预先培训的深神经网络架构进行单细胞常规PAP涂片图像分类
7. Improved Word Sense Disambiguation Using Pre-Trained Contextualized Word Representations [O] . Christian Hadiwinoto, Hwee Tou Ng, Wee Chung Gan 2019

机译：使用预先训练的上下文化词表示改进了词语歧义

Pre-trained Contextualized Representation for Chinese Conversation Topic Classification

摘要

著录项

相似文献

相关主题

期刊订阅