Word-Level and Character-Level Mixed Features for Chinese Short Text Classification

机译：中文短文本分类的词级和字符级混合特征

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, a novel method is proposed to solve the problem of insufficient representation of single character-level features or word-level features. In view of the short length, sparseness and strong context dependencies of short text, our method takes word-level vectors and character-level vectors as inputs simultaneously, and encodes sentence semantics by two Long Short-Term Memory (LSTMs) or bidirectional Long Short-Term Memory (BiLSTMs). The outputs of the entire sentence combined two outputs from word-level vectors and character-level vectors. For Chinese short text classification, our experiments show that the combination of word embedding and character embedding can complement each other in the sentence semantic representation, which helps to improve the classification performance of Chinese short text.

机译：本文提出了一种新颖的方法来解决单个字符级特征或单词级特征表示不足的问题。鉴于短文本的短长度，稀疏性和强烈的上下文相关性，我们的方法同时将单词级向量和字符级向量作为输入，并通过两个Long Short-Term Memory（LSTM）或双向Long Short-Term编码句子语义长期记忆（BiLSTM）。整个句子的输出组合了单词级向量和字符级向量的两个输出。对于中文短文本分类，我们的实验表明，词嵌入和字符嵌入的组合可以在句子的语义表示中相互补充，从而有助于提高中文短文本的分类性能。

著录项

来源
《IEEE International Conference on Computer and Communications》|2018年|2344-2348|共5页
会议地点
作者
Jingwen Li; Xinxin Wan; Sujuan Qin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Semantics; Training; Mathematical model; Text categorization; Logic gates; Recurrent neural networks; Deep learning;

机译：语义;训练;数学模型;文本分类;逻辑门;递归神经网络;深度学习;

相似文献

外文文献
中文文献
专利

1. Chinese text classification based on character-level CNN and SVM [J] . Huaiguang Wu, Daiyi Li, Ming Cheng International journal of intelligent information and database systems . 2019,第3期

机译：基于字符级CNN和SVM的中文文本分类
2. Orthographic features for emotion classification in Chinese in informal short texts [J] . Chen I-Hsuan, Long Yunfei, Lu Qin, Language Resources and Evaluation . 2021,第2期

机译：非正式短文中的情感分类的正交特征
3. Chinese Short-Text Classification Based on Topic Model with High-Frequency Feature Expansion [J] . Hu Y. Jun, Jiang J. Xin, Chang H. You Journal of Multimedia . 2013,第4期

机译：基于主题模型和高频特征扩展的中文短文本分类
4. Word-Level and Character-Level Mixed Features for Chinese Short Text Classification [C] . Jingwen Li, Xinxin Wan, Sujuan Qin IEEE International Conference on Computer and Communications . 2018

机译：中文短文本分类的单词级和字符级混合功能
5. Improving Sentiment Classification for Arabic Short Text Using Deep Learning Approaches [D] . Alwehaibi, Ali. 2021

机译：利用深度学习方法改善阿拉伯语短文本的情感分类
6. Text mining for the Vaccine Adverse Event Reporting System: medical text classification using informative feature selection [O] . Taxiarchis Botsis, Michael D Nguyen, Emily Jane Woo, 2011

机译：疫苗不良事件报告系统的文本挖掘：使用信息特征选择进行医学文本分类
7. Combining Word-Level and Character-Level Representations for Relation Classification of Informal Text [O] . Dongyun Liang, Weiran Xu, Yinge Zhao 2017

机译：结合Word级和字符级表示对非正式文本的关系分类

Word-Level and Character-Level Mixed Features for Chinese Short Text Classification

摘要

著录项

相似文献

相关主题

期刊订阅