Word-Level and Character-Level Mixed Features for Chinese Short Text Classification

机译：中文短文本分类的单词级和字符级混合功能

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, a novel method is proposed to solve the problem of insufficient representation of single character-level features or word-level features. In view of the short length, sparseness and strong context dependencies of short text, our method takes word-level vectors and character-level vectors as inputs simultaneously, and encodes sentence semantics by two Long Short-Term Memory (LSTMs) or bidirectional Long Short-Term Memory (BiLSTMs). The outputs of the entire sentence combined two outputs from word-level vectors and character-level vectors. For Chinese short text classification, our experiments show that the combination of word embedding and character embedding can complement each other in the sentence semantic representation, which helps to improve the classification performance of Chinese short text.

机译：在本文中，提出了一种新方法来解决单个字符级别特征或单词级别特征的表示不足的问题。鉴于短的长度，稀疏性和强大的上下文依赖关系，我们的方法同时将字级向量和字符级向量作为输入，并通过两个长的短期内存（LSTMS）或双向短路进行编码句语义-term内存（bilstms）。整个句子的输出组合来自字级向量和字符级向量的两个输出。对于中国短文本分类，我们的实验表明，单词嵌入和字符嵌入的组合可以在句子语义表示中相互补充，有助于提高中国短文本的分类性能。

著录项

来源
《IEEE International Conference on Computer and Communications》|2018年|2064-2740p|共5页
会议地点
作者
Jingwen Li; Xinxin Wan; Sujuan Qin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词
Chinese short text; Word vector; Long short-term memory;

机译：中国短文本;字矢量;长短短期记忆;

相似文献

外文文献
中文文献
专利

1. Chinese text classification based on character-level CNN and SVM [J] . Huaiguang Wu, Daiyi Li, Ming Cheng International journal of intelligent information and database systems . 2019,第3期

机译：基于字符级CNN和SVM的中文文本分类
2. Orthographic features for emotion classification in Chinese in informal short texts [J] . Chen I-Hsuan, Long Yunfei, Lu Qin, Language Resources and Evaluation . 2021,第2期

机译：非正式短文中的情感分类的正交特征
3. Chinese Short-Text Classification Based on Topic Model with High-Frequency Feature Expansion [J] . Hu Y. Jun, Jiang J. Xin, Chang H. You Journal of Multimedia . 2013,第4期

机译：基于主题模型和高频特征扩展的中文短文本分类
4. Word-Level and Character-Level Mixed Features for Chinese Short Text Classification [C] . Jingwen Li, Xinxin Wan, Sujuan Qin IEEE International Conference on Computer and Communications . 2018

机译：中文短文本分类的词级和字符级混合特征
5. Improving Sentiment Classification for Arabic Short Text Using Deep Learning Approaches [D] . Alwehaibi, Ali. 2021

机译：利用深度学习方法改善阿拉伯语短文本的情感分类
6. Text mining for the Vaccine Adverse Event Reporting System: medical text classification using informative feature selection [O] . Taxiarchis Botsis, Michael D Nguyen, Emily Jane Woo, 2011

机译：疫苗不良事件报告系统的文本挖掘：使用信息特征选择进行医学文本分类
7. Combining Word-Level and Character-Level Representations for Relation Classification of Informal Text [O] . Dongyun Liang, Weiran Xu, Yinge Zhao 2017

机译：结合Word级和字符级表示对非正式文本的关系分类

Word-Level and Character-Level Mixed Features for Chinese Short Text Classification

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅