Combining Statistics-Based and CNN-Based Information for Sentence Classification

机译：结合基于统计数据和基于CNN的信息，用于句子分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Sentence classification, serving as the foundation of the subsequent text-based processing, continues attracting researchers attentions. Recently, with the great success of deep learning, convolutional neural network (CNN), a kind of common architecture of deep learning, has been widely used to this filed and achieved excellent performance. However, most CNN-based studies focus on using complex architectures to extract more effective category information, requiring more time in training models. With the aim to get better performance with less time cost on classification, this paper proposes two simple and effective methods by fully combining information both extracted from statistics and CNN. The first method is S-SFCNN, which combines statistical features and CNN-based probabilistic features of classification to build feature vectors, and then the vectors are used to train the logistic regression classifiers. And the second method is C-SFCNN, which combines CNN-based features and statistics-based probabilistic features of classification to build feature vectors. In the two methods, the Naive Bayes log-count ratios are selected as the text statistical features and the single-layer and single channel CNN is used as our CNN architecture. The testing results executed on 7 tasks show that our methods can achieve better performance than many other complex CNN models with less time cost. In addition, we summarized the main factors influencing the performance of our methods though experiment.

机译：句子分类，作为随后的基于文本的处理的基础，继续吸引研究人员的注意。最近，随着深度学习的巨大成功，卷积神经网络（CNN），一种深入学习的一种常见建筑，已被广泛用于这一提交并取得了良好的性能。然而，基于CNN的大多数基于CNN的研究侧重于使用复杂的架构提取更有效的类别信息，需要更多的培训模型时间。旨在通过较少的分类成本获得更好的性能，通过完全组合从统计和CNN提取的信息，提出了两个简单有效的方法。第一种方法是S-SFCNN，其组合了分类的统计特征和基于CNN的概率特征来构建特征向量，然后使用该向量来训练逻辑回归分类器。第二种方法是C-SFCNN，它将基于CNN的特征和基于统计学的概率特征组合到构建特征向量的分类。在这两种方法中，选择朴素的贝叶斯日志计数比作为文本统计特征，单层和单层CNN用作我们的CNN架构。在7个任务中执行的测试结果表明，我们的方法可以实现比许多其他复杂的CNN模型更好的性能，而且具有较少的时间成本。此外，我们总结了影响我们方法表现的主要因素虽然实验。

著录项

来源
《IEEE International Conference on Tools with Artificial Intelligence》|2016年|xxix 576 p. :|共7页
会议地点
作者
Lang Zhining; Gu Xiaozhuo; Zhou Quan; Xu Taizhong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词
Feature extraction; Probabilistic logic; Training; Computer architecture; Machine learning; Data mining; Logistics;

机译：特征提取;概率逻辑;培训;计算机架构;机器学习;数据挖掘;物流;

相似文献

外文文献
中文文献
专利

1. CNN-Based Land Cover Classification Combining Stratified Segmentation and Fusion of Point Cloud and Very High-Spatial Resolution Remote Sensing Image Data [J] . Keqi Zhou, Dongping Ming, Xianwei Lv, Remote Sensing . 2019,第17期

机译：基于CNN的土地覆盖分类，结合点云和超高分辨率的遥感影像数据的分层分割和融合
2. Sentence sentiment classification using fuzzy word matching combined with fuzzy sentiment classifier [J] . Marcin PIETRAS Przeglad Elektrotechniczny . 2015,第2期

机译：模糊词匹配与模糊情感分类器相结合的句子情感分类
3. Image classification using higher-order statistics-based ICA for NOAA multispectral satellite image [J] . T. Venkata Krishnamoorthy, G. Umamaheswara Reddy International journal of advanced intelligence paradigms . 2020,第1a2期

机译：使用高阶统计信息的ICA用于NOAA多光谱卫星图像的图像分类
4. Combining Statistics-Based and CNN-Based Information for Sentence Classification [C] . Lang Zhining, Gu Xiaozhuo, Zhou Quan, IEEE International Conference on Tools with Artificial Intelligence . 2016

机译：结合基于统计信息和基于CNN的信息进行句子分类
5. The Effects of Sentence-Combining on The Longitudinal Development of Syntactic Complexity in L2 Writing [D] . Marlowe, J. Paul. 2019

机译：句子组合对L2写作句法复杂性纵向发展的影响
6. pcPromoter-CNN: A CNN-Based Prediction and Classification of Promoters [O] . Muhammad Shujaat, Abdul Wahab, Hilal Tayara, 2020

机译：PCPROMoter-CNN：基于CNN的预测和启动子分类
7. An Algorithm Combining Statistics-based and Rules-based for Chunk Identification of Chinese Sentences [O] . Wang Rongbo, Chi Zheru 2006

机译：统计与规则相结合的汉语句子块识别算法

Combining Statistics-Based and CNN-Based Information for Sentence Classification

摘要

著录项

相似文献

相关主题

期刊订阅