Improving unsupervised neural aspect extraction for online discussions using out-of-domain classification

Alekseev Anton; Tutubalina Elena; Malykh Valentin; Nikolenko Sergey

首页> 外文期刊>Journal of intelligent & fuzzy systems: Applications in Engineering and Technology >Improving unsupervised neural aspect extraction for online discussions using out-of-domain classification

【24h】

Improving unsupervised neural aspect extraction for online discussions using out-of-domain classification

机译：使用域名分类改善无监督的神经方面提取在线讨论

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep learning architectures based on self-attention have recently achieved and surpassed state of the art results in the task of unsupervised aspect extraction and topic modeling. While models such as neural attention-based aspect extraction (ABAE) have been successfully applied to user-generated texts, they are less coherent when applied to traditional data sources such as news articles and newsgroup documents. In this work, we introduce a simple approach based on sentence filtering in order to improve topical aspects learned from newsgroups-based content without modifying the basic mechanism of ABAE. We train a probabilistic classifier to distinguish between out-of-domain texts (outer dataset) and in-domain texts (target dataset). Then, during data preparation we filter out sentences that have a low probability of being in-domain and train the neural model on the remaining sentences. The positive effect of sentence filtering on topic coherence is demonstrated in comparison to aspect extraction models trained on unfiltered texts.

机译：基于自我关注的深度学习架构最近实现和超越了现有技术，导致无监督的方面提取和主题建模的任务。虽然诸如基于神经关注的方面提取（ABAE）的模型已成功应用于用户生成的文本，但在应用于传统数据源之类的新闻文章和新闻组文章之类的传统数据源时，它们不太一致。在这项工作中，我们介绍了一种基于句子过滤的简单方法，以改进基于新闻组的内容中学到的主题方面而不修改ABAE的基本机制。我们训练概率分类器，区分域外文本（外部数据集）和域中文本（目标数据集）。然后，在数据准备期间，我们将滤除具有低概率的句子以及在其余句子上培训神经模型的概率。与在未过滤的文本训练的方面提取模型相比，句子过滤对主题相干性的积极效果。

著录项

来源
《Journal of intelligent & fuzzy systems: Applications in Engineering and Technology》 |2020年第2期|共10页
作者
Alekseev Anton; Tutubalina Elena; Malykh Valentin; Nikolenko Sergey;
展开▼
作者单位

Steklov Math Inst St Petersburg Samsung PDMI Joint AI Ctr 27 Fontanka St Petersburg Russia;

Steklov Math Inst St Petersburg Samsung PDMI Joint AI Ctr 27 Fontanka St Petersburg Russia;

Moscow Inst Phys &

Technol 9 Inst Skiy Per Dolgoprudnyi Moscow Region Russia;

Steklov Math Inst St Petersburg Samsung PDMI Joint AI Ctr 27 Fontanka St Petersburg Russia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化系统;
关键词
Aspect extraction; out-of-domain classification; deep learning; topic models; topic coherence;

机译：方面提取;域外分类;深度学习;主题模型;主题连贯;

相似文献

外文文献
中文文献
专利

1. Improving unsupervised neural aspect extraction for online discussions using out-of-domain classification [J] . Alekseev Anton, Tutubalina Elena, Malykh Valentin, Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2020,第2Pta2期

机译：使用域名分类改善无监督的神经方面提取在线讨论
2. An Automated ECG Beat Classification System Using Deep Neural Networks with an Unsupervised Feature Extraction Technique [J] . Siti Nurmaini, Radiyati Umi Partan, Wahyu Caesarendra, Applied Sciences . 2019,第14期

机译：一种自动化的ECG击败分类系统，具有无监督特征提取技术的深神经网络
3. UNSUPERVISED NEURAL NETWORK LEARNING PROCEDURES FOR FEATURE EXTRACTION AND CLASSIFICATION [J] . Becker S., Plumbley M. Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 1996,第3期

机译：用于特征提取和分类的未经监督的神经网络学习程序
4. Unsupervised Aspect Term Extraction in Online Drugs Reviews [C] . Diana C. Cavalcanti, Ricardo B. C. Prudencio International Florida Aritificial Intelligence Research Society Conference . 2017

机译：在线药物中的无监督方面的术语提取评论
5. ReviewMiner: An unsupervised method of aspect extraction and aspect rating from product reviews [D] . Dutta, Anubrata. 2013

机译：ReviewMiner：从商品评论中提取内容和对内容进行评级的无监督方法
6. Blood Vessel Extraction in Color Retinal Fundus Images with Enhancement Filtering and Unsupervised Classification [O] . Zafer Yavuz, Cemal Köse 2017

机译：具有增强滤波和无监督分类的彩色视网膜眼底图像中的血管提取
7. Deep Sentiment Classification and Topic Discovery on Novel Coronavirus or COVID-19 Online Discussions: NLP Using LSTM Recurrent Neural Network Approach [O] . Hamed Jelodar, Yongli Wang, Rita Orji, 2020

机译：深度情绪分类和主题发现在新型冠状病毒或Covid-19在线讨论：NLP使用LSTM经常性神经网络方法
8. Feature Extraction Using an Unsupervised Neural Network [R] . Intrator, N. 1991

机译：基于无监督神经网络的特征提取

Improving unsupervised neural aspect extraction for online discussions using out-of-domain classification

摘要

著录项

相似文献

相关主题

期刊订阅