Topic modeling in short-text using non-negative matrix factorization based on deep reinforcement learning

Shahbazi Zeinab; Byun Yung-Cheol

首页> 外文期刊>Journal of intelligent & fuzzy systems: Applications in Engineering and Technology >Topic modeling in short-text using non-negative matrix factorization based on deep reinforcement learning

【24h】

Topic modeling in short-text using non-negative matrix factorization based on deep reinforcement learning

机译：基于深度加强学习的非负矩阵分解的短文本模型主题建模

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Topic modeling for short texts is a challenging and interesting problem in the machine learning and knowledge discovery domains. Nowadays, millions of documents published on the internet from various sources. Internet websites are full of various topics and information, but there is a lot of similarity between topics, contents, and total quality of sources, which causes data repetition and gives the user the same information. Another issue is data sparsity and ambiguity because the length of the short text is limited, which causes unsatisfactory results and give irrelevant results to end-users. All these mentioned issues in short texts made an interesting topic for researchers to use machine learning and knowledge discovery techniques to discover underlying topics from a massive amount of data. In this paper, we propose a combination of deep reinforcement learning (RL) and semantics-assisted non-negative matrix factorization model to extract meaningful and underlying topics from short document contents. The main objective of this work is to reduce the problem of repetitive information and data sparsity in short texts to help the users to get meaningful and relevant contents. Furthermore, our propose model reviews an issue of the Seq2Seq approach based on the reinforcement learning perspective and provides a combination of reinforcement learning and SeaNMF formulation using the block coordinate descent algorithm. Moreover, we compare different real-world datasets by using numerical calculation and present a couple of state-of-art models to get better performance on short text document topic modeling. Based on experimental results and comparative analysis, our propose model outperforms the state of art techniques in terms of short document topic modeling.

机译：短文本的主题建模是机器学习和知识发现域中有挑战性和有趣的问题。如今，来自各种来源的互联网上发表了数百万的文件。 Internet网站充满了各种主题和信息，但主题，内容和源的总质量之间存在很多相似性，这导致数据重复并给出用户相同的信息。另一个问题是数据稀疏性和歧义，因为短文本的长度是有限的，这导致不令人满意的结果并对最终用户提供无关的结果。所有这些中提到的简短文本问题对研究人员来说，使用机器学习和知识发现技术来发现来自大量数据的基础主题。在本文中，我们提出了深度加强学习（RL）和语义辅助非负矩阵分解模型的组合，以从短文档内容中提取有意义和基础的主题。这项工作的主要目标是减少短文本中重复信息和数据稀疏问题的问题，以帮助用户获得有意义和相关的内容。此外，我们的建议模式根据加强学习的角度，通过块坐标阶级算法提供增强学习和SeanMF配方的组合。此外，我们通过使用数值计算来比较不同的现实数据集，并在几个最先进的模型中展示了在短文本文档主题建模上获得更好的性能。基于实验结果和比较分析，我们提出的模型在短文档主题建模方面优于现有技术的现实状态。

著录项

来源
《Journal of intelligent & fuzzy systems: Applications in Engineering and Technology》 |2020年第1期|共18页
作者
Shahbazi Zeinab; Byun Yung-Cheol;
展开▼
作者单位

Jeju Natl Univ Dept Comp Engn Jejusi 63243 Jeju Special Se South Korea;

Jeju Natl Univ Dept Comp Engn Jejusi 63243 Jeju Special Se South Korea;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化系统;
关键词
Topic modeling; knowledge discovery; short text; non-negative matrix factorization; machine learning;

机译：主题建模;知识发现;短文本;非负矩阵分解;机器学习;
入库时间 2022-08-20 10:32:47

相似文献

外文文献
中文文献
专利

1. Topic modeling in short-text using non-negative matrix factorization based on deep reinforcement learning [J] . Shahbazi Zeinab, Byun Yung-Cheol Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2020,第1期

机译：基于深度加强学习的非负矩阵分解的短文本模型主题建模
2. Deep Non-Negative Matrix Factorization Architecture Based on Underlying Basis Images Learning [J] . Zhao Yang, Wang Huiyang, Pei Jihong IEEE Transactions on Pattern Analysis and Machine Intelligence . 2021,第6期

机译：基于基础图像学习的深度非负矩阵分解架构
3. Automatic annotation of histopathological images using a latent topic model based on non-negative matrix factorization [J] . Angel Cruz-Roa, Gloria Diaz, Eduardo Romero, Journal of Pathology Informatics . 2011,第3期

机译：使用基于非负矩阵分解的潜在主题模型自动标注组织病理图像
4. Short-Text Feature Expansion and Classification Based on Non-negative Matrix Factorization [C] . Ling Zhang, Wenchao Jiang, Zhiming Zhao International Conference on Machine Learning for Cyber Security . 2020

机译：基于非负矩阵分解的短文本功能扩展和分类
5. Robotic Swarm Control Using Deep Reinforcement Learning Strategies Based on Mean-Field Models [D] . Kakish, Zahi. 2021

机译：基于平均场模型的深增强学习策略，机器人群控制
6. A Deep Non-negative Matrix Factorization Model for Big Data Representation Learning [O] . Zhikui Chen, Shan Jin, Runze Liu, 2021

机译：大数据表示学习的深度非负矩阵分解模型
7. Short-Text Feature Expansion and Classification Based on Non-negative Matrix Factorization [O] . Ling Zhang, Wenchao Jiang, Zhiming Zhao 2020

机译：基于非负矩阵分解的短文本功能扩展和分类

Topic modeling in short-text using non-negative matrix factorization based on deep reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅