Graph-induced restricted Boltzmann machines for document modeling

Tu Dinh Nguyen; Truyen Tran; Dinh Phung; Venkatesh Svetha

首页> 外文期刊>Information Sciences: An International Journal >Graph-induced restricted Boltzmann machines for document modeling

【24h】

Graph-induced restricted Boltzmann machines for document modeling

机译：图诱导受限玻尔兹曼机用于文档建模

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Discovering knowledge from unstructured texts is a central theme in data mining and machine learning. We focus on fast discovery of thematic structures from a corpus. Our approach is based on a versatile probabilistic formulation - the restricted Boltzmann machine (RBM) - where the underlying graphical model is an undirected bipartite graph. Inference is efficient - document representation can be computed with a single matrix projection, making RBMs suitable for massive text corpora available today. Standard RBMs, however, operate on bag-of-words assumption, ignoring the inherent underlying relational structures among words. This results in less coherent word thematic grouping. We introduce graph-based regularization schemes that exploit the linguistic structures, which in turn can be constructed from either corpus statistics or domain knowledge. We demonstrate that the proposed technique improves the group coherence, facilitates visualization, provides means for estimation of intrinsic dimensionality, reduces overfitting, and possibly leads to better classification accuracy. (C) 2015 Elsevier Inc. All rights reserved.

机译：从非结构化文本中发现知识是数据挖掘和机器学习的中心主题。我们专注于从语料库快速发现主题结构。我们的方法基于一种通用的概率公式化-受限玻尔兹曼机（RBM）-其中基础图形模型是无向二部图。推理是有效的-可以使用单个矩阵投影来计算文档表示，这使得RBM适用于当今的大量文本语料库。但是，标准的RBM在假设单词袋的情况下运行，而忽略了单词之间固有的潜在关系结构。这导致词主题分组的连贯性降低。我们介绍了利用语言结构的基于图的正则化方案，而语言结构又可以从语料统计或领域知识中构建。我们证明了所提出的技术提高了组的连贯性，促进了可视化，提供了用于估计固有维数的方法，减少了过度拟合，并可能导致更好的分类精度。（C）2015 Elsevier Inc.保留所有权利。

著录项

来源
《Information Sciences: An International Journal》 |2016年第null期|共16页
作者
Tu Dinh Nguyen; Truyen Tran; Dinh Phung; Venkatesh Svetha;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动信息理论;
关键词
Document modeling; Restricted Boltzmann machine; Feature group discovery; Topic coherence; Word graphs;

机译：文档建模;受限玻尔兹曼机;特征群发现;主题一致性;词图;

相似文献

外文文献
中文文献
专利

1. Graph-induced restricted Boltzmann machines for document modeling [J] . Tu Dinh Nguyen, Truyen Tran, Dinh Phung, Information Sciences: An International Journal . 2016,第Null期

机译：图诱导受限玻尔兹曼机用于文档建模
2. Restricted Boltzmann Machines as Models of Interacting Variables [J] . Nicola Bulso, Yasser Roudi Neural computation . 2021,第10期

机译：限制Boltzmann Machines作为交互变量的模型
3. Data-Driven Fuzzy Modeling Using Restricted Boltzmann Machines and Probability Theory [J] . de la Rosa Erick, Yu Wen IEEE Transactions on Systems, Man, and Cybernetics . 2020,第7期

机译：采用限制博尔兹曼机械和概率理论的数据驱动模糊建模
4. Deep Transfer Learning via Restricted Boltzmann Machine for Document Classification [C] . Zhang Jian Machine Learning and Applications and Workshops (ICMLA), 2011 10th International Conference on . 2011

机译：通过受限的Boltzmann机进行深度传输学习以进行文档分类
5. Efficient Machine Learning Inference for Embedded Systems with Integer Based Restricted Boltzmann Machines Classifiers [D] . Sosa Barillas, Bryan Samuel. 2019

机译：基于整数的受限Boltzmann机器分类器的嵌入式系统有效的机器学习推断
6. Correction: Gaussian-binary restricted Boltzmann machines for modeling natural image statistics [O] . Jan Melchior, Nan Wang, Laurenz Wiskott 2012

机译：校正：用于自然图像统计建模的高斯二元受限玻尔兹曼机
7. Convolutional restricted Boltzmann machine aided Monte Carlo: An application to Ising and Kitaev models [O] . Daniel Alcalde Puente, Ilya M. Eremin 2020

机译：卷积限制Boltzmann机器辅助蒙特卡罗：ising和kitaev型号的应用

Graph-induced restricted Boltzmann machines for document modeling

摘要

著录项

相似文献

相关主题

期刊订阅