Semisupervised learning of hierarchical latent trait models for data visualization

Nabney I.T.; Sun Y.; Tino P.; Kaban A.

首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Semisupervised learning of hierarchical latent trait models for data visualization

【24h】

Semisupervised learning of hierarchical latent trait models for data visualization

机译：用于数据可视化的分层潜在特征模型的半监督学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recently, we have developed the hierarchical generative topographic mapping (HGTM), an interactive method for visualization of large high-dimensional real-valued data sets. We propose a more general visualization system by extending HGTM in three ways, which allows the user to visualize a wider range of data sets and better support the model development process. 1) We integrate HGTM with noise models from the exponential family of distributions. The basic building block is the latent trait model (LTM). This enables us to visualize data of inherently discrete nature, e.g., collections of documents, in a hierarchical manner. 2) We give the user a choice of initializing the child plots of the current plot in either interactive, or automatic mode. In the interactive mode, the user selects "regions of interest", whereas in the automatic mode, an unsupervised minimum message length (MML)-inspired construction of a mixture of LTMs is employed. The unsupervised construction is particularly useful when high-level plots are covered with dense clusters of highly overlapping data projections, making it difficult to use the interactive mode. Such a situation often arises when visualizing large data sets. 3) We derive general formulas for magnification factors in latent trait models. Magnification factors are a useful tool to improve our understanding of the visualization plots, since they can highlight the boundaries between data clusters. We illustrate our approach on a toy example and evaluate it on three more complex real data sets.

机译：最近，我们开发了分层的生成地形图（HGTM），这是一种用于可视化大型高维实值数据集的交互式方法。我们通过三种方式扩展HGTM，提出了一个更通用的可视化系统，该系统允许用户可视化更广泛的数据集并更好地支持模型开发过程。 1）我们将HGTM与来自指数分布族的噪声模型集成在一起。基本构件是潜在特征模型（LTM）。这使我们能够以分层方式可视化固有离散性质的数据，例如文档集合。 2）我们为用户提供了以交互方式或自动方式初始化当前图的子图的选择。在交互模式下，用户选择“感兴趣区域”，而在自动模式下，采用无监督的最小消息长度（MML）启发的LTM混合结构。当高级绘图被高度重叠的数据投影的密集簇覆盖时，使用无交互模式时，无监督构造特别有用。当可视化大型数据集时，经常会出现这种情况。 3）推导了潜在性状模型中放大因子的一般公式。放大倍数是提高我们对可视化图的理解的有用工具，因为它们可以突出显示数据集群之间的边界。我们通过一个玩具示例来说明我们的方法，并在三个更复杂的真实数据集上对其进行评估。

著录项

来源
《IEEE Transactions on Knowledge and Data Engineering》 |2005年第3期|p.384-400|共17页
作者
Nabney I.T.; Sun Y.; Tino P.; Kaban A.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Gaussian noise; data mining; data visualisation; exponential distribution; interactive systems; learning (artificial intelligence); self-organising feature maps; very large databases; data visualization; document mining; hierarchical generative topographic mapping;

机译：高斯噪声;数据挖掘;数据可视化;指数分布;交互式系统;学习（人工智能）;自组织特征图;大型数据库;数据可视化;文档挖掘;分层生成地形图;

相似文献

外文文献
中文文献
专利

1. Deep Learning of Semisupervised Process Data With Hierarchical Extreme Learning Machine and Soft Sensor Application [J] . Le Yao, Zhiqiang Ge Industrial Electronics, IEEE Transactions on . 2018,第2期

机译：分层极限学习机和软传感器在半监督过程数据深度学习中的应用
2. A hierarchical latent variable model for data visualization [J] . Bishop C.M., Tipping M.E. IEEE Transactions on Pattern Analysis and Machine Intelligence . 1998,第3期

机译：用于数据可视化的分层潜在变量模型
3. Comparing Fits of Latent Trait and Latent Class Models Applied to Sparse Binary Data: An Illustration with Human Resource Management Data [J] . LILIAN M. DE MENEZES, ANA LASAOSA Journal of applied statistics . 2007,第3a4期

机译：比较适用于稀疏二进制数据的潜在特征和潜在类模型的拟合度：与人力资源管理数据的说明
4. Reading Difficulty in Adults with Intellectual Disabilities: Analysis with a Hierarchical Latent Trait Model [C] . Martin Jansche, Lijun Feng, Matt Huenerfauth 12th international ACM SIGACCESS conference on computers and accessibility 2010 . 2010

机译：智力障碍成年人的阅读困难：分层隐性特征模型的分析
5. Learning Latent Hierarchical Structures via Probabilistic Models and Deep Learning [D] . Arabshahi, Forough 2018

机译：通过概率模型和深度学习来学习潜在的层次结构
6. A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment [O] . Kaja Zupanc, Erik Štrumbelj -1

机译：用于估计大规模绩效评估中评估者偏倚和可靠性的贝叶斯分层潜在性状模型
7. Semisupervised learning of hierarchical latent trait models for data visualization [O] . Nabney, Ian T., Sun, Yi, Tiňo, Peter, 2005

机译：用于数据可视化的分层潜在特征模型的半监督学习

Semisupervised learning of hierarchical latent trait models for data visualization

摘要

著录项

相似文献

相关主题

期刊订阅