Latent semantic analysis for multiple-type interrelated data objects

机译：多种相互关联的数据对象的潜在语义分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Co-occurrence data is quite common in many real applications. Latent Semantic Analysis (LSA) has been successfully used to identify semantic relations in such data. However, LSA can only handle a single co-occurrence relationship between two types of objects. In practical applications, there are many cases where multiple types of objects exist and any pair of these objects could have a pairwise co-occurrence relation. All these co-occurrence relations can be exploited to alleviate data sparseness or to represent objects more meaningfully. In this paper, we propose a novel algorithm, M-LSA, which conducts latent semantic analysis by incorporating all pairwise co-occurrences among multiple types of objects. Based on the mutual reinforcement principle, M-LSA identifies the most salient concepts among the co-occurrence data and represents all the objects in a unified semantic space. M-LSA is general and we show that several variants of LSA are special cases of our algorithm. Experiment resultsshow that M-LSA outperforms LSA on multiple applications, including collaborative filtering, text clustering, and text categorization.

机译：共现数据在许多实际应用中非常普遍。潜在语义分析（LSA）已成功用于识别此类数据中的语义关系。但是，LSA仅能处理两种类型的对象之间的单一共现关系。在实际应用中，在许多情况下存在多种类型的对象，并且这些对象中的任何一对都可能具有成对的共现关系。所有这些共现关系都可以用来减轻数据稀疏性或更有意义地表示对象。在本文中，我们提出了一种新颖的算法 M-LSA ，该算法通过合并多种类型对象之间的所有成对共现来进行潜在的语义分析。基于互增强原理，M-LSA识别同现数据中最重要的概念，并在统一语义空间中表示所有对象。 M-LSA是通用的，我们证明了LSA的几种变体是我们算法的特例。实验结果表明，在包括协同过滤，文本聚类和文本分类在内的多个应用程序中，M-LSA的性能优于LSA。 展开▼

著录项

来源
《Annual international ACM SIGIR conference on Research and development in information retrieval;International ACM SIGIR conference on Research and development in information retrieval》|2006年|P.236-243|共8页

会议地点

作者
Xuanhui Wang; Jian-Tao Sun; Zheng Chen; ChengXiang Zhai; PXuanhui Wang; PJian-Tao Sun; PZheng Chen; PChengXiang Zhai;
展开▼

作者单位

展开▼

会议组织

原文格式 PDF

正文语种

中图分类各种专用数据库;

关键词
mutual reinforcement principle;

机译：互助原则;

入库时间 2022-08-26 14:55:08

相似文献

外文文献

中文文献

专利

1. M:N object matching between image and map object data sets by means of latent semantic analysis [J] . Yong Huh, Jiyoung Kim, Kiyun Yu, International journal of remote sensing . 2014,第17a18期

机译：通过潜在语义分析在图像和地图对象数据集之间进行M：N对象匹配

2. A comparative analysis of Latent Semantic analysis and Latent Dirichlet allocation topic modeling methods using Bible data [J] . Vasantha Kumari Garbhapu, Prajna Bodapati Indian Journal of Science and Technology . 2020,第44期

机译：潜在语义分析与潜在的Dirichlet分配主题建模方法的比较分析

3. Comparison of Latent Semantic Analysis and Probabilistic Latent Semantic Analysis for Documents Clustering [J] . Kuta, Marcin, Kitowski, Computing and informatics . 2015,第3期

机译：文档聚类的潜在语义分析与概率潜在语义分析的比较

4. Latent semantic analysis for multiple-type interrelated data objects [C] . Xuanhui Wang, Jian-Tao Sun, Zheng Chen, Annual international ACM SIGIR conference on Research and development in information retrieval . 2006

机译：多型相互关联数据对象的潜在语义分析

5. Performance Evaluation of Probabilistic Latent Semantic Analysis for Unstructured Social Media Data. [D] . Prakash, Bharat. 2014

机译：非结构化社交媒体数据的概率潜在语义分析的性能评估。

6. MOWDOC: A Dataset of Documents From Taking the Measure of Work for Building a Latent Semantic Analysis Space [O] . Kim F. Nimon 2020

机译：mowdoc：从衡量建立潜在语义分析空间的工作的文件数据集

7. Latent semantic analysis for multiple-type interrelated data objects [O] . Xuanhui Wang, Jian-tao Sun, Zheng Chen, 2006

机译：多类型相关数据对象的潜在语义分析

1. 利用潜在语义分析和关联规则挖掘构造同义与关联词集 [J] . 张文东 ,易轶虎 . 计算机工程与科学 . 2007,第001期

2. 高维稀疏数据对象-属性的非关联子空问分析 [J] . 祝琴 ,戴爱明 . 中国管理信息化 . 2011,第009期

3. 简化达到优化、协调为了统一——从地方标准的制定看它们相互关联、相互渗透、相互依存的关系 [J] . 彭同心 ,葛建华 . 仪器仪表标准化与计量 . 2009,第005期

4. 分句间的多种意念关系和多种关联词语初探 [J] . 孙云 . 天津师范大学学报：社会科学版 . 1981,第006期

5. 两模光场与原子相互作用中光场周期性和频效应-理想Kerr介质腔中非关联双模相干态光场与V型三能级原子相互作用系统中光场的不等阶和压缩效应 [J] . 赖振讲 ,侯洵 ,杨志勇 . 光子学报 . 2002,第12期

6. 基于秩方法的相互关联与相互依赖多网络体系脆弱性分析 [C] . Jin Wei-xin ,金伟新 . 第17届中国系统仿真技术及其应用学术年会（17th CCSSTA 2016) . 2016

7. 分子间相互作用与其磁耦合相互作用的关联研究 [A] . 张程程 . 2017

1. 一种实现多种商品码相互关联调取的系统及方法 [P] . 中国专利： CN109299952B . 2021.10.29

2. 一种实现多种商品码相互关联调取的系统及方法 [P] . 中国专利： CN109299952A . 2019-02-01

3. System and method of structuring data for search using latent semantic analysis techniques [P] . 外国专利： US9183288B2 . 2015-11-10

机译：使用潜在语义分析技术构建搜索数据的系统和方法

4. SYSTEM AND METHOD OF STRUCTURING DATA FOR SEARCH USING LATENT SEMANTIC ANALYSIS TECHNIQUES [P] . 外国专利： US2011225159A1 . 2011-09-15

机译：利用潜在语义分析技术构建搜索数据的系统和方法

5. SCENE ACTIVITY ANALYSIS USING STATISTICAL AND SEMANTIC FEATURE LEARNT FROM OBJECT TRAJECTORY DATA [P] . 外国专利： EP2659456B1 . 2020-02-19

机译：利用对象轨迹数据学习的统计和语义特征进行场景活动分析

相关主题

Latent semantic analysis for multiple-type interrelated data objects

摘要

著录项

相似文献

相关主题

期刊订阅