Applying Latent Semantic Analysis to Optimize Second-order Co-occurrence Vectors for Semantic Relatedness Measurement

机译：应用潜在语义分析优化用于语义相关性测量的二阶共现向量

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Measures of semantic relatedness are largely applicable in intelligent tasks of NLP and Bioinformatics. By taking these automated measures into account, this paper attempts to improve Second-order Co-occurrence Vector semantic relatedness measure for more effective estimation of relatedness between two given concepts. Typically, this measure, after constructing concepts definitions (Glosses) from a thesaurus, considers the cosine of the angle between the concepts' gloss vectors as the degree of relatedness. Nonetheless, these computed gloss vectors of concepts are impure and rather large in size which would hinder the expected performance of the measure. By employing latent semantic analysis (LSA), we try to conduct some level of insignificant feature elimination to generate economic gloss vectors. Applying both approaches to the biomedical domain, using MEDLINE as corpus, UMLS as thesaurus, and reference standard of biomedical concept pairs manually rated for relatedness, we show LSA implementation enforces positive impact in terms of performance and efficiency.

机译：语义相关性的度量很大程度上适用于NLP和生物信息学的智能任务。通过考虑这些自动化措施，本文尝试改进二阶共现向量语义相关性度量，以更有效地估计两个给定概念之间的相关性。通常，在从同义词库构造概念定义（光泽度）之后，此度量将概念光泽度向量之间的角度的余弦视为关联度。尽管如此，这些计算出的概念光泽度向量是不纯净的，并且大小较大，这会阻碍该度量的预期性能。通过使用潜在语义分析（LSA），我们尝试进行某种程度的无关紧要的特征消除，以生成经济的光泽向量。将这两种方法应用于生物医学领域，以MEDLINE为语料库，以UMLS为同义词库，以及对生物医学概念对进行手动相关性评估的参考标准，我们证明LSA实施对性能和效率产生了积极影响。

著录项

来源
《International conference on mining intelligence and knowledge exploration》|2013年|588-599|共12页
会议地点
作者
Ahmad Pesaranghader; Ali Pesaranghader; Azadeh Rezaei;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Biomedical Text Mining; Bioinformatics; Semantic Relatedness; Latent Semantic Analysis; MEDLINE; UMLS; Natural Language Processing;

机译：生物医学文本挖掘;生物信息学语义相关性;潜在语义分析; MEDLINE; UMLS;自然语言处理;
入库时间 2022-08-26 15:12:14

相似文献

外文文献
中文文献
专利

1. Semantic relatedness measurement based on Wikipedia link co-occurrence analysis [J] . Masahiro Ito, Kotaro Nakayama, Takahiro Hara, International journal of web information systems . 2011,第1期

机译：基于维基百科链接共现分析的语义相关性度量
2. Computing semantic relatedness using latent semantic analysis and fuzzy formal concept analysis [J] . Shivani Jain, K.R. Seeja, Rajni Jindal International journal of reasoning-based intelligent systems . 2021,第2期

机译：使用潜在语义分析和模糊正式概念分析计算语义相关性
3. A New Methodology for Computing Semantic Relatedness: Modified Latent Semantic Analysis by Fuzzy Formal Concept Analysis [J] . Shivani Jain, K.R. Seeja, Rajni Jindal Procedia Computer Science . 2020,第5期

机译：用于计算语义相关性的新方法：模糊正式概念分析修改潜在语义分析
4. Applying Latent Semantic Analysis to Optimize Second-order Co-occurrence Vectors for Semantic Relatedness Measurement [C] . Ahmad Pesaranghader, Ali Pesaranghader, Azadeh Rezaei International Conference Mining Intelligence and Knowledge Exploration . 2013

机译：应用潜在语义分析优化用于语义相关性测量的二阶共生载体
5. Measurement from a semantic perspective: A model of scale construction using latent semantic analysis [D] . Sherman, Geoffrey William 2006

机译：从语义角度进行度量：使用潜在语义分析的量表构建模型
6. Searching for Semantic Knowledge: A Vector Space Semantic Analysis of the Feature Generation Task [O] . Rebecca A. Cutler, Melissa C. Duff, Sean M. Polyn 2019

机译：搜索语义知识：特征生成任务的向量空间语义分析
7. The feasibility of applying Latent Semantic Analysis to analyze Item similarity [O] . 陳彥霖, Chen Yan-Lin 2011

机译：应用潜在语义分析来分析项目相似度的可行性

Applying Latent Semantic Analysis to Optimize Second-order Co-occurrence Vectors for Semantic Relatedness Measurement

摘要

著录项

相似文献

相关主题

期刊订阅