首页> 外文期刊>Decision support systems >Independent component analysis for near-synonym choice
【24h】

Independent component analysis for near-synonym choice

机译:独立成分分析,用于近义词选择

获取原文
获取原文并翻译 | 示例
       

摘要

Despite their similar meanings, near-synonyms may have different usages in different contexts, and the development of algorithms that can verify whether near-synonyms do match their given contexts has been the focus of increasing concern. Such algorithms have many applications such as query expansion for information retrieval (IR), alternative word selection for writing support systems, and (near-)duplicate detection for text summarization. In this paper, we propose a framework that incorporates latent semantic analysis (LSA) and independent component analysis (ICA) to automatically select suitable near-synonyms according to the given context. LSA is used to discover useful latent features that do not frequently occur in the contexts of near-synonyms, and ICA is used to estimate a set of independent components by minimizing the dependence between features. An SVM classifier is then trained with the independent components for best near-synonym prediction. In experiments, we evaluate the proposed method on both Chinese and English sentences, and compare its performance to state-of-the-art supervised and unsupervised methods. Experimental results show that training on the independent components that contain useful contextual features with minimized term dependence can improve the classifiers' ability to discriminate among near-synonyms, thus yielding better performance.
机译:尽管近义词具有相似的含义,但它们在不同上下文中的用法可能有所不同,并且可以验证近义词是否确实匹配其给定上下文的算法的开发一直是人们日益关注的焦点。这样的算法具有许多应用,例如用于信息检索(IR)的查询扩展,用于写作支持系统的替代单词选择以及用于文本摘要的(近)重复检测。在本文中,我们提出了一个框架,该框架结合了潜在语义分析(LSA)和独立成分分析(ICA),可以根据给定的上下文自动选择合适的近义词。 LSA用于发现在近义词上下文中不经常出现的有用的潜在特征,而ICA用于通过最小化特征之间的依赖性来估计一组独立的分量。然后,使用独立的组件对SVM分类器进行训练,以实现最佳的近义词预测。在实验中,我们对中文和英文句子中的拟议方法进行了评估,并将其与最新的监督和非监督方法进行比较。实验结果表明,对包含有用上下文特征且具有最小术语依赖性的独立组件进行训练可以提高分类器区分近义词的能力,从而产生更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号