首页> 外文会议>International conference on computational linguistics;COLING-96 >Inherited Feature-based Similarity Measure Based on Large Semantic Hierarchy and Large Text Corpus
【24h】

Inherited Feature-based Similarity Measure Based on Large Semantic Hierarchy and Large Text Corpus

机译:大语义层次和大文本语料库的基于特征的继承相似性度量

获取原文

摘要

We describe a similarity calculation model called IFSM (Inherited Feature Similarity Measure) between objects (words/concepts) based on their common and distinctive features. We propose an implementation method for obtaining features based on abstracted triples extracted from a large text corpus utilizing taxonomical knowledge. This model represents an integration of traditional methods, i.e,. relation based similarity measure and distribution based similarity measure. An experiment, using our new concept abstraction method which we call the flat probability grouping method, over 80,000 surface triples, shows that the abstraction level of 3000 is a good basis for feature description.
机译:我们基于对象(单词/概念)的共同和独特特征,描述了一种称为IFSM(继承特征相似性度量)的相似度计算模型。我们提出了一种基于分类学知识的,基于从大型文本语料库中提取的抽象三元组来获取特征的实现方法。该模型代表了传统方法的集成,即基于关系的相似性度量和基于分布的相似性度量。一项使用新概念抽象方法(称为扁平概率分组方法)的实验对超过80,000个表面三元组进行了测试,结果表明3000的抽象水平是进行特征描述的良好基础。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号