首页> 外文会议>Asian conference on intelligent information and database systems >Bounds on Lengths of Real Valued Vectors Similar with Regard to the Tanimoto Similarity
【24h】

Bounds on Lengths of Real Valued Vectors Similar with Regard to the Tanimoto Similarity

机译:关于Tanimoto相似性的实值向量的长度的界

获取原文

摘要

The Tanimoto similarity measure finds numerous applications in chemistry, bio-informatics, information retrieval and text mining. A typical task in these applications is finding most similar vectors. The task is very time consuming in the case of very large data sets. Thus methods that allow for efficient restriction of the number of vectors that have a chance to be sufficiently similar to a given vector are of high importance. To this end, recently, we have derived bounds on lengths of vectors similar with respect to the Tanimoto similarity. In this paper, we recall those results and derive new bounds on lengths of real valued vectors that have a chance to be Tanimoto similar to a given vector in a required degree. Finally, we compare the previous and current results and illustrate their usefulness.
机译:Tanimoto相似性度量在化学,生物信息学,信息检索和文本挖掘中发现了许多应用。这些应用程序中的典型任务是找到最相似的向量。对于非常大的数据集,此任务非常耗时。因此,允许有效限制矢量数量的方法的方法非常重要,这些矢量有机会与给定的矢量充分相似。为此,最近,我们已经得出了与谷本相似性相似的矢量长度的界线。在本文中,我们回顾了这些结果,并得出了有价向量的长度上的新界限,这些向量有可能在所需程度内类似于给定向量的谷本。最后,我们比较了先前和当前的结果,并说明了它们的有用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号