首页> 外文期刊>ChemMedChem >Introduction of a Generally Applicable Method to Estimate Retrieval of Active Molecules for Similarity Searching using Fingerprints
【24h】

Introduction of a Generally Applicable Method to Estimate Retrieval of Active Molecules for Similarity Searching using Fingerprints

机译:介绍一种普遍适用的估计活性分子检索率的方法,以利用指纹进行相似性搜索

获取原文
获取原文并翻译 | 示例
           

摘要

Fingerprints are bit string representations of molecular structure and properties and are among the most widely used computational tools for similarity searching and database screening.Various fingerprint designs are available and their search performance is in general strongly dependent on the compound classes under study and the chemical characteristics of screening databases.Currently,it is not possible to predict the probability of identifying novel hits through fingerprint searching.However,for practical applications,such estimations would be very useful because one might be able,for example,to prioritize fingerprints and compound selection strategies or decide whether or not a similarity search campaign with subsequent experimental evaluation of candidate compounds would be promising at all.We have developed a method that makes it possible to predict the outcome of similarity search calculations using any type of keyed fingerprint.The methodology incorporates bit frequency distributions of reference molecules and the screening database into an information-theoretic function and determines the principally possible recall of active compounds within selection sets of varying size.We calibrate the function on diverse compound classes and accurately predict compound recovery in retrospective virtual screening trials.Furthermore,we correctly predict fingerprint search performance on two experimental high-throughput screening data sets(HTS).Our findings indicate that given a set of reference molecules,a fingerprint,and a screening database,we can readily estimate how likely it will be to retrieve active compounds,without knowledge about the distribution of potential hits in the database.
机译:指纹是分子结构和性质的位串表示,是用于相似度搜索和数据库筛选的最广泛使用的计算工具之一。可以使用各种指纹设计,并且其搜索性能通常强烈取决于所研究的化合物类别和化学特性目前尚无法预测通过指纹搜索识别出新颖命中的可能性。但是,对于实际应用,这样的估计将非常有用,因为它可以例如对指纹和化合物选择策略进行优先级排序或决定是否进行相似性搜索活动并随后进行候选化合物的实验评估。我们开发了一种方法,可以使用任何类型的键控指纹预测相似性搜索计算的结果。频率分布将参考分子和筛选数据库之间的关系转化为信息理论功能,并确定在不同大小的选择集中可能有效回收的活性化合物。我们在回顾性虚拟筛选试验中校准了各种化合物类别的功能并准确预测了化合物的回收率。 ,我们可以在两个实验性高通量筛选数据集(HTS)上正确预测指纹搜索性能。我们的发现表明,给定一组参考分子,一个指纹和一个筛选数据库,我们可以轻松地估算出检索的可能性活性化合物,无需了解数据库中潜在命中的分布情况。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号