首页> 外文期刊>Metron >Gini diversity index, Hamming distance, and curse of dimensionality
【24h】

Gini diversity index, Hamming distance, and curse of dimensionality

机译:基尼多样性指数,汉明距离和维数诅咒

获取原文
获取原文并翻译 | 示例
       

摘要

The celebrated Gini(-Simpson) biodiversity index has found very useful applications in ecology, bio-environmetrics, econometry, psychometry, genetics, and lately in bioinformatics as well. In econommic, social and environmental sciences, there are some ordinal categorical variables or grouped data sets, so that it needs an alternative means of data summarization and interpretation. However, in many other applications in genetics and bioinformatics, mostly, categorical data models, without possibly an ordering of the categories, crop up, which may preempt routine use of conventional measures of quantitative diversity analysis. Further, in real life problems, mostly, genuine and (often enormously large) multidimensional data models, are encountered. The Hamming distance incorporates and generalizes the concept of the Gini-Simpson diversity index in a variety of multidimensional setups, without making very stringent structural regularity assumptions. In bioinformatics as well as many other large biological system analyses, the curse of dimensionality is a genuine concern. The role of Hamming distance based analysis is appraised in this perspective. Subgroup or MANOVA decomposability aspects are specially appraised in this setup.
机译:著名的吉尼(-Simpson)生物多样性指数已在生态学,生物环境计量学,计量经济学,心理计量学,遗传学以及最近在生物信息学中找到了非常有用的应用。在经济,社会和环境科学中,存在一些有序的分类变量或分组的数据集,因此它需要另一种数据汇总和解释的方法。但是,在遗传学和生物信息学的许多其他应用中,大多数情况下是分类数据模型,没有对类别进行排序,因此可能无法常规使用定量多样性分析的常规方法。此外,在现实生活中,通常会遇到真正的(通常是非常大的)多维数据模型。汉明距离在各种多维设置中合并并推广了基尼-辛普森分集指数的概念,而没有做出非常严格的结构规律性假设。在生物信息学以及许多其他大型生物系统分析中,维数的诅咒是一个真正的问题。从这个角度评估了基于汉明距离的分析的作用。在此设置中,专门评估了子组或MANOVA可分解性方面。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号