Diagnosability of mtDNA with Random Forests: Using sequence data to delimit subspecies

首页> 外文期刊>Marine Mammal Science >Diagnosability of mtDNA with Random Forests: Using sequence data to delimit subspecies

【24h】

Diagnosability of mtDNA with Random Forests: Using sequence data to delimit subspecies

机译：MTDNA与随机林的诊断：使用序列数据分隔亚种

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We examine the use of an ensemble method, Random Forests, to delimit subspecies using mitochondrial DNA (mtDNA) sequences. Diagnosability, a measure of the ability to correctly determine the taxon of a specimen of unknown origin, has historically been used to delimit subspecies, but few studies have explored how to estimate it from DNA sequences. Using simulated and empirical data sets, we demonstrate that Random Forests produces classification models that perform well for diagnosing subspecies and species. Populations with strong social structure and relatively low abundances (e.g., killer whales, Orcinus orca) were found to be as diagnosable as species. Conversely, comparisons involving subspecies that are abundant (e.g., spinner and spotted dolphins, Stenella longirostris and S. attenuata), are only as diagnosable as many population comparisons. Estimates of diagnosability reported in subspecies and species descriptions should include confidence intervals, which are influenced by the sample sizes of the training data. We also stress the importance of reporting the certainty with which individuals in the training data are classified in order to communicate the strength of the classification model and diagnosability estimate. Guidance as to ideal minimum diagnosability thresholds for subspecies will improve with more comprehensive analyses; however, values in the range of 80%-90% are considered appropriate.

机译：None

著录项

来源
《Marine Mammal Science》 |2017年第1期|共31页
作者

展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类水生生物学;
关键词
taxonomy; subspecies; mtDNA; random forests; machine learning; species; population genetics; systematics; classification;

机译：分类;亚种;MTDNA;随机森林;机器学习;物种;人口遗传;系统性;分类;
入库时间 2022-08-20 18:19:42

相似文献

外文文献
中文文献
专利

1. Forest type identification by random forest classification combined with SPOT and multitemporal SAR data [J] . Ying Yu, Mingze Li, Yu Fu 林业研究（英文版） . 2018,第005期
2. How random is the random forest ? Random forest algorithm on the service of structural imaging biomarkers for Alzheimer's disease: from Alzheimer's disease neuroimaging initiative (ADNI) database [J] . Stavros I.Dimitriadis, Dimitris Liparas 中国神经再生研究（英文版） . 2018,第006期
3. A Data-Driven Car-Following Model Based on the Random Forest [J] . Huili Shi, Tingli Wang, Fusheng Zhong, 世界工程和技术（英文） . 2021,第003期
4. Diagnosability of mtDNA with Random Forests: Using sequence data to delimit subspecies [J] . Marine Mammal Science . 2017,第Suppla1期

机译：MTDNA与随机林的诊断：使用序列数据分隔亚种
5. Feature selection and classification of urinary mRNA microarray data by iterative random forest to diagnose renal fibrosis: a two-stage study [J] . Le-Ting Zhou, Yu-Han Cao, Lin-Li Lv, Scientific reports. . 2017,第1期

机译：迭代随机森林对尿mRNA基因芯片数据进行特征选择和分类以诊断肾纤维化的两阶段研究
6. Performance of random forests and logic regression methods using mini-exome sequence data [J] . Yoonhee Kim, Qing Li, Cheryl D Cropp, BMC proceedings. . 2011,第S9期

机译：使用迷你exome序列数据的随机林和逻辑回归方法的性能
7. Agricultural field delimitation using active learning and random forests margin [C] . Ghariani Karim, Chehata Nesrine, Le Bris Arnaud, IEEE International Geoscience and Remote Sensing Symposium . 2014

机译：利用主动学习和随机森林余量进行农田划界
8. The larvae of Chinese Hydropsychidae (Insecta: Trichoptera): Delimiting species boundaries using morphology and DNA sequences. [D] . Zhou, Xin. 2007

机译：中国水psych科（Insecta：Trichoptera）的幼虫：使用形态学和DNA序列划定物种边界。
9. Feature selection and classification of urinary mRNA microarray data by iterative random forest to diagnose renal fibrosis: a two-stage study [O] . Le-Ting Zhou, Yu-Han Cao, Lin-Li Lv, -1

机译：迭代随机森林对尿mRNA基因芯片数据进行特征选择和分类以诊断肾纤维化的两阶段研究
10. Feature selection and classification of urinary mRNA microarray data by iterative random forest to diagnose renal fibrosis: a two-stage study [O] . Le-Ting Zhou, Yu-Han Cao, Lin-Li Lv, 2017

机译：迭代随机森林特征选择和分类尿mRNA微阵列数据诊断肾纤维化：两阶段研究
11. Paired-End Sequence Mapping Detects Extensive Genomic Rearrangement and Translocation During Divergence of Francisella tularensis Subspecies Tularensis and Francisella tularensis Subspecies holarctica Populations [R] . Dempsey, M. P. , Nietfeldt, J. , Ravel, J. , 2006

机译：配对末端序列图谱检测土拉弗朗西斯菌亚种Tularensis和土拉弗朗西斯菌亚种holarctica种群发散过程中的广泛基因组重排和易位

Diagnosability of mtDNA with Random Forests: Using sequence data to delimit subspecies

摘要

著录项

相似文献

相关主题

期刊订阅