首页> 美国卫生研究院文献>BMC Bioinformatics >Selecting informative subsets of sparse supermatrices increases the chance to find correct trees

【2h】

Selecting informative subsets of sparse supermatrices increases the chance to find correct trees

机译：选择稀疏超矩阵的信息子集会增加找到正确树的机会

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

BackgroundCharacter matrices with extensive missing data are frequently used in phylogenomics with potentially detrimental effects on the accuracy and robustness of tree inference. Therefore, many investigators select taxa and genes with high data coverage. Drawbacks of these selections are their exclusive reliance on data coverage without consideration of actual signal in the data which might, thus, not deliver optimal data matrices in terms of potential phylogenetic signal. In order to circumvent this problem, we have developed a heuristics implemented in a software called mare which (1) assesses information content of genes in supermatrices using a measure of potential signal combined with data coverage and (2) reduces supermatrices with a simple hill climbing procedure to submatrices with high total information content. We conducted simulation studies using matrices of 50 taxa × 50 genes with heterogeneous phylogenetic signal among genes and data coverage between 10–30%.

机译：背景技术具有广泛缺失数据的字符矩阵经常被用于系统基因组学研究中，这可能会对树推断的准确性和鲁棒性产生不利影响。因此，许多研究人员选择了具有较高数据覆盖率的分类单元和基因。这些选择的缺点是它们完全依赖数据覆盖范围，而不考虑数据中的实际信号，因此，就潜在的系统发生信号而言，可能无法提供最佳的数据矩阵。为了解决这个问题，我们开发了一种启发式方法，该方法在名为mare的软件中实施，该方法（1）使用潜在信号的测量方法结合数据覆盖范围来评估超级矩阵中基因的信息含量，并且（2）通过简单的爬坡来减少超级矩阵总信息量较高的子矩阵的过程。我们使用50个分类单元×50个基因的矩阵进行了模拟研究，这些基因之间具有不同的系统发育信号，数据覆盖率在10％至30％之间。

著录项

期刊名称 BMC Bioinformatics
作者
Bernhard Misof; Benjamin Meyer; Björn Marcus von Reumont; Patrick Kück; Katharina Misof; Karen Meusemann;
展开▼
作者单位

展开▼
年(卷),期 2013(14),-1
年度 2013
页码 348
总页数 13
原文格式 PDF
正文语种
中图分类应用微生物学;生化遗传学;生化药理学;
关键词

相似文献

外文文献
中文文献
专利

1. Selecting informative subsets of sparse supermatrices increases the chance to find correct trees [J] . Bernhard Misof, Benjamin Meyer, Bj?rn M von Reumont, BMC Bioinformatics . 2013,第1期

机译：选择稀疏超矩阵的信息子集会增加找到正确树的机会
2. Dubious resolution and support from published sparse supermatrices: The importance of thorough tree searches [J] . Simmons Mark P., Goloboff Pablo A. Molecular phylogenetics and evolution . 2014,第Null期

机译：来自已发布的稀疏超级矩阵的可疑解决方案和支持：彻底进行树搜索的重要性
3. Cost-sensitive feature selection using random forest: Selecting low-cost subsets of informative features [J] . Zhou Qifeng, Zhou Hao, Li Tao Knowledge-Based Systems . 2016,第Mara1期

机译：使用随机森林的成本敏感特征选择：选择信息特征的低成本子集
4. Dynamic tree-structured sparse RPCA via column subset selection for background modeling and foreground detection [C] . Salehe Erfanian Ebadi, Valia Guerra Ones, Ebroul Izquierdo IEEE International Conference on Image Processing . 2016

机译：通过列子集选择的动态树结构稀疏RPCA，用于背景建模和前景检测
5. Instance selection for simplified decision trees through the generation and selection of instance candidate subsets. [D] . Bennette, Walter Dean. 2011

机译：通过实例候选子集的生成和选择，简化决策树的实例选择。
6. Identifying Informative Imaging Biomarkers via Tree Structured Sparse Learning for AD Diagnosis [O] . Manhua Liu, Daoqiang Zhang, Dinggang Shen -1

机译：通过树状结构的稀疏学习识别信息成像生物标记物以进行AD诊断
7. Selecting informative subsets of sparse supermatrices increases the chance to find correct trees [O] . 2013

机译：选择稀疏超矩阵的信息子集会增加找到正确树的机会
8. Informative Feature Selection for Object Recognition via Sparse PCA [R] . Naikal, N., Yang, A., Sastry, S. S. 2011

机译：稀疏pCa对象识别的信息特征选择

Selecting informative subsets of sparse supermatrices increases the chance to find correct trees

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅