首页> 美国卫生研究院文献>other >Developmental Stage Annotation of Drosophila Gene Expression Pattern Images via an Entire Solution Path for LDA
【2h】

Developmental Stage Annotation of Drosophila Gene Expression Pattern Images via an Entire Solution Path for LDA

机译:果蝇基因表达模式图像通过LDA的整个解决方案路径的发展阶段注释。

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Gene expression in a developing embryo occurs in particular cells (spatial patterns) in a time-specific manner (temporal patterns), which leads to the differentiation of cell fates. Images of a Drosophila melanogaster embryo at a given developmental stage, showing a particular gene expression pattern revealed by a gene-specific probe, can be compared for spatial overlaps. The comparison is fundamentally important to formulating and testing gene interaction hypotheses. Expression pattern comparison is most biologically meaningful when images from a similar time point (developmental stage) are compared. In this paper, we present LdaPath, a novel formulation of Linear Discriminant Analysis (LDA) for automatic developmental stage range classification. It employs multivariate linear regression with the L1-norm penalty controlled by a regularization parameter for feature extraction and visualization. LdaPath computes an entire solution path for all values of regularization parameter with essentially the same computational cost as fitting one LDA model. Thus, it facilitates efficient model selection. It is based on the equivalence relationship between LDA and the least squares method for multi-class classifications. This equivalence relationship is established under a mild condition, which we show empirically to hold for many high-dimensional datasets, such as expression pattern images. Our experiments on a collection of 2705 expression pattern images show the effectiveness of the proposed algorithm. Results also show that the LDA model resulting from LdaPath is sparse, and irrelevant features may be removed. Thus, LdaPath provides a general framework for simultaneous feature selection and feature extraction.
机译:发育中的胚胎中的基因表达以特定于时间的方式(时间模式)出现在特定的细胞(空间模式)中,这导致细胞命运的分化。可以比较果蝇在给定发育阶段的图像,该图像显示出由基因特异性探针揭示的特定基因表达模式,可以比较空间重叠。该比较对于制定和检验基因相互作用假设至关重要。当比较来自相似时间点(发育阶段)的图像时,表达模式比较在生物学上最有意义。在本文中,我们介绍了LdaPath,这是一种线性判别分析(LDA)的新公式,用于自动发育阶段范围分类。它采用多元线性回归,其中L1范数惩罚由正则化参数控制,用于特征提取和可视化。 LdaPath为正则化参数的所有值计算一条完整的解决方案路径,其计算成本与拟合一个LDA模型基本相同。因此,它有助于有效的模型选择。它基于LDA和最小二乘法之间的等价关系进行多类分类。这种等效关系是在温和的条件下建立的,根据经验我们可以证明它适用于许多高维数据集,例如表达模式图像。我们对2705个表达模式图像的收集实验证明了该算法的有效性。结果还表明,由LdaPath生成的LDA模型是稀疏的,并且可以删除无关的功能。因此,LdaPath提供了用于同时进行特征选择和特征提取的通用框架。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号