首页> 外文期刊>BMC Bioinformatics >Dissecting systems-wide data using mixture models: application to identify affected cellular processes
【24h】

Dissecting systems-wide data using mixture models: application to identify affected cellular processes

机译:使用混合模型解剖系统范围的数据:识别受影响的细胞过程的应用

获取原文
           

摘要

Background Functional analysis of data from genome-scale experiments, such as microarrays, requires an extensive selection of differentially expressed genes. Under many conditions, the proportion of differentially expressed genes is considerable, making the selection criteria a balance between the inclusion of false positives and the exclusion of false negatives. Results We developed an analytical method to determine a p -value threshold from a microarray experiment that is dependent on the quality and design of the data set. To this aim, populations of p -values are modeled as mathematical functions in which the parameters to describe these functions are estimated in an unsupervised manner. The strength of the method is exemplified by its application to a published gene expression data set of sporadic and familial breast tumors with BRCA1 or BRCA2 mutations. Conclusion We present an objective and unsupervised way to set thresholds adapted to the quality and design of the experiment. The resulting mathematical description of the data sets of genome-scale experiments enables a probabilistic approach in systems biology.
机译:背景技术对来自基因组规模的实验(例如微阵列)的数据进行功能分析需要大量选择差异表达的基因。在许多情况下,差异表达基因的比例相当大,从而使选择标准在包含假阳性和排除假阴性之间取得平衡。结果我们开发了一种分析方法,可从微阵列实验中确定p值阈值,该阈值取决于数据集的质量和设计。为此,将p值总体建模为数学函数,其中以无监督方式估算描述这些函数的参数。该方法的强度通过将其应用于已发布的具有BRCA1或BRCA2突变的散发性和家族性乳腺肿瘤的基因表达数据集而得到例证。结论我们提出了一种客观且不受监督的方法来设置适合于实验质量和设计的阈值。所得的基因组规模实验数据集的数学描述使系统生物学中的概率方法成为可能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号