...
首页> 外文期刊>BMC Bioinformatics >A bag-of-words approach for Drosophila gene expression pattern annotation
【24h】

A bag-of-words approach for Drosophila gene expression pattern annotation

机译:果蝇基因表达模式注释的一词袋方法

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Background Drosophila gene expression pattern images document the spatiotemporal dynamics of gene expression during embryogenesis. A comparative analysis of these images could provide a fundamentally important way for studying the regulatory networks governing development. To facilitate pattern comparison and searching, groups of images in the Berkeley Drosophila Genome Project (BDGP) high-throughput study were annotated with a variable number of anatomical terms manually using a controlled vocabulary. Considering that the number of available images is rapidly increasing, it is imperative to design computational methods to automate this task. Results We present a computational method to annotate gene expression pattern images automatically. The proposed method uses the bag-of-words scheme to utilize the existing information on pattern annotation and annotates images using a model that exploits correlations among terms. The proposed method can annotate images individually or in groups (e.g., according to the developmental stage). In addition, the proposed method can integrate information from different two-dimensional views of embryos. Results on embryonic patterns from BDGP data demonstrate that our method significantly outperforms other methods. Conclusion The proposed bag-of-words scheme is effective in representing a set of annotations assigned to a group of images, and the model employed to annotate images successfully captures the correlations among different controlled vocabulary terms. The integration of existing annotation information from multiple embryonic views improves annotation performance.
机译:背景果蝇基因表达模式图像记录了胚胎发生过程中基因表达的时空动态。对这些图像进行比较分析可以为研究控制发展的监管网络提供根本上重要的途径。为了促进模式比较和搜索,使用受控词汇表,使用可变数量的解剖学术语注释了伯克利果蝇基因组计划(BDGP)高通量研究中的图像组。考虑到可用图像的数量正在迅速增加,因此必须设计出使该任务自动化的计算方法。结果我们提出了一种计算方法来自动注释基因表达模式图像。所提出的方法使用词袋方案来利用关于模式注释的现有信息,并使用利用项之间的相关性的模型来对图像进行注释。所提出的方法可以单独地或成组地(例如,根据发育阶段)注释图像。另外,所提出的方法可以整合来自不同二维胚胎视图的信息。来自BDGP数据的胚胎模式结果表明,我们的方法明显优于其他方法。结论所提出的词袋方案有效地表示了分配给一组图像的一组注释,并且用于注释图像的模型成功捕获了不同受控词汇之间的相关性。来自多个原始视图的现有注释信息的集成提高了注释性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号