首页> 外文学位 >Free factories: From the quantum coreworld to the Personal Genome Project.
【24h】

Free factories: From the quantum coreworld to the Personal Genome Project.

机译:免费工厂:从量子核心世界到个人基因组计划。

获取原文
获取原文并翻译 | 示例

摘要

This dissertation develops technical and governance infrastructure for a "free factory" by building on parallels with free and open source software and related communities. By viewing varied technologies and people as comprising free factories---or a federation of co-operating and competing factories with certain common ideals and infrastructure---I argue many scientific questions become easier to answer.In the first chapter, I briefly summarize the dissertation. I then describe the hardware, staff and other resources required to implement the computational aspects of a free factory with reasonable economies of scale. In the next chapter, I use the infrastructure to search for DNA and RNA editing events in more than 600 million genomic traces from ten organisms at NCBI. I find numerous examples of traces that support the existence of these phenomena and set the stage for a more comprehensive investigation. The subsequent chapter uses the same tools to analyze four individual human genomes for variants of clinical interest. This work demonstrates such analyses need not lead to costly or harmful medical workup. In the last chapter, I describe the initial data release of the Personal Genome Project. The release is derived from two gigabases of targeted sequence data from ten individuals. I investigate the quality of the data by comparison with Affymetrix 500K SNPs and discuss one variant of clinical interest. This data release---linking scientists, physicians and members of the general public---demonstrates the utility of free factories for advancing the state-of-the-art in personalized, genomic medicine.In Appendix A, I indicate how the Quantum Coreworld---earlier work on a digital evolution system consistent with the rules of quantum information processing---could efficiently use free factories. Such projects could allow free factories to fully utilize idle resources. Finally, in Appendix B, a novel, open-source primary data analysis pipeline is used to reprocess 100 gigabytes of image data derived from the exome of a Personal Genome Project participant. This approach demonstrates a 14% increase in placeable reads, on the PGP sample, over the vendor's pipeline.
机译:本文通过与自由,开源软件和相关社区的并行发展,为“自由工厂”开发技术和治理基础设施。通过将各种技术和人员视为自由工厂-或具有某些共同理想和基础设施的合作和竞争工厂联合会-我认为许多科学问题变得更容易回答。在第一章中,我简要总结了论文。然后,我将描述实现具有合理规模经济的免费工厂的计算方面所需的硬件,人员和其他资源。在下一章中,我将使用基础结构来搜索来自NCBI十种生物的六亿多条基因组痕迹中的DNA和RNA编辑事件。我发现了许多支持这些现象存在的痕迹示例,并为进行更全面的调查奠定了基础。接下来的章节使用相同的工具来分析四个单独的人类基因组,以获得具有临床意义的变体。这项工作表明,此类分析无需导致昂贵或有害的医学检查。在上一章中,我描述了个人基因组计划的初始数据发布。该版本源自十个人的两个目标序列数据的千兆字节存储。我通过与Affymetrix 500K SNP进行比较来研究数据的质量,并讨论一种临床关注的变体。该数据发布-链接了科学家,医生和普通民众-展示了免费工厂在推动个性化基因组医学领域的最新发展方面的效用。在附录A中,我指出了量子技术Coreworld-先前在与量子信息处理规则一致的数字进化系统上的工作-可以有效地使用免费工厂。这样的项目可以使自由工厂充分利用闲置资源。最后,在附录B中,使用了新颖的开源原始数据分析管道来重新处理从“个人基因组计划”参与者的外显子组中提取的100 GB图像数据。这种方法表明,在PGP样本上,与供应商的销售渠道相比,可放置读数增加了14%。

著录项

  • 作者

    Zaranek, Alexander Wait.;

  • 作者单位

    Harvard University.;

  • 授予单位 Harvard University.;
  • 学科 Biology Bioinformatics.Computer Science.Biophysics General.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 132 p.
  • 总页数 132
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号