首页> 外文期刊>Genes and genomics >GEN2VCF: a converter for human genome imputation output format to VCF format
【24h】

GEN2VCF: a converter for human genome imputation output format to VCF format

机译:Gen2VCF:用于人类基因组归纳输出格式的转换器到VCF格式

获取原文
获取原文并翻译 | 示例
           

摘要

Background For a genome-wide association study in humans, genotype imputation is an essential analysis tool for improving association mapping power. When IMPUTE software is used for imputation analysis, an imputation output (GEN format) should be converted to variant call format (VCF) with imputed genotype dosage for association analysis. However, the conversion requires multiple software packages in a pipeline with a large amount of processing time. Objective We developed GEN2VCF, a fast and convenient GEN format to VCF conversion tool with dosage support. Methods The performance of GEN2VCF was compared to BCFtools, QCTOOL, and Oncofunco. The test data set was a 1 Mb GEN-formatted file of 5000 samples. To determine the performance of various sample sizes, tests were performed from 1000 to 5000 samples with a step size of 1000. Runtime and memory usage were used as performance measures. Results GEN2VCF showed drastically increased performances with respect to runtime and memory usage. Runtime and memory usage of GEN2VCF was at least 1.4- and 7.4-fold lower compared to other methods, respectively. Conclusions GEN2VCF provides users with efficient conversion from GEN format to VCF with the best-guessed genotype, genotype posterior probabilities, and genotype dosage, as well as great flexibility in implementation with other software packages in a pipeline.
机译:用于人类基因组关联研究的背景,基因型归纳是改善关联映射功率的基本分析工具。当使用避税软件用于归咎分析时,应将归纳输出(GEN格式)转换为具有用于关联分析的算刷基因型剂量的变体呼叫格式(VCF)。但是,转换需要在管道中具有大量处理时间的多个软件包。目的我们开发了Gen2VCF,快速方便的Gen格式与Dosage支持的VCF转换工具。方法将Gen2VCF的性能与BCFTOOLS,QCTOOL和ONCOFONCO进行了比较。测试数据集是5000个样本的1 MB格式化文件。为了确定各种样本尺寸的性能,测试从1000〜5000个样品进行,步长为1000.运行时和内存使用用作性能措施。结果Gen2VCF在运行时和内存使用情况下表现出急剧增加。与其他方法相比,Gen2VCF的运行时间和内存使用率分别为至少1.4倍和7.4倍。结论Gen2VCF为用户提供从Gen格式转化为VCF的最佳基因型,基因型后型概率和基因型剂量,以及在管道中的其他软件包的实施方案的灵活性。

著录项

  • 来源
    《Genes and genomics》 |2020年第10期|共6页
  • 作者单位

    Natl Inst Hlth Ctr Genome Sci Div Genome Res Cheongju 28159 Chungcheongbug South Korea;

    Natl Inst Hlth Ctr Genome Sci Div Genome Res Cheongju 28159 Chungcheongbug South Korea;

    Natl Inst Hlth Ctr Genome Sci Div Genome Res Cheongju 28159 Chungcheongbug South Korea;

    Ton Duc Thang Univ Fac Informat Technol Data Sci Lab Ho Chi Minh City 700000 Vietnam;

    Natl Inst Hlth Ctr Genome Sci Div Genome Res Cheongju 28159 Chungcheongbug South Korea;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 生物科学;
  • 关键词

    Human genome; Imputation; SNP; Converter; Parsing;

    机译:人类基因组;归责;SNP;转换器;解析;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号