首页> 美国卫生研究院文献>other >The CAIRR Pipeline for Submitting Standards-Compliant B and T Cell Receptor Repertoire Sequencing Studies to the National Center for Biotechnology Information Repositories
【2h】

The CAIRR Pipeline for Submitting Standards-Compliant B and T Cell Receptor Repertoire Sequencing Studies to the National Center for Biotechnology Information Repositories

机译:将符合标准的B和T细胞受体库测序研究提交国家生物技术信息库中心的CAIRR管道

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The adaptation of high-throughput sequencing to the B cell receptor and T cell receptor has made it possible to characterize the adaptive immune receptor repertoire (AIRR) at unprecedented depth. These AIRR sequencing (AIRR-seq) studies offer tremendous potential to increase the understanding of adaptive immune responses in vaccinology, infectious disease, autoimmunity, and cancer. The increasingly wide application of AIRR-seq is leading to a critical mass of studies being deposited in the public domain, offering the possibility of novel scientific insights through secondary analyses and meta-analyses. However, effective sharing of these large-scale data remains a challenge. The AIRR community has proposed minimal information about adaptive immune receptor repertoire (MiAIRR), a standard for reporting AIRR-seq studies. The MiAIRR standard has been operationalized using the National Center for Biotechnology Information (NCBI) repositories. Submissions of AIRR-seq data to the NCBI repositories typically use a combination of web-based and flat-file templates and include only a minimal amount of terminology validation. As a result, AIRR-seq studies at the NCBI are often described using inconsistent terminologies, limiting scientists’ ability to access, find, interoperate, and reuse the data sets. In order to improve metadata quality and ease submission of AIRR-seq studies to the NCBI, we have leveraged the software framework developed by the Center for Expanded Data Annotation and Retrieval (CEDAR), which develops technologies involving the use of data standards and ontologies to improve metadata quality. The resulting CEDAR-AIRR (CAIRR) pipeline enables data submitters to: (i) create web-based templates whose entries are controlled by ontology terms, (ii) generate and validate metadata, and (iii) submit the ontology-linked metadata and sequence files (FASTQ) to the NCBI BioProject, BioSample, and Sequence Read Archive databases. Overall, CAIRR provides a web-based metadata submission interface that supports compliance with the MiAIRR standard. This pipeline is available at , and will facilitate the NCBI submission process and improve the metadata quality of AIRR-seq studies.
机译:高通量测序对B细胞受体和T细胞受体的适应性使得以前所未有的深度表征自适应免疫受体库(AIRR)成为可能。这些AIRR测序(AIRR-seq)研究提供了巨大的潜力,可增进人们对疫苗学,传染病,自身免疫和癌症的适应性免疫反应的了解。 AIRR-seq的日益广泛的应用正导致大量的研究被存入公共领域,通过二次分析和荟萃分析提供了新颖的科学见解的可能性。但是,有效共享这些大规模数据仍然是一个挑战。 AIRR社区已提出有关自适应免疫受体库(MiAIRR)的最少信息,MiAIRR是报告AIRR-seq研究的标准。 MiAIRR标准已使用国家生物技术信息中心(NCBI)信息库进行了操作。向NCBI存储库提交AIRR-seq数据通常使用基于Web的模板和平面文件模板的组合,并且仅包含最少量的术语验证。结果,NCBI的AIRR-seq研究经常使用不一致的术语来描述,从而限制了科学家访问,查找,互操作和重用数据集的能力。为了提高元数据质量并简化向NCBI提交AIRR-seq研究的工作,我们利用了扩展数据注释和检索中心(CEDAR)开发的软件框架,该中心开发了涉及使用数据标准和本体的技术,提高元数据质量。由此产生的CEDAR-AIRR(CAIRR)管道使数据提交者能够:(i)创建基于Web的模板,其条目受本体术语控制,(ii)生成和验证元数据,以及(iii)提交与本体链接的元数据和序列文件(FASTQ)保存到NCBI BioProject,BioSample和Sequence Read Archive数据库中。总体而言,CAIRR提供了基于Web的元数据提交界面,该界面支持符合MiAIRR标准。该管道可在上找到,这将有助于NCBI提交过程并提高AIRR-seq研究的元数据质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号