首页> 外文期刊>Physical biology >BCR CDR3 length distributions differ between blood and spleen and between old and young patients, and TCR distributions can be used to detect myelodysplastic syndrome
【24h】

BCR CDR3 length distributions differ between blood and spleen and between old and young patients, and TCR distributions can be used to detect myelodysplastic syndrome

机译:BCR CDR3长度分布在血液和脾脏之间以及老年患者和年轻患者之间有所不同,TCR分布可用于检测骨髓增生异常综合症

获取原文
获取原文并翻译 | 示例
           

摘要

Complementarity-determining region 3 (CDR3) is the most hyper-variable region in B cell receptor (BCR) and T cell receptor (TCR) genes, and the most critical structure in antigen recognition and thereby in determining the fates of developing and responding lymphocytes. There are millions of different TCR V beta chain or BCR heavy chain CDR3 sequences in human blood. Even now, when high-throughput sequencing becomes widely used, CDR3 length distributions (also called spectratypes) are still a much quicker and cheaper method of assessing repertoire diversity. However, distribution complexity and the large amount of information per sample (e. g. 32 distributions of the TCR alpha chain, and 24 of TCR beta) calls for the use of machine learning tools for full exploration. We have examined the ability of supervised machine learning, which uses computational models to find hidden patterns in predefined biological groups, to analyze CDR3 length distributions from various sources, and distinguish between experimental groups. We found that (a) splenic BCR CDR3 length distributions are characterized by low standard deviations and few local maxima, compared to peripheral blood distributions; (b) healthy elderly people's BCR CDR3 length distributions can be distinguished from those of the young; and (c) a machine learning model based on TCR CDR3 distribution features can detect myelodysplastic syndrome with approximately 93% accuracy. Overall, we demonstrate that using supervised machine learning methods can contribute to our understanding of lymphocyte repertoire diversity.
机译:互补决定区3(CDR3)是B细胞受体(BCR)和T细胞受体(TCR)基因中变化最大的区域,并且是抗原识别中最关键的结构,从而决定了发育和应答淋巴细胞的命运。人血中有数百万种不同的TCR Vβ链或BCR重链CDR3序列。即使在现在,当高通量测序得到广泛使用时,CDR3长度分布(也称为光谱类型)仍然是一种更快速,更便宜的评估库多样性的方法。然而,分布复杂性和每个样本的大量信息(例如,TCRα链的32个分布,和TCRβ的24个分布)要求使用机器学习工具进行全面探索。我们已经检查了监督机器学习的能力,该能力使用计算模型在预定义的生物组中找到隐藏的模式,分析来自各种来源的CDR3长度分布,并区分实验组。我们发现(a)与外周血分布相比,脾脏BCR CDR3长度分布的特点是标准差低,局部最大值少。 (b)健康的老年人的BCR CDR3长度分布可与年轻人区分开; (c)基于TCR CDR3分布特征的机器学习模型可以以大约93%的准确性检测骨髓增生异常综合症。总体而言,我们证明使用监督的机器学习方法可以有助于我们对淋巴细胞库多样性的理解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号