【24h】

Computational Molecular Biology of Genome Expression and Regulation

机译:基因组表达与调控的计算分子生物学

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Technological advances in experimental and computational molecular biology have revolutionized the whole fields of biology and medicine. Large-scale sequencing, expression and localization data have provided us with a great opportunity to study biology at the system level. I will introduce some outstanding problems in genome expression and regulation network in which better modern statistical and machine learning technologies are desperately needed. Recent revolution in genomics has transformed life science. For the first time in history, mankind has been able to sequence the entire human own genome. Bioinformatics, especially computational molecular biology, has played a vital role in extracting knowledge from vast amount of information generated by the high throughput genomics technologies. Today, I am very happy to deliver this key lecture at the First International Conference on Pattern Recognition and Machine Intelligence at the world renowned Indian Statistical Institute (ISI) where such luminaries as Mahalanobis, Bose, Rao and others had worked before. And it is very timely that genomics has attracted new generation of talented young statisticians, reminding us the fact that statistics was essentially conceived from and continuously nurtured by biological problems. Pattern/rule recognition is at the heart of all learning process and hence of all disciplines of sciences, and comparison is the fundamental method: it is the similarities that allow inferring common rules; and it is the differences that allow deriving new rules. Gene expression, normally referring to the cellular processes that lead to protein production, is controlled and regulated at multiple levels. Cells use this elaborate system of "circuits" and "switches" to decide when, where and by how much each gene should be turned on (activated, expressed) or off (repressed, silenced) in response to environmental clues. Genome expression and regulation refer to coordinated expression and regulation of many genes at large-scales for which advanced computational methods become indispensable. Due to space limitations, I can only highlight some of the pattern recognition problems in transcriptional regulation, which is the most important and best studied. Currently, there are two general outstanding problems in transcriptional regulation studies: (1) How to find the regulatory regions, in particular, the promoters regions in the genome (throughout most of this lecture, we use promoter to refer to proximal promoters, e.g. ~1kb DNA at the beginning of each gene); (2) How to identify functional cis-regulatory DNA elements within each such region.
机译:实验和计算分子生物学的技术进步彻底改变了生物学和医学的整个领域。大规模测序,表达和定位数据为我们提供了一个在系统级别研究生物学的绝好机会。我将介绍基因组表达和调控网络中的一些突出问题,其中迫切需要更好的现代统计和机器学习技术。基因组学的最新革命已经改变了生命科学。人类有史以来第一次能够对整个人类基因组进行测序。生物信息学,尤其是计算分子生物学,在从高通量基因组学技术产生的大量信息中提取知识方面发挥了至关重要的作用。今天,我非常高兴在世界著名的印度统计研究所(ISI)的第一届模式识别和机器智能国际会议上发表这一重要演讲,此前,诸如Mahalanobis,Bose,Rao等人曾在此工作。基因组学已经吸引了新一代有才华的年轻统计学家,这非常及时,这使我们想起了这样一个事实,即统计学本质上是由生物学问题构想的,并不断受到其困扰。模式/规则识别是所有学习过程的核心,因此也是所有科学学科的核心,而比较是最基本的方法:相似之处可以推论出共同的规则;正是这些差异允许推导新规则。基因表达,通常是指导致蛋白质产生的细胞过程,在多个水平上受到控制和调节。细胞使用这种复杂的“电路”和“开关”系统来决定应根据环境线索在何时,何处以及以多少打开(激活,表达)或关闭(抑制,沉默)每个基因。基因组表达和调控是指许多基因的大规模协调表达和调控,因此高级计算方法变得不可或缺。由于篇幅所限,我只能重点介绍转录调控中的一些模式识别问题,这是最重要和研究最好的。当前,在转录调控研究中存在两个普遍存在的突出问题:(1)如何找到调控区,特别是基因组中的启动子区域(在本讲座的大部分内容中,我们使用启动子指代近端启动子,例如〜每个基因开头的1kb DNA); (2)如何在每个这样的区域内识别功能性顺式调控DNA元件。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号