Multi-Factored Gene-Gene Proximity Measures Exploiting Biological Knowledge Extracted from Gene Ontology: Application in Gene Clustering

首页> 外文期刊>IEEE/ACM transactions on computational biology and bioinformatics >Multi-Factored Gene-Gene Proximity Measures Exploiting Biological Knowledge Extracted from Gene Ontology: Application in Gene Clustering

【24h】

Multi-Factored Gene-Gene Proximity Measures Exploiting Biological Knowledge Extracted from Gene Ontology: Application in Gene Clustering

机译：利用从基因本体论中提取生物学知识的多因素基因-基因邻近度测量方法：在基因聚类中的应用

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

To describe the cellular functions of proteins and genes, a potential dynamic vocabulary is Gene Ontology (GO), which comprises of three sub-ontologies namely, Biological-process, Cellular-component, and Molecular-function. It has several applications in the field of bioinformatics like annotating/measuring gene-gene or protein-protein semantic similarity, identifying genes/proteins by their GO annotations for disease gene and target discovery, etc. To determine semantic similarity between genes, several semantic measures have been proposed in literature, which involve information content of GO-terms, GO tree structure, or the combination of both. But, most of the existing semantic similarity measures do not consider different topological and information theoretic aspects of GO-terms collectively. Inspired by this fact, in this article, we have first proposed three novel semantic similarity/distance measures for genes covering different aspects of GO-tree. These are further implanted in the frameworks of well-known multi-objective and single-objective based clustering algorithms to determine functionally similar genes. For comparative analysis, 10 popular existing GO based semantic similarity/distance measures and tools are also considered. Experimental results on Mouse genome, Yeast, and Human genome datasets evidently demonstrate the supremacy of multi-objective clustering algorithms in association with proposed multi-factored similarity/distance measures. Clustering outcomes are further validated by conducting some biological/statistical significance tests. Supplementary information is available at https://www.iitp.ac.in/sriparna/journals.html.

机译：为了描述蛋白质和基因的细胞功能，潜在的动态词汇是基因本体论（GO），它由三个亚本体论组成，即生物过程，细胞成分和分子功能。它在生物信息学领域有多种应用，例如注释/测量基因-基因或蛋白质-蛋白质的语义相似性，通过它们对疾病基因的GO注释和目标发现来识别基因/蛋白质等。要确定基因之间的语义相似性，需要采取几种语义措施在文献中已经提出了涉及GO术语，GO树结构或两者的组合的信息内容。但是，大多数现有的语义相似性度量并没有共同考虑GO术语的不同拓扑和信息理论方面。受这一事实的启发，在本文中，我们首先针对覆盖GO树不同方面的基因提出了三种新颖的语义相似度/距离度量。这些被进一步植入众所周知的基于多目标和单目标的聚类算法的框架中，以确定功能相似的基因。为了进行比较分析，还考虑了10种流行的现有基于GO的语义相似度/距离度量和工具。在小鼠基因组，酵母和人类基因组数据集上的实验结果显然证明了多目标聚类算法与拟议的多因素相似性/距离测量方法相辅相成的优势。通过进行一些生物学/统计显着性检验，进一步验证了聚类结果。有关补充信息，请访问https://www.iitp.ac.in/sriparna/journals.html。

著录项

来源
《IEEE/ACM transactions on computational biology and bioinformatics》 |2020年第1期|207-219|共13页
作者

展开▼
作者单位

Indian Inst Technol Patna Dept Comp Sci & Engn Patna 801103 Bihar India;

Sikkim Manipal Inst Technol Dept Comp Applicat Rangpo 737132 Sikkim India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Semantics; Integrated circuits; Bioinformatics; Ontologies; Tools; Genomics; Current measurement; Gene ontology (GO); gene clustering; semantic similarity; distance measure; gene-gene similarity matrix; multi-objective clustering;

机译：语义学集成电路;生物信息学本体;工具;基因组学电流测量基因本体论（GO）;基因聚类;语义相似度;距离测量基因-基因相似度矩阵多目标聚类;

相似文献

外文文献
中文文献
专利

1. Novel symmetry-based gene-gene dissimilarity measures utilizing Gene Ontology: Application in gene clustering [J] . Sudipta Acharya, Sriparna Saha, Prasanna Pradhan Gene: An International Journal Focusing on Gene Cloning and Gene Structure and Function . 2018,第期

机译：利用基因本体学的基于新的基于对称的基因基因异化测量：在基因聚类中的应用
2. Development and application of an interaction network ontology for literature mining of vaccine-associated gene-gene interactions [J] . Junguk Hur, Arzucan ?zgür, Zuoshuang Xiang, Journal of Biomedical Semantics . 2015,第S1期

机译：疫苗相关基因-基因相互作用文献挖掘的相互作用网络本体的开发与应用
3. Principal interactions analysis for repeated measures data: Application to gene-gene and gene-environment interactions [J] . MukherjeeB., KoY.-A., VanderweeleT., Statistics in medicine . 2012,第22期

机译：重复测量数据的主要相互作用分析：在基因-基因和基因-环境相互作用中的应用
4. Integration of Mutual Information and Text Mining Methods for Extracting Gene-Gene Interactions from Gene Expression Data [C] . David H. Millis, Jeffrey L. Solka, Lakshmi K. Matukumalli IEEE International Conference on Bioinformatics and Biomedicine Workshop . 2009

机译：用于从基因表达数据中提取基因基因相互作用的互信息和文本挖掘方法的整合
5. Search procedure for identifying gene-gene interaction based on entropy measures. [D] . Milanov, Valentin B. 2005

机译：基于熵测度识别基因-基因相互作用的搜索程序。
6. Unsupervised gene selection using biological knowledge : application in sample clustering [O] . Sudipta Acharya, Sriparna Saha, N. Nikhil 2017

机译：利用生物学知识进行无监督基因选择：在样品聚类中的应用
7. Unsupervised gene selection using biological knowledge : application in sample clustering [O] . Sudipta Acharya, Sriparna Saha, N. Nikhil 2017

机译：使用生物学知识的无监督基因选择：在样品聚类中的应用

Multi-Factored Gene-Gene Proximity Measures Exploiting Biological Knowledge Extracted from Gene Ontology: Application in Gene Clustering

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅