首页> 外文学位 >Studies on information-theoretics based data-sequence pattern-discriminant algorithms: Applications in bioinformatic data mining.

【24h】

Studies on information-theoretics based data-sequence pattern-discriminant algorithms: Applications in bioinformatic data mining.

机译：基于信息理论的数据序列模式判别算法研究：在生物信息数据挖掘中的应用。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This research refers to studies on information-theoretic (IT) aspects of data-sequence patterns and developing thereof discriminant algorithms that enable distinguishing the features of underlying sequence patterns having characteristic, inherent stochastical attributes. The application potentials of such algorithms include bioinformatic data mining efforts.; Consistent with the scope of the study as above, considered in this research are specific details on information-theoretics and entropy considerations vis-á-vis sequence patterns (having stochastical attributes) such as DNA sequences of molecular biology. Applying information-theoretic concepts (essentially in Shannon's sense), the following distinct sets of metrics are developed and applied in the algorithms developed for data-sequence pattern-discrimination applications: (i) Divergence or cross-entropy algorithms of Kullback-Leibler type and of general Czizár class; (ii) statistical distance measures; (iii) ratio-metrics; (iv) Fisher type linear-discriminant measure and (v) complexity metric based on information redundancy.; These measures are judiciously adopted in ascertaining codon-noncodon delineations in DNA sequences that consist of crisp and/or fuzzy nucleotide domains across their chains. The Fisher measure is also used in codon-noncodon delineation and in motif detection. Relevant algorithms are used to test DNA sequences of human and some bacterial organisms. The relative efficacy of the metrics and the algorithms is determined and discussed. The potentials of such algorithms in supplementing the prevailing methods are indicated. Scope for future studies is identified in terms of persisting open questions.

机译：这项研究涉及对数据序列模式的信息理论（IT）方面的研究，并开发可区分具有特定的，固有的随机属性的基础序列模式的特征的判别算法。这种算法的应用潜力包括生物信息数据挖掘工作。与上述研究范围相一致，本研究中考虑的是信息理论和熵考虑的具体细节，例如分子生物学的DNA序列，vis-á-vis序列模式（具有随机属性）。应用信息理论概念（本质上是Shannon的意思），开发了以下不同的指标集并将其应用于为数据序列模式区分应用开发的算法中：（i）Kullback-Leibler类型的散度或交叉熵算法和一般的齐齐尔阶级；（ii）统计距离度量；（iii）比率指标；（iv）Fisher类型的线性判别度量和（v）基于信息冗余的复杂性度量;在确定DNA序列中由跨链的脆性和/或模糊核苷酸结构域组成的密码子-非密码子描述时，应明智地采用这些措施。 Fisher度量也用于密码子-非密码子的描绘和图案检测。相关算法用于测试人类和某些细菌有机体的DNA序列。确定和讨论度量标准和算法的相对功效。指出了这种算法在补充主流方法方面的潜力。未来研究的范围是根据持续存在的开放性问题确定的。

著录项

作者
Arredondo, Tomas Vidal.;
展开▼
作者单位

Florida Atlantic University.;

展开▼
授予单位 Florida Atlantic University.;
学科 Engineering Biomedical.; Engineering Electronics and Electrical.
学位 Ph.D.
年度 2003
页码 376 p.
总页数 376
原文格式 PDF
正文语种 eng
中图分类生物医学工程;无线电电子学、电信技术;
关键词
入库时间 2022-08-17 11:45:40

相似文献

外文文献
中文文献
专利

1. A comparative study of machine learning algorithms applied to predictive toxicology data mining. [J] . Neagu DC, Guo G, Trundle PR, Alternatives to laboratory animals: ATLA . 2007,第1期

机译：机器学习算法在预测毒理学数据挖掘中的比较研究。
2. Bioinformatics Identified 17 Immune Genes as Prognostic Biomarkers for Breast Cancer: Application Study Based on Artificial Intelligence Algorithms [J] . Zhiqiao Zhang, Jing Li, Tingshan He, Frontiers in Oncology . 2020,第4期

机译：生物信息学确定了17个免疫基因，作为乳腺癌的预后生物标志物：基于人工智能算法的应用研究
3. Data Compression Concepts and Algorithms and Their Applications to Bioinformatics [J] . #xD6, zkan U. Nalbantog#x303, lu, Entropy . 2009,第1期

机译：数据压缩的概念和算法及其在生物信息学中的应用
4. Research on the Application of Pattern Selection Algorithm in Bioinformatic Data Bases on Mutual Information [C] . Li Xin, Hong Wenxue, Zhao Chun 2010 First International Conference on Pervasive Computing Signal Processing and Applications . 2010

机译：模式选择算法在互信息的生物信息数据库中的应用研究
5. Applications of genetic algorithms in data mining. [D] . Ciocoiu, Malina Mihaela. 2001

机译：遗传算法在数据挖掘中的应用。
6. Bioinformatics Identified 17 Immune Genes as Prognostic Biomarkers for Breast Cancer: Application Study Based on Artificial Intelligence Algorithms [O] . Zhiqiao Zhang, Jing Li, Tingshan He, 2020

机译：生物信息学鉴定了17种免疫基因作为乳腺癌的预后生物标志物：基于人工智能算法的应用研究
7. The importance of data quality and traceability in data mining. Applications of robust methods for multivariate data analysis. A case-study conducting the herring industry [O] . Frosch Stina 2006

机译：数据质量和可追溯性在数据挖掘中的重要性。稳健方法在多变量数据分析中的应用。进行鲱鱼产业的个案研究

Studies on information-theoretics based data-sequence pattern-discriminant algorithms: Applications in bioinformatic data mining.

摘要

著录项

相似文献

相关主题

期刊订阅