GENOMIN: A SOFTWARE FRAMEWORK FOR READING GENOMIC SIGNALS

PAUL GAGNIUC; DANUT CIMPONERIU; CONSTANTIN IONESCU-TIRGOVISTE; CRISTIAN GUJA; POMPILIA APOSTOL; MONICA STAVARACHI; LUCIAN GAVRILA

首页> 外文期刊>Proceedings of the Romanian Academy, Series B. Chemistry, life sciences and geosciences >GENOMIN: A SOFTWARE FRAMEWORK FOR READING GENOMIC SIGNALS

【24h】

GENOMIN: A SOFTWARE FRAMEWORK FOR READING GENOMIC SIGNALS

机译：Gennomin：用于阅读基因信号的软件框架

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data mining produces models that capture and represent hidden patterns in the DNA structure. Any attempt to develop and test new algorithms for data mining in the field of bioinformatics, must begin with an optimal method by which even the huge FASTA files can be read step by step. The aim of the GENOMIN software is to provide an open source software platform which can work with large files like a whole chromosome or genome sequence. We have created an open source template software, named GENOMIN, for analyzing genetic data of sequences of different sizes downloaded from NCBI servers. Large NCBI FASTA files which store sequences of individual chromosomes come from other processing systems like UNIX. Processing these files on other operating systems is difficult due to different markers which indicate the end of each line. The GENOMIN software, reads the FASTA files by continuous buffer reading, without taking into account the end of line markers. The result of this type of reading is a brute, noisy free DNA sequence of the entire file regardless of its size. We presented three examples to demonstrate how the program can be used in biology: the estimation of GC content, identification of repetitive elements and search for sequences with different biological functions (e.g. duplicated regions or potential binding sites for transcription factors). Development of this open source software is limited only by the researcher programming skills. The results of our tests have been shown that GENOMIN can perform various tests on large sequences files and can work with different algorithms used in biology.

机译：数据挖掘产生的模型可以捕获并表示DNA结构中的隐藏模式。在生物信息学领域中开发和测试用于数据挖掘的新算法的任何尝试都必须以一种最佳方法开始，通过该方法，即使是巨大的FASTA文件也可以逐步读取。 GENOMIN软件的目的是提供一个开放源代码软件平台，该平台可以处理诸如整个染色体或基因组序列之类的大文件。我们创建了一个名为GENOMIN的开源模板软件，用于分析从NCBI服务器下载的不同大小序列的遗传数据。存储单个染色体序列的大型NCBI FASTA文件来自UNIX等其他处理系统。由于不同的标记指示每行的结尾，因此很难在其他操作系统上处理这些文件。 GENOMIN软件通过连续读取缓冲区来读取FASTA文件，而无需考虑行尾标记。这种读取的结果是整个文件的粗暴，嘈杂的自由DNA序列，而不管其大小如何。我们提供了三个示例来说明该程序如何在生物学中使用：GC含量的估算，重复元素的识别以及搜索具有不同生物学功能（例如重复区域或转录因子的潜在结合位点）的序列。此开源软件的开发仅受研究人员编程技能的限制。我们的测试结果表明，GENOMIN可以对大型序列文件执行各种测试，并且可以与生物学中使用的不同算法一起使用。

著录项

来源
《Proceedings of the Romanian Academy, Series B. Chemistry, life sciences and geosciences》 |2011年第1期|共10页
作者
PAUL GAGNIUC; DANUT CIMPONERIU; CONSTANTIN IONESCU-TIRGOVISTE; CRISTIAN GUJA; POMPILIA APOSTOL; MONICA STAVARACHI; LUCIAN GAVRILA;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类化学;
关键词
Genomin; open source; data mining; nucleotide sequence; CpG;

机译：Genomin;开源;数据挖掘;核苷酸序列;CpG;

相似文献

外文文献
中文文献
专利

1. GENOMIN: A SOFTWARE FRAMEWORK FOR READING GENOMIC SIGNALS [J] . PAUL GAGNIUC, DANUT CIMPONERIU, CONSTANTIN IONESCU-TIRGOVISTE, Proceedings of the Romanian Academy, Series B. Chemistry, life sciences and geosciences . 2011,第1期

机译：Gennomin：用于阅读基因信号的软件框架
2. PHASE AND FRACTAL ANALYSIS OF DNA AND RE-ORIENTED READING FRAME GENOMIC SIGNALS [J] . PAUL DAN CRISTEA Serie Electrotechnique et Energetique . 2003,第2a3期

机译：DNA和重新定向的阅读框架基因信号的相和分形分析
3. The Signals of Opportunity Coherent Bistatic Scattering Simulator: A Free Open Source Framework [Software and Data Sets] [J] . Eroglu Orhan, Boyd Dylan R., Kurum Mehmet Geoscience and Remote Sensing . 2020,第3期

机译：机会相干双体散射模拟器的信号：自由开源框架[软件和数据集]
4. PYPOP: A SOFTWARE FRAMEWORK FOR POPULATION GENOMICS: ANALYZING LARGE-SCALE MULTI-LOCUS GENOTYPE DATA [C] . ALEX LANCASTER, MARK P. NELSON, DIOGO MEYER, Eighth Pacific Symposium on Biocomputing (PSB), Jan 3-7, 2003, Kauai, Hawaii . 2003

机译：PYPOP：人口基因组学的软件框架：分析大型多地点基因型数据
5. A Scalable Software Framework for Solving PDES on Distributed Octree Meshes Using Nite Element MethodsA scalable software framework for solving pdes on distributed octree meshes using finite element methods [D] . Lofquist, Alec Dale. 2018

机译：使用NITE元素MetableA可扩展软件框架在分布式Octree网格上求解PDE的可扩展软件框架，用于使用有限元方法在分布式Octree网格上求解PDES
6. PyPop: A Software Framework for Population Genomics: Analyzing Large-Scale Multi-Locus Genotype Data [O] . Alex Lancaster, Mark P. Nelson, Diogo Meyer, -1

机译：PyPop：人口基因组学的软件框架：分析大规模多基因座基因型数据
7. AthenaMT: Upgrading the ATLAS Software Framework for the Many-Core World with Multi-Threading [O] . Leggett, Charles, Baines, John, Bold, Tomasz, 2016

机译：AthenaMT：使用多线程升级ATLAS软件框架以应对多核世界

GENOMIN: A SOFTWARE FRAMEWORK FOR READING GENOMIC SIGNALS

摘要

著录项

相似文献

相关主题

期刊订阅