Indexing genomic sequence libraries

OKane KC; Lockner MJ

首页> 外文期刊>Information Processing & Management >Indexing genomic sequence libraries

【24h】

Indexing genomic sequence libraries

机译：索引基因组序列文库

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes an extensible, open-source (GPL) data repository and retrieval system that supports fast, efficient, keyword based retrieval of genomic sequences from 1 multiple libraries with retrieved sequences post-processed by FASTA, Smith-Waterman and other analysis software. This application is implemented for Linux and is written in Mumps, C, and C++ with supporting components that include the Berkeley Data Base, the Perl Compatible Regular Expression Library, GLADE, and tools such as FASTA, Smith-Waterman, and modules from EMBOSS. The package described here can quickly index data sets of up to 256 terabytes using a B-tree based multi-dimensional data model. An example is presented that indexes the text of the full NCBI Genbank library. (C) 2003 Elsevier Ltd. All rights reserved.

机译：本文介绍了一种可扩展的开源（GPL）数据存储和检索系统，该系统支持从1个多个库中快速，高效，基于关键词的基因组序列检索，并使用FASTA，Smith-Waterman和其他分析软件对检索到的序列进行后处理。该应用程序是为Linux实现的，用Mumps，C和C ++编写，具有包括伯克利数据库，Perl兼容正则表达式库，GLADE在内的支持组件，以及FASTA，Smith-Waterman之类的工具以及EMBOSS的模块。使用基于B树的多维数据模型，此处描述的包可以快速索引多达256 TB的数据集。给出了一个示例，该示例为整个NCBI Genbank库的文本建立索引。（C）2003 Elsevier Ltd.保留所有权利。

著录项

来源
《Information Processing & Management》 |2005年第2期|p. 265-274|共10页
作者
OKane KC; Lockner MJ;
展开▼
作者单位

Univ No Iowa, Dept Comp Sci, Cedar Falls, IA 50613 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类图书馆学、图书馆事业;
关键词
bioinformatics; sequence retrieval; genomics; information retrieval; mumps; TERM DISCRIMINATION VALUES; INFORMATION-RETRIEVAL; DOCUMENT-RETRIEVAL; EXPERT SYSTEM; ALGORITHM; DATABASE; PACKAGE; BLAST;

机译：生物信息学;序列检索;基因组学;信息检索;腮腺炎;术语鉴别值;信息检索;文件检索;专家系统;算法;数据库;包装;爆炸;

相似文献

外文文献
中文文献
专利

1. Kmerind: A Flexible Parallel Library for K-mer Indexing of Biological Sequences on Distributed Memory Systems [J] . Tony Pan, Patrick Flick, Chirag Jain, IEEE/ACM transactions on computational biology and bioinformatics . 2019,第4期

机译：Kmerind：灵活的并行库，用于分布式存储系统上生物序列的K-mer索引
2. Development of genomic simple sequence repeat markers from an enriched genomic library of grass pea (Lathyrus sativus L.) [J] . Lioi Lucia, Galasso Incoronata Plant Breeding . 2013,第6期

机译：从豌豆（Lathyrus sativus L.）丰富的基因组文库中开发基因组简单序列重复标记
3. BonnMu: A Sequence-Indexed Resource of Transposon-Induced Maize Mutations for Functional Genomics Studies [J] . Marcon Caroline, Altrogge Lena, Win Yan Naing, Plant physiology . 2020,第2期

机译：BONNMU：用于功能基因组学研究的转座诱导的玉米突变的序列分度资源
4. Indexing Genomic Sequences on the IBM Blue Gene [C] . Amol Ghoting, rnKonstantin Makarychev International conference on high performance computing, networking, storage and analysis 2009 . 2009

机译：在IBM Blue基因上索引基因组序列
5. Identifying functional lox sequences: A genomic search and randomized libraries. [D] . Sheren, Jamie Elizabeth. 2007

机译：识别功能性lox序列：基因组搜索和随机文库。
6. A Sequence-Indexed Mutator Insertional Library for Maize Functional Genomics Study [O] . Lei Liang, Ling Zhou, Yuanping Tang, 2019

机译：玉米功能基因组学研究的序列索引突变体插入文库
7. A Human Genomic Library Enriched in Transcriptionally Active Sequences (aDNA Library) [O] . Pelling, Anna L., Thorne, Alan W., Crane-Robinson, Colyn 2000

机译：富含转录活性序列的人类基因组文库（aDNA文库）

Indexing genomic sequence libraries

摘要

著录项

相似文献

相关主题

期刊订阅