首页> 外文期刊>BioData Mining >Rapid development of entity-based data models for bioinformatics with persistence object-oriented design and structured interfaces
【24h】

Rapid development of entity-based data models for bioinformatics with persistence object-oriented design and structured interfaces

机译:具有持久性面向对象设计和结构化接口的生物信息学基于实体的数据模型的快速开发

获取原文
       

摘要

Databases are imperative for research in bioinformatics and computational biology. Current challenges in database design include data heterogeneity and context-dependent interconnections between data entities. These challenges drove the development of unified data interfaces and specialized databases. The curation of specialized databases is an ever-growing challenge due to the introduction of new data sources and the emergence of new relational connections between established datasets. Here, an open-source framework for the curation of specialized databases is proposed. The framework supports user-designed models of data encapsulation, objects persistency and structured interfaces to local and external data sources such as MalaCards, Biomodels and the National Centre for Biotechnology Information (NCBI) databases. The proposed framework was implemented using Java as the development environment, EclipseLink as the data persistency agent and Apache Derby as the database manager. Syntactic analysis was based on J3D, jsoup, Apache Commons and w3c.dom open libraries. Finally, a construction of a specialized database for aneurysms associated vascular diseases is demonstrated. This database contains 3-dimensional geometries of aneurysms, patient’s clinical information, articles, biological models, related diseases and our recently published model of aneurysms’ risk of rapture. Framework is available in: http:/bel-lab.com .
机译:数据库对于生物信息学和计算生物学的研究势在必行。数据库设计中的当前挑战包括数据异构性和数据实体之间的上下文相关互连。这些挑战推动了统一数据接口和专用数据库的开发。由于引入了新的数据源以及已建立的数据集之间新的关系连接的出现,专业数据库的管理是一个日益严峻的挑战。在这里,提出了一个用于管理专业数据库的开源框架。该框架支持用户设计的数据封装模型,对象持久性以及与本地和外部数据源(例如MalaCard,生物模型和国家生物技术信息中心(NCBI)数据库)的结构化接口。所提出的框架是使用Java作为开发环境,使用EclipseLink作为数据持久性代理以及使用Apache Derby作为数据库管理器来实现的。语法分析基于J3D,jsoup,Apache Commons和w3c.dom开放库。最后,证明了与动脉瘤相关的血管疾病的专门数据库的构建。该数据库包含动脉瘤的3维几何形状,患者的临床信息,文章,生物学模型,相关疾病以及我们最近发布的动脉瘤被提风险模型。可以从以下网站获得该框架:http:/bel-lab.com。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号