Rapid development of entity-based data models for bioinformatics with persistence object-oriented design and structured interfaces

Research output: Contribution to journalArticlepeer-review

Abstract

Databases are imperative for research in bioinformatics and computational biology. Current challenges in database design include data heterogeneity and context-dependent interconnections between data entities. These challenges drove the development of unified data interfaces and specialized databases. The curation of specialized databases is an ever-growing challenge due to the introduction of new data sources and the emergence of new relational connections between established datasets. Here, an open-source framework for the curation of specialized databases is proposed. The framework supports user-designed models of data encapsulation, objects persistency and structured interfaces to local and external data sources such as MalaCards, Biomodels and the National Centre for Biotechnology Information (NCBI) databases. The proposed framework was implemented using Java as the development environment, EclipseLink as the data persistency agent and Apache Derby as the database manager. Syntactic analysis was based on J3D, jsoup, Apache Commons and w3c.dom open libraries. Finally, a construction of a specialized database for aneurysms associated vascular diseases is demonstrated. This database contains 3-dimensional geometries of aneurysms, patient's clinical information, articles, biological models, related diseases and our recently published model of aneurysms' risk of rapture. Framework is available in: http://nbel-lab.com.

Original languageEnglish
Article number11
JournalBioData Mining
Volume10
Issue number1
DOIs
StatePublished - 11 Mar 2017
Externally publishedYes

Bibliographical note

Funding Information:
This work was supported by a JCT research grant.

Publisher Copyright:
© 2017 The Author(s).

Keywords

  • Apache Derby
  • EclipseLink
  • Object oriented programming
  • Object-relational databases
  • Specialized databases

Fingerprint

Dive into the research topics of 'Rapid development of entity-based data models for bioinformatics with persistence object-oriented design and structured interfaces'. Together they form a unique fingerprint.

Cite this