Abstract
The Genome Data Management domain is particularly complex in terms of volume, heterogeneity and dispersion, therefore Information System Engineering (ISE) techniques are strictly required. We work with the Valencian Institute of Agrarian Research (IVIA) to improve its genomic analysis processes. To address this challenge we present in this paper our Model-driven Development (MDD), conceptual modeling-based, experience. The selection of the most appropriate technology is an additional relevant aspect. NoSQL-based solutions where the technology that better fit the needs of the studied domain − the IVIA Research Centre using its Information System in a real-world industrial environment− and therefore used. The contributions of the paper are twofold: to show how ISE in practice provides a better solution using conceptual models as the basic software artefacts, and to reinforce the idea that the adequate technology selection must be the result of a practical ISE-based exercise.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
References
Aguilera, D., Gómez, C., Olivé, A.: Enforcement of conceptual schema quality issues in current integrated development environments. In: Salinesi, C., Norrie, M.C., Pastor, Ó. (eds.) CAiSE 2013. LNCS, vol. 7908, pp. 626–640. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-38709-8_40
Ashburner, M., et al.: Gene ontology: tool for the unification of biology, May 2000. https://doi.org/10.1038/75556
Balaur, I., et al.: EpiGeNet: a graph database of interdependencies between genetic and epigenetic events in colorectal cancer. J. Comput. Biol. 24(10), 969–980 (2017). https://doi.org/10.1089/cmb.2016.0095
Batini, C., Scannapieco, M.: Data and Information Quality. DSA. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-24106-7
Cingolani, P., et al.: A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 6(2), 80–92 (2012)
Collins, F.S., Varmus, H.: A new initiative on precision medicine. N. Engl. J. Med. 372(9), 793–795 (2015). https://doi.org/10.1056/NEJMp1500523
Danecek, P., et al.: The variant call format and VCFtools. Bioinformatics 27(15), 2156–2158 (2011). https://doi.org/10.1093/bioinformatics/btr330
García Simón, A.: Gestión de datos genómicos basada en Modelos Conceptuales (2018). https://hdl.handle.net/10251/111666
Garwood, K., et al.: Model-driven user interfaces for bioinformatics data resources: regenerating the wheel as an alternative to reinventing it. BMC Bioinf. 7(1), 532 (2006). https://doi.org/10.1186/1471-2105-7-532
Mcphee, N.F., Donatucci, D., Helmuth, T.: Using Graph Databases to Explore the Dynamics of Genetic Programming Runs, pp. 185–201 (2016). https://doi.org/10.1007/978-3-319-34223-8_11
NHGRI: DNA Sequencing Costs: Data - National Human Genome Research Institute (NHGRI) (2015). https://www.genome.gov/sequencingcostsdata/
Olive, A.: Conceptual Modeling of Information Systems, 1st edn. Springer, Heidelberg (2007)
Paine, J.A., et al.: Improving the nutritional value of Golden Rice through increased pro-vitamin a content. Nat. Biotechnol. 23(4), 482–487 (2005). https://doi.org/10.1038/nbt1082
Palacio, A.L., López, Ó.P., Ródenas, J.C.C.: A method to identify relevant genome data: conceptual modeling for the medicine of precision. In: Trujillo, J.C., et al. (eds.) ER 2018. LNCS, vol. 11157, pp. 597–609. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00847-5_44
Pareja-Tobes, P., Pareja-Tobes, E., Manrique, M., Pareja, E., Tobes, R.: Bio4j: an open source biological data integration platform. In: Rojas, I., Guzman, F.M.O. (eds.) IWBBIO. p. 281. Copicentro Editorial (2013). http://iwbbio.ugr.es/papers/iwbbio_051.pdf
Pastor, O.: Conceptual modeling of life: beyond the homo sapiens. In: Comyn-Wattiau, I., Tanaka, K., Song, I.-Y., Yamamoto, S., Saeki, M. (eds.) ER 2016. LNCS, vol. 9974, pp. 18–31. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46397-1_2
Paton, N.W., et al.: Conceptual modelling of genomic information. Bioinformatics 16(6), 548–557 (2000). https://doi.org/10.1093/bioinformatics/16.6.548
Pérez, J.A., Poon, C.C.Y., Merrifield, R.D., Wong, S.T.C., Yang, G.: Big data for health. IEEE J. Biomed. Health Inform. 19(4), 1193–1208 (2015). https://doi.org/10.1109/JBHI.2015.2450362
Ram, S., Wei, W.: Modeling the semantics of 3D protein structures. In: Atzeni, P., Chu, W., Lu, H., Zhou, S., Ling, T.-W. (eds.) ER 2004. LNCS, vol. 3288, pp. 696–708. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-30464-7_52
Reyes Román, J.F.: Diseño y Desarrollo de un Sistema de Información Genómica Basado en un Modelo Conceptual Holístico del Genoma Humano. Ph.D. thesis, Universitat Politècnica de València (2018). https://riunet.upv.es/handle/10251/99565
Swainston, N., et al.: Recon 2.2: from reconstruction to modelof human metabolism. Metabolomics 12(7), 109 (2016). https://doi.org/10.1007/s11306-016-1051-4
Wu, G.A., et al.: Genomics of the origin and evolution of Citrus. Nature 554(7692), 311–316 (2018). https://doi.org/10.1038/nature25447
Acknowledgments
This work was supported by the Spanish Ministry of Science and Innovation through Project DataME (ref: TIN2016-80811-P) and the Generalitat Valenciana through project GISPRO (PROMETEO/2018/176). The authors would like to thank members of the PROS Research Centre Genome group, the IVIA Research group, especially Manuel Talón and Javier Terol for the fruitful discussions and their valuable assistance regarding the application of CM in the genomics domain and José Marín Navarro for his advice and help.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
García S., A., Reyes Román, J.F., Casamayor, J.C., Pastor, O. (2019). Towards an Effective and Efficient Management of Genome Data: An Information Systems Engineering Perspective. In: Cappiello, C., Ruiz, M. (eds) Information Systems Engineering in Responsible Information Systems. CAiSE 2019. Lecture Notes in Business Information Processing, vol 350. Springer, Cham. https://doi.org/10.1007/978-3-030-21297-1_9
Download citation
DOI: https://doi.org/10.1007/978-3-030-21297-1_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-21296-4
Online ISBN: 978-3-030-21297-1
eBook Packages: Computer ScienceComputer Science (R0)