Skip to main content

Towards an Effective and Efficient Management of Genome Data: An Information Systems Engineering Perspective

  • Conference paper
  • First Online:
Information Systems Engineering in Responsible Information Systems (CAiSE 2019)

Abstract

The Genome Data Management domain is particularly complex in terms of volume, heterogeneity and dispersion, therefore Information System Engineering (ISE) techniques are strictly required. We work with the Valencian Institute of Agrarian Research (IVIA) to improve its genomic analysis processes. To address this challenge we present in this paper our Model-driven Development (MDD), conceptual modeling-based, experience. The selection of the most appropriate technology is an additional relevant aspect. NoSQL-based solutions where the technology that better fit the needs of the studied domain − the IVIA Research Centre using its Information System in a real-world industrial environment− and therefore used. The contributions of the paper are twofold: to show how ISE in practice provides a better solution using conceptual models as the basic software artefacts, and to reinforce the idea that the adequate technology selection must be the result of a practical ISE-based exercise.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.geneontology.org/.

References

  1. Aguilera, D., Gómez, C., Olivé, A.: Enforcement of conceptual schema quality issues in current integrated development environments. In: Salinesi, C., Norrie, M.C., Pastor, Ó. (eds.) CAiSE 2013. LNCS, vol. 7908, pp. 626–640. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-38709-8_40

    Chapter  Google Scholar 

  2. Ashburner, M., et al.: Gene ontology: tool for the unification of biology, May 2000. https://doi.org/10.1038/75556

  3. Balaur, I., et al.: EpiGeNet: a graph database of interdependencies between genetic and epigenetic events in colorectal cancer. J. Comput. Biol. 24(10), 969–980 (2017). https://doi.org/10.1089/cmb.2016.0095

    Article  Google Scholar 

  4. Batini, C., Scannapieco, M.: Data and Information Quality. DSA. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-24106-7

    Book  MATH  Google Scholar 

  5. Cingolani, P., et al.: A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 6(2), 80–92 (2012)

    Article  Google Scholar 

  6. Collins, F.S., Varmus, H.: A new initiative on precision medicine. N. Engl. J. Med. 372(9), 793–795 (2015). https://doi.org/10.1056/NEJMp1500523

    Article  Google Scholar 

  7. Danecek, P., et al.: The variant call format and VCFtools. Bioinformatics 27(15), 2156–2158 (2011). https://doi.org/10.1093/bioinformatics/btr330

    Article  Google Scholar 

  8. García Simón, A.: Gestión de datos genómicos basada en Modelos Conceptuales (2018). https://hdl.handle.net/10251/111666

  9. Garwood, K., et al.: Model-driven user interfaces for bioinformatics data resources: regenerating the wheel as an alternative to reinventing it. BMC Bioinf. 7(1), 532 (2006). https://doi.org/10.1186/1471-2105-7-532

    Article  Google Scholar 

  10. Mcphee, N.F., Donatucci, D., Helmuth, T.: Using Graph Databases to Explore the Dynamics of Genetic Programming Runs, pp. 185–201 (2016). https://doi.org/10.1007/978-3-319-34223-8_11

  11. NHGRI: DNA Sequencing Costs: Data - National Human Genome Research Institute (NHGRI) (2015). https://www.genome.gov/sequencingcostsdata/

  12. Olive, A.: Conceptual Modeling of Information Systems, 1st edn. Springer, Heidelberg (2007)

    MATH  Google Scholar 

  13. Paine, J.A., et al.: Improving the nutritional value of Golden Rice through increased pro-vitamin a content. Nat. Biotechnol. 23(4), 482–487 (2005). https://doi.org/10.1038/nbt1082

    Article  Google Scholar 

  14. Palacio, A.L., López, Ó.P., Ródenas, J.C.C.: A method to identify relevant genome data: conceptual modeling for the medicine of precision. In: Trujillo, J.C., et al. (eds.) ER 2018. LNCS, vol. 11157, pp. 597–609. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00847-5_44

    Chapter  Google Scholar 

  15. Pareja-Tobes, P., Pareja-Tobes, E., Manrique, M., Pareja, E., Tobes, R.: Bio4j: an open source biological data integration platform. In: Rojas, I., Guzman, F.M.O. (eds.) IWBBIO. p. 281. Copicentro Editorial (2013). http://iwbbio.ugr.es/papers/iwbbio_051.pdf

  16. Pastor, O.: Conceptual modeling of life: beyond the homo sapiens. In: Comyn-Wattiau, I., Tanaka, K., Song, I.-Y., Yamamoto, S., Saeki, M. (eds.) ER 2016. LNCS, vol. 9974, pp. 18–31. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46397-1_2

    Chapter  Google Scholar 

  17. Paton, N.W., et al.: Conceptual modelling of genomic information. Bioinformatics 16(6), 548–557 (2000). https://doi.org/10.1093/bioinformatics/16.6.548

    Article  Google Scholar 

  18. Pérez, J.A., Poon, C.C.Y., Merrifield, R.D., Wong, S.T.C., Yang, G.: Big data for health. IEEE J. Biomed. Health Inform. 19(4), 1193–1208 (2015). https://doi.org/10.1109/JBHI.2015.2450362

    Article  Google Scholar 

  19. Ram, S., Wei, W.: Modeling the semantics of 3D protein structures. In: Atzeni, P., Chu, W., Lu, H., Zhou, S., Ling, T.-W. (eds.) ER 2004. LNCS, vol. 3288, pp. 696–708. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-30464-7_52

    Chapter  Google Scholar 

  20. Reyes Román, J.F.: Diseño y Desarrollo de un Sistema de Información Genómica Basado en un Modelo Conceptual Holístico del Genoma Humano. Ph.D. thesis, Universitat Politècnica de València (2018). https://riunet.upv.es/handle/10251/99565

  21. Swainston, N., et al.: Recon 2.2: from reconstruction to modelof human metabolism. Metabolomics 12(7), 109 (2016). https://doi.org/10.1007/s11306-016-1051-4

    Article  Google Scholar 

  22. Wu, G.A., et al.: Genomics of the origin and evolution of Citrus. Nature 554(7692), 311–316 (2018). https://doi.org/10.1038/nature25447

    Article  Google Scholar 

Download references

Acknowledgments

This work was supported by the Spanish Ministry of Science and Innovation through Project DataME (ref: TIN2016-80811-P) and the Generalitat Valenciana through project GISPRO (PROMETEO/2018/176). The authors would like to thank members of the PROS Research Centre Genome group, the IVIA Research group, especially Manuel Talón and Javier Terol for the fruitful discussions and their valuable assistance regarding the application of CM in the genomics domain and José Marín Navarro for his advice and help.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alberto García S. .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

García S., A., Reyes Román, J.F., Casamayor, J.C., Pastor, O. (2019). Towards an Effective and Efficient Management of Genome Data: An Information Systems Engineering Perspective. In: Cappiello, C., Ruiz, M. (eds) Information Systems Engineering in Responsible Information Systems. CAiSE 2019. Lecture Notes in Business Information Processing, vol 350. Springer, Cham. https://doi.org/10.1007/978-3-030-21297-1_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-21297-1_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-21296-4

  • Online ISBN: 978-3-030-21297-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics