A Framework for Mining Life Sciences Data on the Semantic Web in an Interactive, Graph-Based Environment

Lysenko, Artem; Grzebyta, Jacek; Hindle, Matthew M.; Rawlings, Chris J.; Splendiani, Andrea

doi:10.1007/978-3-319-09042-9_16

Artem Lysenko⁷,
Jacek Grzebyta^7,8,
Matthew M. Hindle⁹,
Chris J. Rawlings⁷ &
…
Andrea Splendiani⁷

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 8452))

Included in the following conference series:

International Meeting on Computational Intelligence Methods for Bioinformatics and Biostatistics

970 Accesses

Abstract

The last decade saw the marked increase in the availability of the Life Sciences data on the Semantic Web. At the same time, the need to interactively explore complex and extensive biological datasets lead to development of advanced visualisation tools, many of which present the data in the form of a network graph. Semantic Web technologies offer both a means to define rich semantics necessary to describe complex biological systems and allow large amounts of data to be shared effectively. However, at present the need to be familiar with relevant technologies greatly impedes access to these datasets by the non-specialist Life Sciences researches. To address this, we have developed a software frame-work that facilitates both access to the resources and presents the data returned in an intuitive, graph-based format. Our framework is closely integrated with Ondex, an established data integration solution in the Life Sciences domain. The implementation consists of two parts. The first one is a query console that allows expert users to execute Semantic Web queries directly. The second one is a graph-based interactive browsing solution that can be used to launch stock queries by choosing items in the menu. In both cases, the result is re-formatted and visualised as a graph in Ondex frontend.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Smoot, M.E., Ono, K., Ruscheinski, J., Wang, P.L., Ideker, T.: Cytoscape 2.8: new features for data integration and network visualization. Bioinformatics 27(3), 431–432 (2011)
Article Google Scholar
Goble, C., Stevens, R.: State of the nation in data integration for bioinformatics. J. Biomed. Inform. 41(5), 687–693 (2008)
Article Google Scholar
Jenssen, T.K., Hovig, E.: The semantic web and biology. Drug Discov. Today 7(19), 992 (2002)
Article Google Scholar
W3C: Resource Description Framework (RDF) Model and Syntax Specification, vol. 2013 (1999). http://www.w3.org/TR/PR-rdf-syntax/
W3C: SPARQL Query Language for RDF, vol. 2013 (2008). http://www.w3.org/TR/rdf-sparql-query/
Berners-Lee, T.: RFC 3986 Uniform Resource Identifier (URI): Generic Syntax, vol. 2013 (2005). http://www.rfc-editor.org/rfc/rfc3986.txt
Kohler, J., Baumbach, J., Taubert, J., Specht, M., Skusa, A., Ruegg, A., Rawlings, C., Verrier, P., Philippi, S.: Graph-based analysis and visualization of experimental results with ondex. Bioinformatics 22(11), 1383–1390 (2006)
Article Google Scholar
Longabaugh, W.J.: Biotapestry: a tool to visualize the dynamic properties of gene regulatory networks. Meth. Mol. Biol. 786, 359–394 (2012)
Article Google Scholar
Taubert, J., Sieren, K., Hindle, M., Hoekman, B., Winnenburg, R., Philippi, S., Rawlings, C., Khler, J.: The oxl format for the exchange of integrated datasets. J. Integr. Bioinform. 4(3), 62 (2007)
Google Scholar
Splendiani, A., Rawlings, C.J., Kuo, S.-C., Stevens, R., Lord, P.: Lost in translation: data integration tools meet the semantic web (experiences from the ondex project). In: Gaol, F.L. (ed.) Recent Progress in DEIT, Vol. 2. LNEE, vol. 157, pp. 87–97. Springer, Heidelberg (2012)
Chapter Google Scholar
Bizer, C., Heath, T., Berners-Lee, T.: Linked data - the story so far. Int. J. Seman. Web Inf. Sys. (IJSWIS) 5(3), 1–22 (2009)
Article Google Scholar
Apweiler, R., Bairoch, A., Wu, C.H., Barker, W.C., Boeckmann, B., Ferro, S., Gasteiger, E., Huang, H., Lopez, R., Magrane, M., Martin, M.J., Natale, D.A., O’Donovan, C., Redaschi, N., Yeh, L.S.: Uniprot: the universal protein knowledgebase. Nucleic Acids Res. 32(Database issue), D115–D119 (2004)
Article Google Scholar
Goble, C.A., Bhagat, J., Aleksejevs, S., Cruickshank, D., Michaelides, D., Newman, D., Borkum, M., Bechhofer, S., Roos, M., Li, P., De Roure, D.: myExperiment: a repository and social network for the sharing of bioinformatics workflows. Nucleic Acids Res. 38(Web Server issue), W677–W682 (2010)
Article Google Scholar
Belleau, F., Nolin, M.A., Tourigny, N., Rigault, P., Morissette, J.: Bio2rdf: towards a mashup to build bioinformatics knowledge systems. J Biomed. Inform. 41(5), 706–716 (2008)
Article Google Scholar
Rhee, S.Y., Beavis, W., Berardini, T.Z., Chen, G., Dixon, D., Doyle, A., Garcia-Hernandez, M., Huala, E., Lander, G., Montoya, M., Miller, N., Mueller, L.A., Mundodi, S., Reiser, L., Tacklind, J., Weems, D.C., Wu, Y., Xu, I., Yoo, D., Yoon, J., Zhang, P.: The arabidopsis information resource (tair): a model organism database providing a centralized, curated gateway to arabidopsis biology, research materials and community. Nucleic Acids Res. 31(1), 224–228 (2003)
Article Google Scholar

Download references

Acknowledgments

Rothamsted Research receives grant in aid from the Biotechnology and Biological Sciences Research Council (BBSRC). This work was supported by the BBSRC award BBS/E/C/00005034.

Author information

Authors and Affiliations

Centre for Mathematical and Computational Biology, Rothamsted Research, Harpenden, Herts, AL5 2JQ, UK
Artem Lysenko, Jacek Grzebyta, Chris J. Rawlings & Andrea Splendiani
Structural and Molecular Biology, University College London, London, WC1E 6BT, UK
Jacek Grzebyta
SynthSys, University of Edinburgh, Edinburgh, EH9 3JD, UK
Matthew M. Hindle

Authors

Artem Lysenko
View author publications
You can also search for this author in PubMed Google Scholar
Jacek Grzebyta
View author publications
You can also search for this author in PubMed Google Scholar
Matthew M. Hindle
View author publications
You can also search for this author in PubMed Google Scholar
Chris J. Rawlings
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Splendiani
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Artem Lysenko or Andrea Splendiani .

Editor information

Editors and Affiliations

University Nice Sophia Antipolis, Sophia Antipolis, France
Enrico Formenti
University of Salerno, Fisciano, Italy
Roberto Tagliaferri
University of Groningen, AG Groningen, The Netherlands
Ernst Wit

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lysenko, A., Grzebyta, J., Hindle, M.M., Rawlings, C.J., Splendiani, A. (2014). A Framework for Mining Life Sciences Data on the Semantic Web in an Interactive, Graph-Based Environment. In: Formenti, E., Tagliaferri, R., Wit, E. (eds) Computational Intelligence Methods for Bioinformatics and Biostatistics. CIBB 2013. Lecture Notes in Computer Science(), vol 8452. Springer, Cham. https://doi.org/10.1007/978-3-319-09042-9_16

Download citation

DOI: https://doi.org/10.1007/978-3-319-09042-9_16
Published: 16 July 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09041-2
Online ISBN: 978-3-319-09042-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics