Skip to main content

ConQuR-Bio: Consensus Ranking with Query Reformulation for Biological Data

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 8574))

Abstract

This paper introduces ConQuR-Bio which aims at assisting scientists when they query public biological databases. Various reformulations of the user query are generated using medical terminologies. Such alternative reformulations are then used to rank the query results using a new consensus ranking strategy. The originality of our approach thus lies in using consensus ranking techniques within the context of query reformulation. The ConQuR-Bio system is able to query the EntrezGene NCBI database. Our experiments demonstrate the benefit of using ConQuR-Bio compared to what is currently provided to users. ConQuR-Bio is available to the bioinformatics community at http://conqur-bio.lri.fr .

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   34.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   44.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ailon, N.: Aggregation of Partial Rankings, p-Ratings and Top-m Lists. Algorithmica 57, 284–300 (2010)

    Article  MATH  MathSciNet  Google Scholar 

  2. Aronson, A.R.: Effective mapping of biomedical text to the umls metathesaurus: the metamap program. In: Proceedings of the AMIA Symposium, p. 17. American Medical Informatics Association (2001)

    Google Scholar 

  3. Bodenreider, O.: The unified medical language system (umls): integrating biomedical terminology. Nucleic Acids Research 32(suppl. 1), D267–D270 (2004)

    Google Scholar 

  4. de Borda, J.C.: Mémoire sur les élection au scrutin. Histoire de l’academie royal des sciences, 657–664 (1781)

    Google Scholar 

  5. Bradley, A.P.: The use of the area under the roc curve in the evaluation of machine learning algorithms. Pattern Recognition 30, 1145–1159 (1997)

    Article  Google Scholar 

  6. Brancotte, B., Biton, A., Bernard-Pierrot, I., Radvanyi, F., Reyal, F., Cohen-Boulakia, S.: Gene List significance at-a-glance with GeneValorization. Bioinformatics 27(8), 1187–1189 (2011)

    Article  Google Scholar 

  7. Cohen-Boulakia, S., Denise, A., Hamel, S.: Using Medians to Generate Consensus Rankings for Biological Data. In: Bayard Cushing, J., French, J., Bowers, S. (eds.) SSDBM 2011. LNCS, vol. 6809, pp. 73–90. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  8. Demner-Fushman, D., Abhyankar, S., Jimeno-Yepes, A., Loane, R.F., Rance, B., Lang, F.-M., Ide, N.C., Apostolova, E., Aronson, A.R.: A knowledge-based approach to medical records retrieval. TREC (2011)

    Google Scholar 

  9. Dwork, C., Kumar, R., Naor, M., Sivakumar, D.: Rank aggregation methods for the web. In: Proceedings of the 10th World Widw Web Conference, pp. 613–622. ACM, New York (2001)

    Google Scholar 

  10. Fagin, R., Kumar, R., Sivakumar, D.: Efficient similarity search and classification via rank aggregation. In: Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, pp. 301–312. ACM (2003)

    Google Scholar 

  11. Fagin, R., Kumar, R., Mahdian, M., Sivakumar, D., Vee, E.: Comparing and aggregating rankings with ties. In: Proceedings of the Twenty-Third ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, PODS 2004, pp. 47–58. ACM, New York (2004)

    Chapter  Google Scholar 

  12. Kendall, M.: A new measure of rank correlation. Biometrika 30, 81–89 (1938)

    Article  MATH  MathSciNet  Google Scholar 

  13. Carolyn, E.: Lipscomb. Medical subject headings (mesh). Bulletin of the Medical Library Association 88(3), 265 (2000)

    Google Scholar 

  14. Maglott, D., Ostell, J., Pruitt, K.D., Tatusova, T.: Entrez gene: gene-centered information at ncbi. Nucleic Acids Research 39(sp.1), D52–D57 (2011)

    Google Scholar 

  15. Sayers, E.W., Barrett, T., Benson, D.A., Bolton, E., Bryant, S.H., Canese, K., Chetvernin, V., Church, D.M., DiCuccio, M., Federhen, S., et al.: Database resources of the national center for biotechnology information. Nucleic Acids Research 39(suppl. 1), D38–D51 (2011)

    Google Scholar 

  16. Stearns, M.Q., Price, C., Spackman, K.A., Wang, A.Y.: Snomed clinical terms: overview of the development process and project status. In: Proceedings of the AMIA Symposium, p. 662 (2001)

    Google Scholar 

  17. Steen, L.A., Seebach, A., Steen, L.A.: Counterexamples in topology. Springer (1978)

    Google Scholar 

  18. Whetzel, P.L., Noy, N.F., Shah, N.H., Alexander, P.R., Nyulas, C., Tudorache, T., Musen, M.A.: Bioportal: enhanced functionality via new web services from the national center for biomedical ontology to access and use ontologies in software applications. Nucleic Acids Research 39(suppl. 2), D541–D545 (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Brancotte, B., Rance, B., Denise, A., Cohen-Boulakia, S. (2014). ConQuR-Bio: Consensus Ranking with Query Reformulation for Biological Data. In: Galhardas, H., Rahm, E. (eds) Data Integration in the Life Sciences. DILS 2014. Lecture Notes in Computer Science(), vol 8574. Springer, Cham. https://doi.org/10.1007/978-3-319-08590-6_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-08590-6_13

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-08589-0

  • Online ISBN: 978-3-319-08590-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics