ConQuR-Bio: Consensus Ranking with Query Reformulation for Biological Data
Abstract
This paper introduces ConQuR-Bio which aims at assisting scientists when they query public biological databases. Various reformulations of the user query are generated using medical terminologies. Such alternative reformulations are then used to rank the query results using a new consensus ranking strategy. The originality of our approach thus lies in using consensus ranking techniques within the context of query reformulation. The ConQuR-Bio system is able to query the EntrezGene NCBI database. Our experiments demonstrate the benefit of using ConQuR-Bio compared to what is currently provided to users. ConQuR-Bio is available to the bioinformatics community at http://conqur-bio.lri.fr .
Keywords
Lynch Syndrome MeSH Term Median Ranking Query Reformulation Consensus RankingPreview
Unable to display preview. Download preview PDF.
References
- 1.Ailon, N.: Aggregation of Partial Rankings, p-Ratings and Top-m Lists. Algorithmica 57, 284–300 (2010)CrossRefzbMATHMathSciNetGoogle Scholar
- 2.Aronson, A.R.: Effective mapping of biomedical text to the umls metathesaurus: the metamap program. In: Proceedings of the AMIA Symposium, p. 17. American Medical Informatics Association (2001)Google Scholar
- 3.Bodenreider, O.: The unified medical language system (umls): integrating biomedical terminology. Nucleic Acids Research 32(suppl. 1), D267–D270 (2004)Google Scholar
- 4.de Borda, J.C.: Mémoire sur les élection au scrutin. Histoire de l’academie royal des sciences, 657–664 (1781)Google Scholar
- 5.Bradley, A.P.: The use of the area under the roc curve in the evaluation of machine learning algorithms. Pattern Recognition 30, 1145–1159 (1997)CrossRefGoogle Scholar
- 6.Brancotte, B., Biton, A., Bernard-Pierrot, I., Radvanyi, F., Reyal, F., Cohen-Boulakia, S.: Gene List significance at-a-glance with GeneValorization. Bioinformatics 27(8), 1187–1189 (2011)CrossRefGoogle Scholar
- 7.Cohen-Boulakia, S., Denise, A., Hamel, S.: Using Medians to Generate Consensus Rankings for Biological Data. In: Bayard Cushing, J., French, J., Bowers, S. (eds.) SSDBM 2011. LNCS, vol. 6809, pp. 73–90. Springer, Heidelberg (2011)CrossRefGoogle Scholar
- 8.Demner-Fushman, D., Abhyankar, S., Jimeno-Yepes, A., Loane, R.F., Rance, B., Lang, F.-M., Ide, N.C., Apostolova, E., Aronson, A.R.: A knowledge-based approach to medical records retrieval. TREC (2011)Google Scholar
- 9.Dwork, C., Kumar, R., Naor, M., Sivakumar, D.: Rank aggregation methods for the web. In: Proceedings of the 10th World Widw Web Conference, pp. 613–622. ACM, New York (2001)Google Scholar
- 10.Fagin, R., Kumar, R., Sivakumar, D.: Efficient similarity search and classification via rank aggregation. In: Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, pp. 301–312. ACM (2003)Google Scholar
- 11.Fagin, R., Kumar, R., Mahdian, M., Sivakumar, D., Vee, E.: Comparing and aggregating rankings with ties. In: Proceedings of the Twenty-Third ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, PODS 2004, pp. 47–58. ACM, New York (2004)CrossRefGoogle Scholar
- 12.Kendall, M.: A new measure of rank correlation. Biometrika 30, 81–89 (1938)CrossRefzbMATHMathSciNetGoogle Scholar
- 13.Carolyn, E.: Lipscomb. Medical subject headings (mesh). Bulletin of the Medical Library Association 88(3), 265 (2000)Google Scholar
- 14.Maglott, D., Ostell, J., Pruitt, K.D., Tatusova, T.: Entrez gene: gene-centered information at ncbi. Nucleic Acids Research 39(sp.1), D52–D57 (2011)Google Scholar
- 15.Sayers, E.W., Barrett, T., Benson, D.A., Bolton, E., Bryant, S.H., Canese, K., Chetvernin, V., Church, D.M., DiCuccio, M., Federhen, S., et al.: Database resources of the national center for biotechnology information. Nucleic Acids Research 39(suppl. 1), D38–D51 (2011)Google Scholar
- 16.Stearns, M.Q., Price, C., Spackman, K.A., Wang, A.Y.: Snomed clinical terms: overview of the development process and project status. In: Proceedings of the AMIA Symposium, p. 662 (2001)Google Scholar
- 17.Steen, L.A., Seebach, A., Steen, L.A.: Counterexamples in topology. Springer (1978)Google Scholar
- 18.Whetzel, P.L., Noy, N.F., Shah, N.H., Alexander, P.R., Nyulas, C., Tudorache, T., Musen, M.A.: Bioportal: enhanced functionality via new web services from the national center for biomedical ontology to access and use ontologies in software applications. Nucleic Acids Research 39(suppl. 2), D541–D545 (2011)Google Scholar