Advertisement

A Recursive Approach to Entity Ranking and List Completion Using Entity Determining Terms, Qualifiers and Prominent n-Grams

  • Madhu Ramanathan
  • Srikant Rajagopal
  • Venkatesh Karthik
  • Meenakshi Sundaram Murugeshan
  • Saswati Mukherjee
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6203)

Abstract

This paper presents our approach for INEX 2009 Entity Ranking track which consists of two subtasks viz. Entity Ranking and List Completion. Retrieving the correct entities according to the user query is a three-step process viz. extracting the required information from the query and the provided categories, extracting the relevant documents which may be either prospective entities or intermediate pointers to prospective entities by making use of the structure available in the Wikipedia Corpus and finally ranking the resultant set of documents. We have extracted the Entity Determining Terms (EDTs), Qualifiers and prominent n-grams from the query, strategically exploited the relation between the extracted terms and the structure and connectedness of the corpus to retrieve links which are highly probable of being entities and then used a recursive mechanism for retrieving relevant documents through the Lucene Search. Our ranking mechanism combines various approaches that make use of category information, links, titles and WordNet information, initial description and the text of the document.

Keywords

Entity Ranking List Completion Entity Determining Terms (EDTs) Qualifiers Prominent n-grams Named Qualifiers Wikipedia tags 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Voorhees, E.M.: Overview of the TREC 2001 Question Answering Track. In: Proceedings of the 10th Text Retrieval Conference (2001)Google Scholar
  2. 2.
    Craswell, N., de Vries, A.P., Soboroff, I.: Overview of the TREC 2005 Enterprise Track. In: Proceedings of the 14th Text Retrieval Conference (2005)Google Scholar
  3. 3.
    de Vries, A.P.: Overview of the INEX 2007 Entity Ranking Track. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 245–251. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  4. 4.
    Chen, J., Diekema, A., Taffett, M.D., McCracken, N., Ozgencil, N.E., Yilmazel, O., Liddy, E.D.: Question Answering: CNLP at the TREC 10 Question Answering Track. In: Proceedings of the 10th Text Retrieval Conference (2001)Google Scholar
  5. 5.
    Yang, H., Chua, T.-S.: Web-based list question answering. In: Proceedings of the 20th International Conference on Computational Linguistics (2004)Google Scholar
  6. 6.
    Kazama, J., Torisawa, K.: Exploiting Wikipedia as External Knowledge for Named Entity Recognition. In: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, EMNLP-CoNLL (2007)Google Scholar
  7. 7.
    Hermjakob, U., Hovy, E.H., Lin, C.-Y.: Knowledge-Based Question Answering. In: Proceedings of the 6th World Multiconference on Systems, Cybernatics and Informatics (SCI 2002), Orlando, FL, U.S.A., July 14-18 (2002)Google Scholar
  8. 8.
    Murugeshan, M.S., Mukherjee, S.: An n-Gram and Initial Description Based Approach for Entity Ranking Track. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 293–305. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  9. 9.
    Thom, J.A., Pehcevski, J., Vercoustre, A.-M.: Use of Wikipedia Categories in Entity Ranking. In: Proceedings of the 12th Australasian Document Computing Symposium (ADCS 2007), Melbourne, Australia, December 10 (2007)Google Scholar
  10. 10.
    Cilibrasi, R., Vitanyi, P.: The Google Similarity Distance. IEEE Trans. Knowledge and Data Engineering 19(3), 370–383 (2007)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Madhu Ramanathan
    • 1
  • Srikant Rajagopal
    • 1
  • Venkatesh Karthik
    • 1
  • Meenakshi Sundaram Murugeshan
    • 1
  • Saswati Mukherjee
    • 1
  1. 1.Department of Computer Science and Engineering, College of Engineering, GuindyAnna UniversityChennaiIndia

Personalised recommendations