A Supervised Learning Approach to Entity Search

  • Guoping Hu
  • Jingjing Liu
  • Hang Li
  • Yunbo Cao
  • Jian-Yun Nie
  • Jianfeng Gao
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4182)


In this paper we address the problem of entity search. Expert search and time search are used as examples. In entity search, given a query and an entity type, a search system returns a ranked list of entities in the type (e.g., person name, time expression) relevant to the query. Ranking is a key issue in entity search. In the literature, only expert search was studied and the use of co-occurrence was proposed. In general, many features may be useful for ranking in entity search. We propose using a linear model to combine the uses of different features and employing a supervised learning approach in training of the model. Experimental results on several data sets indicate that our method significantly outperforms the baseline method based solely on co-occurrences.


Information Retrieval Entity Search Expert Search Time Search Supervised Learning 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Brill, E., Dumais, S., Banko, M.: An Analysis of the AskMSR Question- Answering System. In: EMNLP 2002 (2002)Google Scholar
  2. 2.
    Campbell, C.S., Maglio, P.P., Cozzi, A., Dom, B.: Expertise Identification using Email Communications. In: CILM 2003 (2003)Google Scholar
  3. 3.
    Craswell, N., Hawking, D., Vercoustre, A.M., Wilkins, P.: P@NOPTIC Expert: Searching for Experts not just for Documents. In: Ausweb (2001)Google Scholar
  4. 4.
    Cormack, G.V., Clarke, C.L.A., Kisman, D.I.E., Palmer, C.R.: Fast Automatic Passage Scoring (MultiText Experiments for TREC-8). In: TREC 1999 (1999)Google Scholar
  5. 5.
    Cormack, G.V., Lynam, T.R.: Statistical Precision of Information Retrieval Evaluation. In: SIGIR 2006, Seattle, Washington, USA, August 6-11 (2006)Google Scholar
  6. 6.
    Deerwester, S., Dumais, S.T., Fumas, G.W., Landauer, T.K., Harshman, R.: Indexing by Latent Structure Analysis. Journal of the American Society for Information Sciences, 391–407 (1990)Google Scholar
  7. 7.
    Dom, B., Eiron, I., Cozzi, A., Yi, Z.: Graph-Based Ranking Algorithms for E-mail Expertise Analysis. In: Proc. of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery (2003)Google Scholar
  8. 8.
    Gao, J., Qi, H., Xia, X., Nie, J.-Y.: Linear discriminant model for information retrieval. In: SIGIR 2005 (2005)Google Scholar
  9. 9.
    Herbrich, R., Graepel, T., Obermayer, K.: Large margin rank boundaries for ordinal regression. In: Advances in Large Margin Classifiers, pp. 115–132. MIT Press, Cambridge (2000)Google Scholar
  10. 10.
    Hovy, E.H., Gerber, L., Hermjakob, U., Junk, M., Lin, C.-Y.: Question Answering in Webclopedia. In: TREC 2000 (2000)Google Scholar
  11. 11.
    Ittycheriah, A., Roukos, S.: IBM’s Statistical Question Answering System-TREC 11. In: TREC 2002 (2002)Google Scholar
  12. 12.
    Kwok, C.C.T., Etzioni, O., Weld, D.S.: Scaling question answering to the Web. In: WWW-2001, pp. 150–161 (2001)Google Scholar
  13. 13.
    Maron, M.E., Curry, S., Thompson, P.: An inductive search system: Theory, design and implementation. IEEE Transaction on Systems, Man and Cybernetics 16(1), 21–28 (1986)CrossRefGoogle Scholar
  14. 14.
    Mattox, D., Maybury, M., Morey, D.: Enterprise Expert and Knowledge Discovery. In: Proceedings of the HCI International 1999 (1999)Google Scholar
  15. 15.
    McDonald, D.W.: Evaluating Expertise Recommendations. In: Proc. of the ACM 2001 international conference on Supporting Group Work (GROUP 2001), Boulder, CO (2001)Google Scholar
  16. 16.
    Morgan, W., Greiff, W., Henderson, J.: Direct Maximization of Average Precision by Hill-Climbing, with a Comparison to a Maximum Entropy Approach. In: HLTNAACL 2004, pp. 93–96 (2004)Google Scholar
  17. 17.
    Ponte, J.M., Croft, W.B.: A Language Modeling Approach to Information Retrieval. In: SIGIR 1998, pp. 275–281 (1998)Google Scholar
  18. 18.
    Radev, D.R., Fan, W., Qi, H., Wu, H., Grewal, A.: Probabilistic question answering on the web. In: WWW 2002, pp. 408–419 (2002)Google Scholar
  19. 19.
    Robertson, S.E., Walker, S., Hancock-Beaulieu, M., Gatford, M., Payne, A.: Okapi at TREC-4. In: TREC 1995 (1995)Google Scholar
  20. 20.
    Salton, G., Allan, J., Buckley, C.: Approaches to Passage Retrieval in Full Text Information Systems. In: SIGIR 1993, pp. 49–58 (1993)Google Scholar
  21. 21.
    Sayyadian, M., Shakery, A., Doan, A., Zhai, C.: Toward Entity Retrieval over Structured and Text Data, WIRD 2004. In: The first Workshop on the Integration of Information Retrieval and Databases, WIRD 2004 (2004)Google Scholar
  22. 22.
    Steer, L.A., Lochbaum, K.E.: An Expert/Expert Locating System Based on Automatic Representation of Semantic Structure. In: Proc. of the Fourth IEEE Conference on Artificial Intelligence Applications (1988)Google Scholar
  23. 23.
    Zhang, L., Pan, Y., Zhang, T.: Focused named entity recognition using machine learning. In: SIGIR 2004, pp. 281–288.Google Scholar
  24. 24.
    World Web Consortium (W3C),
  25. 25.

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Guoping Hu
    • 1
  • Jingjing Liu
    • 2
  • Hang Li
    • 4
  • Yunbo Cao
    • 4
  • Jian-Yun Nie
    • 3
  • Jianfeng Gao
    • 4
  1. 1.iFly Speech LabUniversity of Science and Technology of ChinaHefeiChina
  2. 2.College of Information Science & TechnologyNankai UniversityTianjinChina
  3. 3.Département d’informatique et de recherche opérationnelleUniversité de Montréal 
  4. 4.Microsoft Research AsiaBeijingChina

Personalised recommendations