Online Forum Thread Retrieval Using Pseudo Cluster Selection and Voting Techniques

Part of the Smart Innovation, Systems and Technologies book series (SIST, volume 20)

Abstract

Online forums facilitate knowledge seeking and sharing on the Web. However, the shared knowledge is not fully utilized due to information overload. Thread retrieval is one method to overcome information overload. In this paper, we propose a model that combines two existing approaches: the Pseudo Cluster Selection and the Voting Techniques. In both, a retrieval system first scores a list of messages and then ranks threads by aggregating their scored messages. They differ on what and how to aggregate. The pseudo cluster selection focuses on input, while voting techniques focus on the aggregation method. Our combined models focus on the input and the aggregation methods. The result shows that some combined models are statistically superior to baseline methods.

Keywords

Forum thread search Voting techniques 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Aslam, J.A., Montague, M.: Models for metasearch. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2001, pp. 276–284. ACM, New York (2001)CrossRefGoogle Scholar
  2. 2.
    Bhatia, S., Mitra, P.: Adopting inference networks for online thread retrieval. In: Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, Atlanta, Georgia, USA, July 11-15, pp. 1300–1305 (2010)Google Scholar
  3. 3.
    Elsas, J.L.: Ancestry.com online forum test collection. Technical Report CMU-LTI-017, Language Technologies Institute, School of Computer Science, Carnegie Mellon University (2011)Google Scholar
  4. 4.
    Elsas, J.L., Arguello, J., Callan, J., Carbonell, J.G.: Retrieval and feedback models for blog feed search. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2008, pp. 347–354. ACM, New York (2008)CrossRefGoogle Scholar
  5. 5.
    Elsas, J.L., Carbonell, J.G.: It pays to be picky: an evaluation of thread retrieval in online forums. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2009, pp. 714–715. ACM, New York (2009)CrossRefGoogle Scholar
  6. 6.
    Macdonald, C., Ounis, I.: Key blog distillation: ranking aggregates. In: Proceedings of the 17th ACM Conference on Information and Knowledge Management, CIKM 2008, pp. 1043–1052. ACM, New York (2008)CrossRefGoogle Scholar
  7. 7.
    Macdonald, C., Ounis, I.: Voting techniques for expert search. Knowl. Inf. Syst. 16(3), 259–280 (2008)CrossRefGoogle Scholar
  8. 8.
    Metzler, D., Croft, W.B.: Combining the language model and inference network approaches to retrieval. Inf. Process. Manage. 40(5), 735–750 (2004)CrossRefGoogle Scholar
  9. 9.
    Ogilvie, P., Callan, J.: Combining document representations for known-item search. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Informaion Retrieval, SIGIR 2003, pp. 143–150. ACM, New York (2003)CrossRefGoogle Scholar
  10. 10.
    Seo, J., Croft, W.B., Smith, D.: Online community search using conversational structures. Information Retrieval 14, 547–571 (2011), doi:10.1007/s10791-011-9166-8CrossRefGoogle Scholar
  11. 11.
    Seo, J., Croft, W.B.: Blog site search using resource selection. In: Proceedings of the 17th ACM Conference on Information and Knowledge Management, CIKM 2008, pp. 1053–1062. ACM, New York (2008)CrossRefGoogle Scholar
  12. 12.
    Shaw, J.A., Fox, E.A.: Combination of multiple searches. In: The Second Text REtrieval Conference (TREC-2), pp. 243–252 (1994)Google Scholar
  13. 13.
    Zhai, C., Lafferty, J.: A study of smoothing methods for language models applied to information retrieval. ACM Trans. Inf. Syst. 22(2), 179–214 (2004)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  1. 1.Faculty of Computer Science and Information SystemUniversiti Teknologi MalaysiaSkudaiMalaysia

Personalised recommendations