Capacity-Constrained Query Formulation

  • Matthias Hagen
  • Benno Maria Stein
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6273)


Given a set of keyphrases, we analyze how Web queries with these phrases can be formed that, taken altogether, return a specified number of hits. The use case of this problem is a plagiarism detection system that searches the Web for potentially plagiarized passages in a given suspicious document. For the query formulation problem we develop a heuristic search strategy based on co-occurrence probabilities. Compared to the maximal termset strategy [3], which can be considered as the most sensible non-heuristic baseline, our expected savings are on average 50% when queries for 9 or 10 phrases are to be constructed.


Frequent Itemset Query Formulation Apriori Algorithm Frequent Itemset Mining Plagiarism Detection 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Agrawal, R., Srikant, R.: Fast algorithms for mining association rules in large databases. In: Proc. of VLDB 1994, pp. 487–499 (1994)Google Scholar
  2. 2.
    Barker, K., Cornacchia, N.: Using noun phrase heads to extract document keyphrases. In: Proc. of AI 2000, pp. 40–52 (2000)Google Scholar
  3. 3.
    Pôssas, B., Ziviani, N., Ribeiro-Neto, B.A., Meira Jr., W.: Maximal termsets as a query structuring mechanism. In: Proc. of CIKM 2005, pp. 287–288 (2005)Google Scholar
  4. 4.
    Shapiro, J., Taksa, I.: Constructing web search queries from the user’s information need expressed in a natural language. In: Matsui, M., Zuccherato, R.J. (eds.) SAC 2003. LNCS, vol. 3006, pp. 1157–1162. Springer, Heidelberg (2004)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Matthias Hagen
    • 1
  • Benno Maria Stein
    • 1
  1. 1.Faculty of MediaBauhaus UniversityWeimarGermany

Personalised recommendations