A New Method of Clustering Search Results Using Frequent Itemsets with Graph Structures

Su, I-Fang; Chung, Yu-Chi; Lee, Chiang; Lin, Xuanyou

doi:10.1007/978-94-007-2598-0_9

I-Fang Su⁵,
Yu-Chi Chung⁶,
Chiang Lee⁷ &
…
Xuanyou Lin⁷

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 107))

1029 Accesses

Abstract

The representation of search results from the World Wide Web has received considerable attention in the database research community. Systems have been proposed for clustering search results into meaningful semantic categories for presentation to the end user. This paper presents a novel clustering algorithm, which is based on the concept of frequent itemsets mining over a graph structure, to efficiently generate search result clusters. The performance study reveals that the algorithm was highly efficient and significantly outperformed previous approaches in clustering search results.

This work is supported by National Science Council of Taiwan (R.O.C.) under Grants NSC99-2218-E-268-001, NSC99-2221-E-006-133, and NSC100-2221-E-309-011.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Agrawal R, Srikant R (1994) Fast algorithms for mining association rules in large databases. In: VLDB, pp 487–499
Google Scholar
Bernardini A, Carpineto C, D’Amico M (2009) Full-subtopic retrieval with keyphrase-based search results clustering. In: Web intelligence, pp 206–213
Google Scholar
Carpineto C, Osinski S, Romano G, Weiss D (2009) A survey of web clustering engines. ACM Comput Surv 41(3):1–38
Article Google Scholar
Carpineto C, Romano G (2010) Optimal meta search results clustering. In: SIGIR, pp 170–177
Google Scholar
Giacomo ED, Didimo W, Grilli L, Liotta G (2007) Graph visualization techniques for web clustering engines. IEEE Trans Vis Comput Graph 13(2):294–304
Article Google Scholar
Han J, Pei J, Yin Y (2000) Mining frequent patterns without candidate generation. In Proceedings of the 2000 ACM SIGMOD international conference on Management of data, pp 1–12 ACM
Google Scholar
Manning CD, Raghavan P, Schtze H (2008) Introduction to information retrieval. Cambridge University Press, New York
Book MATH Google Scholar
Osinski S, Stefanowski J, Weiss D (2004) Lingo: search results clustering algorithm based on singular value decomposition. In: Intelligent information systems, pp 359–368
Google Scholar
Rijsbergen CV (1979) Information retrieval. Butterworth-Heinemann, Newton
Google Scholar
Zaki MJ (2000) Scalable algorithms for association mining. IEEE Trans Knowl Data Eng 12(3):372–390
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information Management, Fotech, 831, Kaohsiung, Taiwan
I-Fang Su
Department of CSIE, CJCU, 711, Tainan, Taiwan
Yu-Chi Chung
Department of CSIE, NCKU, 701, Tainan, Taiwan
Chiang Lee & Xuanyou Lin

Authors

I-Fang Su
View author publications
You can also search for this author in PubMed Google Scholar
Yu-Chi Chung
View author publications
You can also search for this author in PubMed Google Scholar
Chiang Lee
View author publications
You can also search for this author in PubMed Google Scholar
Xuanyou Lin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to I-Fang Su .

Editor information

Editors and Affiliations

SeoulTech, Computer Science and Engineering, Seoul University of Science & Technology, Gongreung 2-dong 172, Seoul, 139-743, Korea, Republic of (South Korea)
James J. Park
, Computer Science, University of Georgia, GSRC 415, Athens, 30602-7404, Georgia, USA
Hamid Arabnia
, Business Administration, Daejin University, Hogukro 1007, Pocheon-Si, 487-711, Kyonggi-do, Korea, Republic of (South Korea)
Hang-Bae Chang
, Division of Information and Computer Eng, Ajou University, San 5, Suwon, Gyeonggido, 443-749, Korea, Republic of (South Korea)
Taeshik Shon

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Su, IF., Chung, YC., Lee, C., Lin, X. (2011). A New Method of Clustering Search Results Using Frequent Itemsets with Graph Structures. In: Park, J., Arabnia, H., Chang, HB., Shon, T. (eds) IT Convergence and Services. Lecture Notes in Electrical Engineering, vol 107. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-2598-0_9

Download citation

DOI: https://doi.org/10.1007/978-94-007-2598-0_9
Published: 01 November 2011
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-2597-3
Online ISBN: 978-94-007-2598-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics