Knowledge and Information Systems

, Volume 39, Issue 1, pp 61–88

High-dimensional clustering: a clique-based hypergraph partitioning framework

  • Tianming Hu
  • Chuanren Liu
  • Yong Tang
  • Jing Sun
  • Hui Xiong
  • Sam Yuan Sung
Regular Paper

DOI: 10.1007/s10115-012-0609-3

Cite this article as:
Hu, T., Liu, C., Tang, Y. et al. Knowl Inf Syst (2014) 39: 61. doi:10.1007/s10115-012-0609-3

Abstract

Hypergraph partitioning has been considered as a promising method to address the challenges of high-dimensional clustering. With objects modeled as vertices and the relationship among objects captured by the hyperedges, the goal of graph partitioning is to minimize the edge cut. Therefore, the definition of hyperedges is vital to the clustering performance. While several definitions of hyperedges have been proposed, a systematic understanding of desired characteristics of hyperedges is still missing. To that end, in this paper, we first provide a unified clique perspective of the definition of hyperedges, which serves as a guide to define hyperedges. With this perspective, based on the concepts of shared (reverse) nearest neighbors, we propose two new types of clique hyperedges and analyze their properties regarding purity and size issues. Finally, we present an extensive evaluation using real-world document datasets. The experimental results show that, with shared (reverse) nearest neighbor-based hyperedges, the clustering performance can be improved significantly in terms of various external validation measures without the need for fine tuning of parameters.

Keywords

Clique Shared nearest neighbor Hypergraph partitioning High-dimensional clustering 

Copyright information

© Springer-Verlag London 2013

Authors and Affiliations

  • Tianming Hu
    • 1
  • Chuanren Liu
    • 2
  • Yong Tang
    • 3
  • Jing Sun
    • 4
  • Hui Xiong
    • 2
  • Sam Yuan Sung
    • 5
  1. 1.Dongguan University of TechnologyDongguanChina
  2. 2.Department of Management Science and Information SystemsRutgers, The State University of New JerseyNewarkUSA
  3. 3.South China Normal UniversityGuangzhouChina
  4. 4.University of AucklandAucklandNew Zealand
  5. 5.South Texas CollegeMcAllenUSA

Personalised recommendations