A Two-Step Agglomerative Hierarchical Clustering Method for Patent Time-Dependent Data

  • Hongshu Chen
  • Guangquan Zhang
  • Jie Lu
  • Donghua Zhu
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 213)


Patent data have time-dependent property and also semantic attributes. Technology clustering based on patent time-dependent data processed by trend analysis has been used to help technology relationship identification. However, the raw patent data carry more features than processed data. This paper aims to develop a new methodology to cluster patent frequency data based on its time-related properties. To handle time-dependent attributes of patent data, this study first compares it with typical time series data to propose preferable similarity measurement approach. It then presents a two-step agglomerative hierarchical technology clustering method to cluster original patent time-dependent data directly. Finally, a case study using communication-related patents is given to illustrate the clustering method.


Patent analysis Technology clustering Patent time-dependent data Agglomerative hierarchical clustering 


  1. 1.
    Richard SC (1983) Patent trends as a technological forecasting tool. World Pat Inf 5(3):137–143Google Scholar
  2. 2.
    Cozzens S et al (2010) Emerging technologies: quantitative identification and measurement. Technol Anal Strateg Manag 22(3):361–376Google Scholar
  3. 3.
    Bengisu M, Nekhili R (2006) Forecasting emerging technologies with the aid of science and technology databases. Technol Forecast Soc Chang 73(7):835–844Google Scholar
  4. 4.
    Robinson DKR et al (2013) Forecasting innovation pathways (FIP) for new and emerging science and technologies. Technol Forecast and Soc Chang 80(2):267–285 Google Scholar
  5. 5.
    Chen Y-L, Chang Y-C (2012) A three-phase method for patent classification. Inf Process & Manag 48(6):1017−1030Google Scholar
  6. 6.
    Yoon J, Kim K (2012) TrendPerceptor: a property-function based technology intelligence system for identifying technology trends from patents. Expert Syst Appl 39(3):2927–2938Google Scholar
  7. 7.
    Lee H, Lee S, Yoon B (2011) Technology clustering based on evolutionary patterns: the case of information and communications technologies. Technol Forecast Soc Chang 78(6):953–967Google Scholar
  8. 8.
    Trappey CV et al (2011) Using patent data for technology forecasting: China RFID patent analysis. Adv Eng Inform 25(1):53–64Google Scholar
  9. 9.
    Lee S, Lee H, Yoon B (2012) Modeling and analyzing technology innovation in the energy sector: patent-based HMM approach. Comput & Ind Eng 63(3):564–577Google Scholar
  10. 10.
    Dereli T et al (2011) Enhancing technology clustering through heuristics by using patent counts. Expert Syst Appl 38(12):15383–15391CrossRefGoogle Scholar
  11. 11.
    Berndt D, Clifford J (1994) Using dynamic time warping to find patterns in time series. In: Workshop on knowledge discovery in databases KDD-94 proceedings, SeattleGoogle Scholar
  12. 12.
    Warren Liao T (2005) Clustering of time series data—a survey. Pattern Recognit 38(11):1857–1874Google Scholar
  13. 13.
    Keogh E, Pazzani M (1998) An enhanced representation of time series which allows fast and accurate classification, clustering and relevance feedback. In: Workshop on knowledge discovery in databases KDD-98 proceedingsGoogle Scholar
  14. 14.
    Ward JH (1993) Hierarchical grouping to optimize an objective function. J Am Stat Assoc 58:236–244Google Scholar
  15. 15.
    Maimon O, Rokach L (2010) Data mining and knowledge discovery handbook, vol 1. Springer Science + Business Media, LLCCrossRefMATHGoogle Scholar
  16. 16.
    Goldin D, Kanellakis P (1995) On similarity queries for time-series data: constraint specification and implementation. In: The 1st international conference on the principles and practice of constraint programming, Springer, CassisGoogle Scholar
  17. 17.
    Keogh E, Kasetty S (2003) On the need for time series data mining benchmarks: a survey and empirical demonstration. Data Min Knowl Disc 7(4):349–371Google Scholar
  18. 18.
    United States Patent and Trademark Office, Classes within the U.S. Classification System (2012).

Copyright information

© Springer-Verlag Berlin Heidelberg 2014

Authors and Affiliations

  • Hongshu Chen
    • 1
    • 2
  • Guangquan Zhang
    • 1
  • Jie Lu
    • 1
  • Donghua Zhu
    • 2
  1. 1.Decision Systems and e-Service Intelligence Lab, Centre for Quantum Computation and Intelligent SystemsFaculty of Engineering and Information Technology, University of TechnologySydneyAustralia
  2. 2.School of Management and EconomicsBeijing Institute of TechnologyBeijingChina

Personalised recommendations