Skip to main content

A Newton’s Universal Gravitation Inspired Firefly Algorithm for Document Clustering

  • Conference paper
Advances in Computer Science and its Applications

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 279))

Abstract

The divisive clustering has the advantage to build a hierarchical structure that is more efficient to represent documents in search engines. Its operation employs one of the partition clustering algorithms that leads to being trapped in a local optima. This paper proposes a Firefly algorithm that is based on Newton’s law of universal gravitation, known as Gravitation Firefly Algorithm (GFA), for document clustering. GFA is used to find centers of clusters based on objective function that maximizes the force between each document and an initial center. Upon identification of a center, the algorithm then locates documents that are similar to the center using cosine similarity function. The process of finding centers for new clusters continues by sorting the light intensity values of the balance documents. Experimental results on Reuters datasets showed that the proposed Newton inspired Firefly algorithm is suitable to be used for document clustering in text mining.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 329.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Das, S., Abraham, A., Konar, A.: Metaheuristic Clustering. Studies in Computational Intelligence, vol. 178. Springer, Heidelberg (2009)

    MATH  Google Scholar 

  2. Senthilnath, J., Omkar, S.N., Mani, V.: Clustering Using Firefly Algorithm: Performance Study. Elsevier, Swarm and Evolutionary Computation 1(3), 164–171 (2011)

    Google Scholar 

  3. Wilson, H., Boots, B., Millward, A.A.: A Comparison of Hierarchical and PartitionalClustering. Techniques for Multispectral Image Classification 3, 1624–1626 (2002)

    Google Scholar 

  4. Poomagal, S., Hamsapriya, T.: Optimized K-means Clustering with Intelligent Initial Centroids selection for web search using URL and Tag contents. In: Proceedings of the International Conference on Web Intelligence, Mining and Semantics. ACM (2011)

    Google Scholar 

  5. Mishra, B.K., Nayak, N.R., Rath, A., Swain, S.: Far Efficient K-means Clustering Algorithm. In: Proceedings of the International Conference on Advances in Computing, Communications and Informatics, pp. 106–110. ACM (2012)

    Google Scholar 

  6. Hu, G., Zhou, S., Guan, J., Hu, X.: Towards Effective Document Clustering: A Constrained K-means Based Approach. Information Processing and Management 44(4), 1397–1409 (2008)

    Article  Google Scholar 

  7. Hassanzadeh, T., Meybodi, M.R.: A New Hybrid Approach for Data Clustering Using FireflyAlgorithm and K-means. In: Proceedings of the 16th IEEECSI International Symposium on Artificial Intelligence and Signal Processing (AISP), pp. 007–011 (2012)

    Google Scholar 

  8. Tang, R., Fong, S., Yang, X.S., Deb, S.: Integrating Nature-Inspired Optimization Algorithms to K-means Clustering. In: Proceedings of the 7th InternationalConference on Digital Information Management (ICDIM), pp. 116–123. IEEE, Macau (2012)

    Google Scholar 

  9. Hatamlou, A., Abdullah, S., Nezamabdi-pour, H.: A Combined Approach for Clustering Based on K-means and Gravitational Search Algorithms. Elsevier, Swarm and Evolutionary Computation 6, 47–52 (2012)

    Article  Google Scholar 

  10. Forsati, R., Mahdavi, M., Shamsfar, M., Meybodi, M.R.: Efficient stochastic algorithms for document clustering. Elsevier, Information Sciences 220, 269–29 (2013)

    Google Scholar 

  11. Yang, X.S.: Nature-Inspired MetaheuristicAlgorithms. Luniver Press, United Kingdom (2011)

    Google Scholar 

  12. Lewis, D.: The reuters-21578 text categorization test collection (1999), http://kdd.ics.uci.edu/database/reuters21578/reuters21578.html

  13. Murugesan, K., Zhang, J.: Hybrid Bisect K-means Clustering Algorithm. In: Proceedings of the IEEEInternational Conference on Business Computing and Global Information, pp. 216–219. IEEE, Shanghai (2011)

    Google Scholar 

  14. Michael, S., George, K., Vipin, K.: A Comparison of Document Clustering Techniques. In: Proceedings of the KDD Workshop on Text Mining (2000)

    Google Scholar 

  15. Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval, 1st edn. Cambridge University Press (2008)

    Google Scholar 

  16. Mohammed, A.J., Yusof, Y., Husni, H.: Weight-Based Firefly Algorithm for Document Clustering. In: Proceedings of the First International Conference on Advanced Data Engineering (DaEng 2013) (2013) (accepted for publication)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Athraa Jasim Mohammed .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Mohammed, A.J., Yusof, Y., Husni, H. (2014). A Newton’s Universal Gravitation Inspired Firefly Algorithm for Document Clustering. In: Jeong, H., S. Obaidat, M., Yen, N., Park, J. (eds) Advances in Computer Science and its Applications. Lecture Notes in Electrical Engineering, vol 279. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41674-3_174

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-41674-3_174

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-41673-6

  • Online ISBN: 978-3-642-41674-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics