Aspect-Based Text Summarization Using MapReduce Optimization

  • V. PriyaEmail author
  • K. Umamaheswari
Part of the EAI/Springer Innovations in Communication and Computing book series (EAISICC)


Aspect-based summarization techniques require improvement in accuracy of the generated summaries. These systems show significant role in the analysis of web review contents. Also they are useful to analyse the reviews in the web which increases drastically day by day. So the generated summary representing the entire review in a concise manner for each feature or aspect in the reviews would be useful for users. An overview about aspect-based summarization system is provided along with the proposed MapReduce based on node optimization algorithm. The system uses MapReduce framework and an in-node mapper, reducer algorithm is devised for generating summaries. The accuracy of the system-generated summary is better than existing MapReduce-based summarization systems.


Text summarization MapReduce In-node mapper Optimization Partitioner 


  1. Blake, C. (2006). A comparison of document, sentence and term event spaces. In Joint 21st international conference on Computational Linguistics (COLING) and the 44th annual meeting of the Association for Computational Linguistics (ACL), Sydney, Australia, 17–21 July 2006 (pp. 601–608). New York: ACM.Google Scholar
  2. Boiy, E., & Moens, M. (2009). A machine learning approach to sentiment analysis in multilingual web texts. Journal of Information Retrieval, 12, 526–558.CrossRefGoogle Scholar
  3. Dean, J., & Ghemawat, S. (2008). MapReduce: Simplified data processing on large clusters. Communications of ACM, 51, 107–113.CrossRefGoogle Scholar
  4. Deerwester, S., Dumais, S. T., Furnas, G. W., et al. (1990). Indexing by latent semantic analysis. Journal of the American Society for Information Science, 41, 391–407.CrossRefGoogle Scholar
  5. Ferreira, R., et al. (2013). Assessing sentence scoring techniques for extractive text summarization. Expert Systems with Applications, 40(14), 5755–5764.CrossRefGoogle Scholar
  6. Gamon, M., Aue, A., Corston-Oliver, S., & Ringger, E. (2005). Pulse: Mining customer opinions from free text. In Sixth international symposium on Intelligent Data Analysis (IDA), Madrid, Spain, 8–10 September 2005. Paper no. LNCS 3646 (pp. 121–132). Heidelberg: Springer.Google Scholar
  7. Golghate, A. A., & Shende, S. W. (2014). Parallel K-means clustering based on Hadoop and Hama. International Journal of Computing and Technology, 1, 33–37.CrossRefGoogle Scholar
  8. Gomaa, W. H., & Fahmy, A. A. (2013). A survey of text similarity approaches. International Journal of Computer Applications, 68(13), 13–18.CrossRefGoogle Scholar
  9. Hotel Datasets. (2001). Accessed 22 Jan 2016.
  10. Jayashri, K., & Mayura, K. (2013). Latent semantic analysis used for Mobile rating and review summarization. International Journal of Computer Science and Telecommunication, 4, 61–67.Google Scholar
  11. Large Movie Review Dataset. (2001). Accessed 22 Jan 2016.
  12. Lin, C. Y. (2004). ROUGE: A package for automatic evaluation summaries. In Workshop on text summarization branches out, Barcelona, Spain, 25–26 July 2004 (pp. 74–81). Barcelona, Spain: ACL.Google Scholar
  13. Lin, J., & Dyer, C. (2010). Data-intensive text processing with MapReduce. University of Maryland, College Park Manuscript (pp. 28–30). San Rafael: Morgan & Claypool Publishers.Google Scholar
  14. Nenkova, A., & McKeown, K. (2012). A survey of text summarization techniques. In Mining text data (pp. 43–76). Heidelberg: Springer.CrossRefGoogle Scholar
  15. Priya, V., & Umamaheswari, K. (2016). Ensemble based parallel k means using MapReduce for aspect based summarization. In International conference on informatics and analytics article no 26, Pondicherry, India, 25–26 August 2016, Paper No 26. New York: ACM.Google Scholar
  16. Shah, N., & Mahajan, S. (2014). Distributed document clustering using K-means. International Journal of Advanced Research in Computer Science and Software Engineering, 4, 24–29.Google Scholar
  17. Tadano, R., Shimada, K., & Endo, T. (2010). Multi-aspects review summarization based on identification of important opinions and their similarity. In 24th Pacific Asia conference on language, information and computation (PACLIC), Sendai, Japan, 4–7 November 2010 (pp. 685–692). Sendai, Japan: Institute for Digital Enhancement of Cognitive Development.Google Scholar
  18. Umamaheswari, K., & Priya, V. (2016). Aspect ranking based on author specific information aggregation. Journal of Scientific and Industrial Research, 75, 534–539.Google Scholar
  19. Woo-Hyun Lee, Hee-Gook Jun, & Hyoung-Joo Kim. (2015). Hadoop Mapreduce performance enhancement using in-node combiners. International Journal of Computer Science & Information Technology, 7(5), 1–17.Google Scholar
  20. Zhao Weizhong, Ma Huifang, & He Qing. (2009). Parallel K-means clustering based on MapReduce. In First international conference on CloudCom 2009, Beijing, China, 1–4 December 2009, Paper no. LNCS 5931 (pp. 674–679). Heidelberg: Springer.Google Scholar
  21. Zhuang, L., Jing, F., & Zhu, X. (2006). Movie review mining and summarization. In International Conference on Information and Knowledge Management (CIKM), Arlington, VA, USA, 5–11 November 2006 (pp. 43–50). New York: ACM.Google Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  1. 1.Department of Computer Science and EngineeringDr Mahalingam College of Engineering and TechnologyPollachiIndia
  2. 2.Department of Information TechnologyPSG College of TechnologyCoimbatoreIndia

Personalised recommendations