Skip to main content

Machine Learning Techniques for Prediction of Pre-fetched Objects in Handling Big Data Storage

  • Conference paper
  • First Online:
Data Mining and Big Data (DMBD 2017)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10387))

Included in the following conference series:

  • 3850 Accesses

Abstract

Large data storage has to serve high volume transactions of data everyday when users request the data that can cause latency. Therefore, intelligent methods are required to solve the insufficient data storage experienced by some providers. Pre-fetching technique is one of the best techniques that enable assuming the data will be needed by the user in the near future. Consequently, users easily access their data at high speed to avoid latency. However, pre-fetch the wrong objects cause slow down the data management performance. In this context, this research proposes Machine Learning (ML) techniques to predicting the pre-fetched objects accurately. This paper also compares the Rough Decision Tree (RDT) with others ML techniques including J48 Decision Tree, Random Tree (RT), Naïve Bayes (NB), and Rough Set (RS). The experimental results reveal the propose RDT performs better compared with RS single-alone. However, J48 performs well in classifying the web objects for IrCache, UTM blog data, and Proxy Cloud Storage (CS) data sets. Hence, J48 was proposing to be implementing into the future work of mobile cloud storage services.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Roudaki, A., Kong, J., Yu, N.: A classification of web browsing on mobile devices. J. Vis. Lang. Comput. 26, 82–98 (2015)

    Article  Google Scholar 

  2. Hussien, N.S., Sulaiman, S.: Mobile cloud computing architecture on data management for big data storage. Int. J. Adv. Soft Comput. Appl. 8, 139–160 (2016)

    Google Scholar 

  3. Gao, J., Bai, X., Tsai, W.: Cloud testing- issues, challenges, needs and practice. Int. J. 1, 9–23 (2011)

    Google Scholar 

  4. Kumar, P.N.V., Reddy, V.R.: Novel web proxy cache replacement algorithms using machine learning. Int. J. Eng. Sci. Res. Technol. 3, 339–346 (2014)

    Google Scholar 

  5. Sulaiman, S., Shamsuddin, S.M., Abraham, A.: Meaningless to meaningful web log data for generation of web pre-caching decision rules using rough set. In: 4th Conference on Data Mining and Optimization, vol. 1, pp. 2–4 (2012)

    Google Scholar 

  6. Gupta, A.: A survey on stock market prediction using various algorithms. Int. J. Comput. Technol. Appl. 5, 530–533 (2014)

    Google Scholar 

  7. Kim, Y., Enke, D.: Developing a rule change trading system for the futures market using rough set analysis. Expert Syst. Appl. 59, 165–173 (2016)

    Article  Google Scholar 

  8. Sathiyamoorthi, V., Bhaskaran, M.: Data preprocessing techniques for pre-fetching and caching of web data through proxy server. Int. J. Comput. Sci. Netw. Secur. 11, 92–98 (2011)

    Google Scholar 

  9. Singh, N., Panwar, A., Raw, R.S.: Enhancing the performance of web proxy server through cluster based prefetching techniques. In: International Conference on Advances in Computing, Communications and Informatics, pp. 1158–1165 (2013)

    Google Scholar 

  10. Johann, M., Dom, J., Gil, A., Pont, A.: Exploring the benefits of caching and prefetching in the mobile web. In: 2nd IFIP International Symposium on Wireless Communications and Information Technology in Developing Countries, Pretoria, South Africa (2008)

    Google Scholar 

  11. Singh, A., Singh, A.K.: Web pre-fetching at proxy server using sequential data mining. In: 2012 Third International Conference on Computer Communication Technology, pp. 20–25 (2012)

    Google Scholar 

  12. Chang, J.-H., Lai, C.-F., Wang, M.-S., Wu, T.-Y.: A cloud-based intelligent TV program recommendation system. Int. J. Comput. Electr. Eng. 39, 2379–2399 (2013)

    Article  Google Scholar 

  13. Zissis, D., Xidias, E.K., Lekkas, D.: A cloud based architecture capable of perceiving and predicting multiple vessel behaviour. Appl. Soft Comput. 35, 652–661 (2015)

    Article  Google Scholar 

  14. Alemeye, F., Getahun, F.: Cloud readiness assessment framework and recommendation system. In: AFRICON 2015, Addis Ababa, pp. 1–5 (2015)

    Google Scholar 

  15. Witten, I.H., Frank, E.: Machine learning algorithms in java nuts and bolts: machine (2000)

    Google Scholar 

  16. Patil, T.R., Sherekar, S.S.: Performance analysis of naive bayes and j48 classification algorithm for data classification. Int. J. Comput. Sci. Appl. 6, 256–261 (2013)

    Google Scholar 

  17. Suarez-Tangil, G., Tapiador, J.E., Peris-Lopez, P., Pastrana, S.: Power-aware anomaly detection in smartphones: an analysis of on-platform versus externalized operation. Pervasive Mob. Comput. 18, 137–151 (2015)

    Article  Google Scholar 

  18. Gao, W., Grossman, R., Gu, Y., Yu, P.S.: Why Naive ensembles do not work in cloud computing. In: IEEE International Computing Society, pp. 282–289 (2009)

    Google Scholar 

  19. Pawlak, Z.: Rough sets. Int. J. Comput. Inf. Sci. 11, 341–356 (1982)

    Article  MATH  Google Scholar 

  20. Sulaiman, S., Shamsuddin, S.M., Abraham, A., Sulaiman, S.: Rough set granularity in mobile web pre-caching. In: Eighth International Conference on Intelligent Systems Design and Applications, vol. 1, pp. 587–592 (2008)

    Google Scholar 

  21. Torgeir R. Hvidsten: A tutorial-based guide to the ROSETTA system: a rough set toolkit for analysis of data, pp. 1–44 (2013)

    Google Scholar 

  22. Sabitha, B., Amma, N.G.B., Annapoorani, G., Balasubramanian, P.: Implementation of data mining techniques to perform market analysis. Int. J. Innov. Res. Comput. Commun. Eng. 2, 7003–7008 (2014)

    Google Scholar 

  23. Tiwari, S., Pandit, R., Richhariya, V.: Predicting future trends in stock market by decision tree rough-set based hybrid system with HHMM. Int. J. Electron. Comput. Sci. Eng. 3, 1–10 (2010)

    Google Scholar 

  24. Voges, K.E., Pope, N.K.L.: Rough clustering using an evolutionary algorithm. In: 2012 45th Hawaii International Conference on System Science, pp. 1138–1145 (2012)

    Google Scholar 

  25. Moorthy, N.S.H.N., Poongavanam, V.: The KNIME based classification models for yellow fever virus inhibition. RSC Adv. R. Soc. Chem. 5, 14663–14669 (2015)

    Article  Google Scholar 

  26. Rojas, I., Work-conference, I., Hutchison, D.: Advances in Computational (2013)

    Google Scholar 

Download references

Acknowledgements

This research is supported by Ministry of Higher Education Malaysia (MOHE), Ministry of Science, Technology and Innovation Malaysia (MOSTI) and Universiti Teknologi Malaysia (UTM). This paper is financially supported by E-Science Fund, R.J130000.7928.4S117, PRGS Grant, R.J130000.7828.4L680, GUP Tier 1 UTM, Q.J130000.2528.13H48, FRGS Grant, R.J130000.7828.4F634 and IDG Grant, R.J130000.7728.4J170. The authors would like to express their deepest gratitude to IrCache.net and CICT, UTM for their support in providing the datasets to ensure the success of this research, as well as Soft Computing Research Group (SCRG) for their continuous support and fondness in making this research possible.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sarina Sulaiman .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Hussien, N.S., Sulaiman, S., Shamsuddin, S.M. (2017). Machine Learning Techniques for Prediction of Pre-fetched Objects in Handling Big Data Storage. In: Tan, Y., Takagi, H., Shi, Y. (eds) Data Mining and Big Data. DMBD 2017. Lecture Notes in Computer Science(), vol 10387. Springer, Cham. https://doi.org/10.1007/978-3-319-61845-6_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-61845-6_12

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-61844-9

  • Online ISBN: 978-3-319-61845-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics