Skip to main content

Design Issue and Performance Analysis of Data Migration Tool in a Cloud-Based Environment

  • Conference paper
Proceedings of the 4th International Conference on Computer Engineering and Networks

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 355))

  • 821 Accesses

Abstract

With the popularization of Web applications and the emergence of cloud computing technology, database management and storage has evolved from PC to Web-based, even to cloud-based services as well. Also, with big data applications in cloud computing, more many organizations will eventually move their data from Web applications to a cloud-based environment. Except the cloud migration method, the cloud platform must provide an automatic tool in application and data migration for easy implementation. The existing Sqoop data migration tool requires users to be familiar with existing migration commands or move the data manually to a cloud-based database. Hence, the purpose of this paper is to design a new automatic data migration tool named MSCH using the migration of MySQL to HBase as an example to improve on the processing performance of Sqoop. After simulations, the experimental results show that the MSCH migration tool was 33 %, 25 %, and 15 % faster than Sqoop in terms of number of entries. CPU utilization during migration was also reduced by 35 %, 43 %, and 33 %, while memory usage was reduced by 90 %, 72 %, and 74 %, proving that the MSCH data migration tool shortens data migration time and lowers the load on computing resources to enhance the performance in data migration under cloud-based environment. Further studies will be conducted to determine the migration accuracy ratio of MSCH migration tools and Sqoop.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 329.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Oriaku C, Lami IA. Holistic view angles of cloud computing services. In: Provisions, 2012: International conference on cyber-enabled distributed computing and knowledge discovery. IEEE New York, NY, USA. 2012. pp. 97–105.

    Google Scholar 

  2. Roitman H, Schenkel R, Grobelnik M. Search and mining entity-relationship data. In: CIKM ’11: Proceedings of the 20th ACM international conference on information and knowledge management, ACM New York, NY, USA. 2011. pp. 2639–40.

    Google Scholar 

  3. The Apache Software Foundation. Apache Sqoop. http://sqoop.apache.org. Accessed 20 Mar 2014.

  4. Mokhtar SA, Ali SHS, Al-Sharafi A, Abdulaziz Aborujilah. Cloud computing in academic institutions. In: ICUIMC ’13: Proceedings of the 7th international conference on ubiquitous information management and communication, ACM New York, NY, USA. 2013.

    Google Scholar 

  5. The Apache Software Foundation. Apache Hadoop. http://hadoop.apache.org. Accessed 12 Apr 2014.

  6. Nikhil RS. Abstraction in hardware system design. ACM Queue. 2011;9(18):1–15.

    Google Scholar 

  7. Brown RA. Hadoop at home: large-scale computing at a small college. In: Proceedings of the 40th ACM technical symposium on computer science education, ACM New York, NY, USA. 2009. pp. 106–10.

    Google Scholar 

  8. Chen R, Chen H. Tiled-MapReduce: efficient and flexible MapReduce. In: Proceedings on multicore with tiling, ACM transactions on architecture and code optimization, vol. 10, no. 1, Article 3. ACM New York, NY, USA. 2013. pp. 3:1–30.

  9. Gao X, Nachankar V, Qiu J. Experimenting lucene index on HBase in an HPC environment. In: HPCDB ’11: Proceedings of the first annual workshop on high performance computing meets databases, ACM New York, NY, USA. 2011. pp. 25–8.

    Google Scholar 

  10. The Apache Software Foundation. The apache software foundation blogging in action. https://blogs.apache.org/sqoop/entry/apache_sqoop_highlights_of_sqoo. Accessed 18 Apr 2014.

  11. Borthakur D, Gray J, Sarma JS, et al. Apache Hadoop goes realtime at facebook. In: SIGMOD ’11: Proceedings of the 2011 ACM SIGMOD international conference on management of data, ACM New York, NY, USA. 2011. pp. 1071–80.

    Google Scholar 

  12. Rashmi, Mehfuz S, Sahoo GB. A five-phased approach for the cloud migration. Int J Emerg Technol Adv Eng. 2012;2(4):286–91.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shin-Jer Yang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Yang, SJ., Tu, CC., Lin, J. (2015). Design Issue and Performance Analysis of Data Migration Tool in a Cloud-Based Environment. In: Wong, W. (eds) Proceedings of the 4th International Conference on Computer Engineering and Networks. Lecture Notes in Electrical Engineering, vol 355. Springer, Cham. https://doi.org/10.1007/978-3-319-11104-9_87

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-11104-9_87

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-11103-2

  • Online ISBN: 978-3-319-11104-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics