Skip to main content

High Performance Computing for Analyzing PB-Scale Data in Nuclear Experiments and Simulations

  • Conference paper
  • First Online:

Abstract

By performance improvement of computers and expansion of experiment facilities, output data having became huge. In near future, the output data will become petabyte (PB)-scale. It will become increasingly important how huge data is analyzed efficiently and derive useful information. To analysis huge data efficiently, we are constructing large-scale data integrated analysis system which treats terabytes-petabytes data. In this system, two elemental technologies, i.e., heterogeneous processor and distributed parallel computing framework with fault-tolerance are implemented. The former and the latter are effective for computation dominant processes and data I/O dominant processes, respectively. First, we have applied acceleration by the heterogeneous processor to experimental data and estimated its performance. The processor accelerated experimental data processing substantially. Next, then we have constructed a prototype of distributed parallel computing system for simulation data and carried out processing test. We have found the notice points for application these elemental techniques.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   109.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   149.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   139.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Suzuki, Y., Nishida, A., Araya, F., Kushida, N., Akutsu, T., Teshima, N., Nakajima, K., Kondo, M., Hayashi, S., Aoyagi, T., Nakajima, N. (2009): Development of Three-dimensional Virtual Plant Vibration Simulator on Grid Computing Environment ITBL-IS/AEGIS, Journal of Power and Energy Systems 3, No. 1: 60–71.

    Google Scholar 

  2. Sato, T., Takahashi, T., Yoshikawa, K. eds. (2009): Particle and Nuclear Physics at J-PARC, Lecture Notes in Physics 781: 1–193.

    Google Scholar 

  3. Suzuki, Y., Nakajima, K., Kushida, N., Kino, C., Minami, T., Matsumoto, N., Aoyagi, T., Nakajima, N., Iba, K., Hayashi, N., Ozeki, T., Totsuka, T., Nakanishi, H., Nagayama, Y. (2008): Research and development of fusion grid infrastructure based on atomic energy grid infrastructure (AEGIS). Sixth IAEA Technical Meeting on Control, Data Acquisition, and Remote Participation for Fusion Research (4-8 June 2007, Inuyama, Japan), Fusion Engineering and Design 83:511–515.

    Google Scholar 

  4. Kureta, M., Akimoto, H., Hibiki, T., Mishima, K. (2001): Void Fraction Measurement in Subcooled-Boiling Flow Using High-Frame-Rate Neutron Radiography, Nuclear Technology 136: 241–251.

    Google Scholar 

  5. Kahle, J. A., Day, M. N., Hofstee, H. P., Johns, C. R., Maeurer, T. R., Shippy, D. (2005): Introduction to the Cell multiprocessor, IBM J. Research and Development 49, no. 4/5:589–604.

    Article  Google Scholar 

  6. White, T. (2010): Hadoop: The Definitive Guide (Yahoo! Press).

    Google Scholar 

  7. Ghemawat, S., Gobioff, H., Leung, S.-T. (2003): The Google File System, ACM SIGOPS Operating Systems Review 37, Issue 5:29–43.

    Google Scholar 

  8. Dean, J. and Ghemawat, S. (2004): MapReduce: Simplified Data Processing on Large Clusters, Proceedings of the 6th Symposium on Operating Systems Design and Implementation (December 6–8, 2004, San Francisco, CA).

    Google Scholar 

  9. Shvachko, K., Kuang, H., Radia, S., Chansler, R. (2010): The Hadoop Distributed File System, IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST): 1–10.

    Google Scholar 

  10. O’mally, O. and Murthy, A. C. (2009): Winning a 60 Second Dash with a Yellow Elephant.

    Google Scholar 

  11. Nakamura Miyamura, H., Hayashi, S., Suzuki, Y., Takemiya, H. (2010): Spatio-temporal Mapping-A Technique for Overview Visualization of Time-series Data-set-, The Joint International Conference of the 7th Supercomputing in Nuclear Application and the 3rd Monte Carlo (SNA+MC2010) (October 17–21, 2010, Tokyo, Japan).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Takayuki Tatekawa .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Tatekawa, T., Teshima, N., Kushida, N., Miyamura, H.N., Kim, G., Takemiya, H. (2011). High Performance Computing for Analyzing PB-Scale Data in Nuclear Experiments and Simulations. In: Resch, M., Wang, X., Bez, W., Focht, E., Kobayashi, H., Roller, S. (eds) High Performance Computing on Vector Systems 2011. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22244-3_8

Download citation

Publish with us

Policies and ethics