Abstract
By performance improvement of computers and expansion of experiment facilities, output data having became huge. In near future, the output data will become petabyte (PB)-scale. It will become increasingly important how huge data is analyzed efficiently and derive useful information. To analysis huge data efficiently, we are constructing large-scale data integrated analysis system which treats terabytes-petabytes data. In this system, two elemental technologies, i.e., heterogeneous processor and distributed parallel computing framework with fault-tolerance are implemented. The former and the latter are effective for computation dominant processes and data I/O dominant processes, respectively. First, we have applied acceleration by the heterogeneous processor to experimental data and estimated its performance. The processor accelerated experimental data processing substantially. Next, then we have constructed a prototype of distributed parallel computing system for simulation data and carried out processing test. We have found the notice points for application these elemental techniques.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Suzuki, Y., Nishida, A., Araya, F., Kushida, N., Akutsu, T., Teshima, N., Nakajima, K., Kondo, M., Hayashi, S., Aoyagi, T., Nakajima, N. (2009): Development of Three-dimensional Virtual Plant Vibration Simulator on Grid Computing Environment ITBL-IS/AEGIS, Journal of Power and Energy Systems 3, No. 1: 60–71.
Sato, T., Takahashi, T., Yoshikawa, K. eds. (2009): Particle and Nuclear Physics at J-PARC, Lecture Notes in Physics 781: 1–193.
Suzuki, Y., Nakajima, K., Kushida, N., Kino, C., Minami, T., Matsumoto, N., Aoyagi, T., Nakajima, N., Iba, K., Hayashi, N., Ozeki, T., Totsuka, T., Nakanishi, H., Nagayama, Y. (2008): Research and development of fusion grid infrastructure based on atomic energy grid infrastructure (AEGIS). Sixth IAEA Technical Meeting on Control, Data Acquisition, and Remote Participation for Fusion Research (4-8 June 2007, Inuyama, Japan), Fusion Engineering and Design 83:511–515.
Kureta, M., Akimoto, H., Hibiki, T., Mishima, K. (2001): Void Fraction Measurement in Subcooled-Boiling Flow Using High-Frame-Rate Neutron Radiography, Nuclear Technology 136: 241–251.
Kahle, J. A., Day, M. N., Hofstee, H. P., Johns, C. R., Maeurer, T. R., Shippy, D. (2005): Introduction to the Cell multiprocessor, IBM J. Research and Development 49, no. 4/5:589–604.
White, T. (2010): Hadoop: The Definitive Guide (Yahoo! Press).
Ghemawat, S., Gobioff, H., Leung, S.-T. (2003): The Google File System, ACM SIGOPS Operating Systems Review 37, Issue 5:29–43.
Dean, J. and Ghemawat, S. (2004): MapReduce: Simplified Data Processing on Large Clusters, Proceedings of the 6th Symposium on Operating Systems Design and Implementation (December 6–8, 2004, San Francisco, CA).
Shvachko, K., Kuang, H., Radia, S., Chansler, R. (2010): The Hadoop Distributed File System, IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST): 1–10.
O’mally, O. and Murthy, A. C. (2009): Winning a 60 Second Dash with a Yellow Elephant.
Nakamura Miyamura, H., Hayashi, S., Suzuki, Y., Takemiya, H. (2010): Spatio-temporal Mapping-A Technique for Overview Visualization of Time-series Data-set-, The Joint International Conference of the 7th Supercomputing in Nuclear Application and the 3rd Monte Carlo (SNA+MC2010) (October 17–21, 2010, Tokyo, Japan).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tatekawa, T., Teshima, N., Kushida, N., Miyamura, H.N., Kim, G., Takemiya, H. (2011). High Performance Computing for Analyzing PB-Scale Data in Nuclear Experiments and Simulations. In: Resch, M., Wang, X., Bez, W., Focht, E., Kobayashi, H., Roller, S. (eds) High Performance Computing on Vector Systems 2011. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22244-3_8
Download citation
DOI: https://doi.org/10.1007/978-3-642-22244-3_8
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22243-6
Online ISBN: 978-3-642-22244-3
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)