Configure Scheme of Mixed Computer Architecture for FMM Algorithm
Along with the scale expansion of high performance computing, accelerators are increasingly viewed as computer coprocessors that can provide significant computational performance at low price. Thus, research of mixed computer architecture is becoming popular. This paper presents a mixed configurable computer architecture which can run fast multipole method (FMM) algorithm of N-Body problem well. Each sub-procedure of FMM algorithm is implemented and tested on GPU, FPGA and CELL. FMM is optimized on the proposed configure scheme through decomposing its task flow. The probable solution for different task flow is also put forward. The conclusion is significant to the research on the mixed computer architecture of high performance computing.
KeywordsGraphical Processing Unit Field Programmable Gate Array High Performance Computing Computer Architecture Multipole Expansion
Unable to display preview. Download preview PDF.
- 3.Che, S., Li, J., Sheaffer, J.W., et al.: Accelerating Compute -Intensive Applications with GPUs and FPGAs. In: Proc. of the IEEE Symposium on Application Specific Processors, SASP (June 2008)Google Scholar