Abstract
Stream processors, with the stream programming model, have demonstrated significant performance advantages in the domains signal processing, multimedia and graphics applications, and are covering scientific applications. In this paper we examine the applicability of a stream processor to 2-D stencil scientific applications, an important and widely used class of scientific applications, which compute values using neighboring array elements in a fixed stencil pattern. We first map 2-D stencil scientific applications in FORTRAN version to the stream processor in a straightforward way. In a stream processor system, the management of system resources is the programmers’ responsibility. We then present several optimizations, which avail the stream program for 2-D stencil scientific applications, of various aspects of the stream processor architecture. Finally, we analyze the performance of optimized 2-D stencil scientific stream applications, with the presented optimizations. The final stream scientific programs gain from 2.56 to 7.62 times faster than the corresponding FORTRAN programs on a Xeon processor, with the optimizations playing an important role in realizing the performance improvement.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Rixner S (2001) Stream processor architecture. Kluwer Academic Publishers, Boston
Kapasi U, Dally W, Rixner S, Owens J, Khailany B (2002) The imagine stream processor. In: Proceedings of 2002 IEEE international conference on computer design, pp 282–288
Gordon M, Maze D, Amarasinghe S, Thies W, Karczmarek M, Lin J, Meli A, Lamb A, Leger C, Wong J et al (2002) A stream compiler for communication-exposed architectures. ACM SIGARCH Comput Archit News 30(5):291–303
Fatica M, Jameson A, Alonso J STREAMFLO: an Euler solver for streaming architectures, submitted to AIAA conference
Kapasi U, Rixner S, Dally W, Khailany B, Ahn J, Mattson P, Owens J (2003) Programmable stream processors. Computer 36(8):54–62
Das A, Dally WJ, Mattson P (2006) Compiling for stream processing. In: proceedings of the 15th international conference on parallel architectures and compilation techniques PACT ’06. ACM Press, New York, pp 33–42
Buck I, Foley T, Horn D, Sugerman J, Fatahalian K, Houston M, Hanrahan P (2004) Brook for gpus: stream computing on graphics hardware. ACM Trans Graph 23(3):777–786
Kahle JA, Day MN, Hofstee HP, Johns CR, Maeurer TR, Shippy D (2005) Introduction to the cellmultiprocessor. IBM J Res Dev 49(4/5):589–604
Acknowledgments
This work was supported by NSFC (61003075, 61103193,61103011, 61103014).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer Science+Business Media Dordrecht(Outside the USA)
About this paper
Cite this paper
Zhang, Y., Li, G., Zhou, H., Lu, P., Sun, C., Dou, Q. (2013). Mapping and Optimizing 2-D Scientific Applications on a Stream Processor. In: Park, J., Ng, JY., Jeong, HY., Waluyo, B. (eds) Multimedia and Ubiquitous Engineering. Lecture Notes in Electrical Engineering, vol 240. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-6738-6_55
Download citation
DOI: https://doi.org/10.1007/978-94-007-6738-6_55
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-6737-9
Online ISBN: 978-94-007-6738-6
eBook Packages: EngineeringEngineering (R0)