Abstract
We consider the problem of automatic mapping of computation-intensive loop nests onto FPGA hardware. The regular cell array structure of these chips reflects the parallelism in regular loop-like computations. Furthermore, the flexibility of FPGAs allows the cost-effective implementation of reconfigurable high performance processor arrays. So far, there exists no continuous design flow that allows automated generation of FPGA configuration data from a loop nest specified in a high level language. Here, we present a methodology for automatic generation of synthesizable VHDL code specifying a processor array and optimized for FPGA implementation.
Similar content being viewed by others
References
V. Baumgarte, F. M. A. Nückel, M. Vorbach, and M. Weinhardt. PACT XPP-A self-reconfigurable data processing architecture. In Proceedings of the Int. Conf. on Engineering of Reconfigurable Systems and Algorithms (ERSA), 2001.
M. Bednara, O. Beyer, J. Teich, and R. Wanka. Tradeoff analysis and architecture design of a hybrid hardware/software sorter. In ASAP00-Proc. Int. Conf. on Application-Specific Systems, Architectures, and Processors, Boston, U.S.A., pp. 299–308, 2000.
M. Bednara, F. Hannig, and J. Teich. Generation of distributed loop control. In E. F. Deprettere, J. Teich and S. Vassiliadis, eds., Embedded Processor Design Challenges: Systems, Architectures, Modeling, and Simulation-SAMOS (LNCS2268), Springer, 2002.
Chameleon. Wireless base station design using reconfigurable communications processors (http://www.chameleonsystems.com/whitepapers/whitepapers.html). Technical report, Chameleon Systems, Inc., 2000.
P. Feautrier. Automatic parallelization in the polytope model. Technical Report 8, Laboratoire PRiSM, Université des Versailles St-Quentin en Yvelines, 45, avenue des E´tats-Unis, F-78035 Versailles Cedex, 1996.
B. Kienhuis. MatParser: An array dataflow analysis compiler. Technical report, Department EECS, University of California at Berkeley, Cory Hall 524, Berkeley, California, 94720, USA, 2000.
R. Kuhn. Transforming algorithms for single-stage andVLSI architectures. Workshop Interconnection Networks for Parallel and Distributed Processing, 1980.
D. Moldovan. On the design of algorithms for VLSI systolic arrays. In Proceedings of the IEEE, Vol. 71, pp. 113–120, 1983.
J. Teich. A compiler for application-specific processor arrays. Ph.D. thesis, Institut für Mikroelektronik, Universität des Saarlandes, Saarbrücken, Deutschland, 1993.
J. Teich and L. Thiele. Control generation in the design of processor arrays. Int. Journal on VLSI and Signal Processing, 3(2):77–92, 1991.
J. Teich and L. Thiele. Partitioning of processor arrays: A piecewise regular approach. INTEGRATION: The VLSI Journal, 14(3):297–332, 1993.
J. Teich, L. Thiele, and L. Zhang. Scheduling of partitioned regular algorithms on processor arrays with constrained resources. In ASAP96 Proc. Int. Conf. on Application-Specific Systems, Architectures, and Processors, Chicago, U.S.A., pp. 131–144, 1996.
J. Teich, L. Thiele, and L. Zhang. Partitioning processor arrays under resource constraints. Int. Journal of VLSI Signal Processing, 17(1):5–20, 1997.
L. Thiele Scheduling of uniform algorithms with resource constraints. Journal of VLSI Signal Processing, 10:295–310, 1995.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Bednara, M., Teich, J. Automatic Synthesis of FPGA Processor Arrays from Loop Algorithms. The Journal of Supercomputing 26, 149–165 (2003). https://doi.org/10.1023/A:1024447517501
Issue Date:
DOI: https://doi.org/10.1023/A:1024447517501