Abstract
Given n values x1, x2, ... ,xn and an associative binary operation o, the prefix problem is to compute x1 o x2 o ··· o xi, 1≤i≤n. Many combinational circuits for solving the prefix problem, called prefix circuits, have been designed. It has been proved that the size s(D(n)) and the depth d(D(n)) of an n-input prefix circuit D(n) satisfy the inequality d(D(n))+s(D(n))≥2n−2; thus, a prefix circuit is depth-size optimal if d(D(n))+s(D(n))=2n−2. In this paper, we construct a new depth-size optimal prefix circuit SL(n). In addition, we can build depth-size optimal prefix circuits whose depth can be any integer between d(SL(n)) and n−1. SL(n) has the same maximum fan-out ⌈lg n⌉+1 as Snir's SN(n), but the depth of SL(n) is smaller; thus, SL(n) is faster. Compared with another optimal prefix circuit LYD(n), d(LYD(n))+2≥d(SL(n))≥d(LYD(n)). However, LYD(n) may have a fan-out of at most 2 ⌈lg n⌉−2, and the fan-out of LYD(n) is greater than that of SL(n) for almost all n≥12. Because an operation node with greater fan-out occupies more chip area and is slower in VLSI implementation, in most cases, SL(n) needs less area and may be faster than LYD(n). Moreover, it is much easier to design SL(n) than LYD(n).
Similar content being viewed by others
References
S. G. Akl. Parallel Computation: Models and Methods. Prentice-Hall, Upper Saddle River, N.J., 1997.
G. E. Blelloch. Scans as primitive operations. IEEE Transactions on Computers, 38:1526-1538, 1989.
R. P. Brent and H. T. Kung. A regular layout for parallel adders. IEEE Transactions on Computers, C-31:260-264, 1982.
D. A. Carlson and B. Sugla. Limited width parallel prefix circuits. The Journal of Supercomputing, 4:107-129, 1990.
R. Cole and U. Vishkin. Faster optimal parallel prefix sums and list ranking. Information and Control, 81:334-352, 1989.
F. E. Fich. New bounds for parallel prefix circuits. In Proceedings of the 15th Symposium on the Theory of Computing, pp. 100-109, 1983.
C. P. Kruskal, T. Madej, and L. Rudolph. Parallel prefix on fully connected direct connection machines. In Proceedings of the 1986 International Conference on Parallel Processing, pp. 278-284, 1986.
R. E. Ladner and M. J. Fischer. Parallel prefix computation. Journal of the Association for Computing Machinery, 27:831-838, 1980.
S. Lakshmivarahan and S. K. Dhall. Parallel Computing Using the Prefix Problem. Oxford University Press, Oxford, UK, 1994.
S. Lakshmivarahan, C. M. Yang, and S. K. Dhall. On a new class of optimal parallel prefix circuits with (size + depth) = 2n-2 and [log n] < depth < (2 [log n]-3). In Proceedings of the 1987 International Conference on Parallel Processing, pp. 58-65, 1987.
Y. C. Lin. Optimal prefix circuits with fan-out 2. In Proceedings of the International Conference on Algorithms, pp. 175-181, Kaohsiung, Taiwan, December 1996.
A. Nicolau and H. Wang. Optimal schedule for parallel prefix computation with bounded resources. In Proceedings of the 3rd ACM SIGPLAN Symposium on Principles & Practice Parallel Programming, pp. 1-10, 1991.
M. Snir. Depth-size trade-offs for parallel prefix computation. Journal of Algorithms, 7:185-201, 1986.
H. Wang, A. Nicolau, and K. S. Siu. The strict time lower bound and optimal schedules for parallel prefix with resource constraints. IEEE Transactions on Computers, 45:1257-1271, 1996.
N. H. E. Weste and K. Eshraghian. Principles of CMOS VLSI Design: A System Perspective, 2nd ed. Addison-Wesley, Reading, Mass., 1993.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Lin, YC., Shih, CC. A New Class of Depth-Size Optimal Parallel Prefix Circuits. The Journal of Supercomputing 14, 39–52 (1999). https://doi.org/10.1023/A:1008147229964
Issue Date:
DOI: https://doi.org/10.1023/A:1008147229964