Skip to main content
Log in

A New Class of Depth-Size Optimal Parallel Prefix Circuits

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

Given n values x1, x2, ... ,xn and an associative binary operation o, the prefix problem is to compute x1 o x2 o ···  o xi, 1≤i≤n. Many combinational circuits for solving the prefix problem, called prefix circuits, have been designed. It has been proved that the size s(D(n)) and the depth d(D(n)) of an n-input prefix circuit D(n) satisfy the inequality d(D(n))+s(D(n))≥2n−2; thus, a prefix circuit is depth-size optimal if d(D(n))+s(D(n))=2n−2. In this paper, we construct a new depth-size optimal prefix circuit SL(n). In addition, we can build depth-size optimal prefix circuits whose depth can be any integer between d(SL(n)) and n−1. SL(n) has the same maximum fan-out ⌈lg n⌉+1 as Snir's SN(n), but the depth of SL(n) is smaller; thus, SL(n) is faster. Compared with another optimal prefix circuit LYD(n), d(LYD(n))+2≥d(SL(n))≥d(LYD(n)). However, LYD(n) may have a fan-out of at most 2 ⌈lg n⌉−2, and the fan-out of LYD(n) is greater than that of SL(n) for almost all n≥12. Because an operation node with greater fan-out occupies more chip area and is slower in VLSI implementation, in most cases, SL(n) needs less area and may be faster than LYD(n). Moreover, it is much easier to design SL(n) than LYD(n).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. S. G. Akl. Parallel Computation: Models and Methods. Prentice-Hall, Upper Saddle River, N.J., 1997.

    Google Scholar 

  2. G. E. Blelloch. Scans as primitive operations. IEEE Transactions on Computers, 38:1526-1538, 1989.

    Google Scholar 

  3. R. P. Brent and H. T. Kung. A regular layout for parallel adders. IEEE Transactions on Computers, C-31:260-264, 1982.

    Google Scholar 

  4. D. A. Carlson and B. Sugla. Limited width parallel prefix circuits. The Journal of Supercomputing, 4:107-129, 1990.

    Google Scholar 

  5. R. Cole and U. Vishkin. Faster optimal parallel prefix sums and list ranking. Information and Control, 81:334-352, 1989.

    Google Scholar 

  6. F. E. Fich. New bounds for parallel prefix circuits. In Proceedings of the 15th Symposium on the Theory of Computing, pp. 100-109, 1983.

  7. C. P. Kruskal, T. Madej, and L. Rudolph. Parallel prefix on fully connected direct connection machines. In Proceedings of the 1986 International Conference on Parallel Processing, pp. 278-284, 1986.

  8. R. E. Ladner and M. J. Fischer. Parallel prefix computation. Journal of the Association for Computing Machinery, 27:831-838, 1980.

    Google Scholar 

  9. S. Lakshmivarahan and S. K. Dhall. Parallel Computing Using the Prefix Problem. Oxford University Press, Oxford, UK, 1994.

    Google Scholar 

  10. S. Lakshmivarahan, C. M. Yang, and S. K. Dhall. On a new class of optimal parallel prefix circuits with (size + depth) = 2n-2 and [log n] < depth < (2 [log n]-3). In Proceedings of the 1987 International Conference on Parallel Processing, pp. 58-65, 1987.

  11. Y. C. Lin. Optimal prefix circuits with fan-out 2. In Proceedings of the International Conference on Algorithms, pp. 175-181, Kaohsiung, Taiwan, December 1996.

    Google Scholar 

  12. A. Nicolau and H. Wang. Optimal schedule for parallel prefix computation with bounded resources. In Proceedings of the 3rd ACM SIGPLAN Symposium on Principles & Practice Parallel Programming, pp. 1-10, 1991.

  13. M. Snir. Depth-size trade-offs for parallel prefix computation. Journal of Algorithms, 7:185-201, 1986.

    Google Scholar 

  14. H. Wang, A. Nicolau, and K. S. Siu. The strict time lower bound and optimal schedules for parallel prefix with resource constraints. IEEE Transactions on Computers, 45:1257-1271, 1996.

    Google Scholar 

  15. N. H. E. Weste and K. Eshraghian. Principles of CMOS VLSI Design: A System Perspective, 2nd ed. Addison-Wesley, Reading, Mass., 1993.

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lin, YC., Shih, CC. A New Class of Depth-Size Optimal Parallel Prefix Circuits. The Journal of Supercomputing 14, 39–52 (1999). https://doi.org/10.1023/A:1008147229964

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1008147229964

Navigation