Abstract
Tandem builds single-fault-tolerant computer systems. At the hardware level, the system is designed as a loosely coupled multi-processor with fail-fast modules connected via dual paths. It is designed for online diagnosis and maintenance. A range of CPUs may be inter- connected via a hierarchical fault-tolerant local network. A variety of peripherals needed for online transaction processing are attached via dual ported controllers. A novel disc subsystem allows a choice between low cost-per-Mbyte and low cost-per-access. System software provides processes and messages as the basic structuring mechanism. Processes provide software modularity and fault isolation. Process pairs tolerate hardware and transient software failures. Applications are structured as requesting processes making remote procedure calls to server processes. Process server classes utilize multi-processors. The resulting process abstractions provide a distributed system which can utilize thousands of processors. Networking protocols such as SNA, OSI, and a proprietary network are built atop this base. A relational database provides distributed data and distributed transactions. An application generator allows users to develop fault-tolerant applications as though the system were a conventional computer. The resulting system has price/performance competitive with conventional systems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bartlett, J., “A NonStop Kernel,” Proceedings of the Eighth Symposium on Operating System Principles, pp. 22–29, Dec. 1981.
Borr, A., “Transaction Monitoring in ENCOMPASS,” Proc. 7Th VLDB, September 1981. Also Tandem Computers TR 81. 2.
Borr, A., “Robustness to Crash in a Distributed Database: A Non Shared-Memory Multi-processor Approach,” Proc. 9th VLDB, Sept. 1984. Also Tandem Computers TR 84. 2.
Burman, M. “Aspects of a High Volume Production Online Banking System”, Proc. Int. Workshop on High Performance Transaction Systems, Asilomar, Sept. 1985.
Anon., “Tandem Makes a Good Thing Better”, Electronics, pp. 34–38, April 14, 1986.
Gray, J., “Why Do Computers Stop and What Can We Do About It?”, Tandem Technical Report TR85.7, 1985, Cupertino, CA.
Horst, R. and Metz, S., “New System Manages Hundreds of Transactions/Second.” Electronics, pp. 147–151, April 19, 1984. Also Tandem Computers TR 84. 1
Horst, R., Chou, T., “The Hardware Architecture and Linear Expansion of Tandem NonStop Systems” Proceedings of 12th International Symposium on Computer Architecture, June 1985. or Tandem Technical Report 85. 3
Lamport, L., Shostak, R., Pease, M., “Then Byzantine Generals Problem”, ACM Transactions on Programming Languages and Systems, Vol. 4, No. 3, July 1982.
Mourad, S. and Andrews, D., “The Reliability of the IBM/XA Operating System”, Digest of 15th Annual Int. Sym. on Fault-Tolerant Computing, June 1985. IEEE Computer Society Press.
Introduction to Tandem Computer Systems, Tandem Part No. 82503, March 1985, Cupertino, CA. “System Description Manual”, Tandem Part No. 82507, Cupertino, CA. “Expand(tm) Reference Manual” Tandem Part Nö. 82370, Cupertino, CA. 76 “Introduction to Pathway”, Tandem Computers Inc., Part No: 82339 - A00, Cupertino, CA.
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1987 Springer-Verlag/Wien
About this paper
Cite this paper
Bartlett, J., Gray, J., Horst, B. (1987). Fault Tolerance in Tandem Computer Systems. In: Avižienis, A., Kopetz, H., Laprie, JC. (eds) The Evolution of Fault-Tolerant Computing. Dependable Computing and Fault-Tolerant Systems, vol 1. Springer, Vienna. https://doi.org/10.1007/978-3-7091-8871-2_3
Download citation
DOI: https://doi.org/10.1007/978-3-7091-8871-2_3
Publisher Name: Springer, Vienna
Print ISBN: 978-3-7091-8873-6
Online ISBN: 978-3-7091-8871-2
eBook Packages: Springer Book Archive