Performance, reliability, and performability of a hybrid RAID array and a comparison with traditional RAID1 arrays

Thomasian, Alexander; Tang, Yujie

doi:10.1007/s10586-012-0216-9

Performance, reliability, and performability of a hybrid RAID array and a comparison with traditional RAID1 arrays

Published: 21 June 2012

Volume 15, pages 239–253, (2012)
Cite this article

Cluster Computing Aims and scope Submit manuscript

Alexander Thomasian^1,2 &
Yujie Tang^1,3

536 Accesses
10 Citations
Explore all metrics

Abstract

We describe a hybrid mirrored disk organization patented by LSI Logic Corp. and compare its performance, reliability, and performability with traditional mirrored RAID1 disk organizations and RAID(4+ℓ), ℓ≥1. LSI RAID has the same level of redundancy as mirrored disks, but also utilizes parity coding. Unlike RAID1, which cannot tolerate all two disk failures, LSI RAID similarly to RAID6 is 2 Disk Failure Tolerant (2DFT), but in addition it can tolerate almost all three disk failures, while RAID1 organizations are generally 1DFT. We list analytic expressions for the reliability of various RAID1 organizations and use enumeration when the reliability expression cannot be obtained analytically. An asymptotic expansion method based on disk unreliabilities is used for an easy comparison of RAID reliabilities. LSI RAID performance is evaluated with the Read-Modify-Write (RMW) and ReConstruct Write (RCW) methods to update parities. The combination of the two methods is used to balance data and parity disk loads, which results in maximizing the I/O throughput. The analysis shows that LSI RAID has an inferior performance with respect to basic mirroring in processing an OLTP workload, but it outperforms RAID6. LSI RAID in spite of its higher Mean Time to Data Loss (MTTDL) is outperformed by other RAID1 organizations as far as its performability is concerned, i.e., the number of I/Os carried out by the disk array operating at maximum I/Os Per Second (IOPS) until data loss occurs. A survey of RAID1 organizations and distributed replicated systems is also included.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Mirrored and hybrid disk arrays and their reliability

Article 03 March 2018

H $$^{2}$$ -RAID: A Novel Hybrid RAID Architecture Towards High Reliability

A High-Performance and High-Reliability RAIS5 Storage Architecture with Adaptive Stripe

Notes

Figure 5 in [36] is different in that the system reliability is plotted versus decreasing disk reliabilities, not normalized time.
We have corrected the first part of (7) in [35].
The final result given here is slightly different from (9) in [35].

Abbreviations

BM:: Basic Mirroring
CD:: Chained Declustering
CRAID:: Clustered RAID
Ddisk:: Data disk
DoutD:: Data out-Degree
GRD:: Group Rotate Declustering
HRAID:: Hierarchical RAID
HST:: Head Settling Time
ID:: Interleaved Declustering
IOPS:: I/Os per Second
kDFT:: k Disk Failure Tolerant
LSE:: Latent Sector Error
MDS:: Maximum Distance Separable
MTTDL:: Mean Time to Data Loss
MTTF:: Mean Time to Failure
OLTP:: OnLine Transaction Processing
OSM:: Orthogonal Striping and Mirroring
PCM:: Permanent Customer Model
Pdisk:: Parity disk
PinD:: Parity in-Degree
RAID:: Redundant Array of Independent Disks
RCW:: ReConstruct Write
RMD:: Rotated Mirrored Declustering
RMW:: Read-Modify-Write
RPM:: Rotations Per Minute
RS:: Reed-Solomon code
SADA:: Self-Adaptive Disk Array
SSPiRAL:: Survivable Storage using Parity in Redundant Array Layouts
VSM:: Vacationing Server Model
XOR:: eXclusive OR

References

Alvarez, G.A., Burkhard, W.A., Stockmeyer, L.J., Cristian, F.: Declustered disk array architectures with optimal and near-optimal parallelism. In: Proc. 25th Ann’l Int’l Symp. on Computer Architecture (ISCA 1998), Barcelona, Spain, June, pp. 109–120 (1998)
Google Scholar
Amer, A., Long, D.D.E., Paris, J.-F., Schwarz, T.: Increased reliability with SSPiRAL data layouts. In: Proc. 16th Int’l Symp. on Modeling, Analysis, and Simulation of Computer and Telecomm. Systems (MASCOTS’08), Baltimore, MD, Sept., pp. 189–198 (2008)
Google Scholar
Bachmat, E., Schindler, J.: Analysis of methods for scheduling low priority disk drive tasks. In: Proc. ACM SIGMETRICS Conf. on Measurement and Modeling of Computer Systems, Los Angeles, CA, June, pp. 55–65 (2002)
Google Scholar
Chen, P.M., Lee, E.K., Gibson, G.A., Katz, R.H., Patterson, D.A.: RAID: high-performance, reliable secondary storage. ACM Comput. Surv. 26(2), 145–185 (1994)
Article Google Scholar
Chen, S.-Z., Towsley, D.F.: A performance evaluation of RAID architectures. IEEE Trans. Comput. 45(10), 1116–1130 (1996)
Article MATH Google Scholar
Chen, M.S., Hsiao, H.-I., Li, C.-S., Yu, P.S.: Using rotational mirrored declustering for replica placement in a disk-array-based video server. Multimed. Syst. 5(6), 371–379 (1997)
Article MATH Google Scholar
Dholakia, A., Eleftheriou, E., Hu, X.-Y., Iliadis, I., Menon, J., Rao, K.K.: A new intra-disk redundancy scheme for high-reliability RAID storage systems in the presence of unrecoverable errors. ACM Trans. Storage 4(1), 1 (2008)
Article Google Scholar
Gibson, G.A.: Redundant Disk Arrays: Reliable, Parallel Secondary Storage. MIT Press, Cambridge (1992)
Google Scholar
Hafner, J.L., Deenadhayalan, V., Kanungo, T., Rao, K.K.: Performance metrics for erasure codes in storage systems. In: IBM research report RJ 10231, Almaden, CA, USA, August (2004)
Hafner, J.L.: WEAVER codes: highly fault tolerant erasure codes for storage systems. In: Proc. 4th USENIX Conf. on File and Storage Technologies (FAST’05), San Francisco, CA, December, pp. 211–224 (2005)
Google Scholar
Haverkort, B.R., Marie, R., Rubino, R., Trivedi, K.S.: Performability Modelling: Techniques and Tools. Wiley, New York (2001)
Google Scholar
Hsiao, H.-I., DeWitt, D.J.: Chained declustering: a new availability strategy for multiprocessor database machines. In: Proc. IEEE Int’l Conf. on Data Engineering (ICDE’90), Los Angeles, CA, February, pp. 456–465 (1990)
Google Scholar
Hsiao, H.-I., DeWitt, D.J.: A performance study of three high available data replication strategies. Distrib. Parallel Databases 1(1), 53–80 (1993)
Article Google Scholar
Hwang, K., Jin, H., Ho, R.S.C.: Orthogonal striping and mirroring in distributed RAID for I/O-centric cluster computing. IEEE Trans. Parallel Distrib. Syst. 13(1), 26–44 (2002)
Article Google Scholar
Iliadis, I., Haas, R., Hu, X.-Y., Eleftheriou, E.: Disk scrubbing versus intradisk redundancy for RAID storage systems. ACM Trans. Storage 7(2), 5 (2011)
Article Google Scholar
Menon, J., Mattson, D.: Comparison of sparing alternatives for disk arrays. In: Proc. 19th Ann’l Int’l Symp. on Computer Architecture (ISCA 1992), Gold Coast, Australia, May, pp. 318–329 (1992)
Google Scholar
Merchant, A., Yu, P.S.: Analytic modeling and comparisons of striping strategies for replicated disk arrays. IEEE Trans. Comput. 44(3), 419–433 (1995)
Article MATH Google Scholar
Merchant, A., Yu, P.S.: Analytic modeling of clustered RAID with mapping based on nearly random permutation. IEEE Trans. Comput. 45(3), 367–373 (1996)
Article MATH Google Scholar
Muntz, R.R., Lui, J.C.S.: Performance analysis of disk arrays under failure. In: 6th Int’l Conf. on Very Large Data Bases, Brisbane, Queensland, Australia, August, pp. 162–173 (1990)
Google Scholar
Paris, J.-F., Schwarz, T.J.E., Long, D.D.E.: Self-adaptive disk arrays. In: Proc. 8th Int’l Symp. on Stabilization, Safety, and Security of Distributed Systems (SSS 2006), Dallas, TX, November, pp. 469–483 (2006)
Chapter Google Scholar
Park, C.-I.: Efficient placement of parity and data to tolerate two disk failures in disk array systems. IEEE Trans. Parallel Distrib. Syst. 6(11), 1177–1184 (1995)
Article Google Scholar
Schroeder, B., Gibson, G.A.: Understanding disk failure rates: what does an MTTF of 1,000, 000 hours mean to you? ACM Trans. Storage 3(3), 8 (2007)
Article Google Scholar
Schroeder, B., Damouras, S., Gill, P.: Understanding latent sector errors and how to protect against them. ACM Trans. Storage 8(3), 8 (2010)
Google Scholar
Shang, P., Wang, J., Zhu, H., Gu, P.: A new placement-ideal layout for multiway replication storage system. IEEE Trans. Comput. 60(8), 1142–1156 (2011)
Article MathSciNet Google Scholar
Teradata: DBC/1012 database computer system manual release 2.0. Document No. C10-0001-02, Teradata Corp., November (1985)
Kari, H.H.: Latent sector faults and reliability of disk arrays. Ph.D. thesis, University of Helsinki, Espoo, Finland (1997)
Lee, E.K., Thekkath, C.A.: Petal: distributed virtual disks. In: Proc. 7th Int’l Conf. on Architectural Support for Programming Languages and Operating Systems (ASPLOS-VII), Cambridge, MA, October, pp. 84–92 (1996)
Chapter Google Scholar
Li, M., Shu, J., Zheng, W.: GRID codes: strip-based erasure codes with high fault tolerance for storage systems. ACM Trans. Storage 4(4), 15 (2009)
Article Google Scholar
Sun, H., Tyan, T., Johnson, S., Elling, R., Talagala, N., Wood, R.B.: Performability analysis of storage systems in practice: methodology and tools. In: Proc. 3rd Int’l Service Availability Symp. (ISAS 2006). Helsinki, Finland, May 2006. Lecture Notes in Computer Science, vol. 4328, pp. 62–75. Springer, Berlin (2006) (Revised selected papers)
Chapter Google Scholar
Thomasian, A., Menon, J.: Performance analysis of RAID5 disk arrays with a vacationing server model for rebuild mode operation. In: Proc. IEEE Int’l Conf. on Data Engineering (ICDE’94), Houston, TX, February, pp. 111–119 (1994)
Chapter Google Scholar
Thomasian, A., Menon, J.: RAID5 performance with distributed sparing. IEEE Trans. Parallel Distrib. Syst. 8(6), 640–657 (1997)
Article Google Scholar
Thomasian, A.: Reconstruct versus read-modify writes in RAID. Inf. Process. Lett. 93(4), 163–168 (2005)
Article MathSciNet MATH Google Scholar
Thomasian, A.: Clustered RAID arrays and their access costs. Comput. J. 48(6), 702–713 (2005)
Article MathSciNet Google Scholar
Thomasian, A.: Mirrored disk routing and scheduling. Clust. Comput. 9(4), 475–484 (2006)
Article Google Scholar
Thomasian, A.: Shortcut method for reliability comparisons in RAID5. J. Syst. Softw. 79(11), 1599–1605 (2006)
Article Google Scholar
Thomasian, A., Blaum, M.: Mirrored disk organization reliability analysis. IEEE Trans. Comput. 55(12), 1640–1644 (2006)
Article Google Scholar
Thomasian, A., Fu, G., Han, C.: Performance of two-disk failure-tolerant disk arrays. IEEE Trans. Comput. 56(6), 799–814 (2007)
Article MathSciNet Google Scholar
Thomasian, A., Xu, J.: Reliability and performance of mirrored disk organizations. Comput. J. 51(6), 615–629 (2008)
Article Google Scholar
Thomasian, A., Blaum, M.: Higher reliability redundant disk arrays: organization, operation, and coding. ACM Trans. Storage 5(3), 7 (2009)
Article Google Scholar
Thomasian, A.: Survey and analysis of disk scheduling methods. Comput. Archit. News 39(2), 8–25 (2011)
Article Google Scholar
Thomasian, A., Xu, J.: RAID level selection for heterogeneous disk arrays. Clust. Comput. 14(2), 115–127 (2011)
Article Google Scholar
Thomasian, A., Tang, Y.: Performance, reliability, and performability aspects of hierarchical RAID. In: Proc. 6th Int’l Conf. on Networking, Architecture, and Storage (NAS 2011), Dalian, China, July, pp. 92–101 (2011)
Chapter Google Scholar
Trivedi, K.S.: Probability and Statistics with Reliability, Queuing, and Computer Science Applications, 2nd edn. Wiley, New York (2001)
Google Scholar
Venkatesan, V., Iliadis, I., Hu, X.-Y., Haas, R., Fragouli, C.: Effect of replica placement on the reliability of large-scale data storage systems. In: Proc. 18th Ann’l IEEE/ACM Int’l Symp. on Modeling, Analysis and Simulation of Computer and Telecomm. Systems (MASCOTS’10), Miami, FL, August, pp. 79–88 (2010)
Google Scholar
Venkatesan, V., Iliadis, I., Fragouli, C., Urbanke, R.: Reliability of clustered vs. declustered replica placement in data storage systems. In: Proc. 19th Ann’l IEEE/ACM Int’l Symp. on Modeling, Analysis and Simulation of Computer and Telecomm. Systems (MASCOTS’11), Raffles Hotel, Singapore, August, pp. 307–317 (2011)
Google Scholar
Venkatesan, V., Iliadis, I., Hass, R.: Reliability of data storage systems under network rebuild bandwidth constraints. In: Proc. 20th Ann’l IEEE/ACM Int’l Symp. on Modeling, Analysis and Simulation of Computer and Telecomm. Systems (MASCOTS’12), Washington, D.C., August, pp. 79–88 (2012)
Google Scholar
Wilner, A.: Multiple drive failure tolerant RAID system. US Patent 6,327,672, December 2001
Xu, L., Bohossian, V., Bruck, J., Wagner, D.G.: Low-density MDS codes and factors of complete graphs. IEEE Trans. Inf. Theory 45(6), 1817–1836 (1999)
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Shenzhen Institutes of Advanced Technology (SIAT), Chinese Academy of Sciences, Shenzhen, China
Alexander Thomasian & Yujie Tang
Thomasian & Associates, 17 Meadowbrook Rd., Pleasantville, NY, 10570, USA
Alexander Thomasian
ECE Dept., University of Waterloo, Waterloo, Ontario, Canada
Yujie Tang

Authors

Alexander Thomasian
View author publications
You can also search for this author in PubMed Google Scholar
Yujie Tang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alexander Thomasian.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Thomasian, A., Tang, Y. Performance, reliability, and performability of a hybrid RAID array and a comparison with traditional RAID1 arrays. Cluster Comput 15, 239–253 (2012). https://doi.org/10.1007/s10586-012-0216-9

Download citation

Received: 15 October 2011
Accepted: 25 May 2012
Published: 21 June 2012
Issue Date: September 2012
DOI: https://doi.org/10.1007/s10586-012-0216-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Performance, reliability, and performability of a hybrid RAID array and a comparison with traditional RAID1 arrays

Abstract

Access this article

Similar content being viewed by others

Mirrored and hybrid disk arrays and their reliability

H $$^{2}$$ -RAID: A Novel Hybrid RAID Architecture Towards High Reliability

A High-Performance and High-Reliability RAIS5 Storage Architecture with Adaptive Stripe

Notes

Abbreviations

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Performance, reliability, and performability of a hybrid RAID array and a comparison with traditional RAID1 arrays

Abstract

Access this article

Similar content being viewed by others

Mirrored and hybrid disk arrays and their reliability

H $$^{2}$$ -RAID: A Novel Hybrid RAID Architecture Towards High Reliability

A High-Performance and High-Reliability RAIS5 Storage Architecture with Adaptive Stripe

Notes

Abbreviations

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation