Abstract
Memory failures are quite common in todays technology. When they occur, the whole memory bank has to be replaced, even if only few bytes of memory are faulty. With increasing sizes of memory chips the urge not to waste these ‘not quite properly working’ pieces of equipment becomes bigger and bigger. Operating systems such as Linux already provide mechanisms for memory management which could be utilized to avoid allocating bad memory blocks which have been identified earlier, allowing for a failure-free software operation despite hardware problems. The paper describes problems of detecting memory failures and OS mechanisms that can be used for bad block exclusion. It proposes modifications to Linux kernel allowing a software solution to hardware failures.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Biswas, S., Metodi, T., Chong, F., Kastner, R.: A pageable, defect-tolerant nanoscale memory system. In: IEEE International Symposium on Nanoscale Architectures, NANOSARCH 2007, pp. 85–92 (2007), doi:10.1109/NANOARCH.2007.4400862
De Jonge, J.H., Smeulders, A.J.: Moving Inversions Test Pattern is Thorough, Yet Speedy. International Computer Design (1976)
Demeulemeester, S.: Memtest86, an Advanced Memory Diagnostic Tool (2010), http://www.memtest.org
Elkind, S., Siewiorek, D.: Reliability and performance of error-correcting memory and register arrays. IEEE Transactions on Computers C-29(10), 920–927 (1980), doi:10.1109/TC.1980.1675475
Elm, C., Klein, M., Tavangarian, D.: Automatic On-line Memory Tests in Workstations. In: Records of the IEEE International Workshop on Memory Technology, Design and Testing (1994)
Gu, W., Kalbarczyk, Z., Ravishankar, Iyer, K., Yang, Z.: Characterization of Linux kernel behavior under errors. In: Proceedings of 2003 International Conference on Dependable Systems and Networks, pp. 459–468 (2003), doi:10.1109/DSN.2003.1209956
Hamdioui, S.: Testing static random access memories: defects, fault models, and test pattern. Kluwer Academic Publishers, Dordrecht (2004)
Intel Corporation IA-32 Intel Architecture Software Developer’s Manual. In: System Programming Guide, vol. 3 (2009), http://www.intel.com/products/processor/manuals/
Kozierok, C.: The PC Guide – Memory Errors, Detection and Correction (2001)
Maestro, J., Reviriego, P.: Selection of the optimal memory configuration in a system affected by soft errors. IEEE Transactions on Device and Materials Reliability 9(3), 403–411 (2009), doi:10.1109/TDMR.2009.2023081
Messer, A., Bernadat, P., Fu, G., Chen, D., Dimitrijevic, Z., Lie, D., Mannaru, D., Riska, A., Milojicic, D.: Susceptibility of commodity systems and software to memory soft errors. IEEE Transactions on Computers 53(12), 1557–1568 (2004), doi:10.1109/TC.2004.119
Silberschats, A., Peterson, J.L., Galvin, P.B.: Operating System Concepts. Addison-Wesley Publishing Company, Inc., Reading (1991)
Swift, W.: Memory errors: roll the dice! IEEE Antennas and Propagation Magazine 38(6), 124–125 (1996), doi:10.1109/MAP.1996.556530
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Surmacz, T., Zawistowski, B. (2011). Bad Memory Blocks Exclusion in Linux Operating System. In: Zamojski, W., Kacprzyk, J., Mazurkiewicz, J., Sugier, J., Walkowiak, T. (eds) Dependable Computer Systems. Advances in Intelligent and Soft Computing, vol 97. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21393-9_17
Download citation
DOI: https://doi.org/10.1007/978-3-642-21393-9_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21392-2
Online ISBN: 978-3-642-21393-9
eBook Packages: EngineeringEngineering (R0)