Abstract
Due to its higher storage density and lower energy consumption compared to DRAM, persistent memory (PM) holds the potential to address the growing memory demands of applications, such as Deep Neural Network (DNN) training. However, PM also suffers from longer latency and lower bandwidth, making it impractical to completely replace DRAM. The hybrid memory architecture, which combines DRAM and PM, is expected to improve this issue. Nevertheless, it also introduces a challenging problem: the state-of-the-art page placement mechanism designed for DRAM-only systems with NUMA ignores the performance disparities between DRAM and PM, resulting in sub-optimal performance. To improve this problem, we propose PM-Migration, a page placement mechanism tailored for real-time systems with hybrid memory architecture. PM-Migration prioritizes placing frequently accessed pages in DRAM and increases the access frequency of write-intensive pages to leverage the read-write asymmetry of Intel Optane DC persistent memory module (DCPMM), a commercially available PM hardware. It also incorporates a transmission handover strategy to select the transfer engine according to the size of the page and then utilizes DMA technology for migrating pages of size 2 MB. Experimental results demonstrate that PM-Migration provides an average throughput improvement of 1.31\(\times \) to 3.6\(\times \) compared to existing mechanisms proposed for the hybrid memory architecture.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Li, D., Liao, X., Jin, H., Zhou, B., Zhang, Q.: A new disk I/O model of virtualized cloud environment. IEEE Trans. Parallel Distrib. Syst. 24(6), 1129–1138 (2013)
Luo, H., et al.: CLR-DRAM: a low-cost dram architecture enabling dynamic capacity-latency trade-off. In: 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture (ISCA), pp. 666–679 (2020)
Li, D., Zhang, N., Dong, M., Chen, H., Ota, K., Tang, Y.: PM-AIO: an effective asynchronous I/O system for persistent memory. IEEE Trans. Emerg. Top. Comput. 10(3), 1558–1574 (2022)
Beeler, B.: Intel Optane DC persistent memory module (PMM). https://www.storagereview.com/news/intel-optane-dc-persistent-memory-module-pmm
Weiland, M., et al.: An early evaluation of intel’s optane DC persistent memory module and its impact on high-performance scientific applications. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, ser. SC 2019. New York, NY, USA: Association for Computing Machinery (2019). https://doi.org/10.1145/3295500.3356159
Gugnani, S., Kashyap, A., Lu, X.: Understanding the idiosyncrasies of real persistent memory. In: Proceedings of the VLDB Endow, vol. 14, no. 4, pp. 626–639 (2021). https://doi.org/10.14778/3436905.3436921
Awati, R.: Non-uniform memory access (NUMA). https://www.techtarget.com/whatis/definition/NUMA-non-uniform-memory-access
Linux Memory Management Documentation —— Page migration. https://www.kernel.org/doc/html/v5.4/vm/page_migration.html
Yan, Z., Lustig, D., Nellans, D., Bhattacharjee, A.: Nimble page management for tiered memory systems. In: Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems. p. 331–345. ASPLOS 2019, Association for Computing Machinery, New York, NY, USA (2019). https://doi.org/10.1145/3297858.3304024
Kim, J., Choe, W., Ahn, J.: Exploring the design space of page management for multi-tiered memory systems. In: USENIX Annual Technical Conference (2021)
AutoNUMA: the other approach to NUMA scheduling. https://lwn.net/Articles/835402/
Vaidyanathan, K., Chai, L., Huang, W., Panda, D.K.: Efficient asynchronous memory copy operations on multi-core systems and I/OAT. In: 2007 IEEE International Conference on Cluster Computing, pp. 159–168 (2007). https://doi.org/10.1109/CLUSTR.2007.4629228
Liu, L., Yang, S., Peng, L., Li, X.: Hierarchical hybrid memory management in OS for tiered memory systems. IEEE Trans. Parallel Distrib. Syst. 30(10), 2223–2236 (2019)
Ying, H.: AutoNUMA: optimize memory placement for memory tiering system. https://lwn.net/Articles/835402/
Marques, M., Kuzmin, I., Barreto, J., Monteiro, J., Rodrigues, R.: Dynamic page placement on real persistent memory systems (2021). https://arxiv.org/abs/2112.12685
NAS parallel benchmarks. https://www.nas.nasa.gov/software/npb.html
Stress-ng (stress next generation). https://github.com/ColinIanKing/stress-ng
Acknowledgment
This work was funded by the Key-Area Research and Development Program of Guangdong Province under grant number 2020B0101650001, by the National Natural Science Foundation of China under grant number 61972164.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Xu, L., Chen, G., Li, D., Luo, H. (2024). PM-Migration: A Page Placement Mechanism for Real-Time Systems with Hybrid Memory Architecture. In: Tari, Z., Li, K., Wu, H. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2023. Lecture Notes in Computer Science, vol 14491. Springer, Singapore. https://doi.org/10.1007/978-981-97-0808-6_18
Download citation
DOI: https://doi.org/10.1007/978-981-97-0808-6_18
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-0807-9
Online ISBN: 978-981-97-0808-6
eBook Packages: Computer ScienceComputer Science (R0)