Improving Read Performance with Online Access Pattern Analysis and Prefetching

Tang, Houjun; Zou, Xiaocheng; Jenkins, John; Boyuka, David A.; Ranshous, Stephen; Kimpe, Dries; Klasky, Scott; Samatova, Nagiza F.

doi:10.1007/978-3-319-09873-9_21

Houjun Tang^16,17,
Xiaocheng Zou^16,17,
John Jenkins^16,18,
David A. Boyuka II^16,17,
Stephen Ranshous^16,17,
Dries Kimpe¹⁸,
Scott Klasky¹⁷ &
…
Nagiza F. Samatova^16,17

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8632))

Included in the following conference series:

European Conference on Parallel Processing

2813 Accesses
9 Citations

Abstract

Among the major challenges of transitioning to exascale in HPC is the ubiquitous I/O bottleneck. For analysis and visualization applications in particular, this bottleneck is exacerbated by the write-onceread- many property of most scientific datasets combined with typically complex access patterns. One promising way to alleviate this problem is to recognize the application’s access patterns and utilize them to prefetch data, thereby overlapping computation and I/O. However, current research methods for analyzing access patterns are either offline-only and/or lack the support for complex access patterns, such as high-dimensional strided or composition-based unstructured access patterns. Therefore, we propose an online analyzer capable of detecting both simple and complex access patterns with low computational and memory overhead and high accuracy. By combining our pattern detection with prefetching,we consistently observe run-time reductions, up to 26%, across 18 configurations of PIOBench and 4 configurations of a micro-benchmark with both structured and unstructured access patterns.

Download to read the full chapter text

Chapter PDF

Characterizing the Impact of Prefetching on Scientific Application Performance

From Application to Disk: Tracing I/O Through the Big Data Stack

Adding data provenance support to Apache Spark

Article 07 August 2017

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Chen, J.H., Choudhary, A., De Supinski, B., DeVries, M., Hawkes, E., Klasky, S., Liao, W., Ma, K., Mellor-Crummey, J., Podhorszki, N., et al.: Terascale direct numerical simulations of turbulent combustion using S3D. Computational Science & Discovery 2(1), 15001 (2009)
Article Google Scholar
Wang, W., Lin, Z., Tang, W., Lee, W., Ethier, S., Lewandowski, J., Rewoldt, G., Hahm, T., Manickam, J.: Gyro-kinetic simulation of global turbulent transport properties in tokamak experiments. Physics of Plasmas 13, 092505 (2006)
Google Scholar
Zhu, Y., Jiang, H., Qin, X., Feng, D., Swanson, D.R.: Improved read performance in a cost-effective, fault-tolerant parallel virtual file system (ceft-pvfs). In: CCGrid 2003, pp. 730–735. IEEE (2003)
Google Scholar
Di Biagio, A., Speziale, E., Agosta, G.: Exploiting thread-data affinity in openmp with data access patterns. In: Jeannot, E., Namyst, R., Roman, J. (eds.) Euro-Par 2011, Part I. LNCS, vol. 6852, pp. 230–241. Springer, Heidelberg (2011)
Chapter Google Scholar
Byna, S., Chen, Y., Sun, X.H., Thakur, R., Gropp, W.: Parallel I/O prefetching using MPI file caching and I/O signatures. In: SC 2008, pp. 1–12. IEEE (2008)
Google Scholar
Oly, J., Reed, D.A.: Markov model prediction of I/O requests for scientific applications. In: ICS 2002, pp. 147–155. ACM (2002)
Google Scholar
Li, Z., Chen, Z., Srinivasan, S.M., Zhou, Y.: C-Miner: Mining Block Correlations in Storage Systems. In: FAST, pp. 173–186 (2004)
Google Scholar
Choi, J.Y., Abbasi, H., Pugmire, D., Podhorszki, N., Klasky, S., Capdevila, C., Parashar, M., Wolf, M., Qiu, J., Fox, G.: Mining hidden mixture context with adios-p to improve predictive pre-fetcher accuracy. In: 2012 IEEE 8th International Conference on E-Science (e-Science), pp. 1–8. IEEE (2012)
Google Scholar
Crandall, P.E., Aydt, R.A., Chien, A.A., Reed, D.A.: Input/output characteristics of scalable parallel applications. In: Proceedings of the IEEE/ACM SC 1995 Conference on Supercomputing, pp. 59–59. IEEE (1995)
Google Scholar
Madhyastha, T.M., Reed, D.A.: Learning to classify parallel input/output access patterns. TPDS 13(8), 802–813 (2002)
Google Scholar
Carns, P., Latham, R., Ross, R., Iskra, K., Lang, S., Riley, K.: 24/7 characterization of petascale I/O workloads. In: Cluster 2010, pp. 1–10 (2010)
Google Scholar
Shorter, F.: Design and analysis of a performance evaluation standard for parallel file systems. PhD thesis, Clemson University (2003)
Google Scholar
Gong, Z., Boyuka, D., Zou, X., Liu, Q., Podhorszki, N., Klasky, S., Ma, X., Samatova, N.F.: Parlo: Parallel run-time layout optimization for scientific data explorations with heterogeneous access patterns. In: CCGrid 2013, pp. 343–351 (2013)
Google Scholar
Han, W.S., Moon, Y.S., Whang, K.Y.: Prefetchguide: Capturing navigational access patterns for prefetching in client/server object-oriented/object-relational dbmss. Information Sciences 152, 47–61 (2003)
Article Google Scholar
Baer, J.L., Chen, T.F.: An effective on-chip preloading scheme to reduce data access penalty. In: Proceedings of the 1991 ACM/IEEE Conference on Supercomputing 1991, pp. 176–186. IEEE (1991)
Google Scholar
Dahlgren, F., Dubois, M., Stenstrom, P.: Fixed and adaptive sequential prefetching in shared memory multiprocessors. In: ICPP 1993, vol. 1, pp. 56–63. IEEE (1993)
Google Scholar
Dahlgren, F., Dubois, M., Stenstrom, P.: Sequential hardware prefetching in shared-memory multiprocessors. TPDS 6(7), 733–746 (1995)
Google Scholar
Ding, X., Jiang, S., Chen, F., Davis, K., Zhang, X.: Diskseen: Exploiting disk layout and access history to enhance I/O prefetch. In: USENIX Annual Technical Conference, vol. 7, pp. 261–274 (2007)
Google Scholar
Carns, P.H., Ligon III, W.B., Ross, R.B., Thakur, R.: Pvfs: A parallel file system for linux clusters. In: Proceedings of the 4th Annual Linux Showcase and Conference, pp. 391–430 (2000)
Google Scholar
Braam, P.J., Zahir, R.: Lustre: A scalable, high performance file system. Cluster File Systems, Inc. (2002)
Google Scholar
Patterson, R.H., Gibson, G.A., Ginting, E., Stodolsky, D., Zelenka, J.: Informed prefetching and caching, vol. 29. ACM (1995)
Google Scholar

Download references

Author information

Authors and Affiliations

North Carolina State University, Raleigh, NC, 27695, USA
Houjun Tang, Xiaocheng Zou, John Jenkins, David A. Boyuka II, Stephen Ranshous & Nagiza F. Samatova
Oak Ridge National Laboratory, Oak Ridge, TN, 37830, USA
Houjun Tang, Xiaocheng Zou, David A. Boyuka II, Stephen Ranshous, Scott Klasky & Nagiza F. Samatova
Argonne National Laboratory, Argonne, IL, 60439, USA
John Jenkins & Dries Kimpe

Authors

Houjun Tang
View author publications
You can also search for this author in PubMed Google Scholar
Xiaocheng Zou
View author publications
You can also search for this author in PubMed Google Scholar
John Jenkins
View author publications
You can also search for this author in PubMed Google Scholar
David A. Boyuka II
View author publications
You can also search for this author in PubMed Google Scholar
Stephen Ranshous
View author publications
You can also search for this author in PubMed Google Scholar
Dries Kimpe
View author publications
You can also search for this author in PubMed Google Scholar
Scott Klasky
View author publications
You can also search for this author in PubMed Google Scholar
Nagiza F. Samatova
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

CRACS/INESC-TEC and FCUP, Universidade do Porto, Rua do Campo Alegre, 1021, 4169-007, Porto, Portugal
Fernando Silva , Inês Dutra & Vítor Santos Costa , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tang, H. et al. (2014). Improving Read Performance with Online Access Pattern Analysis and Prefetching. In: Silva, F., Dutra, I., Santos Costa, V. (eds) Euro-Par 2014 Parallel Processing. Euro-Par 2014. Lecture Notes in Computer Science, vol 8632. Springer, Cham. https://doi.org/10.1007/978-3-319-09873-9_21

Download citation

DOI: https://doi.org/10.1007/978-3-319-09873-9_21
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09872-2
Online ISBN: 978-3-319-09873-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Improving Read Performance with Online Access Pattern Analysis and Prefetching

Abstract

Chapter PDF

Similar content being viewed by others

Characterizing the Impact of Prefetching on Scientific Application Performance

From Application to Disk: Tracing I/O Through the Big Data Stack

Adding data provenance support to Apache Spark

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Improving Read Performance with Online Access Pattern Analysis and Prefetching

Abstract

Chapter PDF

Similar content being viewed by others

Characterizing the Impact of Prefetching on Scientific Application Performance

From Application to Disk: Tracing I/O Through the Big Data Stack

Adding data provenance support to Apache Spark

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation