Skip to main content

Advertisement

Log in

Application of statistics to correlate groundwater chemistry with land use on O‘ahu, Hawai‘i

  • Published:
Environmental Monitoring and Assessment Aims and scope Submit manuscript

Abstract

This study compiles commonly available groundwater chemistry data from the Pearl Harbor Sole Source Aquifer (SSA), Hawai‘i—O‘ahu’s primary drinking water source—and applies hierarchical clustering analysis (HCA), principal component analyses (PCA), piper diagrams, and box plots with geospatial analysis to better define groundwater regions and correlate groundwater chemistry in those regions with land use. Groundwater in this aquifer recharges and flows through chemically similar soil and rocks, such that anthropogenic activities are a primary influence on the chemical variability of the aquifer’s differing regions. Our analyses link specific chemical species in groundwater to land use/cover categories: urban, agriculture, and natural and anthropogenically-induced saline water intrusion. To create distinct statistical groupings with different groundwater chemistry compositions, it was important that the suite of parameters used in the statistical analysis do not covary. In our case, Cl covaried with several major ions; however, by including F, alkalinity, and SiOx that do not covary with Cl in the covariance matrix, we produced improved spatial grouping of HCA clusters and stronger affinities to land use designations. Results show that dominant groundwater chemistry changes with land use along flow paths. These results pertain to areas where groundwater flows from conservation land in high recharge areas of O‘ahu’s mountain ranges to urban and agricultural land use regions: groundwater retains its source characteristics until about 3–6 km into agricultural and urban zoned lands. Ultimately, this study outlines a simple method for water quality regulators to use groundwater chemistry to identify risks of target contaminants based on land use.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Data availability

The data used in this study are publicly available from a variety of sources, cited in the text with full details in the Reference List.

References

Download references

Acknowledgements

The research was a partnership among HDOH Safe Drinking Water Branch and the University of Hawai‘i (UH) at Mānoa, Hawai‘i Institute of Geophysics and Planetology. This project is part of a broad effort to test the feasibility of using general groundwater chemistry to assess the anthropogenic contamination risk to public drinking water systems and to strengthen the research partnership between UH and HDOH staff to use science to investigate critical groundwater quality and quantity issues.

Funding

Funding was provided by the Hawai‘i Department of Health (HDOH) under a Memorandum of Agreement SDWB-18–001-RW, NSF Hawaiʻi EPSCoR Program through the National Science Foundation’s Research Infrastructure Improvement award (RII) Track-1: ‘Ike Wai: Securing Hawaiʻi’s Water Future Award #OIA-1557349, and the U.S. Department of Energy’s Geothermal Technologies Program Award #DE-EE6729DOE-GTO.

Author information

Authors and Affiliations

Authors

Contributions

All authors contributed to the study’s design and execution. Daniel Dores and Robert Whittier wrote the original manuscript text, with significant revisions by Nicole Lautze and major contributions from Donald Thomas. Figures were prepared by Daniel Dores and Robert Whittier, with editing from Nicole Lautze and Donald Thomas. All authors reviewed the manuscript, an effort led by Nicole Lautze.

Corresponding author

Correspondence to Daniel Dores.

Ethics declarations

Ethical approval

All authors have read, understood, and have complied as applicable with the statement on “Ethical responsibilities of Authors” as found in the Instructions for Authors and are aware that with minor exceptions, no changes can be made to authorship once the paper is submitted.

Conflict of interest

The authors have no competing interests to declare that are relevant to the content of this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix 1. Discussion of the method

A.1 Multiple samples from the same well

Samples used in this study were collected at different times and by different organizations and were analyzed by different laboratories; several sampling locations have multiple samples over time. We recognize that this diversity may affect the robustness of the results and the conclusions drawn. It is a key part of our study to demonstrate the utility of existing data for regional evaluations, thus removing the need for expensive field sampling for each new hydrologic investigation. Additionally, it is important to note that, for this study, our priority was to capture substantial differences in groundwater composition across broad gradients. Therefore, we completed this analysis with all available data that met our vetting process. It was necessary to include datasets from multiple studies, as no single past project had a broad enough reach to meet our needs. Having duplicate samples from different time intervals allowed for greater insight into the results of the coupled HCA-PCA approach as it showed how local groundwater composition can vary through time due to changes in pumping, climate variations, anthropogenic contamination, and other external stressors. We then investigated how those processes affect groundwater chemistry and resulting relationships to other regional groundwater compositions.

A.2 Statistical approach

Considering the two data management methods to prepare the HCA-PCA pipelines, the approach that incorporated F and SiOx into the log-normalized ion data to produce HCA2 and PCA2 appears to be more robust (Fig. 13). HCA1, which utilized the % meq/L data input for the Piper diagram showed six clusters at a Euclidean distance of 100 (Fig. 2). In contrast, HCA2 using the log-normalized ion data and included SiOx and F showed six clusters at a Euclidean distance of 9 (Fig. 4). The shorter Euclidean distance suggests a greater general similarity between the log-normalized data points. The difference of essentially an order of magnitude between the Euclidean distances could be due in part to data management, as HCA1 data was normalized on a percent scale, where HCA2 data was normalized on a log scale. Additionally, there could be some improved clustering results from the separation of the Na+-K+ parameter and inclusion of the SiOx and F variables in HCA2.

Both PCA1 and PCA2 demonstrate a general geochemistry for O‘ahu groundwater. In PCA1, PCA1-1 accounts for the majority of explained variance (0.471) with Na+-K+, Ca2+, and Mg2+ being the most important features to PCA1-1 (0.536, 0.501, and 0.453, respectively). SO42−, alkalinity, and Cl, are secondary features in PCA1-1 (0.332. 0.290, and 0.249, respectively). PCA1-2 of PCA1 accounts for only 0.347 of explained variance, but Alkalinity (0.597) and Cl (0.596) are the key features. The variance in PCA1-1 implies that there is a general variance among the chemical composition of groundwater across the study area, attributable to a variety of common ion concentrations. The variance in PCA1-2 attributable to alkalinity and Cl, key components of seawater, show that a somewhat secondary driver to the differences seen in groundwater compositions across O‘ahu is the presence of saltwater intrusion. It could be argued that PCA1-1 of PCA1 is a measurement of general groundwater chemistry variability, whereas PCA1-2 is a metric of seawater contamination.

PCA2 continues the narrative of a general chemistry for local groundwater, as Ca2+, Mg2+, Na+, K+, and Cl all carried similar weight (between 0.380 and 0.402) for PCA2-1, which accounted for 0.662 of the data variance. PCA2 also showed the importance of incorporating SiOx, alkalinity, and F into the analysis, as they were by far the most important variables of PCA2-2 (0.685, 0.565, and 0.387). The implication from PCA2-2 of PCA2 is that SiOx alkalinity, and F could be important measurements for differentiating local regions of groundwater on O‘ahu as the other ion concentration variables often co-vary. The differentiations offered by these additional parameters indicates a robustness for the HCA2-PCA2 dataset and approach.

When small-batch PCAs were run for each cluster of HCA2 with the log-normalized ions including SiOx and F, the majority of variance was explained for each by their PC1s, with the majority of parameters covarying together. One exception was cluster 2a, in which 0.733 of variance was explained by PCA2-1; within that PCA2-1, SiOx had a weight of 0.998. It seems that for cluster 2a, SiOx concentrations were the defining variable.

HCA2 clusters 2b and 2a appear to be the most unique. Cluster 2a has higher alkalinity, while cluster 2b has higher F concentrations, although there is a large range of F concentrations for all clusters. Cluster 2b and 2a both have higher Ca2+, Mg2+, Cl, Na+, K+, and SO42−, although cluster 2b is always higher than cluster 2a. Again, the PCA partnered with its HCA, in this case, PCA2, shows agreement with the box plot chemistry: clusters 2b and 2a are plotted in distinct and separate regions from the rest of the clusters, which otherwise plot adjacent to one another (Fig. 5). Interestingly, the upper and lower quartile boundaries for each cluster defined by HCA2 are generally closer together than in those clusters defined by HCA1. This could imply that the grouping is more robust for HCA2 and more accurately groups samples that are similar to one another, as discussed earlier in reference to the Euclidean distances and inclusion of additional variables.

Appendix 2. Piper diagrams

Fig. 12
figure 12

Piper diagram 1—color-coded based on HCA with no combined pairs in Piper diagram data (HCA1). Six clusters. Piper diagram completed with %meq/L data for all parameters. Points color-coded based on the groups they formed in the previously described HCA methods

Fig. 13
figure 13

Piper diagram 2—color-coded based on HCA with log-normalized ion data including F‾ and SiOx (HCA2). Six clusters. Points color-coded based on the groups they formed in the previously described HCA methods

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Dores, D., Whittier, R., Lautze, N. et al. Application of statistics to correlate groundwater chemistry with land use on O‘ahu, Hawai‘i. Environ Monit Assess 195, 551 (2023). https://doi.org/10.1007/s10661-023-11030-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10661-023-11030-1

Keywords

Navigation