Abstract
Vibrio parahaemolyticus (V.p) is a marine pathogenic bacterium that poses a high risk to human health and shellfish industry, yet an effective regional-scale nowcasting model for managing the risk remains lacking. This study presents the first regional-scale model for nowcasting the level of V.p in oysters in the marine environment by developing an ensemble modeling approach. The ensemble modeling approach involves the integration of genetic programming (GP) and deep artificial neural networks (DNN)-based modeling. The new approach was demonstrated by developing three GP-DNN ensemble models for predicting the V.p level in North Carolina, New Hampshire, and the combined region. Specifically, GP was employed to establish nonlinear functions between the V.p level and antecedent conditions of environmental variables. The nonlinear GP functions and current conditions of individual environmental variables were then utilized as inputs into a DNN model, forming a GP-DNN ensemble model. Modeling results indicated that the GP-DNN ensemble models were capable of predicting the V.p level with the correlation coefficient of 0.91, 0.90, and 0.80 for North Carolina, New Hampshire, and the combined region, respectively, demonstrating the impact of distinct environmental conditions in the local areas on accuracy of the combined regional-scale model. Sensitivity analysis results showed that sea surface temperature and sea surface salinity are the two most important environmental predictors for the abundance of V.p in oysters, followed by water level, pH, chlorophyll-a, and turbidity. The findings suggested that the GP-DNN ensemble models could be utilized as effective predictive tools for mitigating the V.p risk.
Similar content being viewed by others
Data availability
All data used in this work are available from the literature.
Code availability
The codes used in this research are available from the authors upon request.
References
Chenar, S. S., & Deng, Z. (2018a). Development of artificial intelligence approach to forecasting oyster norovirus outbreaks along Gulf of Mexico coast. Environment International, 111, 212–223.
Chenar, S. S., & Deng, Z. (2018b). Development of genetic programming-based model for predicting oyster norovirus outbreak risks. Water Research, 128, 20–37.
Chenar, S. S., & Deng, Z. (2021). Hybrid modeling and prediction of oyster norovirus outbreaks. Journal of Water and Health, 19(2), 254–266. https://doi.org/10.2166/wh.2021.251
Daniels, N. A., MacKinnon, L., Bishop, R., Altekruse, S., Ray, B., Hammond, R. M., Thompson, S., Wilson, S., Bean, N. H., & Griffin, P. M. (2000). Vibrio parahaemolyticus infections in the United States, 1973–1998. Journal of Infectious Diseases, 181, 1661–1666.
Fawcett, T. (2006). An introduction to ROC analysis. Pattern Recognition Letters, 27(8), 861–874.
Fernandez-Piquer, J., Bowman, J. P., Ross, T., & Tamplin, M. L. (2011). Predictive models for the effect of storage temperature on Vibrio parahaemolyticus viability and counts of total viable bacteria in Pacific oysters (Crassostrea gigas). Applied and Environmental Microbiology, 77(24), 8687–8695.
Food and Drug Administration (FDA). (2017). National shellfish sanitation program guide for the control of molluscan shellfish–2017 revision. Center for Food Safety and Applied Nutrition, Food and Drug Administration, U.S. Department of Health and Human Services.
Food and Drug Administration (FDA). (2005). Quantitative risk assessment on the public health impact of pathogenic Vibrio parahaemolyticus in raw oysters. Center for Food Safety and Applied Nutrition, Food and Drug Administration, U.S. Department of Health and Human Services.
Froelich, B., Ayrapetyan, M., Fowler, P., Oliver, J. D., & Noble, R. J. (2015). Development of a matrix tool for the prediction of Vibrio species in oysters harvested from North Carolina. Applied and Environmental Microbiology, 81(3), 1111–1119.
He, L., & He, Z. (2008). Water quality prediction of marine recreational beaches receiving watershed baseflow and stormwater runoff in southern California, USA. Water Research, 42, 2563–2573.
LeCun, Y., Bengio, Y., & Hinton, G. (2015). Deep learning. Nature, 521, 436–444. https://doi.org/10.1038/nature14539
Matthews, B. W. (1975). Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochimica et Biophysica Acta (BBA) - Protein Structure, 405(2), 442–451.
Mehr, A. D., & Nourani, V. (2018). Season algorithm-multigene genetic programming: A new approach for rainfall-runoff modeling. Water Resources Management, 32(8), 2665–2679.
Muttil, N., & Chau, K.-W. (2006). Neural network and genetic programming for modeling coastal algal blooms. International Journal of Environment and Pollution, 28(3–4), 223–238.
Najafzadeh, M., Noori, R., Afroozi, D., Ghiasi, B., Hosseini-Moghari, S. -M., Mirchi, A., Haghighi, A. T., & Kløve, B. (2021). A comprehensive uncertainty analysis of model-estimated longitudinal and lateral dispersion coefficients in open channels. Journal of hydrology, 603 (Part A), 126850, https://www.sciencedirect.com/science/article/pii/S0022169421009008?via%3Dihub
Namadi, H. P., & Deng, Z. (2021). Modeling and forecasting Vibrio Parahaemolyticus concentrations in oysters. Water Research, 189, 116638. https://doi.org/10.1016/j.watres.2020.116638
Noori, R., Deng, Z., Kiaghadi, A., & Kachoosangid, F. T. (2016). How reliable are ANN, ANFIS, and SVM techniques for predicting longitudinal dispersion coefficient in natural rivers?”. Journal of Hydraulic Engineering, 142(1), 04015039. https://doi.org/10.1061/(ASCE)HY.1943-7900.0001062
Paranjpye, R. N., Nilsson, W. B., Liermann, M., Hilborn, E. D., George, B. J., Li, Q., Bill, B. D., Trainer, V. L., Strom, M. S., & Sandifer, P. A. (2015). Environmental influences on the seasonal distribution of Vibrio parahaemolyticus in the Pacific Northwest of the USA. FEMS Microbiology Ecology, 91(12), fiv121, https://academic.oup.com/femsec/article/91/12/fiv121/2467317
Sætrom, P., Sneve, R., Kristiansen, K. I., Snøve, O., Grünfeld, T., Rognes, T., & Seeberg, E. (2005). Predicting non-coding RNA genes in Escherichia coli with boosted genetic programming. Nucleic Acids Research, 33(10), 3263–3270. https://doi.org/10.1093/nar/gki644
Silva, S., & Almeida, J. (2003). Gplab-a genetic programming toolbox for Matlab. Book Gplab-a genetic programming toolbox for Matlab, City: Citeseer.
Sivapragasam, C., Muttil, N., Muthukumar, S., & Arun, V. M. (2010). Prediction of Algal Blooms Using Genetic Programming., 60(10), 1849–1855. https://doi.org/10.1016/j.marpolbul.2010.05.020
Urquhart, E. A., Zaitchik, B., Guikema, S., Haley, B., Taviani, E., Chen, A., Brown, M., Huq, A., & Colwell, R. (2015). Use of environmental parameters to model pathogenic Vibrios in the Chesapeake Bay. Journal of Environmental Informatics, 26(1), 1–13. https://doi.org/10.3808/jei.201500307
Urquhart, E. A., Jones, S. H., Jong, W. Y., Schuster, B. M., Marcinkiewicz, A. L., Whistler, C. A., & Cooper, V. S. (2016). Environmental conditions associated with elevated Vibrio parahaemolyticus concentrations in Great Bay Estuary. New Hampshire. Plos One, 11, e0155018. https://doi.org/10.1371/journal.pone.0155018
Wang, J., & Deng, Z. (2019). Modeling and predicting fecal coliform bacteria levels in oyster harvest waters along Louisiana Gulf coast. Ecological Indicators, 101, 212–220. https://doi.org/10.1016/j.ecolind.2019.01.013
Williams, T. C., Froelich, B. A., Phippen, B., Fowler, P., Noble, R. T., & Oliver, J. D. (2017). Different abundance and correlational patterns exist between total and presumed pathogenic V. vulnificus and V. parahaemolyticus in shellfish and waters along the North Carolina coast. FEMS Microbiology Ecology, 93(6), fix071, https://doi.org/10.1093/femsec/fix071.
Zhang, Z., Deng, Z., Rusch, K. A., & Walker, N. D. (2015). Modeling system for predicting enterococci levels at Holly Beach. Marine Environmental Research, 109, 140–147. https://doi.org/10.1016/j.marenvres.2015.07.003
Zhang, Z., Deng, Z., & Rusch, K. A. (2012). Development of predictive models for determining Enterococci levels at Gulf Coast beaches. Water Research, 46(2), 465–474. https://doi.org/10.1016/j.watres.2011.11.027
Zimmerman, A., DePaola, A., Bowers, J., Krantz, J., Nordstrom, J., Johnson, C., & Grimes, D. (2007). Variability of total and pathogenic Vibrio parahaemolyticus densities in northern Gulf of Mexico water and oysters. Applied and Environmental Microbiology, 73(23), 7589–7596. https://doi.org/10.1128/AEM.01700-07
Funding
This work was supported by the US NASA (National Aeronautics and Space Administration: award number 80NSSC20M0216) and the Louisiana Board of Regents (LEQSF(2020–23)-Phase3-14).
Author information
Authors and Affiliations
Contributions
PN performed data collection, model development and validation, first draft writing, and graph construction under the direction of ZD who designed the methodology, edited the current version of the manuscript, and applied for funding. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Namadi, P., Deng, Z. Deep learning-based ensemble modeling of Vibrio parahaemolyticus concentration in marine environment. Environ Monit Assess 195, 229 (2023). https://doi.org/10.1007/s10661-022-10836-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10661-022-10836-9