Abstract
The upsurge in technical and epidemiological research employing Maximum Entropy (Maxent) establishes this machine-learning algorithm for species distribution modeling (SDM). Although Maxent robustly and accurately predicts the potential distribution of various species in different environments, data quality and varying hyperparameters influence its predictions. Optimizing hyperparameters can compensate for the rigidity of data quality. Addressing this caveat of Maxent, a bipartite approach (tuning and fine-tuning) in increasing model parsimony was developed to optimize the pipeline for range prediction of vectors with limited occurrence records in the Philippines. Tuned models reveal the influence of predictor collinearity on model accuracy, with a Pearson correlation threshold of 0.7 yielding the highest Area Under the Receiving Operator Characteristic Curve (AUC) score, analogous to popularly used methods in SDM. Fine-tuned models show that, contrary to the conventional pipeline, ΔAICc values approaching but not equal to zero produce a combination of hyperparameters (feature classes and regularization multiplier) leading to higher AUC scores. Fine-tuned models are more parsimonious and portray wider distributions than the a priori models generated using the default Maxent settings. This study integrates the best approaches to advance the conventional pipeline for Maxent modeling, substantiating the call for intensive surveying of vectors in a data-poor and high-burden country.
Similar content being viewed by others
Data availability
The data used in this research are available by the corresponding author upon reasonable request.
References
Abiodun GJ, Maharaj R, Witbooi P, Okosun KO (2016) Modeling the influence of temperature and rainfall on the population dynamics of Anopheles arabiensis. Malar J 15(1). https://doi.org/10.1186/s12936-016-1411-6
Afrane YA, Githeko AK, Yan G (2012) The ecology of Anopheles mosquitoes under climate change: case studies from the effects of deforestation in East African highlands. Ann N Y Acad Sci 1249(1):204–210. https://doi.org/10.1111/j.1749-6632.2011.06432.x
Afrane YA, Lawson BW, Githeko AK, Yan G (2005) Effects of microclimatic changes caused by land use and land cover on duration of gonotrophic cycles of Anopheles gambiae (Diptera: Culicidae) in Western Kenya Highlands. J Med Entom 42(6):974–980. https://doi.org/10.1093/jmedent/42.6.974
Afrane YA, Little TJ, Lawson BW, Githeko AK, Yan G (2008) Deforestation and vectorial capacity of Anopheles gambiae giles mosquitoes in malaria transmission, Kenya. Emerg Infect Dis 14(10):1533–1538. https://doi.org/10.3201/eid1410.070781
Aguiar LMS, Bernard E, Ribeiro V, Machado RB, Jones G (2016) Should I stay or should I go? Climate change effects on the future of Neotropical Savannah Bats. Glob Ecol Conserv 5:22–33. https://doi.org/10.1016/j.gecco.2015.11.011
Ahmadi M, Hemami M, Kaboli M, Shabani F (2023) MaxEnt brings comparable results when the input data are being completed; Model parameterization of four species distribution models. Ecol Evol 13(2). https://doi.org/10.1002/ece3.9827
Aiello-Lammens ME, Boria RA, Radosavljevic A, Vilela B, Anderson RP (2015) spThin: an R package for spatial thinning of species occurrence records for use in ecological niche models. Ecography 38:541–545. https://doi.org/10.1111/ecog.01132
Alkishe A, Raghavan RK, Peterson AT (2021) Likely geographic distributional shifts among medically important tick species and tick-associated diseases under climate change in North America: a review. Insects 12(3):225. https://doi.org/10.3390/insects12030225
Austin M (2002) Spatial prediction of species distribution: an interface between ecological theory and statistical modeling. Ecol Mod 157(2–3):101–118. https://doi.org/10.1016/S0304-3800(02)00205-3
Bao R, Li X, Zheng J (2022) Feature tuning improves Maxent predictions of the potential distribution of Pedicularis longiflora Rudolph and its variant. PeerJ 10. https://doi.org/10.7717/peerj.13337
Boutayeb A (2010) The burden of communicable and non-communicable diseases in developing countries. In: Preedy VR, Watson RR (eds) Handbook of disease burdens and quality of life measures. Springer, New York, NY, pp 531–546. https://doi.org/10.1007/978-0-387-78665-0_32
Brown JL, Bennett JR, French CM (2017) SDMtoolbox 2.0: the next generation Python-based GIS toolkit for landscape genetic, biogeographic and species distribution model analyses. PeerJ 5 e4095. https://doi.org/10.7717/peerj.4095
Burnham KP, Anderson DR (2004) Multimodel Inference: understanding AIC and BIC in Model Selection. Sociol Methods Res 33(2):261–304. https://doi.org/10.1177/0049124104268644
Caminade C, McIntyre KM, Jones AE (2018) Impact of recent and future climate change on vector-borne diseases. Ann N Y Acad Sci 1436(1):157–173. https://doi.org/10.1111/nyas.13950
Chen F, Du Y, Niu S, Zhao J (2015) Modeling forest lightning fire occurrence in the Daxinganling Mountains of Northeastern China with MAXENT. Forests 12:1422–1438. https://doi.org/10.3390/f6051422
Chen K, Wang B, Chen C, Zhou G (2022) MaxEnt modeling to predict the current and future distribution of Pomatosace filicula under climate change scenarios on the Qinghai-Tibet Plateau. Plants 5:670. https://doi.org/10.3390/plants11050670
Chua PL, Dorotan MM, Sigua JA, Estanislao RD, Hashizume M, Salazar MA (2019) Scoping review of climate change and health research in the Philippines: a complementary tool in research agenda-setting. Int J Environ Res Pub Health 16(14):2624. https://doi.org/10.3390/ijerph16142624
Clarke-Crespo E, Moreno-Arzate CN, López-González CA (2020) Ecological niche models of four hard tick genera (Ixodidae) in Mexico. Animals 10(4):649. https://doi.org/10.3390/ani10040649
Creley CM, Shilling FM, Muchlinski AE (2019) An ecological niche model to predict range expansion of the Eastern Gray Squirrel in California. Bull S Cal Acad Sci 118(1). https://doi.org/10.3160/0038-3872-118.1.58
Delgado-Jaramillo M, Aguiar LM, Machado RB, Bernard E (2020) Assessing the distribution of a species-rich group in a continental-sized megadiverse country: bats in Brazil. Divers Distrib 26(5):632–643. https://doi.org/10.1111/ddi.13043
Deng SQ, Khater EI, Tambo E, Wang DQ (2023) Editorial: emerging mosquito-borne diseases and novel biocontrol strategies. Front Cell Infect Microbiol 13. https://doi.org/10.3389/fcimb.2023.1143165
Department of Health (2022) Annual Report 2022. Field health services information system. https://doh.gov.ph/publications/serials. Accessed 10 October 2023
Elith J, Phillips SJ, Hastie T, Dudík M, Chee YE, Yates CJ (2010) A statistical explanation of MaxEnt for ecologists. Divers Distrib 1:43–57. https://doi.org/10.1111/j.1472-4642.2010.00725.x
Ennos R, Johnson ML (2018) Statistical and data handling skills in biology, 4th edn. Pearson Education Limited, p 70
Fick SE, Hijmans RJ (2017) WorldClim 2: new 1-km spatial resolution climate surfaces for global land areas. Int J Climatol 37(12):4302–4315. https://doi.org/10.1002/joc.5086
Foley DH, Torres EP, Mueller I (2002) Stream-bank shade and larval distribution of the Philippine malaria vector Anopheles flavirostris. Med Veter Entom 16(4):347–355. https://doi.org/10.1046/j.1365-2915.2002.00382.x
Fourcade Y, Engler JO, Rödder D, Secondi J (2014) Mapping species distributions with Maxent using a geographically biased sample of presence data: a performance assessment of methods for correcting sampling bias. PLoS One 9(5). https://doi.org/10.1371/journal.pone.0097122
Gray J, Dantas-Torres F, Estrada-Peña A, Levin M (2013) Systematics and ecology of the brown dog tick, Rhipicephalus sanguineus. Ticks Tick-Borne Dis 4(3):171–180. https://doi.org/10.1016/j.ttbdis.2012.12.003
Guillera-Arroita G, Lahoz-Monfort JJ, Elith J (2014) Maxent is not a presence-absence method: a comment on Thibaud et al. Meth Ecol Evol 11:1192–1197https://doi.org/10.1111/2041-210x.12252
Hernandez PA, Graham CH, Master LL, Albert DL (2006) The effect of sample size and species characteristics on performance of different species distribution modeling methods. Ecography 29(5):773–785. https://doi.org/10.1111/j.0906-7590.2006.04700.x
Hirzel A, Guisan A (2002) Which is the optimal sampling strategy for habitat suitability modelling. Ecol Model 157(2–3):331–341. https://doi.org/10.1016/S0304-3800(02)00203-X
Kadmon R, Farber O, Danin A (2003) A systematic analysis of factors affecting the performance of climatic envelope models. Ecol App 13(3):853–867. https://doi.org/10.1890/1051-0761(2003)013[0853:ASAOFA]2.0.CO;2
Kass JM, Muscarella R, Galante PJ, Bohl C, Pinilla‐Buitrago GE, Boria RA, Soley‐Guardia M, Anderson RP (2021) ENMeval 2.0: Redesigned for customizable and reproducible modeling of species’ niches and distributions. Meth Ecol Evol 12(9):1602–1608. https://doi.org/10.1111/2041-210x.13628
Khan AM, Li Q, Saqib Z, Khan N, Habib T, Khalid N, Majeed M, Tariq A (2022) Maxent modelling and impact of climate change on habitat suitability variations of economically important chilgoza pine (Pinus gerardiana wall.) in South Asia. Forests 13(5):715. https://doi.org/10.3390/f13050715
Khormi HM, Kumar L (2012) Assessing the risk for dengue fever based on socioeconomic and environmental variables in a geographical information system environment. Geospat Health 6(2):171. https://doi.org/10.4081/gh.2012.135
Li A, Wang J, Wang R, Yang H, Yang W, Yang C, Jin Z (2019) MaxEnt modeling to predict current and future distributions of Batocera lineolata (Coleoptera: Cerambycidae) under climate change in China. Écoscience 1:23–31. https://doi.org/10.1080/11956860.2019.1673604
Li Y, Li M, Li C, Liu Z (2020) Optimized maxent model predictions of climate change impacts on the suitable distribution of Cunninghamia lanceolata in China. Forests 11(3):302. https://doi.org/10.3390/f11030302
Licuanan WY, Cabreira RW, Aliño PM (2019) The Philippines. In: Sheppard C (ed) World Seas: an environmental evaluation, 2nd edn. Academic Press, pp 515–537. https://doi.org/10.1016/B978-0-08-100853-9.00051-8
Limbo-Dizon JE, Almadrones-Reyes KJ, Macabago SAB, Dagamac NHA (2022) Bioclimatic modeling for the prediction of the suitable regional geographical distribution of selected bright-spored myxomycetes in the Philippine archipelago. Biodiversitas 23(5). https://doi.org/10.13057/biodiv/d230506
Loiselle BA, Howell CA, Graham CH, Goerck JM, Brooks T, Smith KG, Williams PH (2003) Avoiding pitfalls of using species distribution models in conservation planning. Conserv Biol 17(6):1591–1600. https://doi.org/10.1111/j.1523-1739.2003.00233.x
Ma J, Guo Y, Gao J, Tang H, Xu K, Liu Q, Xu L (2022) Climate change drives the transmission and spread of vector-borne diseases: an ecological perspective. Biology 11(11):1628. https://doi.org/10.3390/biology11111628
Mamai W, Lobb LN, Bimbilé Somda NS, Maiga H, Yamada H, Lees RS, Gilles JRL (2018) Optimization of mass-rearing methods for Anopheles arabiensis larval stages: effects of rearing water temperature and larval density on mosquito life-history traits. J Econ Entomol. https://doi.org/10.1093/jee/toy213
Manalo JA, Matsumoto J, Takahashi HG, Villafuerte MQ II, Olaguera LMP, Ren G, Cinco TA (2021) The effect of urbanization on temperature indices in the Philippines. Int. J. Climatol 1–18. https://doi.org/10.1002/joc.7276
Marquez AR, Eamens K, Westman M, Šlapeta J (2021) Vector-borne pathogens in ticks and fleas of client-owned dogs in Metro Manila, Philippines. Parasitologia 1(4):247–256. https://doi.org/10.3390/parasitologia1040026
Merow C, Smith MJ, Silander JA (2013) A practical guide to maxent for modeling species’ distributions: what it does, and why inputs and settings matter. Ecography 36(10):1058–1069. https://doi.org/10.1111/j.1600-0587.2013.07872.x
Morales NS, Fernández IC, Baca-González V (2017) Maxent’s parameter configuration and small samples: are we paying attention to recommendations? A systematic review. PeerJ 5. https://doi.org/10.7717/peerj.3093
Muscarella R, Galante PJ, Soley-Guardia M, Boria RA, Kass JM, Uriarte M, Anderson RP (2014) ENMeval: an R package for conducting spatially independent evaluations and estimating optimal model complexity for Maxent ecological niche models. Methods Ecol Evol 5(11):1198–1205. https://doi.org/10.1111/2041-210X.12261
Naimi N, Hamm NAS, Groen TA, Skidmore AK, Toxopeus AG (2014) Where is positional uncertainty a problem for species distribution modeling? Ecography 37(2):191–203. https://doi.org/10.1111/j.1600-0587.2013.00205.x
Obsomer V, Defourny P, Coosemans M (2012) Predicted distribution of major malaria vectors belonging to the Anopheles dirus complex in Asia: ecological niche and environmental influences. PLoS One 7(11):e50475. https://doi.org/10.1371/journal.pone.0050475
[PAGASA] Philippine Atmospheric, Geophysical and Astronomical Services Administration (2023) Climate of the Philippines. GOV.PH. https://www.pagasa.dost.gov.ph/information/climate-philippines. Accessed 10 Oct 2023
Parola P, Socolovschi C, Jeanjean L, Bitam I, Fournier PE, Sotto A, Raoult D (2008) Warmer weather linked to tick attack and emergence of severe rickettsioses. PLoS Negl Trop Dis 2(11):e338. https://doi.org/10.1371/journal.pntd.0000338
Pascoe EL, Nava S, Labruna MB, Paddock CD, Levin ML, Marcantonio M, Foley JE (2022) Predicting the northward expansion of tropical lineage Rhipicephalus sanguineus sensu lato ticks in the United States and its implications for medical and veterinary health. PLoS One 17(8):e0271683. https://doi.org/10.1371/journal.pone.0271683
Pearce J, Ferrier S (2000) Evaluating the predictive performance of habitat models developed using logistic regression. Ecol Model 133(3):225–245. https://doi.org/10.1016/S0304-3800(00)00322-7
Pearson RG (2010) Species’ distribution modeling for conservation educators and practitioners. Lessons Conserv 3:54–89
Pearson RG, Raxworthy CJ, Nakamura M, Townsend Peterson A (2006) Predicting species distributions from small numbers of occurrence records: a test case using cryptic geckos in Madagascar. J Biogeogr 34(1):102–117. https://doi.org/10.1111/j.1365-2699.2006.01594.x
Peters DPC, McVey DS, Elias EH, Pelzel‐McCluskey AM, Derner JD, Burruss ND, Schrader TS, Yao J, Pauszek SJ, Lombard J, Rodriguez LL (2020) Big data–model integration and AI for vector‐borne disease prediction. Ecosphere 6. https://doi.org/10.1002/ecs2.3157
Peterson AT, Soberon J, Pearson RG, Anderson RP, Martınez-Meyer E, Nakamura M, Araujo M (2011) Ecological Niches and geographic distributions. Monographs in population biology 49. Princeton University Press, Princeton, New Jersey, USA
Phillips SJ (2017) A brief tutorial on Maxent. Retrieved from http://biodiversityinformatics.amnh.org/open_source/maxent/. Accessed 1 May 2023
Phillips SJ, Dudík M (2008) Modeling of species distributions with Maxent: new extensions and a comprehensive evaluation. Ecography 31(2):161–175. https://doi.org/10.1111/j.0906-7590.2008.5203.x
Phillips SJ, Anderson RP, Schapire RE (2006) Maximum entropy modeling of species geographic distributions. Ecol Model 190(3–4):231–259. https://doi.org/10.1016/j.ecolmodel.2005.03.026
Pramanik M, Singh P, Dhiman RC (2021) Identification of bio‑climatic determinants and potential risk areas for Kyasanur forest disease in Southern India using MaxEnt modeling approach. BMC Infect Dis 21. https://doi.org/10.1186/s12879-021-06908-9
Qin A, Liu B, Guo Q, Bussmann RW, Ma F, Jian Z, Xu G, Pei S (2017) Maxent modeling for predicting impacts of climate change on the potential distribution of Thuja sutchuenensis Franch., an extremely endangered conifer from southwestern China. Glob Ecol Conserv 10:139–146. https://doi.org/10.1016/j.gecco.2017.02.004
R Core Team (2013) R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN 3–900051–07–0, URL http://www.R-project.org/
Radosavljevic A, Anderson RP (2013) Making better Maxent models of species distributions: complexity, overfitting and evaluation. J Biogeogr 41:629–643. https://doi.org/10.1111/jbi.12227
Rocklov J, Dubrow R (2020) Climate change: an enduring challenge for vector-borne disease prevention and Control. Nat Immunol 21(5):479–483. https://doi.org/10.1038/s41590-020-0648-y
Sahin EK (2020) Assessing the predictive capability of ensemble tree methods for landslide susceptibility mapping using XGBoost, gradient boosting machine, and random forest. SN App Sci 2. https://doi.org/10.1007/s42452-020-3060-1
Salazar F, Malijan R, Braganza J, Angeles J, Ammugauan M, Aguila A, Macam O, Segundo U, Ebol A, Hugo C, Torno M, Aure W, Velayudhan R, Espino F (2021) Operationalization of insecticide resistance monitoring amongst malaria vectors in the Philippines. Medwin Publishers 5(2). https://doi.org/10.23880/phoa-16000186. Accessed 1 May 2023
Salvacion A, Magcale-Macandog D, Sta. Cruz P, Saludes R, Pangga I, Cumagun C (2018) Exploring spatial patterns of trends in monthly rainfall and temperature in the Philippines based on climate research unit grid. Spat Inf Res. https://doi.org/10.1007/s41324-018-0189-8
Shcheglovitova M, Anderson RP (2013) Estimating optimal complexity for ecological niche models: a jackknife approach for species with small sample sizes. Ecol Model 269:9–17. https://doi.org/10.1016/j.ecolmodel.2013.08.011
Stockwell DR, Peterson AT (2002) Effects of sample size on accuracy of species distribution models. Ecol Model 148(1):1–13. https://doi.org/10.1016/S0304-3800(01)00388-X
Tan LP, Hamdan RH, Hassan BNH, Reduan MFH, Okene IAA, Loong SK, Lee SH (2021) Rhipicephalus tick: a contextual review for Southeast Asia. Pathogens 10(7):821. https://doi.org/10.3390/pathogens10070821
Tang X, Yuan Y, Li X, Zhang J (2021) Maximum entropy modeling to predict the impact of climate change on Pine wilt disease in China. Front Plant Sci 12. https://doi.org/10.3389/fpls.2021.652500
Tesfamariam BG, Gessesse B, Melgani F (2022) MaxEnt-based modeling of suitable habitat for rehabilitation of Podocarpus forest at landscape-scale. Environ Syst Res 11(4). https://doi.org/10.1186/s40068-022-00248-6
Torres EP, Foley DH, Bryan JH (2006) Molecular systematics of the Philippine malaria vector Anopheles flavirostris. Med Vet Entomol 20(1):44–52. https://doi.org/10.1111/j.1365-2915.2006.00610.x
Valderrama L, Ayala S, Reyes C, González CR (2021) Modeling the potential distribution of the malaria vector Anopheles (ano.) pseudopunctipennis Theobald (diptera: Culicidae) in arid regions of Northern Chile. Front Pub Health 9. https://doi.org/10.3389/fpubh.2021.611152
Vignali S, Barras AG, Arlettaz R, Braunisch V (2020) SDMtune: an R package to tune and evaluate species distribution models. Ecol Evol 10:11488–11506. https://doi.org/10.1002/ece3.6786
Warren DL, Seifert SN (2011) Ecological niche modeling in Maxent: the importance of model complexity and the performance of model selection criteria. Ecol App 21:335–342. https://doi.org/10.1890/10-1171.1
Wiltshire KH, Tanner JE (2020) Comparing maximum entropy modelling methods to inform aquaculture site selection for novel seaweed species. Ecol Model 429:109071. https://doi.org/10.1016/j.ecolmodel.2020.109071
Wisz MS, Hijmans RJ, Li J, Peterson AT, Graham CH, Guisan A (2008) Effects of sample size on the performance of species distribution models. Divers Distribut 14:763–773. https://doi.org/10.1111/j.1472-4642.2008.00482.x
Yan H, He J, Xu X, Yao X, Wang G, Tang L, Feng L, Zou L, Gu X, Qu Y, Qu L (2021) Prediction of potentially suitable distributions of Codonopsis pilosula in China based on an optimized Maxent model. Front Ecol Evol 9. https://doi.org/10.3389/fevo.2021.773396
Zeng Y, Low BW, Yeo DCJ (2016) Novel methods to select environmental variables in MaxEnt: a case study using invasive crayfish. Ecol Model 341:5–13. https://doi.org/10.1016/j.ecolmodel.2016.09.019
Zhang J, Jiang F, Li G, Qin W, Li S, Gao H, Cai Z, Lin G, Zhang T (2019) Maxent modeling for predicting the spatial distribution of three raptors in the Sanjiangyuan National Park, China. Ecol Evol 9(11):6643–6654. https://doi.org/10.1002/ece3.5243
Zhao Z, Xiao N, Shen M, Li J (2022) Comparison between optimized MaxEnt and random forest modeling in predicting potential distribution: a case study with Quasipaa boulengeri in China. Sci Total Environ 842:156867–156867. https://doi.org/10.1016/j.scitotenv.2022.156867
Zhu G, Qiao H (2016) Effect of the Maxent model’s complexity on the prediction of species potential distributions. Biodivers Sci 24(10):1189–1196. https://doi.org/10.17520/biods.2016265
Funding
None.
Author information
Authors and Affiliations
Contributions
Germaine Comia-Geneta: data collection, data curation, data processing, writing – original draft; Simon Justin Reyes-Haygood: data collection, data curation, writing – original draft; Nicole Louise Salazar-Golez: data collection, writing – original draft; Nicole Alessandra Seladis-Ocampo: data analysis, methodology, writing – original draft; Merlin Rei Samuel-Sualibios: data collection, methodology, processing and analysis of data; Nikki Heherson A. Dagamac: conceptualization, supervision, writing – review and editing; Don Enrico Buebos-Esteve: methodology, processing and analysis of the data, supervision, writing – review and editing.
Corresponding author
Ethics declarations
The authors confirm that this article is original research and has not been published or presented previously in any journal or conference in any language in whole or in part.
Ethics approval
Not Applicable.
Consent to participate
Not applicable.
Consent for publication
All of the authors consented to publish this manuscript.
Competing interests
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Comia-Geneta, G., Reyes-Haygood, S.J., Salazar-Golez, N.L. et al. Development of a novel optimization modeling pipeline for range prediction of vectors with limited occurrence records in the Philippines: a bipartite approach. Model. Earth Syst. Environ. (2024). https://doi.org/10.1007/s40808-024-02005-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s40808-024-02005-3