Skip to main content
Log in

Development of a novel optimization modeling pipeline for range prediction of vectors with limited occurrence records in the Philippines: a bipartite approach

  • Original Article
  • Published:
Modeling Earth Systems and Environment Aims and scope Submit manuscript

Abstract

The upsurge in technical and epidemiological research employing Maximum Entropy (Maxent) establishes this machine-learning algorithm for species distribution modeling (SDM). Although Maxent robustly and accurately predicts the potential distribution of various species in different environments, data quality and varying hyperparameters influence its predictions. Optimizing hyperparameters can compensate for the rigidity of data quality. Addressing this caveat of Maxent, a bipartite approach (tuning and fine-tuning) in increasing model parsimony was developed to optimize the pipeline for range prediction of vectors with limited occurrence records in the Philippines. Tuned models reveal the influence of predictor collinearity on model accuracy, with a Pearson correlation threshold of 0.7 yielding the highest Area Under the Receiving Operator Characteristic Curve (AUC) score, analogous to popularly used methods in SDM. Fine-tuned models show that, contrary to the conventional pipeline, ΔAICc values approaching but not equal to zero produce a combination of hyperparameters (feature classes and regularization multiplier) leading to higher AUC scores. Fine-tuned models are more parsimonious and portray wider distributions than the a priori models generated using the default Maxent settings. This study integrates the best approaches to advance the conventional pipeline for Maxent modeling, substantiating the call for intensive surveying of vectors in a data-poor and high-burden country.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Data availability

The data used in this research are available by the corresponding author upon reasonable request.

References

Download references

Funding

None.

Author information

Authors and Affiliations

Authors

Contributions

Germaine Comia-Geneta: data collection, data curation, data processing, writing – original draft; Simon Justin Reyes-Haygood: data collection, data curation, writing – original draft; Nicole Louise Salazar-Golez: data collection, writing – original draft; Nicole Alessandra Seladis-Ocampo: data analysis, methodology, writing – original draft; Merlin Rei Samuel-Sualibios: data collection, methodology, processing and analysis of data; Nikki Heherson A. Dagamac: conceptualization, supervision, writing – review and editing; Don Enrico Buebos-Esteve: methodology, processing and analysis of the data, supervision, writing – review and editing.

Corresponding author

Correspondence to Don Enrico Buebos-Esteve.

Ethics declarations

The authors confirm that this article is original research and has not been published or presented previously in any journal or conference in any language in whole or in part.

Ethics approval

Not Applicable.

Consent to participate

Not applicable.

Consent for publication

All of the authors consented to publish this manuscript.

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 601 KB)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Comia-Geneta, G., Reyes-Haygood, S.J., Salazar-Golez, N.L. et al. Development of a novel optimization modeling pipeline for range prediction of vectors with limited occurrence records in the Philippines: a bipartite approach. Model. Earth Syst. Environ. (2024). https://doi.org/10.1007/s40808-024-02005-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s40808-024-02005-3

Keywords

Navigation