Predictive capacity of nine algorithms and an ensemble model to determine the geographic distribution of tree species

Juan Carlos Montoya-Jiménez, José René Valdez-Lazalde, Gregorio Ángeles-Perez, Héctor Manuel de los SantosPosadas, Gustavo Cruz-Cárdenas

Research output: Contribution to journalArticlepeer-review

7 Scopus citations

Abstract

The different models that predict the distribution of species are a useful tool for the evaluation and monitoring of forest resources as they facilitate the planning of their management in a changing climate environment. Recently, a significant number of algorithms have been proposed for this purpose, making it difficult to select the most appropriate to use. The evaluation of performance and predictive stability of these models can elucidate this problem. Distribution data of 17 pine species with high economic importance for Mexico were collected and distribution models were carried out. We carried out a pre-modeling design to select the prediction variables (climatic, edaphic and topographic), after which nine algorithms and an ensemble model were contrasted against one another. The true skill statistic (TSS) and the area under the curve (AUC) were used to evaluate the predictive performance of the models, and the coefficient of variation of the predictions was used to evaluate their stability. The number of predictive variables in the final models fluctuated from 6 to 12; the mean diurnal range and the maximum temperature of warmest month were included in the models for most species. Random forests, the ensemble model, generalized additive models and MaxEnt were the ones that best described the distribution of the species (AUC> 0.92 and TSS> 0.72); the opposite was found in Bioclim and Domain (AUC<0.75 and <0.82; and TSS<0.5 and <0.55). Support vector machine, Mahalanobis distance, generalized linear models and boosted regression trees obtained intermediate settings. The coefficient of variation indicated that Bioclim, Domain and Support vector machine have low predictive stability (CV>0.055); on the contrary, Maxent and the ensemble model attained high predictive stability (CV<0.015). The ensemble model obtained greater performance and predictive stability in the predictions of the distribution of the 17 species of pines. The differences found in performance and predictive stability of the algorithms suggest that the ensemble model has the potential to model the distribution of tree species.

Original languageEnglish
Pages (from-to)363-371
Number of pages9
JournalIForest
Volume15
DOIs
StatePublished - 2022

Keywords

  • AUC
  • BRT
  • Ensemble Model
  • GAM
  • MaxEnt
  • Random Forests
  • SVM
  • TSS

Fingerprint

Dive into the research topics of 'Predictive capacity of nine algorithms and an ensemble model to determine the geographic distribution of tree species'. Together they form a unique fingerprint.

Cite this