Dealing with overprediction in species distribution models: How adding distance constraints can improve model accuracy

2020 ◽  
Vol 431 ◽  
pp. 109180 ◽  
Author(s):  
Poliana Mendes ◽  
Santiago José Elías Velazco ◽  
André Felipe Alves de Andrade ◽  
Paulo De Marco
2019 ◽  
Author(s):  
Truly Santika ◽  
Michael F. Hutchinson ◽  
Kerrie A. Wilson

ABSTRACTPresence-only data used to develop species distribution models are often biased towards areas that are frequently surveyed. Furthermore, the size of calibration area with respect to the area covered by the species occurrences has been shown to affect model accuracy. However, existing assessments of the effect of data inadequacy and calibration size on model accuracy have predominately been conducted using empirical studies. These studies can give ambiguous results, since the data used to train and test the model can both be biased.These limitations were addressed by applying simulated data to assess how inadequate data coverage and the size of calibration area affect the accuracy of species distribution models generated by MaxEnt and BIOCLIM. The validity of four presence-only performance measures, Contrast Validation Index (CVI), Boyce index, AUC and AUCratio, was also assessed.CVI, AUC and AUCratio ranked the accuracy of univariate models correctly according to the true importance of their defining environmental variable, a desirable property of an accuracy measure. Contrastingly, Boyce index failed to rank the accuracy of univariate models correctly and a high percentage of irrelevant variables produced models with a high Boyce index.Inadequate data coverage and increased calibration area reduced model accuracy by reducing the correct identification of the dominant environmental determinant. BIOCLIM outperformed MaxEnt models in predicting the true distribution of simulated species with a symmetric dominant response. However, MaxEnt outperformed BIOCLIM in predicting the true distribution of simulated species with skew and linear dominant responses. Despite this, the standard performance measures consistently overestimated the performance of MaxEnt models and showed them as always having higher model accuracy than the BIOCLIM models.It has been acknowledged that research should be directed towards testing and improving species distribution modelling tools, particularly how to handle the inevitable bias and scarcity of species occurrence data. Simulated data, as demonstrated here, provides a powerful approach to comprehensively test the performance of modelling tools and to disentangle the effects of data properties and modelling options on model accuracy. This may be impossible to achieve using real-world data.


2008 ◽  
Vol 45 (2) ◽  
pp. 599-609 ◽  
Author(s):  
Omri Allouche ◽  
Ofer Steinitz ◽  
Dotan Rotem ◽  
Arik Rosenfeld ◽  
Ronen Kadmon

2019 ◽  
Vol 9 (1) ◽  
Author(s):  
Tali Magory Cohen ◽  
Roi Dor

Abstract Estimating the potential distribution of invasive species has been primarily achieved by employing species distribution models (SDM). Recently introduced joint species distribution models (JSDM) that include species interactions are expected to improve model output. Here we compare the predictive ability of SDM and JSDM by modelling the distribution of one of the most prolific avian invaders in the world, the common myna (Acridotheres tristis), in a recent introduction in Israel. Our results indicate that including information on the local species composition did not improve model accuracy, possibly because of the unique characteristics of this species that include broad environmental tolerance and behavior flexibility. However, the JSDM provided insights into co-occurrence patterns of common mynas and their local heterospecifics, suggesting that at this time point, there is no evidence of species exclusion by common mynas. Our findings suggest that the invasion potential of common mynas depends greatly on urbanization and less so on the local species composition and reflect the major role of anthropogenic impact in increasing the distribution of avian invaders.


2021 ◽  
Vol 13 (8) ◽  
pp. 1495
Author(s):  
Jehyeok Rew ◽  
Yongjang Cho ◽  
Eenjun Hwang

Species distribution models have been used for various purposes, such as conserving species, discovering potential habitats, and obtaining evolutionary insights by predicting species occurrence. Many statistical and machine-learning-based approaches have been proposed to construct effective species distribution models, but with limited success due to spatial biases in presences and imbalanced presence-absences. We propose a novel species distribution model to address these problems based on bootstrap aggregating (bagging) ensembles of deep neural networks (DNNs). We first generate bootstraps considering presence-absence data on spatial balance to alleviate the bias problem. Then we construct DNNs using environmental data from presence and absence locations, and finally combine these into an ensemble model using three voting methods to improve prediction accuracy. Extensive experiments verified the proposed model’s effectiveness for species in South Korea using crowdsourced observations that have spatial biases. The proposed model achieved more accurate and robust prediction results than the current best practice models.


Sign in / Sign up

Export Citation Format

Share Document