Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

The general construction protocol that was used for models has been previously published (Sarkar et al. 2010, ; Labay et al. 2011), so the description here will be cursory. A wide variety of machine-learning algorithms have been used for SDM construction (reviewed in Elith et al. 2006). This project used a maximum entropy algorithm incorporated in the Maxent software package (Phillips & Dudik 2008, Phillips et al. 2006; Phillips & Dudik 2008) because it directly provides probabilistic output (unlike the genetic algorithm of GARP (Stockwell 1999)) that can be used without further treatment for subsequent analyses, and because a variety of recent studies have concluded that its performance is superior to those of other methods (Elith et al. 2006, ; Wisz et al. 2008). Maxent was parameterized following published recommendations (Phillips et al. 2006), with models replicated 100 times withholding randomly in each replicate 40% of localities as "test" records, with the remaining 60% serving as model "training" records. Model performance was evaluated using a (threshold-independent) receiver operating characteristic (ROC) analysis and 11 internal binomial analyses of "raining" and "˜test" occurrence omission. The ROC analysis characterizes model performance at all possible thresholds using the area under the curve (AUC), a measure of model performance independent of any threshold (Hanley & McNeil 1982). An optimal model with perfect discrimination would have an AUC of 1 while a model that predicted species occurrences at random would have an AUC of 0.5 (Hanley & McNeil 1982).

...

Occurrence data input consists of FoTX records. Records with > one km potential georeferencing error (radius, see Georeferencing and Geographic Units) were excluded to assure input occurrences closely corresponded in spatial resolution to environmental layers used in modeling. This spatial error threshold of one km approximately matches the grid cell resolution of 30 arc-seconds (which approximates one km at the Equator), but is slightly larger than the longitudinal boundary of the average cell size (0.73 km2) due to geographic projection at the latitude of Texas. However, the maximum entropy algorithm used for analysis (see Model Construction above) has been shown not to be affected by spatial errors in occurrence datasets with standard deviations up to five km (Hernandez et al. 2006, Wisz et al. 2008). Occurrence records before 1950 were similarly excluded so that occurrence data were temporally congruent with climatic variables used (see Table 1 below). Finally, since model performance stabilizes with respect to accuracy of prediction at about 10 records when using the maximum entropy model construction algorithm (Phillips & Dudik 2008, Phillips et al. 2006; Phillips & Dudik 2008), models were produced only for those species for which we had a minimum of 10 occurrences corresponding to at least 10 unique cells on the environmental layer grids.

...

Layer CategoryLayer TypeDescriptionSource
TopologicalContinuousaspect1km DEM
TopologicalContinuousslope1km DEM
TopologicalContinuouscompound topological index (ln(acc.flow/tan[slope]))1km DEM
TopologicalContinuousaltitude1km DEM
ClimateContinuousannual mean temperatureWordclim variable 1
ClimateContinuousmean diurnal range (mean of monthly (max temp - min temp))Wordclim variable 2
ClimateContinuousisothermality (P2/P7)(*100)Wordclim variable 3
ClimateContinuous(temperature seasonality (sd *100)Wordclim variable 4
ClimateContinuousmax temperature of warmest monthWordclim variable 5
ClimateContinuousmin temperature of coldest monthWordclim variable 6
ClimateContinuoustemperature annual range (P5-P6)Wordclim variable 7
ClimateContinuousannual precipitationWordclim variable 12
ClimateContinuousprecipitation of wettest monthWordclim variable 13
ClimateContinuousprecipitation of driest monthWordclim variable 14
ClimateContinuousprecipitation seasonality (coefficient of variation)Wordclim variable 15
ClimateContinuousprecipitation of wettest quarterWordclim variable 16
ClimateContinuousprecipitation of driest quarterWordclim variable 17
ClimateContinuousprecipitation of warmest quarterWordclim variable 18
ClimateContinuousprecipitation of coldest quarterWordclim variable 19
GeographicCategoricalmajor river basinsTexas Water Development Board
GeographicCategorical8-digit hydrologic unit code (HUC)Texas Water Development Board
HydrologicContinuouscumulative drainageNational Hydrology Dataset plus
HydrologicContinuousmean annual flowNational Hydrology Dataset plus
HydrologicContinuousmean annual velocityNational Hydrology Dataset plus


References

Guisan A, Thuiller W (2005) Predicting species distribution: offering more than simple habitat models. Ecology Letters 8: 993-1009.

Elith J, Graham CH, Anderson RP, Dudik M, Ferrier SElith, J., C. H. Graham, R. P. Anderson, M. Dudik, S. Ferrier, et al. , others ( 2006) . Novel methods improve prediction of species' distributions from occurrence data. Ecography 29:129-151.

Margules C, Sarkar S (2007) Systematic conservation planning. Systematic conservation planning.

Hernandez PA, Graham CH, Master LL, Albert DL (2006) González, C., O. Wang, S. E. Strutz, C. González-Salazar, V. Sánchez-Cordero, et al. 2010. Climate change and risk of Leishmaniasis in North America: Predictions from ecological niche models of vector and reservoir species. PLoS Neglected Tropical Diseases 4: e585.

Guisan, A, and W. Thuiller. 2005. Predicting species distribution: offering more than simple habitat models. Ecology Letters 8:993-1009.

Hanley, J. A., and B. J. McNeil. 1982. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143:29.

Hernandez, P. A., C. H. Graham, L. L. Master, and D. L. Albert. 2006. The effect of sample size and species characteristics on performance of different species distribution modeling methods. Ecography 29:773-785.

Pawar S, Koo MS, Kelley C, Ahmed MF, Chaudhuri S, et al. (2007) Conservation assessment and prioritization of areas in Northeast India: priorities for amphibians and reptiles. Biological Conservation 136: 346-361Illoldi-Rangel, P., T. Fuller, M. Linaje, C. Pappas, V. Sánchez-Cordero, et al. 2008. Solving the maximum representation problem to prioritize areas for the conservation of terrestrial mammals at risk in Oaxaca. Diversity and Distributions 14:493-508.

Labay, B. J., A. E. Cohen, B. Sissel, D. A. Hendrickson, F. D. Martin, and S. Sarkar. , 2011. Assessing historical fish community composition using surveys, historical collection data, and species distribution models. PLoS ONE 6, e25145.

Illoldi-Rangel P, Fuller T, Linaje M, Pappas C, Sánchez-Cordero V, et al. (2008) Solving the maximum representation problem to prioritize areas for the conservation of terrestrial mammals at risk in Oaxaca. Diversity and distributions 14: 493-508.

González C, Wang O, Strutz SE, González-Salazar C, Sánchez-Cordero V, et al. (2010) Climate Change and Risk of Leishmaniasis in North America: Predictions from Ecological Niche Models of Vector and Reservoir Species.

Wisz MS, Hijmans RJ, Li J, Peterson AT, Graham CH, et al. (2008) Effects of sample size on the performance of species distribution models. Diversity and Distributions 14: 763-773.

Sarkar S, Sánchez-Cordero V, Londoño M, Fuller T (2009) : e25145.

Margules, C. R., and S. Sarkar. 2007. Systematic conservation planning. Cambridge University Press, Cambridge, UK

Pawar, S., M. S. Koo, C. Kelley, M. F. Ahmed, S. Chaudhuri, et al. 2007. Conservation assessment and prioritization of areas in Northeast India: priorities for amphibians and reptiles. Biological Conservation 136:346-361.

Phillips, S. J., R. P. Anderson, and P. E. Schapire. 2006. Maximum entropy modeling of species geographic distributions. Ecological Modelling 190:231-259.

Phillips, S. J., and M. Dudik. 2008. Modeling of species distributions with Maxent: new extensions and a comprehensive evaluation. Ecography 31:161-175.

Sarkar, S., V. Sánchez-Cordero, M. Londoño, and T. Fuller. 2009. Systematic conservation assessment for the Mesoamerica, Chocó, and Tropical Andes biodiversity hotspots: a preliminary analysis. Biodiversity and Conservation 18:1793-1828.

Phillips SJ, Dudik M (2008) Modeling of species distributions with Maxent: new extensions and a comprehensive evaluation. Ecography 31: 161-175.

Phillips SJ, Anderson RP, Schapire RE (2006) Maximum entropy modeling of species geographic distributions. Ecological Modelling 190: 231-259.

Stockwell D (1999) Sarkar, S., S. E. Strutz, D. M. Frank, C. Rivaldi, B. Sissel, et al. 2010. Chagas disease risk in Texas. PLoS Neglected Tropical Diseases 4: e836. Accessed 8 October 2010.

Stockwell, D. 1999. The GARP modelling system: problems and solutions to automated spatial prediction. International Journal of Geographical Information Science 13:143-158.

Hanley JA, McNeil BJ (1982) The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143: 29.

Sarkar S, Strutz SE, Frank DM, Rivaldi C, Sissel B, et al. (2010) Chagas Disease Risk in Texas. PLoS Neglected Tropical Diseases 4: e836. Accessed 8 October 2010Wisz, M. S., R. J. Hijmans, J. Li, A. T. Peterson, C. H. Graham, et al. 2008. Effects of sample size on the performance of species distribution models. Diversity and Distributions 14:763-773.