Research Paper |
Corresponding author: Patrick J. Comer ( pat_comer@natureserve.org ) Academic editor: Gonzalo Navarro-Sánchez
© 2022 Patrick J. Comer, Jon C. Hak, Daryn Dockter, Jim Smith.
This is an open access article distributed under the terms of the CC0 Public Domain Dedication.
Citation:
Comer PJ, Hak JC, Dockter D, Smith J (2022) Integration of vegetation classification with land cover mapping: lessons from regional mapping efforts in the Americas. Vegetation Classification and Survey 3: 29-43. https://doi.org/10.3897/VCS.67537
|
Aims: Natural resource management and biodiversity conservation rely on inventories of vegetation that span multiple management or political jurisdictions. However, while remote sensing data and analytical tools have enabled production of maps at increasing spatial resolution and reliability, there are limited examples where national or continental-scaled maps are produced to represent vegetation at high thematic detail. We illustrate two examples that have bridged the gap between traditional land cover mapping and modern vegetation classification. Study area: Our two case studies include national (USA) and continental (North and South America) vegetation and land cover mapping. These studies span conditions from subpolar to tropical latitudes of the Americas. Methods: Both case studies used a supervised modeling approach with the International Vegetation Classification (IVC) to produce maps that provide for greater thematic detail. Georeferenced locations for these vegetation types are used by machine learning algorithms to train a predictive model and generate a distribution map. Results: The USA LANDFIRE (Landscape Fire and Resource Management Planning Tools Project) case study illustrates how a history of vegetation-based classification and availability of key inputs can come together to generate standard map products covering more than 9.8 million km2 that are unsurpassed anywhere in the world in terms of spatial and thematic resolution. That being said, it also remains clear that mapping at the thematic resolution of the IVC Group and finer resolution require very large and spatially balanced inputs of georeferenced samples. Even with extensive prior data collection efforts, these remain a key limitation. The NatureServe effort for the Americas - encompassing 22% of the global land surface - demonstrates methods and outputs suitable for worldwide application at continental scales. Conclusions: Continued collection of input data used in the case studies could enable mapping at these spatial and thematic resolutions around the globe.
Abbreviations: CART = Classification and Regression Tree; CONUS = Conterminous United States; DSWE = Dynamic Surface Water Extent; EPA = United States Environmental Protection Agency; FGDC = Federal Geographic Data Committee; IVC = International Vegetation Classification; LANDFIRE = Landscape Fire and Resource Management Planning Tools Project; LFRDB = LANDFIRE Reference Database; LiDAR = Light Detection and Ranging; NDVI = Normalized Difference Vegetation Index; NLCD = National Land Cover Database; USNVC = United States National Vegetation Classification; USA = United States of America; WWF = World Wildlife Fund or Worldwide Fund for Nature.
America, distribution modeling, Random Forest, vegetation classification, vegetation mapping
Natural resource management and biodiversity conservation often rely on inventories of vegetation that span multiple management or political jurisdictions. However, while remote sensing data and analytical tools have enabled production of maps at increasing spatial resolution and accuracy, there are limited examples where maps are produced at large national or continental scale to represent vegetation at high thematic detail. For example, the U.S. National Land Cover Database (NLCD) (
As vegetation classification has advanced, the potential to map far more ecologically distinct map classes presents important opportunities to address pressing societal needs (
We will review two case studies of both national and continental land cover mapping that have bridged the gap between traditional land cover mapping and modern vegetation classification. We trace recent development of terrestrial ecological classification in the Americas with specific reference to land cover mapping applications at regional to national and continental scales. This experience assisted in development of the International Vegetation Classification (IVC) (
The two case studies span conditions from subpolar to tropical latitudes of the Americas. One, limited to the United States, aimed to map the current distribution of the Group level of the IVC hierarchy (Table
U.S. National Vegetation Classification Hierarchy, including example classification units. The number of natural types documented within each hierarchical level from the conterminous United States (as of March 2021).
Level No. | Level Name | Defining Characteristics | No. Types | Example |
---|---|---|---|---|
1 | Class | Life Form Physiognomy | 6 | Grassland & Shrubland |
2 | Subclass | Global Physiognomy | 13 | Temperate & Boreal Grassland & Shrubland |
3 | Formation | Global Physiognomy | 36 | Temperate Grassland & Shrubland |
4 | Division | Continental Floristics | 50 | Great Plains Grassland & Shrubland |
5 | Macrogroup | Subcontinental Floristics | 143 | Great Plains Tallgrass Prairie |
6 | Group | Regional Floristics | 327 | Northern Great Plains Tallgrass Prairie |
7 | Alliance | Subregional Floristics | 1,174 | Schizachyrium scoparium - Bouteloua curtipendula Northern Grassland |
8 | Association | Local Floristics | 6,108 | Schizachyrium scoparium - Bouteloua curtipendula - Hesperostipa spartea - (Pascopyrum smithii) Grassland |
Our first case study reviews the experience of the U.S. Landscape Fire and Resource Management Planning Tools Project (LANDFIRE) that, since the mid 2000s, has produced a series of moderate-to-high resolution national map products to facilitate strategic decision support to both wildfire and wildlife habitat managers. Resultant map layers describe vegetation composition and structure and can be compared to expected conditions to indicate alteration to expected natural wildfire regimes (
Our second case study describes continental-scale mapping that encompasses temperate and tropical latitudes of North America and all of South America. Mapping methods such as those used by LANDFIRE were adapted to the hemisphere to provide a more thematically detailed view than had been previously attained. The intent of this effort was to support conservation status assessment of ecosystem types occurring within and across national borders (
Both case studies used a supervised modeling approach which include a priori classification of vegetation types as the basis for mapping (
Many terrestrial ecosystems across temperate North America support natural wildfire regimes of varying frequency and intensity. The multi-agency LANDFIRE effort was established in 2001 to produce a series of moderate-to-high resolution map products, along with state-and-transition models, to characterize expected and actual vegetation condition with regards to natural disturbances like wildfire. All products of the effort are intended to facilitate strategic decisions by both wildfire and wildlife habitat managers.
Beginning with a vegetation-based classification standard, conceptual and quantitative state-and-transition models describe expected succession and disturbance pathways, as well as characteristic fuels, for a given vegetation type. Spatial models, called biophysical settings, aim to depict the likely historical location of each type, given natural disturbance regimes. These models were based on the terrestrial ecological systems classification developed by NatureServe (
The LANDFIRE map legend for existing vegetation and land cover encompasses the continuum from natural to ruderal and cultural vegetation types. The hierarchical structure of the USNVC includes broad units at upper levels defined by vegetation physiognomy, followed by progressively narrow units at lower levels defined by vegetation floristic composition (Federal Geographic Data Committee [FGDC]
While the USNVC Group level provided a useful classification of natural vegetation units for the map legend, additional map legend categories were used to provide robust map product. First, the revised USNVC includes “ruderal” units that are defined as including plant assemblages with no natural analog. These commonly result from prior land conversion and subsequent abandonment, so they encompass what are often referred to as “old fields” and secondary forests where exotic species and/or native species are present in abundances not found where prior human influence is less discernable. Several ruderal vegetation units, approximating the Group level, were documented for use in LANDFIRE map products.
Second, a series of map class modifiers were used to facilitate mapping structural variants within each USNVC Group-based map class informed by the using the National Land Cover Database (NLCD;
Modern mapping methods include use of georeferenced sample locations – each labeled to the intended map units they represent – to train models that will combine predictor layers to generate a vegetation map. Due the the very large number of georeferenced samples needed for national land cover mapping at thematic levels like the USNVC Group, LANDFIRE produced algorithmic tools called “autokeys” for processing vegetation sample plot data for subsequent modeling and mapping. The autokey algorithm scans the content of each sample plot to detect species presence and abundance as well as structural categories to determine to which map legend class the sample belongs. It then applies the appropriate label for use in subsequent modeling steps. Autokeys were designed and implemented within regions determined by clustering ecologically similar ecoregions modifed from US Forest Service (
Expert ecologists reviewed and hand labeled nearly 18,000 samples to assess autokey performance. The total number of samples labeled by autokeys varied by region, from a high of 80,148 in the Rocky Mountains to 3,517 in the North Coast region. For most regions, the proportion of plots used in assessment that were reviewed by experts was 4% to 8%. Validation statistics for each map legend category were used throughout the development and final evaluation of each autokey. The overall validation statistic is a useful measure of how well each autokey performed across all types. It is the number of matches between expert and autokey labels divided by the total number of expert plots × 100, and the overall validation statistic was calculated for each autokey. Overall agreement for the USNVC Group keys ranged from a high of 90% (Texas-Oklahoma Hill Prairie) to a low of 39.9% (Coastal Plain). In most cases, lower performance occurred where substantial proportions of the landscape are dominated by ruderal vegetation, and distinguishing among very similar vegetation types using sample plots becomes more challenging.
Although over 500,000 vegetation samples were labeled through autokeys, there were still hundreds of thousands of samples with insufficient quantitative information to run through the autokeys. These often included documented locations from local natural resource inventories where an existing classification was used to label the location without including vegetation composition and structure. A series of classification crosswalks were used to reconcile these differences and label samples to the intended unit on the LANDFIRE map legend. In the 2016 LANDFIRE map, over one million samples were processed either by autokeys or through expert labeling across the CONUS.
A key factor in ecosystem modeling is determining the boundaries within which to build and apply models for existing vegetation types. Initial prototyping found that Omernik Level III Ecoregions (
The modeling process began by removing sample plots in recently disturbed areas collected prior to the disturbance using LANDFIRE annual disturbance products. Spectral outliers were identified by summing Landsat bands one through six for each class and sample plots, those plots greater than two standard deviations from the mean were removed. The resultant filtered plots were used to model lifeform and vegetation structure. The spectral test was performed separately for vegetation types prior to witholding samples for map validation.
Vegetation structure is important for fire behavior fuel models. Therefore, existing vegetation products were designed to nest by lifeform. For example, pixels identified as tree in the lifeform mask will be assigned a tree cover, height, and vegetation type. The lifeform modeling process began with an initial output using the filtered sample plots. The initial lifeform model output was improved through an iterative process by adding expert-labeled training samples based on desktop review of aerial photos to correct obvious mapping errors.
Using USNVC Group concepts as a guide, sample plots were separated into three lifeforms: tree, shrub, and herbaceous vegetation types, as well as barren or sparsely vegetated types (<10% total cover). Plots were further separated into wetland vs non-wetland categories, and alpine vs non-alpine categories where they existed. Classification tree models were generated with the See5 algorithm using raster predictor variables (Table
Dataset Name | Units of Analysis | Range of Values | Source (citation) |
---|---|---|---|
Elevation | Meter | -113 to 4415 | 3DEP DEM ( |
Aspect | Degrees | 0 to 359° | 3DEP DEM Derivative ( |
Percent Slope | % | 0 to 85 | 3DEP DEM Derivative ( |
Topographic Position Index (300 & 2000) | Index | approx. 900–2,330, 400–3150 | 3DEP DEM Derivative ( |
Landsat Imagery (Seasonal) ca. 2016 | Radiance | 6 (0 to 255 per band) | Processed Landsat scenes courtesy of the U.S. Geological Survey |
Tasseled Cap (Seasonal) ca. 2016 | Index | 3 (-8,000 to 24,000 per band) | Processed Landsat scenes courtesy of the U.S. Geological Survey |
NDVI 5-year statistics (Min, Max, Median, Max-Median) | Index | -1.0 to 1.0 | Processed Landsat scenes courtesy of the U.S. Geological Survey |
Climate (Precipitation, Temperature) | Milimeters, Degree | 1,390 to 65,534, 26,000 to 49,494 | Gradients ( |
Soils (Percent Sand, Silt, Clay, Organic Matter, and pH) | % | 0 to 100, 0 to 9 | gSSURGO (USDA. 2016) |
The draft thematic map was edited using rulesets based on geography or topography, or manual pixel reclassification with hand-drawn polygons based on expert opinion and review. Draft maps were also revised by removing problematic plots identified during the modeling process, reclassifying plots to a better fit, or adding sample plots based on expert opinion to correct modeling errors and improve mapping of problematic classes. Draft maps were reviewed by regional experts with NatureServe and staff from state agencies. Wildland Urban Interface maps produced by the Forest Service (
Vegetation percent cover and height training samples were derived from field estimates of vegetation cover and height. In addition, tree canopy percent cover estimates were calculated from the percentage of Light Detection and Ranging (LiDAR) point cloud above 3 m and tree height estimates were derived from the 90th percentile of LiDAR returns. Sample plots were separated into three lifeforms: tree, shrub, and herbaceous cover and height. Regression tree models predicting percent cover and height of dominant vegetation were generated with the Cubist algorithm using the following predictor layers identified in Table
Several masks were developed to identify open water, barren land, sparse vegetation, developed, and agricultural lands. Open water was identified using custom modeling methods based on the Landsat Level 3 Dynamic Surface Water Extent (DSWE) Science Product from the U.S. Geological Survey. Fragmented segments along streams and rivers were connected using National Hydrography Dataset (
The mapping process for existing vegetation based on the USNVC Group concepts generated a 30 m pixel resolution map raster with 499 natural, ruderal, and cultural map classes (Figure
The LANDFIRE Program implemented a map assessment approach that utilized existing information because no resources were available to collect additional data. The goal was to provide assessment results in tandem with data delivery. The assessment sampling strategy for the LANDFIRE 2016 Remap randomly withdrew 10% of the available plots for each of the terrestrial ecological systems classification developed by NatureServe (
Confusion tables were created for each of the six LANDFIRE GeoArea delivery packages across CONUS by cross-tabulating the autokey USNVC Group assignment for each assessment plot against the LANDFIRE USNVC Group assignment for map pixels at the plot location. Category agreement focused tables were then generated from each GeoArea contingency table. No stratification, spatial buffering, or category weighting was used (Table
The assessment sample was based on plots previously available to the program so the sample size and distribution reflected the overall plot numbers and categorical distribution present in any GeoArea. Across the continent the assessment sample was not sufficient for most of the mapped categories. While the agreement results were not high, there was variation in the results across GeoAreas and across categories within each GeoArea. No consistent error patterns were identified, although there is some indication that forest types tend to have lower error rates than shrub and herbaceous types. The opportunities for comparing category error rates across GeoAreas are limited by the sample sizes. For example, Southern Rocky Mountain Ponderosa Pine was mapped in both the Southwest and Northwest GeoAreas but the limited extent in the Northwest GeoArea resulted in too small an assessment set for this GeoArea. Results were not specifically linked to the number of categories assessed. For example, while the Southeast GeoArea had the lowest number of assessed categories and the lowest percentage of assessed categories with more than 70% agreement, the North Central had the lowest percentage of assessed categories with more than 50% agreement. Readers should note that these are absolute errors. If the plot assignment did not match the map assignment exactly it was designated as an error, so errors between floristically similar groups are counted the same as errors between floristically dissimilar groups. Users can review results for USNVC Groups of specific interest to fully understand the results of the assessment analysis.
To understand the results and ramifications for the USNVC Group-based map, a small portion of the Category Agreement Table for the Northwest GeoArea is presented in Table
For example, the most prevalent misclassification for Intermountain Basins Dry Tall Sagebrush Shrubland & Steppe was Intermountain Low & Black Sagebrush Shrubland & Steppe, followed by Mesic Tall Sagebrush Shrubland & Steppe. These types can occur immediately adjacent to each other across the western landscapes where they are found, and share substantial floristic composition, while the Dry-Mesic Spruce - Fir Forest & Woodland is much less similar so those errors may be more substantial depending on the application. This type of variation in agreement results was common across the GeoAreas, so users can review results for USNVC Groups of specific interest to fully understand the results of the assessment analysis.
Map users should also note that, in addition to the issues with sample size and distribution, these results do not indicate the scope of misclassifications, e.g., how much area within a GeoArea had agreement greater than or less than 50% or 70%.
Assessment results for USNVC Groups with sufficient samples (n > 30) by GeoArea.
GeoArea | No. USNVC Groups with assessment plots | No. USNVC Groups with >30 assessment plots | Proportion of USNVC Groups with >30 assessment plots with >70% agreement between map and plot designation | Proportion of USNVC Groups with >30 assessment plots with >50% agreement between map and plot designation |
---|---|---|---|---|
Northwest | 132 | 38 | 13% | 34% |
North Central | 95 | 24 | 8% | 29% |
Northeast | 139 | 48 | 10% | 40% |
Southwest | 173 | 61 | 15% | 41% |
South Central | 122 | 17 | 35% | 59% |
Southeast | 50 | 19 | 5% | 42% |
Portions of the Northwest GeoArea Category Agreement Table for USNVC Group.
USNVC Name | Row Total (pixels) | Row Agreement | Primary Within Row Mismatch | Secondary Within Row Mismatch | Tertiary Within Row Mismatch |
---|---|---|---|---|---|
Columbia Plateau Western Juniper Woodland & Savanna | 89 | 75.28% | 6480 Columbia Plateau Western Juniper Shrubland; 4 Incorrect Pixels | 6288 Intermountain Mountain Big Sagebrush Shrubland & Steppe; 3 Incorrect Pixels | 6145 Central Rocky Mountain Lower Montane Foothill & Valley Grassland; 3 Incorrect Pixels |
Intermountain Basins Dry Tall Big Sagebrush Shrubland & Steppe | 740 | 56.49% | 6285 Intermountain Low & Black Sagebrush Shrubland & Steppe; 58 Incorrect Pixels | 6287 Intermountain Mesic Tall Sagebrush Shrubland & Steppe; 36 Incorrect Pixels | 6070 Rocky Mountain Subalpine Dry-Mesic Spruce - Fir Forest & Woodland; 28 Incorrect Pixels |
Vancouverian & Rocky Mountain Montane Wet Meadow & Marsh | 35 | 22.86% | 6239 Western Montane-Subalpine Riparian & Seep Shrubland; 4 Incorrect Pixels | 6070 Rocky Mountain Subalpine Dry-Mesic Spruce - Fir Forest & Woodland; 3 Incorrect Pixels | 6330 Northern Rocky Mountain Lowland & Foothill Riparian Forest; 2 Incorrect Pixels |
Accelerating landscape change threatens biodiversity worldwide, so documented trends in the extent of ecosystems provide a foundation that can be used for conservation action. However, a comprehensive ecosystem classification of sufficient thematic detail to support these types of analyses has been lacking across the Americas. While a number of ecosystem classification maps exist at regional (
This second case study includes a project area of approximately 32.6 million km2 or nearly 22% of the global land surface, excluding the Boreal and Arctic regions of North America. The aim was to produce both “potential” and “current” distribution maps for major terrestrial ecosystem types that would be suitable for continental-scale assessment and planning, and also include units suitable for on-the-ground conservation action. The “potential distribution” includes biophysical conditions where each type might occur today had there not been any prior intensive human intervention. “Current distribution” then accounts for those areas of intensive intervention and conversion, as of approximately 2010. For this effort an effective minimum map unit size, or mapped pixel resolution, ranged from 270 m to 450 m.
Mapping across the hemisphere brings challenges of working with a high diversity of vegetation types and uneven availability of modeling inputs. Different modeling approaches and lower levels of thematic and spatial resolution in map products may be useful. Building from experience in the United States, we developed new spatial models of potential distributions of vegetation Macrogroups as defined by the International Vegetation Classification or IVC (
Table
Next, we screened the patch sizes of a given type, patches > 10 km2 in area provided the pool of source areas for sample selection. Selection of the 10 km2 is again an expert judgment, having evaluated existing maps and concluded that sampling from types depicted in smaller areas risked introducing substantial error. We acknowledge that this risks exclusion of naturally rare ecosystem types, but we judged this risk was warranted given the quality of existing map information for this purpose. This pool of map polygons encompasses 95% of natural landscapes. Stratified random sample selection was weighted by continent-wide area of each type using the log10 (area)*100, providing a sample total weighted towards types of lesser area. A total of 595,951 georeferenced samples were generated for the Americas, with an additional 70,380 held aside for map validation.
Explanatory variables, represented as map surfaces, included a series of biophysical factors, such as bioclimate, landform, slope, and aspect, as well as surface flow accumulation (Table
Bioclimates, as modeled by
Map sources of sample points for model development – from institutions and publications - with emphasis on Latin America and Caribbean (MMU = minimum map unit).
Mapping Region | Map Sources | Source MMU | Sample Points |
---|---|---|---|
Caribbean |
|
1 ha | 80,539 |
Mexico | Mexican National Institute of Statistics and Geography (INEGI), TNC, ProNatura-Yucatan | 5 ha | 41,731 |
MesoAmerica | TNC, ProNatura-Yucatan | 5 ha | 56,372 |
South America | TNC, NatureServe, World Wildlife Fund | 1000 ha | 416,309 |
Inputs to mapping vegetation types (as needed for modeling, each layer was rescaled to summarize variable per 90 m pixel).
Dataset Name | Data Type | Range of Values | Spatial Resolution | Institutional Source (or citation) |
---|---|---|---|---|
Climate | raster | 125 | 1 km2 |
|
Slope | raster | 89 | 90 m × 90 m | NatureServe, from SRTM digital elevation |
Aspect | raster | 1–360° | 90 m × 90 m | NatureServe, from SRTM digital elevation |
Landform | raster | 11 | 90 m × 90 m | NatureServe, from SRTM digital elevation |
Lithology | raster | 9–40 | 450 m × 450 m |
|
Soils | raster | 259 | 90 m × 90 m | CanVec (Natural Resources Canada) |
Surface Flow Accumulation | raster | 156 | 90 m × 90 m | HydroSHEDS ( |
EarthSat NatureVue Imagery | raster | 3 (0–255 per band) | 150 m × 150 m | ESRI |
Map Samples | raster | 683,119 | 90 m × 90 m |
LANDFIRE, |
Hexagon Grid | vector | 320,561 | 96 km2 | NatureServe, DGGRID, |
We used a sequential mapping process where maps derived for multiple broader levels of the IVC classification hierarchy were then used as input to modeling distributions of types defined at lower hierarchical levels. In this application, the first thematic level for inductive modeling was the IVC Division (Level 4 from Table
Numbers of mapped classification units by region and level of ecological classification (including types with regionally overlapping distributions).
Region | Number of IVC Divisions | Number of IVC Macrogroups |
---|---|---|
Caribbean | 4 | 14 |
Mexico | 32 | 55 |
Central America | 20 | 33 |
South America | 80 | 190 |
Over-prediction of more common (over rare, or low sample size) land cover types is a common source of error in CART-based inductive modeling of land cover (
As noted above, during initial sample data collection from map sources, georeferenced samples of each vegetation type were gathered and set aside for use in map validation. These samples were gathered for types that had existing polygons in regional/local source maps > 10 km2 in size in South America and > 5 hectares for temperate and tropical North America. Of the 315 Macrogroup map classes in North and South America, 284 had sufficient samples to be quantitatively assessed.
Once map edits were finalized for the 90 m products, validation samples were used to score the degree of agreement between samples and map classes for each map class at three spatial scales. Circular buffers around each sample encompassed 1-km2 (within 6 pixels of center) and 5-km2 (within 28 pixels of center). A point sample was defined from the centroid of each pixel of the 6 × 6 neighborhood of the 90 m product and is equivalent to the 270 m version of each map. Overlay of these samples on the final map product generated tabular summaries to determine whether or not the mapped class present matched the type labeled to each sample; i.e., the same types co-occur within the buffered area. While truly independent samples could not be acquired to evaluate a spatial model depicting “potential/historical” extent of these vegetation types, this technique provides one initial measure of map quality, and serves as a primary input to decisions regarding use of the map for type-by-type assessment. Thus, the percentage of agreement between validation samples and maps can indicate the degree of map reliability for use with a practical minimum map unit of 270 m vs. 1 km2 vs. 5 km2.
Table
Summary validation statistics for 284 (315 total mapped) assessed macrogroups in North and South America.
Validation Sample Resolution | No. with 90–100% Agreement | No. with 80–90% Agreement | No. with 70–80% Agreement | No. with 60–70% Agreement | No. with 50–60% Agreement | No. with <50% Agreement |
---|---|---|---|---|---|---|
270 m (point) n = 284 | 8 | 11 | 16 | 39 | 52 | 158 |
*1 km2 n = 131 | 44 | 32 | 14 | 8 | 9 | 12 |
5 km2 n = 284 | 160 | 50 | 23 | 20 | 15 | 16 |
The inclusion of the 1 km2 was limited to the North American portion of the map product for two reasons. First, the sample sizes available for CONUS in North America was substantially higher than that for adjacent countries. Secondly, the inclusion of the 1 km2 allowed the examination of the gradient of model performance over a spatial gradient of neighborhoods. Using the 1-km2 validation sample area in North America only, 44 types scored at 90–100% agreement, 32 types scored 80–90% agreement, 14 types scored 70–80% agreement, 8 types scored 60–70% agreement, and 9 types scored 50–60% agreement. A total of 12 types (10% of all assessed map classes) scored < 50% agreement. For 1 km2 samples, the total sample agreement was 85% and the median level of map class agreement for the types assessed was 88%.
Using the 5 km2 validation sample area, 160 types scored at 90–100% agreement, 50 types scored 80–90% agreement, 23 types scored 70–80% agreement, 20 types scored 60–70% agreement and 15 types scored 50–60% agreement. A total of 16 types (6% of all assessed map classes) scored below 50% agreement. For 5-km2 samples, the total sample agreement was 85% and the median level of map class agreement was 92%.
These results indicate that map reliability is limited on a per pixel basis (at 270 m pixels), but within relatively small clusters of adjacent pixels, the reliability of the map increases for most map classes.
There is scientific value in documenting the location and trends in the extent and condition of ecosystem types to inform public policy and conservation action. These two case studies illustrate what can be accomplished with the systematic application of robust, hierarchically-structured vegetation-based classification and machine learning tools that utilize georeferenced sample locations and robust predictor maps.
The USA LANDFIRE case study illustrates where a deep history of vegetation-based classification and investments in key inputs to mapping (e.g., georeferenced samples, remote sensing data, sophisticated algorithms) can come together to generate standard map products covering more than 9.8 million km2 of U.S. land that are unsurpassed, in terms of spatial and thematic resolution, anywhere in the world. That being said, it also remains clear that mapping at thematic resolutions of the USNVC Group and finer require very large and spatially balanced inputs of georeferenced samples, and even with the extensive prior investments, these remain a key limitation affecting the quality of map outputs. While one can reasonably say that “we know enough” about vegetation types at “mid” scales of the classification hierarchy (e.g., the USNVC Group), sufficient numbers of georeferenced samples that depict the full spectrum of those classification units is lacking across their range of distribution. Efforts such as LANDFIRE provide knowledge of where these gaps exist so that new data collection could maximize its effect on future map iterations.
The NatureServe effort for the Americas - encompassing 22% of the global land surface - demonstrates methods and outputs suitable for worldwide application at continental scales; albeit more challenging in parts of the globe with a more limited history of ecosystem classification and mapping, and more limited availability of predictor layers. Along with this mapping approach, the rich text, tabular, and map data set accompanying that study provide a foundation for deepened analysis and conservation action across the Americas. Continued collection of the input data used in the case studies could enable mapping at these spatial and thematic resolutions around the globe.
Data associated with the LANDFIRE case study are available from www.landfire.gov; Data associated with the NatureServe case study are accessible from: https://transfer.natureserve.org/download/Longterm/Ecosystem_Americas/Maps/
P.J.C. was team member in both case studies, and led the manuscript preparation; J.C.H. developed and implemented map production of the second NatureServe case study; D.D. completed primary mapping tasks in the LANDFIRE case study, and provided manuscript text and review; J.S. coordinated with P.J.C. on classification-related efforts for LANDFIRE case study, and contributed text for the manuscript. All authors critically revised the manuscript.
Work by KBR was performed under USGS contract 140G0121D0001. Any use of trade, firm, or product names is for descriptive purposes only and does not imply endorsement by the U.S. Government. The LANDFIRE portion was funded by U.S. Geological Survey (USGS) Earth Resources Observation and Science (EROS) Center.
Spreadsheet includes listing of classification units, hyperlinks to descriptions, and statistics on each type as represented in map products.