Long Database Report |
Corresponding author: Gwendolyn Peyre ( gf.peyre@uniandes.edu.co ) Academic editor: Melisa A. Giorgis
© 2022 Gwendolyn Peyre, Daniel Montesinos, Daniela Giraldo, Antonio Galán de Mera, Barbara Ruthsatz, Federico Luebert, Marcela Ontivero, Nicolas García, Miguel Álvarez, Rosa Isela Meneses, Pablo Lozano, Daniela León, Maximilian Weigend, Fabien Anthelme, Milagro Palma, Carolina Rodriguez.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Peyre G, Montesinos D, Giraldo D, Galán de Mera A, Ruthsatz B, Luebert F, Ontivero M, García N, Álvarez M, Meneses RI, Lozano P, León D, Weigend M, Anthelme F, Palma M, Rodriguez C (2022) VegAndes: the vegetation database for the Latin American highlands. Vegetation Classification and Survey 3: 287-296. https://doi.org/10.3897/VCS.95750
|
In the era of Big Data, Latin American countries and biomes remain underrepresented. To remediate this issue, promoting repositories for biodiversity data focused on Latin America is a main priority. VegAndes -Dpt the vegetation database for the Latin American highlands (GIVD: SA-00-005), is a novel dataset for georeferenced and standardized information on vascular pants in the region. The database compiles 5,340 vegetation plots sampled above the montane treeline and below the permanent snowline in 11 Latin American countries and spanning over seven decades. VegAndes currently encompasses 5,804 taxon names, corresponding to 3,858 accepted names, as well as 136 syntaxon names. The database is nested within a scientific consortium of Latin American experts on highland vegetation and piloted from the University of the Andes (Colombia). Because the VegAndes data can support multi-scale studies in botany, ecology and biogeography, the database makes an essential contribution to biodiversity research and management perspectives in Latin America.
Taxonomic reference: TROPICOS (preferential source, www.tropicos.org/), World Flora Online (secondary source, www.worldfloraonline.org/).
database, flora, highland, Mesoamerica, phytosociology, plot data, South America, vegetation
Mountains only encompass 13% of the global land’s surface, yet they disproportionally contribute to global biodiversity and provide key ecosystem services to billions (
With 550,000 km2 of highlands, calculated as the surface above the potential treeline (
Scientific interest for Latin American mountain ecology and biogeography, has quickly increased over the last decades, mostly because they i) contribute to more than half of the 85,000 species pool that conform the Latin American vascular flora (
In this study, we present VegAndes, the vegetation database for the Latin American highlands, a novel dataset to address the dramatic data-gap in vegetation surveys in Latin American mountains and provide strong bases for future studies in plant ecology and biogeography. The VegAndes initiative emerges from consortium discussions involving expert botanists and vegetation scientists dedicated to highland research in Latin America. It builds on local and regional efforts, and compiles vegetation plot information that is methodologically and taxonomically standardized into a single repository housed at the University of the Andes in Bogotá (Colombia). Both the consortium and dataset are expected to grow with regards to data volume and outreach in the short to mid-term, with the publication of the VegAndes webpage.
The study area encompasses the highlands of all Latin American mountains, from Mexico to Chile and Argentina, with a focus on the Andes (Figure
In the north, Mesoamerican highlands are distributed over the Mexican transition zone and Central American mountain ranges down to the Panama Isthmus (Lat 20°N–11°N). They are often isolated and encompass a variety of substrates and climates. For instance, Mexican mountains are mainly volcanic and are subject to strong temperature seasonality, whereas Panama and Costa Rica mountains are principally sedimentary and experience from overall low seasonality and humid climates (
On the altitudinal gradient, Latin American highlands typically encompass the subalpine (supratropical) belt above the timberline, the alpine (orotropical) belt, subnival belt (cryotropical) and nival belt (ice-athermic). However, each belt’s surface and elevation range vary substantially with latitude and position within a mountain range. A recent study on alpine areas worldwide places the potential treeline roughly at a minimum of 1,000–1,500 m a.s.l. in the Patagonian Andes and a maximum of 3,500–4,000 m a.s.l. near the equator (
Topographic map of Latin America, with focus on mountain areas and their division into four main biogeographical provinces, Mesoamerica (photo: Pico de Orizaba, Mexico, credit G.P.), Northern Andes (photo: Nevado Santa Isabel, Colombia, credit G.P.), Central Andes (photo: Macusani, Peru, credit A.G.M.), and Southern Andes. The latter province can be divided into two main regions, the Mediterranean Andes (photo: Tacora, Chile, credit F.L.) and Patagonian Andes (photo: El Calafate, Argentina, credit A.G.M.). The map indicates names for those countries with highlands covered by the VegAndes vegetation data.
The database structure was programmed in SQL, following VegPáramo (
The Primary table contains the plot data, with unique values to represent species occurrence and/or cover within plots. The species list uses the initial taxon names provided by the plot authors as well as a taxon ID assigned to each name. Non-vascular plants were excluded from the datasets. All non-identified taxa above the genus level were also discarded, and those at or below the genus level were included and assigned a codified taxon ID including the source name and unique number. Useful particles to assess the certainty of taxonomic determination such as cf., aff. and gr., were recorded. All names will subsequently be updated pending future taxonomic determinations. VegAndes allows for presence-absence data as well as several cover-scales: the
The Geo-environmental table resumes the geographic and environmental information recorded for the VegAndes plots. It contains the plot surface area (m2), elevation (m a.s.l), slope (°) and aspect. It also includes the plot geolocation, both as decimal coordinates and UTM grid cell (in Datum WGS84). Whenever a plot is precisely georeferenced, with a ≤ 10 m error as provided by modern GPS, the decimal coordinates are considered exact. Nevertheless, if the original plot georeferencing is imprecise (10 m–10 km), then its decimal coordinates refer to the centroid of the smallest resolution UTM grid cell available. Plots with very imprecise georeferencing, above 10 km, were not registered into VegAndes. Finally, the table includes the plot’s precise location (often a toponym), municipality, region and country, as well as optional observations on terrain or additional features.
The Vegetation-structure table reports all attributes on plot physiognomy and plant community labels. It includes information on the total, vascular and non-vascular vegetation covers (in %), bare soil (in %), as well as height (in cm) and cover (in %) for the following vegetation strata (if applicable): trees, high and low shrubs, herbs and prostrate strata. Furthermore, this table accounts for three vegetation classification systems. First, it offers a simple classification into azonal (azo) and zonal (zo) vegetation, the latter with their respective altitudinal belt as subalpine (zo-sub), alpine (zo-alp) and subnival-nival (zo-niv). Second, the table records the syntaxon name that was initially given by the plot’s authors and a syntaxon ID assigned to each name, which can be further checked by the Syntaxon list (see details below). Last, it shows the plant community informal name, combining structural qualifiers and the name of up to two dominant and/or diagnostic plant species, for example “Bunchgrassland with Espeletia grandiflora and Eryngium humboldtii”.
The Data-source table contains the information related to the primary or secondary (when applicable) source of the vegetation plot data. Both published and unpublished sources are considered, but unpublished ones, such as private datasets and academic theses, were carefully and expertly revised beforehand. The table includes the source’s full author list, publication date, sampling date, title, and journal/book complete reference. Currently, 53 data-sources are included in VegAndes (86 when counting VegPáramo’s multi-source input), of which 11 are unpublished datasets (see Suppl. material
Finally, VegAndes presents two additional secondary tables in the form of the Taxon list and Syntaxon list that are not connected to the Primary table by the VegAndes plot ID, but are instead meant to check name validity for taxa, via the taxon ID (Primary table), and syntaxa, through the syntaxon ID (Vegetation-structure table), respectively. The Taxon list was built based on the initial names contained in the Primary table. The global botanic platforms TROPICOS (www.tropicos.org) and World Flora Online (www.worldfloraonline.org) were used as preferential and secondary sources respectively to assess each name’s validity. Whenever a taxon is classified as doubtful or unresolved by both platforms, the initial name is retained. Similarly, hybrid taxa are kept as such and considered as accepted names. The Taxon list also includes family names and higher clades to allow for easy data-searches in: (i) Angiosperms, including Magnoliids, Monocots and Eudicots; (ii) Gymnosperms; (iii) Lycophytes; (iv) Monilophytes. It is expected that the Taxon list will undergo annual synonymy revisions and grow as new data is added to VegAndes. The Syntaxon list is in its initial stages and compiles all existing information on syntaxonomy as displayed in the VegAndes data-sources. As a result, this table contains hierarchical syntaxon terms for classes, orders, alliances and associations. However, there has been no revision of the complete classification to date, so it is advised to handle this information as highly provisional. At present, 5,804 taxon names, corresponding to 3,858 accepted names, as well as 136 syntaxon names are recorded in the VegAndes lists.
VegAndes is registered under Global Index of Vegetation-Plot Database (GIVD) – ID: SA-00-005 and it currently contains 5,340 vegetation plots. The temporal distribution of the data includes the 1950–2022 period, with 75% of the sampling taking place after the year 2000 (Suppl. material
The VegAndes plots are spread over the Latin American highlands from Lat. 20°N to 39°S in a fairly uniform manner, except for the Patagonian Andes region in the Southern Andes (Table
Plot contributions (as absolute numbers and percentages) from each country and biogeographical province to the VegAndes dataset.
Countries | Mesoamerica | Northern Andes | Central Andes | Southern Andes | Total |
---|---|---|---|---|---|
Mexico | 94 | 94 (1.8%) | |||
Guatemala | 18 | 18 (0.3%) | |||
Costa Rica | 1 | 1 (0.02%) | |||
Panama | 8 | 8 (0.1%) | |||
Venezuela | 312 | 312 (5.8%) | |||
Colombia | 1748 | 1748 (32.7%) | |||
Ecuador | 1023 | 1023 (19.2%) | |||
Peru | 57 | 810 | 867 (16.2%) | ||
Bolivia | 270 | 270 (5.1%) | |||
Chile | 121 | 169 | 290 (5.4%) | ||
Argentina | 709 | 709 (13.4%) | |||
Total | 121 (2.3%) | 3140 (58.8%) | 1201 (22.5%) | 878 (16.4%) |
Distribution of the VegAndes plots in Latin America. (A) Plot density in 100 km × 100 km grid cells throughout the Latin American highlands, and position of the main biogeographical transitions. (B) Number of plots along the latitudinal and altitudinal gradient (colors represent the four main biogeographical provinces, from North to South: Mesoamerica, Northern Andes, Central Andes and Southern Andes).
Most VegAndes data comes from the Northern Andes (almost 60% of the plot total) and especially from Colombia and Ecuador (together summing up to 52%). The Central Andes come second with almost 23% of the plot total, essentially from Peru (16%). Certain areas contribute disproportionately to the dataset, for instance the Nevados sector in Colombia (Lat. 4.5°N – Long. 75.5°W) with 472 plots in 10,000 km2, and to lesser extent the Moquegua mountains in Peru (Lat. 16.5°S – Long. 70.5°W) with 385 plots in 10,000 km2. Differences in sampling efforts are usually due to expert location and ease of access. For example, the guerrilla conflict in Colombia substantially limited fieldwork in certain mountain areas over the last decades, many of which have only become accessible after the peace treaty of 2016 (
From an altitudinal standpoint, the VegAndes plot data can serve as simple proxy and give approximations about the current timberline position and highland elevation range throughout Latin America (Figure
VegAndes contains 3,858 vascular plant species across 151 plant families, which in turn account for (i) 85% Angiosperms, including 67% Eudicots, 15% Monocots and 3% Magnoliids; (ii) 2% Gymnosperms; (iii) 2% Lycophytes; and (iv) 11% Monilophytes. According to the VegAndes data, dominant families in the Latin American highlands included Asteraceae, Poaceae, Orchidaceae, Fabaceae and Cyperaceae, summing up 40% of the total floristic inventory (Figure
A comparison of species richness patterns of the Latin American highlands based on VegAndes data shows complex patterns (Figure
Distribution of the families of vascular plants in the VegAndes database. (A) Relative contribution of the dominant ten families to total species richness according to the VegAndes data (in blue) and the global vascular flora (species numbers obtained from the World Flora Online, www.worldfloraonline.org, accessed July 2022, in yellow). (B) Family richness in 100 × 100 km grid cells throughout the Latin American highlands.
With the emergence of global vegetation initiatives covering wide spatio-temporal frames, such as sPlot, the global vegetation database (
G.P. planned the research, all authors contributed the data. G.P. and D.G. compiled the data, G.P. built the database and led the writing, while all authors critically revised the manuscript.
We would like to thank the LMI-BioInca initiative for funding D.G and C.R. We are also grateful to Viviana Camelo, Sophia Lopez, and Nicolas Salcedo for their help with data handling.
Summary of the 53 data sources included in the VegAndes database and their plot content (*.pdf)
Temporal distribution of the vegetation plots contained in VegAndes (5,340 plots) based on their source (publication year or dataset description). Plot numbers are expressed as independent values (gray) as well as cumulative values (orange) per decade (*.pdf)