Research Article |
Corresponding author: Lucy H. Wells ( l.h.wells@sms.ed.ac.uk ) Academic editor: Janet Franklin
© 2025 Lucy H. Wells, Kyle G. Dexter, R. Toby Pennington, Ítalo Antônio Cotta Coutinho, Desiree Ramos, Oliver L. Phillips, Tim Baker, Casey M. Ryan.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Wells LH, Dexter KG, Pennington RT, Coutinho ÍAC, Ramos D, Phillips OL, Baker T, Ryan CM (2025) Satellite remote sensing can operationalise the IUCN Global Ecosystem Typology in the biome-diverse north-east of Brazil. Frontiers of Biogeography 18: e145498. https://doi.org/10.21425/fob.18.145498
|
Accurate biome delineation is difficult where biomes occupy the same climatic space, as is the case for tropical dry forest and savanna. The resulting confusion limits our ability to understand and manage impacts of global change on these biomes. To address this, we developed an unsupervised, repeatable method to delineate biomes and their component functional ecosystems, based on landscape-level vegetation structure measured using remote sensing and an understanding of the ecology of the region. This approach contrasts with previous definitions, based on climate differences amongst savanna, dry forest and rain forest.
Using the heterogeneous north-east Brazil, where several biomes interdigitate, as a case study, a hierarchical functional ecosystem classification is proposed that aligns with both the IUCN Global Ecosystem Typology (GET) and previous work. Based on fuzzy clustering of remotely sensed vegetation attributes, seven groups were found, identified as rain forest, cerrado (savanna) and five caatinga vegetation groups. These groups broadly align with the literature, for example, sedimentary and arboreal caatinga. These groups align with three ‘Ecosystem Functional Groups’ (EFGs) described by the IUCN GET and, additionally, suggest there is a new, fourth EFG in the region: non-pyric shrublands. Random Forest models showed soil pH was the most important environmental variable distinguishing these vegetation groups.
These results suggest a remotely sensed structure-based approach is an effective method for operationalising the IUCN GET. North-East Brazil – where many EFGs are interdigitated – serves as a challenging case study and, therefore, we hope our approach will have generality for other regions globally.
There are seven vegetation groups in northeast Brazil, including savanna, rain forest and five types of caatinga.
Most of these vegetation groups align with the IUCN Global Ecosystem Typology 2.0, but non-pyric shrubland (caatinga) vegetation may represent a new Ecosystem Functional Group.
Soil pH is the strongest determinant of vegetation distribution in northeast Brazil.
Remote sensing can provide objective, spatially explicit information on vegetation types in the region, largely consistent with previous vegetation classifications.
Accurate biome mapping is vital for management, as biomes differ in ecosystem function and consequently require different management.
Biome, Brazil, caatinga, IUCN, remote sensing, soil, vegetation structure
Biomes are a key concept in ecology and biogeography and are generally now understood to be vegetation units that occupy a large geographical area across continents and have distinct ecosystem functioning (
Classically, biome definitions and classifications have often been climate-based, with an emphasis on precipitation and temperature to delineate global biomes (
The IUCN have recently designed a global, hierarchical classification system, the Global Ecosystem Typology 2.0 (GET), to provide a global foundation for ecosystem assessments, sustainable management and conservation (
The IUCN GET tries to address many of the critiques of biome classification noted above, presenting a hierarchical approach to global vegetation delimitation which emphasises the processes that shape ecosystem properties and the interactions between processes and vegetation form. It aims to meet six criteria: 1) incorporate ecosystem functions and ecological processes; 2) encapsulate characteristic biota of ecosystems; 3) conceptual consistency at the global scale; 4) scalability; 5) provide spatially explicit units; and 6) parsimony (
Here, we identify vegetation structural groups using remotely-sensed metrics that describe ‘vegetation expression’. This is an unsupervised, bottom-up approach that does not make any a priori assumptions about what vegetation types are found in the region (see
We focus on NE Brazil because of its well-known heterogeneity of vegetation types and biomes, which has made it a useful case study to test biome conceptualisations and mapping in previous work (
The study was undertaken in NE Brazil, including the edges of the Amazon and Atlantic Rain Forest regions and parts of the cerrado savanna, but consisting primarily of the Caatinga Region. In the scientific literature, “caatinga” refers to both the region and various types of vegetation. In the Brazilian lexicon, the Atlantic Forest, Amazon Forest, Cerrado and Caatinga were previously referred to as Domains and are now referred to as Biomes (
NE Brazil has globally high levels of species endemism and floristic compositional heterogeneity (
The characteristics of tropical biomes in terms of attributes describing vegetation expression were identified via a literature review of all biomes potentially present within the region of interest. Remote sensing products were identified and obtained that could describe a subset of these attributes (Table
Sources of vegetation attribute data used in the clustering analysis and justification for their inclusion.
Vegetation Attribute | Data Source | Justification |
---|---|---|
Above ground woody biomass density (AGB, Mg/ha) | Globbiomass AGB data product for the year 2010 (+/- 1 year) ( |
Above ground woody biomass is highly correlated to variation in caatinga physiognomies ( |
AGB Heterogeneity | Coefficient of Variation (CV) of the Globbiomass AGB product ( |
Captures spatial variation, to help distinguish areas which may have uniform biomass in comparison to those with high heterogeneity. This may capture distinct vegetation formations like cerrado savanna, which have high variability in tree cover. |
Seasonality | Calculated from NDVI data from MODIS MOD13A1 V6.1 product, between 10/06/2000 and 10/06/2022 ( |
Deciduousness of caatinga is conspicuous, with leaf flushing at the start of the wet season, but there is phenological variation across caatinga vegetation. For example, caatinga on sedimentary soils is less strongly influenced by rainfall ( |
Wet Season Leaf Area Index (LAIwet). LAI represents half the total area of green elements of the canopy per unit horizontal ground area. This incorporates all the canopy layers, including the understorey. | Data from the Copernicus Global Land Service, over the period 2000–2020, at 1 km resolution. December to mid-June were considered to be the wet season. All images within these months were used to make a mean wet season LAI data layer. | Inclusion of LAI in different seasons aimed to capture both bulk difference in total leaf area and responses to seasonal conditions. |
Dry Season Leaf Area Index (LAIdry) | As above, but for mid-June to November. | This layer aimed to represent the LAI in the driest part of the seasonal cycle of each pixel. |
Fire Count (no. fires in 21 years) | Calculated from burned area data from the MODIS MCD64A1.061 product between 01/01/2001 and 11/01/2021, with the raw resolution of 500 m ( |
A proxy for the presence of a grassy underlayer, which is otherwise difficult to determine directly using remote sensing. It is assumed areas with a grassy understorey represent a regime with fire disturbance. This also aims to reduce the reliance upon tree structure in defining dry tropical biomes, which should reduce misclassification of grassy biomes as forest ( |
Canopy Height (m) | Global canopy height product for 2019. The product was made by interpolating GEDI LiDAR estimates of canopy height (RH95) with 2019 Landsat analysis-ready time-series data. The data was accessed via Google Earth Engine ( |
|
All data were resampled to 1 km resolution using bilinear interpolation, calculated using the ’resample’ function in the R package ‘raster’ (
Exclusion of non-natural vegetation is important to prevent conflation of human-dominated areas with naturally different structural vegetation groups. We thus removed areas with non-natural land covers using the MapBiomas land cover product for 2015 (along with water covered areas; MapBiomas 2022). However, there remains potential for areas with some human modification to be present in the analysis and, for land use change to have occurred over the time period examined (2000–2022) – see Suppl. material
Fuzzy spatial clustering was used to investigate the degree to which pixels belonged to multiple groups. This was implemented by using the geocmeans package in R, version 0.3.3 (
To understand the determinants of each cluster, hypothesised explanatory variables were selected that have previously been used to explain biome distribution in NE Brazil (e.g.
The IUCN GET identifies five types of ecological drivers of biome distribution, three of which are encapsulated in our random forest analysis as variables ecologically relevant to NE Brazil (resource drivers, ambient environment and human activities). Given that only a few drivers shape the properties of ecosystems (
Random forest classification (
We reviewed the literature to identify the vegetation types that we expected the classification to detect. In order to describe the vegetation groups found by the clustering, these structural descriptions of differences between vegetation groups were cross referenced with additional sources of information: i) previous descriptions of the geographical distribution of the vegetation types compared to our groups (Fig.
Comparison of the vegetation structural groups identified by c-means clustering of vegetation attributes (A) with the IUCN Global Ecosystem Typology 2.0 Ecosystem Functional Groups in NE Brazil (B). The maps of the EFGs in B is produced by the aggregation of the structural groups in A, as per the lower panel, rather than the GET indicative maps. Scrubby caatinga, heterogeneous caatinga and hyper-seasonal caatinga represent an additional EFG not yet described by the IUCN GET which we term ‘non-pyric shrublands’ as they consist of vegetation that does not burn, but which structurally resembles a shrubland or short-statured forest rather than a tall canopy, closed forest.
Attributes of the vegetation in each group in NE Brazil. The scaled mean vegetation attribute for each vegetation group is shown, as a percentage of the maximum of each vegetation attribute. The vegetation attributes were (clockwise from top) aboveground woody biomass, aboveground woody biomass heterogeneity (bio.cv), fire frequency (burn), mean Leaf area index (LAI) in the dry season (LAI.dry), mean LAI in the wet season (LAI.wet), NDVI seasonality (seas) and canopy height (height).
The structural groups identified in the clustering were compared to the descriptions of EFGs in the IUCN GET. We focused on ‘T1.1 Tropical subtropical lowland rain forests’, ‘T1.2 Tropical subtropical dry forests and thickets’, ‘T3.1 Seasonally dry tropical shrublands’ and ‘T4.2 Pyric tussock savannas’. We considered the ecological equivalence between our vegetation groups and the IUCN EFG descriptions, with a particular focus upon the links between vegetation structure and ecosystem processes emphasised in each EFG. Furthermore, we considered the results of IBGE’s analysis of EFGs in Brazil (
The silhouette index, explained inertia and Xie-Beni indices gave varying results for the optimum values of k, the number of clusters (Suppl. material
We also conducted the clustering analysis using a hard clustering approach, but found little difference between fuzzy and hard clustering results. In addition, we also tested whether using PCAs of the vegetation attributes as the clustering input altered our results and found that it did not. Details of these analyses are found in Suppl. material
The seven groups are mapped in Fig.
Random forest classification was highly accurate at predicting cluster membership, based on the predictors listed in section Random forest models to quantify the determinants of vegetation groups. The random forest model had an accuracy value of 81% for the held-out ‘test’ data set. The per group error rate from the random forest confusion matrix was below 24% for all groups, except the unclassified group, which had an error rate of 95%. Based on the importance values (which quantify the mean decrease in Gini coefficient when that variable is randomised; Fig.
The partial dependence plots (PDPs; Fig.
The hyper-seasonal caatinga group generally was more sensitive to environmental variables compared to heterogeneous caatinga, although they exhibit similar trends. This could suggest the hyper-seasonal caatinga group has stricter environmental requirements and is less likely to be found in unfavourable conditions.
Variable importance plot illustrating the importance of explanatory variables in determining the structural groups. Values indicate the mean decrease in Gini Index after removal of each variable from the random forest classification. Shapes indicate the type of variable; cross = soil variables, circle = climatic variables, square = human-related variables, triangle = geology.
Partial dependence plots showing the effect of the three most important variables predicating vegetation type in northeast Brazil. A. Soil pH; B. Precipitation in the wettest month; C. Precipitation seasonality (coefficient of variation of monthly precipitation). The y axis shows the log odds of a pixel being in each vegetation type, given varying levels of the focal predictor variable, with all other predictors held at the mean of the dataset.
Our review of the literature identified nine vegetation groups we would expect to find in this region: rain forest, cerrado savanna, restinga, campos rupestres, cerradão and four groups of caatinga vegetation: crystalline, sedimentary, karst and arboreal (Fig.
Comparison of vegetation groups in NE Brazil according to the literature and structural groups identified in this work by unsupervised classification of vegetation attributes. Solid lines indicate clearly identified parallels between the literature definition and our structural groups and dotted lines identify more tenuous relationships between the structural groups and literature definitions.
We found the following links between the previous descriptions of vegetation in NE Brazil and our vegetation structural groups. One group clearly represented rainforest (of both the Atlantic and Amazon Regions). Cerrado and campos rupestres were represented together in the cerrado group and the remaining five groups represented different formations of caatinga. These caatinga groups were named arboreal caatinga, scrubby caatinga, transition caatinga, hyper-seasonal caatinga and heterogeneous caatinga. Arboreal caatinga is equivalent to the caatinga type of the same name as described in literature. Scrubby caatinga is equivalent to sedimentary caatinga and restinga. The hyper-seasonal caatinga and heterogeneous caatinga may collectively comprise the crystalline caatinga, given their geographical distribution, characteristics of high seasonality and shallow soils with high soil pH, which are typical of crystalline caatinga (
Detailed descriptions of the vegetation groups can be found in Suppl. material
What we term scrubby caatinga broadly matches the description and geographic distribution of sedimentary caatinga and restinga in past work (
This group was identified as a distinct type of caatinga in this analysis and as part of the crystalline caatinga when compared to literature. Hyper-seasonal caatinga was the most frequent caatinga vegetation group found in the study region, comprising 21% of the clustered area. Hyper-seasonal caatinga had spatial distribution in the central north-east of the study area, within the core Caatinga Region (
This group aligns as a subset of the crystalline caatinga, along with hyper-seasonal caatinga. We describe this group as heterogeneous caatinga because it has low, but spatially heterogeneous biomass (biomass cv: 1.4 ± 0.5), likely due to intermittent large trees or patchy tree distribution within the 1 km2 grids used. This may suggest that this group represents human-disturbed caatinga. It is located alongside areas masked as ‘non-natural’ land cover. As human disturbance also exists to some degree in ‘natural’ land covers, heterogeneous caatinga may represent a human – disturbed form of caatinga.
Arboreal caatinga is often found in areas of transition between biomes. Structurally, it could be described as ‘tall caatinga’, with a canopy of intermediate height between other caatingas and rain forest (
The fifth group is similar to arboreal caatinga as it is forest-like, having an intermediate biomass (56 ± 14 Mg/ha) and biomass heterogeneity (0.3 ± 0.1). It also has taller canopy height (4.7 ± 1.0 m), similar to arboreal caatinga and cerrado, as opposed to the other caatinga groups. However, its NDVI seasonality (0.7 ± 0.2) is high, being just slightly less than hyper-seasonal caatinga (1.0 ± 0.1) (Fig.
The cerrado is the main extent of savanna in Latin America, found towards the south and west of the study region. This vegetation group was identified as cerrado primarily because it is the only group which experiences substantial fire, suggesting the presence of a flammable grassy layer. Pixels experienced fire on average 1.3 ± 0.4 times from 2001–2021. The group is distributed in the west of the study region, with a significant patch in central Bahia, within the Chapada Diamantina (Fig.
This group has high values of all the vegetation abundance attributes (biomass of 74 ± 13 Mg/ha and canopy height of 13 ± 3.5 m), a mostly evergreen canopy with the lowest NDVI seasonality (0.15 ± 0.05) (Fig.
A small number of pixels were not aligned to any structural group; 0.24% of the clustered area. These pixels displayed medium values of the vegetation attributes and low value for fire (Suppl. material
The vegetation groups were assigned to the IUCN GET EFG groups present in the study region by comparing the descriptions of the EFGs (
This group corresponds to EFG ‘T1.1 Tropical and Subtropical Lowland Rainforest’, due to its similarities in terms of ecological traits to the IUCN description – such as high biomass and LAI and absence of grasses – and the drivers including less seasonal precipitation and moist soil (
This group corresponds to the ‘T4.2 Pyric tussock savannas’ EFG, which is described as dominated by C4 grasses, with variable tree cover and sub-decadal fire regimes (
Arboreal caatinga and transition caatinga have a forest-like structure, with a higher biomass and taller canopy than the three other caatinga groups we found. Therefore, we have placed these groups in the ‘T1.2 Tropical and subtropical dry forests and thickets’ EFG (
In contrast, scrubby, hyper-seasonal and heterogeneous caatinga do not easily align to an EFG within the GET. They structurally align with the GET description of ‘T3.1 Seasonally dry tropical shrublands’ (low open forests less than 6 m tall, with spatially heterogeneous vegetation (
We find seven structural vegetation groups, which overall match well with previous descriptions of the vegetation of the region. Broadly, these mappable, locally relevant vegetation groups can be classified within three IUCN EFGs or biomes: T1.2 Tropical subtropical dry forests and thickets, T4.2 Pyric tussock savannas and T1.1 Tropical-subtropical lowland rainforests. However, we also find three vegetation groups that are similar to each other, but not well aligned with the GET. The attributes of these three groups may indicate a new IUCN GET EFG category is required. The distribution of all the vegetation groups is well explained by our hypothesised drivers, with an accuracy of 81% for predicting the test dataset. Overall vegetation group appears mostly to be determined by soil pH, with only a secondary role for climate, highlighting how these biomes do not fit with a climatically deterministic view of vegetation classification (e.g. the classic Holdridge Life Zone system (
The clustering method identifies seven ecologically meaningful groups with different physiognomies. These structural groups align with previously recognised vegetation groups in the region – categorised using a variety of methods including floristic classifications (
The number of groups identified is suitable for capturing the heterogeneity of the region without being overly complex for interpretation at a regional scale. For further discussion of the effectiveness of our method, see Suppl. material
Random forest classification showed that soil pH was the most important determinant of vegetation structural groups. Precipitation in the wettest month and precipitation seasonality were the next most important variables. These results broadly agree with the argument that climate is not the main determinant of vegetation type in NE Brazil and align with the findings of several authors who describe edaphic factors as important drivers of vegetation in NE Brazil (
The importance of soil pH as a determinant of vegetation type, may be due to its relationship with soil fertility. Hyper-seasonal caatinga and heterogeneous caatinga (collectively comprising crystalline caatinga) were associated with higher soil pH (Fig.
Variables describing precipitation were the most important climatic determinants of vegetation structural group. This makes sense in a dry region where water availability is important for plants and where elevational variation is relatively limited. The IUCN EFG descriptions for the groups present in the region all describe water availability as an important driver for these groups (
A difficulty in the analysis is that SoilGrids data do not include some relevant variables, including phosphorus and aluminium content which are known to be important determinants of tree species composition (
We have identified both coherence and challenges with the existing IUCN’s GET EFGs, with the key issues surrounding the complexities of classifying dry forest, thicket and shrubland. We find that the transition caatinga and arboreal caatinga groups align well with the ‘T1.2 Tropical subtropical dry forests & thickets’ EFG, a finding supported by
In the IUCN GET, dry forest and thickets are described as closed canopy forest, with seasonally high LAI and deciduous or semi-deciduous phenology and absence of grasses (
Scaled mean vegetation structural attributes for the IUCN Global Ecosystem Typology’s Ecosystem Functional Groups in NE Brazil. Axes show the percentage for that structural vegetation group of the maximum data point for each vegetation attribute. The vegetation attributes were (clockwise from top) aboveground woody biomass, canopy height (height), mean Leaf area index (LAI) in the dry season, mean Leaf area index (LAI) in the wet season, burn count (burn), aboveground woody biomass heterogeneity (bio.cv) and NDVI seasonality (seas). Scrubby caatinga, heterogeneous caatinga and hyper-seasonal caatinga are placed in the Non-pyric shrublands group (blue). Arboreal caatinga and transition caatinga are placed in T1.2 Tropical-subtropical Dry Forests EFG (red) and the cerrado and rain forest structural groups in T4.2 Pyric tussock savannas (yellow) and T1.1 Tropical-subtropical Lowland rain forests (green) EFGs, respectively.
Campos rupestres is a vegetation type in eastern Brazil that does align closely with EFG T3.1 (shrublands). This vegetation type has a different floristic and functional composition from the vegetation in our three caatinga groups. Most importantly, plant species in campos rupestres show many fire-resistance and fire-tolerance traits (
Given the discussion above, we suggest a new EFG should be incorporated into the GET to describe scrubby caatinga, hyper-seasonal caatinga and heterogeneous caatinga. However, it is beyond the scope of this paper to definitively determine the name of this new EFG and its location within the higher biome level of the Typology, given the regional nature of this study in comparison to the international scope of the IUCN Typology. Solely from our NE Brazil perspective, we suggest that the new EFG should be ‘T1.5 Tropical-subtropical non-pyric thickets and shrublands’. This is because of the lack of fire and the short stature, but closed canopy of our caatinga groups (Fig.
The new EFG which we propose would occupy the drier end of the seasonally dry tropical forest spectrum (sensu
Overall, we find that the remote sensing-driven approach which we develop here can be used to identify structural groups in a complex region. Using an understanding of the drivers of EFG distribution including disturbance regime, soil and climatic factors, the structural groups from an unsupervised clustering can largely be grouped into EFGs as described by the IUCN GET. Therefore, this approach is suitable for fulfilling the fifth criterion of the GET (
This work, therefore, adds to the growing body of work implementing the IUCN GET in a range of ecosystems (
There is ongoing work within the IUCN assessing risks to ecosystems, through the Red List of Ecosystems, to help conservation prioritisation (
This research has been supported by the Natural Environment Research Council (NERC) through a SENSE CDT studentship (grant no. NE/T00939X/1) and SECO: Resolving the Current and Future Carbon Dynamics of the Dry Tropics (grant no. NE/T01279X/1). DR was supported by FAPESP (grants #2017/17380-1 and #2022/02323-0). Thanks to Samuel Bowers for his help in writing Google Earth Engine code used for developing the seasonality and burn count data layers. This publication has been prepared using European Union’s Copernicus Land Monitoring Service information; https://land.copernicus.eu/en/products/vegetation/leaf-area-index-v2-0-1km. We also thank the editor and three reviewers for their valuable comments which resulted in a much improved analysis and manuscript.
Lucy Wells: Conceptualisation, Methodology, Analysis, Interpretation, Writing – original draft, review and editing, Visualisation. Kyle Dexter: Conceptualisation, Methodology, Interpretation, Writing – review and editing. Toby Pennington: Conceptualisation, Interpretation, Writing – review and editing. Ítalo Coutinho: Fieldwork, Writing – review. Desiree Ramos: Fieldwork, Writing – review. Oliver Phillips: Conceptualisation, Writing – review. Tim Baker: Conceptualisation. Casey Ryan: Conceptualisation, Methodology, Analysis, Interpretation, Writing – review and editing.
The data is shared via the University of Edinburgh Open Access DataShare platform, with the DOI of: https://doi.org/10.7488/ds/7879.
Suppl. appendices S1–S4, figures S1–S5, tables S1–S8 (.docx)