Research Article |
Corresponding author: Neda Bihamta Toosi ( n.bihamtaitoosi@na.iut.ac.ir ) Academic editor: Javier Martínez-López
© 2023 Ali Reza Soffianian, Neda Bihamta Toosi, Ali Asgarian, Hervé Regnauld, Sima Fakheran, Lars T. Waser.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Soffianian AR, Toosi NB, Asgarian A, Regnauld H, Fakheran S, Waser LT (2023) Evaluating resampled and fused Sentinel-2 data and machine-learning algorithms for mangrove mapping in the northern coast of Qeshm island, Iran. Nature Conservation 52: 1-22. https://doi.org/10.3897/natureconservation.52.89639
|
Mangrove forests, as an essential component of the coastal zones in tropical and subtropical areas, provide a wide range of goods and ecosystem services that play a vital role in ecology. Mangroves are globally threatened, disappearing, and degraded. Consequently, knowledge on mangroves distribution and change is important for effective conservation and making protection policies. Developing remote sensing data and classification methods have proven to be suitable tools for mapping mangrove forests over a regional scale. Here, we scrutinized and compared the performance of pixel-based and object-based methods under Support Vector Machine (SVM) and Random Forest (RF) algorithms in mapping a mangrove ecosystem into four main classes (Mangrove tree, mudflat, water, and sand spit) using resampled and fused Sentinel-2 images. Additionally, landscape metrics were used to identify the differences between spatial patterns obtained from different classification methods. Results showed that pixel-based classifications were influenced heavily by the effect of salt and pepper noise, whereas in object-based classifications, boundaries of land use land cover (LULC) polygons were smoother and visually more appealing. Object-based classifications, with an excellent level of kappa, distinguished mudflat and sand spit from each other and from mangrove better than the pixel-based classifications which obtained a fair-to-good level of kappa. RF and SVM performed differently under comparable circumstances. The results of landscape metrics comparison presented that the classification methods can be affected on quantifying area and size metrics. Although the results supported the idea that fused Sentinel images may provide better results in mangrove LULC classification, further research needs to develop and evaluate various image fusion approaches to make use of all Sentinel’s fine resolution images. Our results on the mapping of mangrove ecosystems can contribute to the improvement of management and conservation strategies for these ecosystems being impacted by human activities.
image fusion, landscape metric, mangroves, object-based classification, Sentinel-2
Mangroves offer a considerable array of ecosystem goods and services including habitat and nursery for plant and animal, biodiversity, water quality maintenance, storm buffering, flood and flow control, fisheries, recreation, tourism, and so forth (
As our literature review on English articles published since 2011 (Fig.
Furthermore, the algorithm by which a classifier is developed is also another key factor affecting the classification results. These algorithms are divided into parametric and nonparametric groups. Nonparametric methods allow training data to more robustly participate in the process of image classification and have a higher potency than nonparametric ones (
Differences in spatial patterns identified in thematic maps affect the results of land use metrics (
The study area of this research encompasses part of mangrove ecosystems located in the northwest of Qeshm Island in the Persian Gulf, Iran. This island is situated 20 km from the Bandar-Abbas Port in the Strait of Hormuz (Hormozgan Province) and spans over 26°40'N–27°00'N longitude and 55°20'E–55°50'E latitude. With an area of approximately 1490 km2, Qeshm is the largest island in the Persian Gulf (2.5 times larger than the second biggest Island: Bahrain Country). About 90% of Iranian mangrove forests are located in the margins of the island’s northwestern estuaries (
The workflow of this study is designed to utilize two Sentinel-2 MIS images (table 1). The onboard MSI consists of 13 spectral bands with four bands at 10 m, six bands at 20 m and three bands at 60 m spatial resolution (
Spatial Resolution (m) | Band Number | S2A | S2B | ||
---|---|---|---|---|---|
Central Wavelength (nm) | Bandwidth (nm) | Central Wavelength (nm) | Bandwidth (nm) | ||
10 | 2 | 496.6 | 98 | 492.1 | 98 |
3 | 560.0 | 45 | 559 | 46 | |
4 | 664.5 | 38 | 665 | 39 | |
8 | 835.1 | 145 | 833 | 133 | |
20 | 5 | 703.9 | 19 | 703.8 | 20 |
6 | 740.2 | 18 | 739.1 | 18 | |
7 | 782.5 | 28 | 779.7 | 28 | |
8a | 864.8 | 33 | 864 | 32 | |
11 | 1613.7 | 143 | 1610.4 | 141 | |
12 | 2202.4 | 242 | 2185.7 | 238 | |
60 | 1 | 443.9 | 27 | 442.3 | 45 |
9 | 945.0 | 26 | 943.2 | 27 | |
10 | 1373.5 | 75 | 1376.9 | 76 |
One Sentinel-2 MIS image acquired at the time of the highest tidal level (December 10, 2017) was used to produce a radiometric mask distinguishing the study area from the surrounding bare lands and the other one obtained at the time of the lowest tidal level (November 15, 2017) was used for mangrove LULC classification. In this study 10 m and 20 m resolution bands were utilized for classification.
In addition to imagery data, a total of 170 GPS reference points (Garmin 629sc) were obtained close to the image acquisition time to evaluate the classification accuracy during the field survey. In doing so, a special attention was given to the positional error which was frequently regarded as one of the main sources of uncertainty in assessing the classification accuracy (
To enhance the quality of images and perform more accurate image classifications, atmospheric and radiometric correction was conducted by the Fast Line-of-sight Atmospheric Analysis of Hypercubes (FLAASH) module in ENVI 5.4. This module relies on MODTRAN (Moderate resolution atmospheric Transmission) which is a radiative atmospheric propagation model for the spectral range of 200–100,000 nm (
(1)
This task enhances the quality and interpretation capability of the images and, in turn, may produce more accurate classification results (
In this study, the method of
NDWI (see section 2.3.1), Normalized Difference Vegetation Index (NDVI), and Normalized Difference Built-up Index (NDBI) were produced to support the image bands through mangrove LULC classification. Several studies used the NDVI to improve monitoring of the mangroves (
(2)
(3)
Image segmentation splits the image into small polygons in alignment with the objects of interest in the study area (
A number of 356 polygons were sampled through visual interpretation of the color composites of the Sentinel images and a Spot image scene acquired on December 20, 2016. In doing so, a special attention was also given to the mean reflectance of each segment whereby those which most correspond to the pre-defined spectral signature of the respective LULC class were chosen as sample polygons. Sample polygons of each LULC class were randomly split into two groups, each containing 78 samples for mangrove tree, 54 for mudflat, 25 for sand spit, and 21 for water. One group which was used to train the algorithms was evaluated in terms of the divergence between the classes and the other group was used for accuracy assessment. Given the promising results of non-parametric algorithms (see the introduction section), RF and SVM as the two well-known and commonly used classifiers in this case (
SVM was pioneered by Fisher in 1936 to reduce the error rate of discriminating training data based on the Statistical Learning Theory and then expanded by
RF (also known as random decision forests) was first introduced by
Two-thirds of training data are used for generating trees (known as in bag samples) and the remaining one-third for cross-validation which evaluates the performance of the RF model (
A review of the literature shows that the metrics most commonly used to study forest ecosystems include metrics of area, shape, proximity, and diversity (
Descriptions of landscape metrics used in the study (
Category | Name | Acronym/Units | Description |
---|---|---|---|
Area | Mean Patch Area | AREA-MN/ Meter | Description Average area of all patches in the SU (ha) |
Percentage of Landscape | PLAND/ Percent | PLAND made up of the corresponding class | |
Total Area | TA/ Hectares | TA includes any internal background present. | |
Edge Density | ED/ Meters per hectare | ED equals the sum of the lengths (m) of all edge segments involving the corresponding patch type. | |
Shape | Area Weighted Mean Patch Fractal Dimension | FRAC-AM/ None | FRAC reflects shape complexity across a range of spatial scales |
Subdivision | Patch density | PD/ Number per 100 hectares | PD of a certain class divided by the total landscape area n patches per ha |
Number of patches | NP/ None | NP of the corresponding patch type (class). | |
Aggregation | Euclidean nearest neighbor distance | ENN-MN/ Meters | MNN increases when the mean distance of patches from each type is increased. |
Landscape shape Index | LSI/ None | A perimeter-to-area ratio that measures the overall geometric complexity of the landscape (landscape level) | |
Contagion | Contagion | CONTAG/ Percent | Higher CONTAG values represents a landscape with less dispersed and more contiguous units. |
Diversity | Shannon’s Diversity Index | SHID/ None | SHDI increases with the increase in evenness and richness of patch types. |
The accuracy of object-based classification depends significantly on the quality of image segmentation. In this research, segments were generated through frequent trial-and-error and visual interpretation of the outputs. Accordingly, the scale factor was assigned to a value of 5. The influence weight of NDVI was considered twice greater than other bands (2:1). The shape and compactness factors were also kept constant as 0.1. The resampled and fused Sentinel-2 images were then classified using pixel- and object-oriented RF and SVM algorithms. Table
Algorithm | LULC class | Area (ha) | |||
---|---|---|---|---|---|
Pixel-based classification | Object-based classification | ||||
Resampled image | Fused image | Resampled image | Fused image | ||
RF | Wet area | 15536.4 | 15099.2 | 13184.5 | 13612.6 |
Mangrove | 9249.8 | 9032.1 | 8804.4 | 8543.8 | |
Mudflat | 13974.4 | 15184.0 | 16845.1 | 16957.2 | |
Water | 26906.4 | 26351.6 | 26832.8 | 26553.2 | |
SVM | Wet area | 12389.3 | 14073.7 | 12872.8 | 13184.5 |
Mangrove | 8979.6 | 9293.2 | 8846.1 | 8804.4 | |
Mudflat | 17039.3 | 15549.7 | 16935.7 | 16845.1 | |
Water | 27258.7 | 26750.3 | 27012.1 | 26832.8 |
The results of accuracy assessment for each classification are summarized in fig. 5 (due to limited space and for easier comparison of the results, confusion matrices are not shown). In this study, object-based classifications produced more satisfactory results than pixel-based classifications. Kappa coefficient and overall accuracy values obtained from pixel-based classifications were relatively low, below 72.2% and 80.8%, respectively, whereas these values were estimated to be much higher for object-based classifications. None of the object-based classifications represented kappa coefficient and overall accuracy values of less than 85% and 90%, respectively. According to the kappa classification scheme proposed by
The producer’s and user’s accuracy statistics were also calculated to find out whether the classifications succeeded in mapping each LULC class. The producer’s accuracy, as an indicator depicting how well a class is categorized (the accuracy of classification) (
At the patch level, mangrove patches extracted from the pixel-based and object-oriented methods were investigated for both RF and SVM algorithms. The results of the metrics of NP and PD showed that in the pixel-based method the total number of patches and mangrove patch density was higher than in the object-oriented method and a significant difference was observed between the two algorithms (Fig.
Comparison of A number of patch (NP) B patch density (PD) C landscape shape index (LSI), and D Mean Patch Area (AREA-MN) metrics for mangrove patches resulting from fusion image processing using two pixel-based (PB) and object-oriented (OB) methods with two random forest (RF) and support vector machine (SVM) algorithms.
At the class and landscape level, metrics were compared for classification maps obtained from the object-oriented method with the RF algorithm and the pixel-based method with the SVM algorithm, which had the highest accuracy (fig. 7). The results of PLAND metric showed that the percentage of mangrove and water class area was similar in both methods, but the mudflat class area was larger for the pixel-based method with SVM algorithm than the object-oriented method with RF algorithm and vice versa for barren class. The results of the NP metric showed that the number of spots for all classes in the pixel-based method with the SVM algorithm was much higher than the object-oriented method. In both methods, the highest number of patches was related to the mudflat class and the lowest was attributed to the water class. The fractal dimension metric was related to the complexity of the patches, which approaches two if the shape is very complex. The results of fractal dimension metric showed that the value of this metric for the object-oriented method with RF algorithm was less than the pixel-based method with SVM algorithm and the highest value of fractal dimension in both methods was related to the water class. The lowest values for the object-oriented and pixel-based methods were related to the barren and mangrove methods, respectively. The results of ENN-MN metric showed that the amount of this metric for the pixel-based method was less than the object-oriented method. The lowest ENN-MN values in the object-oriented method belonged to the mudflat class and in the pixel-based method to the barren class.
Comparison of landscaping metric results between the classification map obtained from the object-oriented method with random forest (RF) algorithm and the classification map obtained from the pixel-based method with the support vector machine (SVM) algorithm at the class level A percentage of landscape (PLAND) B number of patch (NP) C area Weighted Mean Patch Fractal Dimension (FRAC-MN) D Euclidean nearest neighbor distance (ENN-MN).
At the landscape level, the results of the metrics are displayed in Table
Comparison of landscape metrics for two classification maps using the object-oriented and pixel-based methods at the landscape level.
LID | TA | NP | PD | ED | AREA-MN | FRAC-AM | ENN-MN | CONTAG | SHDI |
---|---|---|---|---|---|---|---|---|---|
RF object-based | 65659.31 | 1356 | 2.06 | 48.72 | 48.42 | 1.20 | 99.35 | 48.37 | 1.30 |
SVM object- based | 65659.32 | 26982 | 40.65 | 101.37 | 2.44 | 1.37 | 30.30 | 46.11 | 1.27 |
This study compared the performance of different combinations of satellite images and classification techniques to provide insights for future studies on mangrove LULC classification. In terms of image selection, a growing propensity has emerged recently in using free Sentinel 1/2 images (
In Sentinel images, fusion can provide an opportunity to take full advantage of both spectral and spatial power of the image bands (
Another challenging issue which has increasingly caught the attention of remote sensing experts is the difference between the performance of pixel-based and object-based methods in LULC classification. Studies in this case showed that there is a glaring disparity between the classification results achieved under these methods and that the superiority of them over each other may depend on a number of factors such as the classification extent, spatial pattern and number of LULC classes (
As given in the introduction section, the majority of studies on LULC classification concluded that non-parametric algorithms can provide better classification results than the parametric ones. Drawing on previous research carried out in this research area, the performances of RF and SVM as two well-known classification algorithms were evaluated in this research. As shown in fig. 5, none of the algorithms scored best under all circumstances. In object-based classifications, RF was found to be the foremost classifier while SVM performed better in pixel-based classifications. Similar to these results,
Mangroves are globally threatened, disappearing and degraded. Monitoring and evaluating changes in the trends of mangrove distributions and dynamics is urgent for the conservation of these ecosystems. The results of this research showed that object-based classification can better distinguish mangrove trees from mudflat and sand spit, while no solid conclusion was drawn in terms of the application of classification algorithms. Moreover, although further investigations are necessary to determine the potential of fused Sentinel images in LULC classification, our results showed that the accuracy of mangrove LULC classification can be well improved by object-oriented classification of fused Sentinel images. Mangrove ecosystems generally consist of the same LULC classes as those categorized in this research. Therefore, further research in our study area or other mangrove ecosystems is needed to strengthen or support the results obtained by this research and, in turn, inform future research on mangrove ecosystems. The findings of this research can contribute to the improvement of management and conservation strategies for these ecosystems being impacted by human activities.
This work was supported by the Isfahan University of Technology, Department of Natural Resources.