Research Article |
Corresponding author: William Magnuson ( bill@inpa.gov.br ) Academic editor: Pierre-Yves Henry
© 2014 William Magnuson.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0) which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Magnuson W (2014) Uncertainty and the design of in-situ biodiversity-monitoring programs. Nature Conservation 8: 77-94. doi: 10.3897/natureconservation.8.5929
|
There are many techniques to deal with uncertainty when modeling data. However, there are many forms of uncertainty that cannot be dealt with mathematically that have to be taken into account when designing a biodiversity monitoring system. Some of these can be minimized by careful planning and quality control, but others have to be investigated during monitoring, and the scale and methods adjusted when necessary to meet objectives. Sources of uncertainty include uncertainty about stakeholders, who will monitor, what to sample, where to sample, causal relationships, species identifications, detectability, distributions, relationships with remote sensing, biotic concordance, complementarity, validity of stratification, and data quality and management. Failure to take into account any of these sources of uncertainty about how the data will be used can make monitoring nothing more than monitoring for the sake of monitoring, and I make recommendations as to how to reduce uncertainties. Some form of standardization is necessary, despite the multiple sources of uncertainty, and experience from RAPELD and other monitoring schemes indicates that spatial standardization is viable and helps reduce many sources of uncertainty.
Biodiversity, stakeholder, sampling, identification, detectability, distribution, remote sensing, concordance, complementarity, stratification, data quality
There are many sources of uncertainty in scientific research, some of which can be modeled mathematically, but some sources of uncertainty are considered non-probabilistic, and the best way to deal with these is controversial (e.g.
Ecologists are generally most worried about uncertainty in relation to their field of research. Taxonomists worry about the correctness of identifications, modelers worry about the accuracy of parameter estimates in their models, sociologists are concerned with uncertainties about the contributions of different stakeholders, geneticists try to reduce uncertainty about gene flow, etc. However, biodiversity managers have to deal with all sources of uncertainty simultaneously, and the importance of different forms of uncertainty will vary depending on the objectives of management. I have therefore adopted a very broad concept of uncertainty, and make recommendations as to how they can be reduced or quantified by planning during implementation of field infrastructure and quality control during data collection.
Here we will describe some of the sorts of uncertainty that we had to take into account when developing the RAPELD system of biodiversity monitoring (
The basic sampling unit in the RAPELD system is a 1 km transect combined with one or more permanent plots that are usually 250 m long in the largest dimension. That is, the system was designed for the relatively large landscapes managed by most municipal, state and federal governments, and is often not appropriate for the small-scale landscapes studied by many academic biologists. Uncertainty about the scale at which users would apply the results was the prime reason for designing a modular system.
Although there is large variation in the sizes of RAPELD modules, most users use a standard 25 km2 grid with 30 uniformly distributed plots for intensive studies in long-term ecological research sites located near major research institutions, and standard 5 km2 (5 km × 1 km) modules with 10 uniformly distributed plots for RAP studies or long-term studies distributed over large areas (http://ppbio.inpa.gov.br/inventarios/modular). Uniformly distributed plots are 250 m long and the center line follows the altitudinal contours (http://ppbio.inpa.gov.br/instalacao/parcelas), a design that generally allows greater precision of models that relate biodiversity parameters to environmental variables. Plots for special strata, such as streams and riparian zones, are distributed in proportion to their occurrence in the landscape.
Most biologists specialize in a limited range of taxa (e.g. vascular plants) or processes (e.g. pollination), but decision makers have to take into account the needs of many different stakeholders, who may be interested in subjects as varied as the effect of large carnivores on domestic animals, bacterial metagenomics, ecosystem carbon storage and traditional uses of biodiversity. Reduction of these varied interests to a production-line mathematical model with limited inputs and outputs is usually not feasible, especially when a major uncertainty is whether we are addressing the right question (
A major difficulty, perhaps the major difficulty, with the interpretation of data collected in monitoring exercises is that the biologists have focused on their favorite group and not collected the data in such a way that it can be integrated with information generated on other biological groups and presumed environmental drivers. Different taxa provide different information about the distribution of biodiversity, and there is often heated discussion about the appropriate group to study (
When we started our studies, discussing possible sampling designs only resulted in endless discussions as to which design optimized for a particular question was the “right” design. We found that the only way to obtain integrated data collection was to provide standardized infrastructure that could be used by most researchers to answer a wide variety of questions.
There is a great difference between planning for an individual study of a limited range of organisms and planning a monitoring system for a wide range of taxa over very large areas. We did not appreciate this at the beginning, and it only became obvious to us as we saw what worked for a wide range of researchers over large areas, and what was mainly useful for specific studies. This dichotomy has been recognized by many researchers responsible for nationwide monitoring of biodiversity (e.g.
Individual researchers tend to consider their study site to be primordially of interest in relation to their current research question. However, that piece of land may have a multitude of other values for the local people (
Stakeholder roles also depend on their position in the hierarchy. Multinational bodies try to influence decisions by changing national policies. Most funding agencies for long-term monitoring are national, but international organizations may provide short-term funding. Federal and state governmental agencies generally try to manipulate people´s behavior through the legal system. Academics are involved in planning and analysis, and most of the long-term monitoring has to be done by local people or students. All of these categories have fuzzy boundaries, and the relative interest of these groups is likely to change depending on unpredictable factors, such as employment opportunities, market demand and global climate change. Nevertheless, a monitoring scheme has to take into account the different roles of stakeholders. Probably the most difficult aspect of developing the RAPELD system was ensuring that different actors in different levels of the political system would be satisfied with their role, and the roles of other actors (
Biological relationships with distance are not linear (
Different stakeholders have different human and financial resources, but very few have the capacity to undertake detailed studies over large areas. Therefore, we needed a system that would allow integration of a large number of stakeholders with different technological tools at their disposal. Our infrastructure is suitable for use by local people with no formal education, and their participation is often vital because much biodiversity is hidden from the eyes of casual visitors (
Students are the main researchers in most RAPELD sites, but many of the surveys carried out in the modules around the Santo Antônio hydro-electric dam were undertaken by parataxonomists who had been trained in another state. We found that there was a trade-off in sources of uncertainty. Monitoring by students and volunteers increases uncertainty as to the frequency and quality of monitoring. Relying on surveyors specifically contracted for the task, as was the case in Santo Antônio, increases uncertainty as to whether funding will be sufficiently reliable to meet labor-law requirements. By concentrating on field infrastructure, we were able to take advantage of different forms of financing for monitoring in different places and time periods.
Costs of monitoring could not be too great if different stakeholders with limited economic resources were to be involved, but the system had to house high-technology systems, such as eddy-flux towers, when available. While it is not possible to foresee all stakeholders, it is possible to provide field infrastructure that most will need, such as access trails and permanent plots.
Because of its complexity, all aspects of biodiversity cannot be measured directly and decisions are made based on surrogates, which are usually maps derived from remote-sensing data, but may be simply the representation of one biological group by another. For instance, vascular plants are often used to identify “habitats”, “ecoregions” or “ecosystems” that purportedly represent boundaries to the distribution of other organisms, such as insects, mammals or fish (e.g. Olson et al. 2001,
Biologists generally stratify and collect only where they “know” that certain types of organisms occur (
We were initially uncertain about both the questions that stakeholders would want to answer and the remote-sensing technology that would become available. However, most political decisions are made on scales of tens to hundreds of linear kilometers, and few researchers have the resources to use remote-sensing tools with pixel sizes of a few meters. Therefore, we designed a system with relatively large sampling units (250 m long plots and 5 km long transects) that would allow the use of a wide variety of remote-sensing products available today, and will allow the use of even more in the future as products with smaller pixel sizes come on line.
Many monitoring programs have fixed targets, such as the Alberta Biodiversity Monitoring Institute (ABMI –
Integration can be obtained by associating individual sites with larger initiatives for particular targets. For instance, RAPELD plots are included in the RAINFOR (e.g.
Most biologists now carry GPS devices, and geographic coordinates are the backbone of the Darwin-core system for digitalizing the information in biological collections. However, precise information about collection locations, or a single point representing the headquarters of park personnel in a large reserve are generally not sufficient to evaluate search effort or relate biological data to potential abiotic drivers. When we installed RAPELD modules in areas that had been intensively studied by other monitoring programs, we encountered many difficulties in avoiding disturbance to their plots, because the other programs did not have precise coordinates delimitating their field infrastructure.
RAPELD modules provide researchers with extremely detailed information on the location of trails, plots and large sessile organisms, such as trees. As RAPELD plots are long and thin, it is easy for researchers to locate their organisms quickly using only a compass and a measuring tape. All trails are marked at 50 m intervals (100 m intervals in some older sites), so researchers and local assistants can record relatively precise locations even when they do not have GPS equipment. This has been especially important for the use of RAPELD in environmental-impact studies, because reduction in area occupied is often a more sensitive measure of impact than attempts to estimate absolute numbers of organisms by mark-recapture methods (See “Uncertainty about detection” below).
Different aspects of biodiversity and the environment are usually studied by different researchers, and very often it is difficult to see what these researchers have in common. “Integrated” projects usually involve extensive discussion about how the funds will be divided and the general locations of study sites. However, when the time comes for analysis, it is usually impossible to integrate the different studies that were done on different temporal and spatial scales, even though all were contained within the same geographic envelope. Often, researchers from different disciplines have completely different concepts of what represents replication and independence of observations.
By installing a system of transects, and especially plots, that can be used by a wide range of disciplines, we were able to integrate many studies that previously had been considered too disparate for interdisciplinary studies (see also
Narrow plots that follow altitudinal contours have less internal variability in environmental predictor variables than conventional square plots, or long thin plots that are not oriented along contour lines (
Most studies in RAPELD plots have investigated the relationships between topographic, soil or hydrological variables and organisms (e.g.
Field work is increasingly being considered unfashionable (
Different observers can result in different diversities, and different levels of biotic complementarity, even for comparisons of the same site. Therefore, we have invested heavily in surveyor mobility, so that different researchers can exchange experiences and compare identifications. Field workshops are much more efficient than learning by reading, and field courses are a large part of our investment <http://ppbio.inpa.gov.br/extensao>. In any case, as many voucher specimens and photographs should be taken as are financially and ethically feasible. Passive sampling by traps may reduce observer biases, and some taxa can only be efficiently sampled with traps. However, passive sampling is often inefficient in comparison to active sampling (e.g.
Marked plants (live herbariums) may allow re-evaluation of identifications in the future. Production of printed and internet field guides helps maintain stability of identifications across sites, and within the same site through time. For the first RAPELD site, we produced guides to frogs, lizards, the predominant understory angiosperms and ferns (
Conservation decisions are made based on the distribution of taxa, but distribution is defined as much by the area that a species does not occur as by where it occurs. False absences may lead to bad scientific decisions (
Estimates of species detectability and occupancy generally require repeated surveys of sampling units, though in some cases space can be substituted for time. To be able to use those techniques, it is important that detailed information is available about where organisms were collected, and the effort expended to detect them. This is relatively easy to do with the spatially standardized sampling units used in the RAPELD system. Precise coordinates are often available for specimens collected in conventional surveys, but researchers who do not collect in spatially standardized units usually do not report sampling effort, especially if no specimen was collected. Spatially standardized units have allowed the evaluation of occupancy in RAPELD modules, and will allow long-term changes in occupancy to be evaluated. Even in the case that the researcher is confident that they record all the species within a sampling unit (a rare occurrence in the field, but common in researcher imaginations), quantifying detectability greatly increases the confidence that other researchers and managers will have in the results.
Complementarity is a core concept in systematic conservation planning (
There are other options for planning, such as the use of complementarity of species assemblages based on multivariate ordination techniques (e.g.
For monitoring, data is generally a more important product than scientific publications (
We were initially uncertain as to the best way to make data available. There are many database programs available, and information-technology specialists are always willing to develop another one. However, use of a single database for all the data is not viable for diverse monitoring data (
The information-technology revolution is recent, and none of our major field researchers had formal training in data management. Worse still, most university programs still do not offer specific courses in the principles of data management to biologists, though some do offer courses in the use of specific database programs. Data management is not easy or intuitive, and it is a critical phase in research that can effectively nullify all the planning that has gone into data collection. In the end, we found that the major uncertainties were related to whether (1) researchers would be motivated to make their data available, (2) whether researchers had already lost critical data in the field, (3) whether researchers had sufficient training to effectively deposit data, and (4) whether we could find the resources to undertake the capacity building and data verification necessary. Every major project should have a full time data manager responsible for training and data screening, but few do.
We adopted the Ecological Metadata Language (EML) used in the Metacat system (
I have covered only a few of the sources of uncertainty that a field monitoring scheme has to deal with, but they illustrate the complexity of the problem. We are monitoring because we are uncertain, so uncertainty has to be a central issue in any monitoring scheme. However, biodiversity monitoring is more complex than monitoring physical phenomena, such as weather, because there are so many definitions of biodiversity, and it has different values for different segments of society (see
When we first implemented RAPELD, we opted for a hierarchical design, with regular sampling at the local level, but sampling sites limited by logistical considerations over larger areas. Although there was little theoretical support for it at the time, it has been useful to respond to a wide range of questions of interest to decision makers (
We were even uncertain about the questions that researchers will want to answer in the future using the data we are collecting today. Therefore, rather than adopting a taxon- or question-oriented approach, we focused on a spatial design that allows flexibility in questions and taxa studied, while allowing the landscape-geographical approaches required by conservation biology. It is a compromise, but a necessary one. RAPELD sites are being used to answer many specific questions, so the trade-off is generally not between monitoring and answering specific questions. It is between investing in field infrastructure and planning now, rather than in short-term studies that are good for researcher curricula, but that contribute little to long-term, wide-scale conservation planning. At the moment, many citizen monitoring programs, combined with good data availability, are contributing more to our understanding of global phenomena, such as climate change, than are more scientific programs (
The ideas expressed in this paper were developed by too many people to cite here, but all are participants in the Instituto Nacional de Ciência Tecnologia - Centro de Estudos Integrados da Biodiversidade Amazônica (INCT-CENBAM), and the Brazilian Program for Biodiversity Research (PPBio). I thank Klaus Henle for the opportunity to participate in the workshop and the stimulus for the important interactions through EU Bon (EU FP project grant number 308454).