Evaluating community-wide temporal sampling in passive acoustic monitoring: A comprehensive study of avian vocal patterns in subtropical montane forests

Shih-Hung Wu; Jerome Chie-Jen Ko; Ruey-Shing Lin; Chia-Hao Chang-Yang; Hsueh-Wen Chang

doi:10.12688/f1000research.141951.2

Home Browse Evaluating community-wide temporal sampling in passive acoustic monitoring:...

ALL Metrics

Views

Downloads

Get PDF

Get XML

Export

▬

✚

Research Article

Revised

Evaluating community-wide temporal sampling in passive acoustic monitoring: A comprehensive study of avian vocal patterns in subtropical montane forests

[version 2; peer review: 1 approved, 2 approved with reservations]

Shih-Hung Wu^1,2, Jerome Chie-Jen Ko^2,3, Ruey-Shing Lin², Chia-Hao Chang-Yang¹, Hsueh-Wen Chang¹

Shih-Hung Wu^1,2, Jerome Chie-Jen Ko^2,3, [...] Ruey-Shing Lin², Chia-Hao Chang-Yang¹, Hsueh-Wen Chang¹

PUBLISHED 23 Jan 2024

Author details Author details

¹ Taiwan Biodiversity Research Institute, Nantou, 552, Taiwan
² Department of Biological Sciences, National Sun Yat-sen University, Kaohsiung, 804, Taiwan
³ Institute of Ecology and Evolutionary Biology, National Taiwan University, Taipei, 106, Taiwan

Shih-Hung Wu
Roles: Conceptualization, Data Curation, Formal Analysis, Investigation, Methodology, Project Administration, Resources, Software, Validation, Visualization, Writing – Original Draft Preparation, Writing – Review & Editing

Jerome Chie-Jen Ko
Roles: Conceptualization, Investigation, Methodology, Validation, Writing – Review & Editing

Ruey-Shing Lin
Roles: Conceptualization, Funding Acquisition, Supervision, Writing – Review & Editing

Chia-Hao Chang-Yang
Roles: Supervision, Writing – Review & Editing

Hsueh-Wen Chang
Roles: Supervision, Writing – Review & Editing

OPEN PEER REVIEW

REVIEWER STATUS

This article is included in the Ecology and Global Change gateway.

This article is included in the Automated Biodiversity Monitoring Methods collection.

Abstract

Background

From passive acoustic monitoring (PAM) recordings, the vocal activity rate (VAR), vocalizations per unit of time, can be calculated and is essential for assessing bird population abundance. However, VAR is subject to influences from a range of factors, including species and environmental conditions. Identifying the optimal sampling design to obtain representative acoustic data for VAR estimation is crucial for research objectives. PAM commonly uses temporal sampling strategies to decrease the volume of recordings and the resources needed for audio data management. Yet, the comprehensive impact of this sampling approach on VAR estimation remains insufficiently explored.

Methods

In this study, we used vocalizations extracted from recordings of 12 bird species, taken at 14 PAM stations situated in subtropical montane forests over a four-month period, to assess the impact of temporal sampling on VAR across three distinct scales: short-term periodic, diel, and hourly. For short-term periodic sampling analysis, we employed hierarchical clustering analysis (HCA) and the coefficient of variation (CV). Generalized additive models (GAMs) were utilized for diel sampling analysis, and we determined the average difference in VAR values per minute for the hourly sampling analysis.

Results

We identified significant day and species-specific VAR fluctuations. The survey season was divided into five segments; the earliest two showed high variability and are best avoided for surveys. Data from days with heavy rain and strong winds showed reduced VAR values and should be excluded from analysis. Continuous recordings spanning at least seven days, extending to 14 days is optimal for minimizing sampling variance. Morning chorus recordings effectively capture the majority of bird vocalizations, and hourly sampling with frequent, shorter intervals aligns closely with continuous recording outcomes.

Conclusions

While our findings are context-specific, they highlight the significance of strategic sampling in avian monitoring, optimizing resource utilization and enhancing the breadth of monitoring efforts.

Keywords

passive acoustic monitoring, vocal activity rate, temporal sampling, Aves

Corresponding authors: Chia-Hao Chang-Yang, Hsueh-Wen Chang

Competing interests: No competing interests were disclosed.

Grant information: The author(s) declared that no grants were involved in supporting this work.

Copyright: © 2024 Wu SH et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Wu SH, Ko JCJ, Lin RS et al. Evaluating community-wide temporal sampling in passive acoustic monitoring: A comprehensive study of avian vocal patterns in subtropical montane forests [version 2; peer review: 1 approved, 2 approved with reservations]. F1000Research 2024, 12:1299 (https://doi.org/10.12688/f1000research.141951.2) First published: 11 Oct 2023, 12:1299 (https://doi.org/10.12688/f1000research.141951.1) Latest published: 23 Jan 2024, 12:1299 (https://doi.org/10.12688/f1000research.141951.2)

Revised Amendments from Version 1

In accordance with the valuable feedback provided by two peer reviewers, we have revised certain textual and graphical elements within the abstract and the main body of the manuscript. These modifications have been implemented to enhance the readability of our paper.

See the authors' detailed response to the review by Laurel B Symes
See the authors' detailed response to the review by Jerry S Cole

1. Introduction

Biodiversity is paramount for the sustainable progression of human society and environmental preservation. It contributes either directly or indirectly to all 17 of the Sustainable Development Goals (SDGs) (Blicharska et al., 2019). Monitoring biodiversity is a crucial endeavor that facilitates the comprehension of the current state, alterations, and trends of biodiversity and evaluates the efficacy of interventions aimed at mitigating biodiversity loss (Pereira & Davidcooper, 2006). Birds serve as an ideal indicator taxon for monitoring terrestrial biodiversity due to their detectability, identifiability, diversity, widespread distribution, and migratory characteristics (Fraixedas et al., 2020). Bird monitoring can shed light on the effects of habitat loss, deforestation, climate change, invasive species, light pollution, and illegal hunting (Xu et al., 2019; Northrup et al., 2019; Dueñas et al., 2021; La Sorte et al., 2022; Crespo et al., 2020; Negret et al., 2021), while also highlight the beneficial outcomes of conservation efforts (Cazalis et al., 2020).

Beyond human observations, various technologies including radar, thermal imaging, and passive acoustics have been employed for manual or automatic bird monitoring (Lahoz-Monfort & Magrath, 2021). In recent years, PAM has seen an upsurge in its use for bird monitoring and research (Hoefer et al., 2023). Thanks to decreasing costs, autonomous recording units (ARUs) can now be extensively deployed across diverse environments, recording considerable volumes of soundscape data. These data provide rich biological information and allow continuous, automated monitoring, thereby significantly increasing both the temporal and spatial coverage of these efforts (Ross et al., 2023; Pérez-Granados et al., 2021; Sugai et al., 2019; Shonfield & Bayne, 2017). However, the labor-intensive and time-consuming process of identifying species sounds within soundscape data presents a significant bottleneck for PAM utilization. The rise of automated identification tools, such as BirdNET and SILIC, are steadily alleviating this issue (Kahl et al., 2021; Wu et al., 2022). Despite these advancements, handling large volumes of audio files remains a considerable challenge due to the high energy requirements for extended ARU field operations, the need for extensive storage space, and lengthy analysis time (Zwerts et al., 2021).

Sampling has been employed as an effective and commonly used method to decrease the amount of recording data. Sampling design can be categorized into four temporal scales: intra-annual, short-term periodic, diel, and hourly. An intra-annual sampling design implies recording during one or several time periods within a year. Many bird species exhibit more frequent vocalization during the breeding season; hence, the majority of studies prefer to conduct PAM surveys within this timeframe of a year (Campos-Cerqueira & Aide, 2016; Bateman et al., 2021; Duchac et al., 2020). Alternatively, certain studies opt for acoustic surveys during the non-breeding season, a period characterized by relatively stable detectability and community composition (Metcalf et al., 2021).

Short-term periodic sampling design is often employed in light of limited ARU availability, necessitating a rotational system among diverse survey locations. Typically, after being operated for a predetermined number of days at each location, the ARU is relocated to a subsequent site. This rotation ensures a comprehensive collection of crucial soundscape data from each location throughout the survey season (Jahn et al., 2022; Machado et al., 2017). However, avian vocal activity can vary significantly over time (Pérez-Granados & Schuchmann, 2020), and the rotation of devices leads to asynchronous data collection, potentially increasing variability. Therefore, concentrating the rotation of devices during periods when avian vocal activity shows relatively minor temporal variations can help mitigate this variability, enhancing the comparability of data across different sites. Furthermore, the duration of each deployment during the rotation process is a critical factor affecting sound data collection. Longer deployment durations can dilute data collected under extreme conditions, such as typhoons, but also increase the volume and processing costs of the data. Choosing an appropriate deployment duration to reduce the impact of extreme events while minimizing deployment time presents a significant challenge.

The diel sampling design is typically framed based on the behavioral patterns of the target species. For instance, as Passeriformes frequently vocalize during the dawn and dusk choruses, the recordings are concentrated around these periods (Alvarez-Berríos et al., 2016; Deichmann et al., 2017; Rumelt et al., 2021). For nocturnal birds, recordings are conducted during the night (Jahn et al., 2022; Wood et al., 2020). When the research objective targets one or a few bird species with similar vocal activity patterns, prior research on the vocal behavior of these species is crucial. It aids in planning the recording schedule to coincide with the peak vocalization periods of the target species. However, when the goal encompasses a wide variety of species with different habits, the challenge lies in optimizing recording times to capture the vocal peaks of most species within limited resources.

Hourly sampling design can be categorized into coverage (the proportion of recorded time within an hour) and dispersion (the number of recording segments within an hour). Examples of such strategies might include recording a one-minute segment (Diepstraten & Willie, 2021) or a fifteen-minute segment (Pérez-Granados et al., 2021) within an hour, or perhaps recording one minute every ten minutes (Ducrettet et al., 2020; Melo et al., 2021) or every fifteen minutes (Yoo et al., 2020), and even recording fifteen minutes every half hour (Favaro et al., 2021). There is a trade-off challenge in balancing reduced coverage with adequate acoustic data collection. Regarding dispersion, the critical question is which recording schedule, whether dispersed or concentrated, more accurately reflects the actual scenario. Furthermore, it's important to understand how different bird species respond to varying levels of dispersion. Both coverage and dispersion are significant factors influencing the design of hourly sampling designs.

The vocal activity rate (VAR) of birds, defined as the quantity of vocalizations per unit of time, can be derived from PAM data. VAR is a pivotal metric in acoustic surveys, enabling the estimation of bird abundance or density, which is crucial for monitoring avian population (Pérez-Granados et al., 2019a; Pérez-Granados & Traba, 2021). However, avian vocal activity is modulated by an intricate blend of both exogenous and endogenous factors. This results in diverse vocal patterns that vary by species, sex, age, temporal factors, environmental conditions, site-specific characteristics, habitat types, and social contexts (Catchpole & Slater 2003; Marques et al. 2013; Bruni et al., 2014; Digby et al., 2014; Symes et al., 2022). The VAR is substantially influenced by its temporal sampling design. For studies targeting individual species or a limited group, the temporal sampling can be customized to their specific behaviors (Pérez-Granados et al., 2019b). Yet, when the scope encompasses an entire avian community, an optimally structured survey should capture the most prevalent species and a significant proportion of the less common ones (Franklin et al., 2021). Past research has underscored the profound impact of the recording schedule on assessments of avian community richness and composition. For example, prolonging recording durations generally augments species detection, especially for less common species (Wood et al., 2021; Symes et al., 2022). Concentrating recordings during specific time, like dawn, often captures more species but may miss those from distinct functional groups (Shaw et al., 2022). Nevertheless, the effects of temporal sampling designs on the VAR of individual species within a community remain under-investigated.

This study aims to evaluate the impact of various temporal sampling methodologies on the identification of VAR patterns in a biotic community, with a focus on avian communities in subtropical montane forests. Our objectives include: (a) to investigate the influence of three different time-scale sampling designs – short-term periodic, diel, and hourly – on the perceived VAR patterns; and (b) to provide strategic recommendations for optimal temporal sampling strategies to maximize the utility of limited research resources. By aligning the best practices of sampling strategies with available resources, we believe our findings will promote efficient and effective passive acoustic monitoring, thereby contributing to the conservation of avian communities and terrestrial biodiversity.

2. Methods

2.1 Study area

This study was conducted in the southern sector of Yushan National Park (YSNP) situated in central Taiwan, encompassing an expanse greater than 100,000 hectares (Underlying data: Figure S1 (Wu et al., 2023)). YSNP is named after Yushan or Jade Mountain, renowned for its highest peak in Northeast Asia with an elevation of 3,952 meters. This national park is pivotal in sustaining high-altitude ecosystems, transitioning from subtropical zones at its base to alpine zones at higher altitudes. The mean annual precipitation recorded is around 3,600 mm. Altitudinal variation influences the average annual temperatures: approximately 20°C at 1,000 meters, around 10°C at 2,500 meters, and roughly 5°C beyond 3,500 meters (Minister of the Interior, 2022).

2.2 Soundscape data collection

For the data collection, 14 PAM stations were deployed within the study area. These stations spanned a distance of approximately 10 kilometers along the Southern Cross-Island Highway, from Meishan (23°15′51″N, 120°49′33″E) to Yakou (23°15′51″N, 120°57′28″E) (Underlying data: Figure S1 (Wu et al., 2023)). The altitude of the initial station, SCIH07, was 1,500 meters, and subsequent stations were set up at 100-meter elevation increments, terminating at an elevation of 2,800 meters with station SCIH20. The distance between any two adjacent stations ranged from 750 and 1,850 meters. This elevation gradient encapsulated four distinctive vegetation types: submontane evergreen broad-leaved forest, montane evergreen broad-leaved forest, montane mixed coniferous-broadleaved forest, and upper montane coniferous forest (Minister of the Interior, 2022) (Underlying data: Table S1 (Wu et al., 2023)).

Each PAM station was equipped with a Song Meter Mini recorder (Wildlife Acoustic Inc.) designed to capture soundscape data. These devices, anchored to trees at an average height of 1.5 meters, operated continuously throughout the day between March and June 2021. This period corresponds with the breeding season of the region’s montane forest avian species. The recording configurations were set to mono mode, capturing audio in a 16-bit WAV format with a sampling rate of 44.1 kHz. To facilitate subsequent analytical procedures, the recordings were segmented and stored in three-minute durations.

2.3 Target species

To gain a comprehensive understanding of individual species’ status within a community, we utilized ecological guilds as our primary criterion for selecting target species. Ding (1993) cataloged 59 avian species recognized as montane forest breeders and categorized them into 12 distinct ecological guilds within YSNP: raptorial carnivores (RC), ground graminivores (GG), ground omnivores (GO), ground insectivores (GI), bush insectivores (BI), tree fruitivores (TF), tree omnivores (TO), tree insectivores (TI), bole gleaners (BG), bole peckers (BP), tree hoverers (TH), and air flycatchers (AF).

To identify our research’s target species, a singular species was selected as a representative from each guild. When a particular guild included more than two species, we used trait data from Tsai et al. (2020) regarding Taiwan’s breeding birds to inform our selection. The species that manifested an altitudinal distribution most congruent with our study’s objectives was then chosen.

As a result, we designated 12 bird species as our primary focus, including Collared Owlet (Taenioptynx brodiei), Large-billed Crow (Corvus macrorhynchos), Taiwan Bush Warbler (Locustella alishanensis), Grey-chinned Minivet (Pericrocotus solaris), Taiwan Vivid Niltava (Niltava vivida), Eurasian Nuthatch (Sitta europaea), Taiwan Rosefinch (Carpodacus formosanus), Taiwan Yuhina (Yuhina brunneiceps), Taiwan Shortwing (Brachypteryx goodfellowi), Ashy Wood-Pigeon (Columba pulchricollis), Green-backed Tit (Parus monticolus), and Gray-headed Woodpecker (Picus canus). Each of these species was emblematic of the 12 aforementioned guilds (Underlying data: Table S1 (Wu et al., 2023)).

2.4 Vocal detection and performance evaluation

We selected SILIC, an automated wildlife sound identification tool recently developed based on the YOLOv5 object detection model and spectrogram images (Wu et al., 2022), for detecting bird vocalizations in our study. This choice was primarily motivated by two key attributes of SILIC. Firstly, it can recognize the vocalizations of 141 bird species native to Taiwan, encompassing all of our 12 target species. Secondly, SILIC offers a unique capability to detect each vocalization’s exact start and end time within an audio recording at the millisecond level, rather than merely identifying the presence or absence of a certain vocalization within a broad time frame. This feature enables precise computation of the VAR, a crucial metric for our research.

SILIC categorizes sounds into ‘sound classes’ rather than by species. Of the 12 target species we studied, each had 1 to 5 sound classes in SILIC, including ‘song’, ‘call’, ‘drumming’, and ‘unknown’ (a classification with an undetermined function). As song is common during many bird species’ breeding, it was our primary choice. If multiple song classes were provided by SILIC, we consulted experts to select the prevalent one. For species without a song or if the song was understated and hard to discern, we chose a frequently observed sound class. Accordingly, ‘song’ represented 9 of the 12 species, ‘call’ 2, and ‘unknown’ 1. The spectrograms representing the selected sound classes are provided in the Underlying data: Figure S2 (Wu et al., 2023).

We utilized SILIC to extract vocalizations of our target species from the soundscape recordings. Each three-minute recording was segmented into three-second spectrogram clips and analyzed using a one-second sliding window. Due to the overlapping nature of the sliding window, one vocalization might be detected multiple times. For detections of the same species within a single recording, if the intersection area of two overlapping bounding boxes (each bounding box representing a specific vocalization in the spectrogram) divided by the area of the smaller bounding box exceeded 0.25 (Underlying data: Figure S3 (Wu et al., 2023)) or if the intersection area divided by the union area exceeded 0.1 (Underlying data: Figure S4 (Wu et al., 2023)), the two bounding boxes (vocalizations) were combined.

A random sample of 100 detected vocalizations for each species, each tagged with a confidence score (ranging from 0 to 1, indicating the level of certainty that the vocalization belongs to a particular species), was manually reviewed. This set of manually reviewed detections constituted our test dataset for evaluating the performance of SILIC on our soundscape recordings. We created a confusion matrix consisting of four parameters: true positives (TP), true negatives (TN), false positives (FP), and false negatives (FN). Subsequently, we calculated precision as TP/(TP+FP) and recall as TP/(TP+FN). Additionally, we computed the area under the receiver operating characteristic curve (AUC) and the average precision (AP), the latter being equivalent to the area under the Precision-Recall curve. Detailed calculations of these performance metrics can be found in the Underlying data: Appendix S2 (Wu et al., 2023). Finally, we identified the confidence scores corresponding to the maximum F1-score values and designated these as the threshold scores for each species. Detected vocalizations exceeding these confidence score thresholds were classified as positive detections and were incorporated in subsequent analyses.

2.5 Temporal sampling designs and statistical analyses

2.5.1 Environment factors

As previously stated, bird vocal activity is influenced by multiple factors. In addition to species and seasonality, it is also affected by environmental factors such as climatic conditions, topographic features, and vegetation type. Some of these factors, like topographic features and vegetation type, may remain constant throughout the survey period, while others may vary over time. When investigating the impact of temporal sampling on avian vocalization activity, it is crucial to collect as much information as possible about these environmental factors to comprehensively explore their influence on the VAR. We collected data on each station’s altitude (in meters) and vegetation types (as described in the Soundscape data collection section). Daily climate data, including corrected precipitation (Rainfall; mm), wind speed at two meters (WindSpeed; m/s), temperature at two meters (Temp; °C), relative humidity at two meters (RH; %), and surface pressure (SP; kPa), were downloaded from the NASA/POWER CERES/MERRA2 Native Resolution Daily Data website (https://power.larc.nasa.gov/data-access-viewer/).

We employed a forward stepwise selection approach to develop four GAMs to evaluate the relationship between the response variable, daily vocal activity rate (VAR_d, representing the quantity of vocalizations per day), and nine exploratory variables. VAR_d was calculated as the total daily vocalization count for each station and bird species separately. The initial model contained only categorical variables ‘Species’ and ‘Vegetation’, as well as their interaction effect. In the second model, a smooth term for ‘Altitude’ for each ‘Species’ was incorporated. The third model included five additional smooth terms for meteorological variables, namely ‘Rainfall’, ‘WindSpeed’, ‘Temp’, ‘RH’, and ‘SP’. Finally, a smooth term for the day of year (DOY) for each ‘Species’ was included in the final model.

As VAR_d is discrete and highly positively skewed count data, we chose a negative binomial distribution function and a log link function. The collinearity of explanatory variables was assessed using variance inflation factors (VIF), with variables demonstrating values greater than five being discarded, following Zuur et al. (2007). A Thin Plate Regression Splines (TPRS) smoother was employed for each smooth term as it can capture a wide variety of functional forms, making them suitable for modeling complex nonlinear relationships without having to specify the functional form in advance. The performance of the models was assessed by calculating the deviance explained and Akaike’s Information Criterion (AIC). The analyses were conducted using the “mgcv” package (version 1.9-0) in R statistical software (version 4.3.1).

2.5.2 Short-term periodic sampling

In a single survey season, should there be an insufficient number of ARUs for concurrent recordings across all sampling sites, necessitating rotation, discrepancies in temporal setup could introduce biases. Such inconsistencies might undermine the reliability of subsequent analyses. For example, shifts in breeding statuses over time might influence vocalization frequencies (Slagsvold, 1977). Thus, during experimental design, it is imperative to target a timeframe wherein temporal variations in avian vocalizations are minimal. Essentially, intervals should be chosen where daily vocal patterns remain sufficiently stable to complete rotation recording within that period. Given the time constraints associated with ARU rotation, reducing deployment durations at individual sites might allow more rotation cycles. Yet, exceedingly short recording spans might risk data bias (Pérez-Granados et al., 2019b). In the segment discussing short-term periodic sampling, we address two facets: (1) Time window: pinpointing intervals throughout the PAM survey season when bird acoustic activity is relatively uniform; and (2) Survey duration: ascertaining the required days for a PAM survey to garner representative data.

For the time window, we employed HCA to group dates with similar vocal activity patterns. We first calculated daily VAR (VAR_d) separately for each station and species, and normalized the data using Z-score normalization to achieve a mean value of zero and a standard deviation of one. Subsequently, we conducted HCA to identify dates with Euclidean distance and Ward’s linkage. The root-mean-square error (RMSE, the square root of the sum of squared distances between any two samples within a cluster divided by the number of samples) and the within-cluster maximum distance (WCMD, the greatest distance between any two samples within a cluster) were computed as indicators of data homogeneity within clusters. The clustering analysis and metric calculations were performed using Python (version 3.10.9), with the SciPy (version 1.10.0) and Scikit-learn (version 1.2.1) packages. Visualization was accomplished using the Matplotlib package (version 3.7.0).

For the survey duration, we utilized the CV of daily VAR (VAR_d) values across various consecutive survey durations as an evaluation metric to assess the influence of different consecutive survey durations on the reliability of VAR_d values. Initially, we defined the consecutive survey durations between one to 14 days for test based on the PAM deployment durations used in previous works (Machado et al., 2017; Wood et al., 2020; Metcalf et al., 2021; Rumelt et al., 2021; Jahn et al., 2022). For each duration, starting from day one, we extracted the corresponding data, calculating the VAR_d values and their mean. Upon completion of calculations, we moved forward day by day and repeated the extraction and calculation process for the same duration until all days were covered. Finally, from the obtained average VAR_d values, we calculated their standard deviation and mean. The ratio of these two values yielded the CV. We applied this calculation to each homogeneous cluster obtained from the Time window analysis, allowing a comparative assessment of CV differences across clusters.

2.5.3 Diel sampling

To conserve both the storage capacity of audio files and the time required for analysis, recordings are typically sampled during peak vocalization periods of target species within a day. We divided the day into 24-hour segments. Using a GAM, we evaluated the association between an hourly VAR response variable (VAR_h, denoting the quantity of vocalizations per hour) and three explanatory variables: species (categorical), DOY (smooth term), and hour (smooth term). Due to the same rationales applied in the environmental factor analysis, we employed a negative binomial distribution with a log link function and a TPRS smoother. The collinearity of the explanatory variables was evaluated using the VIF; variables with VIF values greater than five were excluded. To represent the 24 hours in a day, a value of 24 basis dimensions (k) was utilized. All analyses were performed using the “mgcv” package (version 1.9-0) in the R statistical software (version 4.3.1).

2.5.4 Hourly sampling

As we conducted continuous 24-hour recordings each day, we possessed comprehensive recording data, enabling us to simulate different hourly sampling designs and examine their impacts on the VAR value per minute (VAR_m). We concentrated on two main dimensions: coverage (representing the ratio of time recorded in an hour) and dispersion (indicating the number of recording intervals within an hour, with an X:Y format signifying cycles of X minutes of recording followed by Y minutes of inactivity). Based on seven coverage patterns, we simulated 21 unique sampling combinations for each hour, as elaborated in Table 1. For every species, date, and hour, we derived the mean VAR_m values from both continuous recordings and various sampling strategies, subsequently determining the difference between these values. In the end, we computed the average difference in VAR_m values across different species and sampling designs. A reduced mean difference indicates closer alignment between the VAR_m values from continuous recording and a particular sampling design.

Table 1. Hourly sampling combinations from seven coverage designs.

This table lists 21 distinct sampling combinations derived from seven temporal coverage designs. ‘Temporal coverage’ indicates the fraction of an hour recorded. In the ‘Sampling combinations’ column, the format X:Y designates cycles of X minutes of recording (ON) succeeded by Y minutes of pause (OFF). These sampling designs were simulated to evaluate their effects on the VAR_m.

Temporal coverage	Sampling combinations
1/2	1:1, 5:5, 15:15, 30:30
1/3	1:2, 5:10, 10:20, 20:40
1/6	1:5, 2:10, 5:25, 10:50
1/10	1:9, 3:27, 6:54
1/15	1:14, 2:28, 4:56
1/30	1:29, 2:58
1/60	1:59

3. Results

3.1 Vocal detection

Between March 1 and June 30, 2021, spanning 122 days, a total of 789,986 three-minute audio files were collected across 14 sampling stations, approximately totaling 39 thousand hours. During this period, two stations, SCIH11 and SCIH18, failed to record audio files from mid-May to mid-June due to memory card issues. These two stations were subsequently excluded from further analysis. Of the remaining 12 stations, due to deployment scheduling, equipment operation, and battery management issues, at least one station had days where recorded data did not reach 23.5 hours for a total of 10 days. Data from these specific days (i.e., DOY 60–62, 141, 168, and 177–181) were also omitted from further analysis. Hence, data from 12 stations over 112 days continued for subsequent analysis.

We employed SILIC (Version exp29) for automated sound detection. Upon manual inspection by experienced bird surveyors of the 1,200 randomly sampled entries detected, 424 were confirmed as true detections, while 776 were false detections. AUC scores for each species, derived from the test set, ranged from 0.87 to 1.0, and AP scores ranged from 0.85 to 1.0. This demonstrates the excellent detection performance of SILIC within the scope of this study, making it apt for further analysis.

We selected the confidence score at which the precision score for each species was not less than 0.95 as the threshold to minimize the occurrence of false positives. Sound detection results with a confidence score greater than or equal to the threshold were screened for subsequent analysis. In total, 8,202,731 vocalizations from 12 species were detected, with the Taiwan Yuhina having the highest count at 2,863,838, and the Gray-headed Woodpecker the lowest at 23,312. Detailed data on vocalizations detected by SILIC can be found in the Underlying data: a compressed file “VAR_m_all_columns.zip” and a summary as presented in Table S2 (Wu et al. (2023)). For comprehensive information on the test datasets, threshold values, and various performance metrics for each species, please refer to the Underlying data: Table S3 (Wu et al. (2023)). Precision and recall curves are provided in the Underlying data: Figure S5 (Wu et al. (2023)).

3.2 Environment factors

Collinearity tests revealed that the variance inflation factor (VIF) for all exploratory variables was below 5, hence all variables were retained for forward stepwise GAM modeling and fitting. When only using species and vegetation types as predictors, the deviance explained is 50.1%. Incorporating altitude as a smoothed term, differentiated by species, increased the deviance explained to 62.9%. Subsequently, by adding five climatic variables as smoothed terms, the deviance explained rose to 68%. Finally, introducing DOY while distinguishing among species further increased the deviance explained to 73.2%. This demonstrates that each variable contributes to the prediction of daily vocal activity rate (VAR_d), as shown in Table 2.

Table 2. Model selection results for predicting daily vocal activity rate with GAMs.

This table summarizes the outcomes of a forward stepwise variable selection procedure, detailing the Akaike Information Criterion (AIC), adjusted R², and proportion of deviance explained. The GAM predicts the daily vocal activity rate (vocalizations per day) based on interactions between Species and Vegetation types. The model includes smoothed effects of Altitude (meters, varying by Species), corrected precipitation (Rainfall, mm), wind speed at 2 meters (WindSpeed, m/s), temperature at 2 meters (Temp, °C), relative humidity at 2 meters (RH, %), surface pressure (SP, kPa), and Day of the Year (DOY, varying by species). Weather data were sourced from the NASA/POWER CERES/MERRA2 Native Resolution Daily Data repository, accessible at https://power.larc.nasa.gov/data-access-viewer/. Vegetation data were provided by the Minister of the Interior (2022).

Model (predictors)	AIC	R²(adj.)	Deviance
Null	206430	0	0%
Species * Vegetation	192263	0.358	50.1%
Species * Vegetation + s (Altitude, by=Species)	186863	0.456	62.9%
Species * Vegetation + s (Altitude, by=Species) + s (Rainfall) + s (WindSpeed) + s (Temp) + s (RH) + s (SP)	184246	0.438	68.0%
Species * Vegetation + s (Altitude, by=Species) + s (Rainfall) + s (WindSpeed) + s (Temp) + s (RH) + s (SP) + s (DOY, by=Species)	181310	0.474	73.2%

The model with the lowest AIC, encompassing all nine exploratory variables, was chosen. Most fixed effects and all smoothed terms (including altitude, DOY, and climatic variables) were found to have a significant impact on the predictive capacity of the model, as detailed in the Underlying data: Table S4 (Wu et al., 2023).

While all 12 species exhibited significant correlations between DOY and the VAR_d, the patterns of these relationships were not consistent across species. Some species displayed a strong positive correlation in the early stages of the survey period, which shifted to a pronounced negative correlation in later stages. Conversely, other species demonstrated the opposite pattern (refer to the Underlying data: Figure S6 (Wu et al., 2023)). Similar inconsistencies between species were observed in relation to altitude (see the Underlying data: Figure S7 (Wu et al., 2023)).

When rainfall was less than 40 mm, there was no apparent influence on VAR_d. However, beyond this threshold, VAR_d showed a rapid decrease, with effects diminishing after approximately 60 mm. Wind speeds of up to 3.0 m/s had no discernible effect on VAR_d, but rates declined sharply beyond this speed. For temperatures up to 20°C, VAR_d gradually increased as temperatures rose, with no apparent effects beyond this threshold. Relative humidity had a slight negative effect on VAR_d once it exceeded 80%. As for atmospheric pressure, the impact on VAR_d shifted from negative to positive as pressure moved from low to high. For more detailed information, please refer to the Underlying data: Figure S8 (Wu et al., 2023).

3.3 Short-term periodic sampling

Upon examining the dendrogram derived from HCA (Figure 1), it becomes evident that at a Euclidean distance of 35, the entire survey period can be partitioned into five distinct clusters. Within each cluster, the VAR_d patterns of different bird species at each PAM station are similar among days within the same cluster but differ from those in other clusters. Notably, Clusters 3, 4, and 5 exhibit substantially lower root-mean-square error (RMSE) and within-cluster maximum distance (WCMD) values compared to Clusters 1 and 2, indicating more homogeneous distributions of VAR_d within these clusters (Table 3). An observation of the DOY distribution within the five clusters reveals a largely sequential pattern over time, with only two exceptions (DOY 83 and 119) both falling within Cluster 5. Clusters 1 and 2 encompass 12 and 18 days respectively, approximately aligning with the first and latter halves of March. Conversely, Clusters 3, 4, and 5, each containing no fewer than 25 days, roughly correspond to the months of April, May, and June, respectively (Figure 2).

Figure 1. Hierarchical clustering of dates by vocal activity patterns.

Dendrogram derived from hierarchical clustering of dates based on daily vocal activity. At a Euclidean distance threshold of 35 (indicated by the grey dashed line), five clusters are discerned (numbered circles).

Table 3. Day of year (DOY) clusters from hierarchical clustering analysis Using Daily Vocal Activity Rate.

This table details the five clusters identified based on daily vocal activity rates. The root-mean-square error (RMSE), within-cluster maximum distance (WCMD), number of days, and specific DOYs are provided for each cluster.

Cluster	RMSE	WCMD	No. of days	DOYs
1	12.48	24.36	12	63~74
2	12.00	24.33	18	75~82, 84~93
3	9.18	18.26	25	94~118
4	8.04	17.55	29	120~149
5	7.78	18.59	28	83, 119, 150~176

Figure 2. Short-term periodic representation of vocal activity clusters and rainfall.

The outer ring indicates vocal activity clusters distinguished by colors: 1 (red), 2 (orange), 3 (blue), 4 (green), and 5 (yellow). Day of Year (DOY) is shown clockwise from the top on the ring's periphery. Within this ring, dashed lines represent rainfall (in mm) from the SCIH13 PAM station, centrally located in the survey area. Two outlier dates in Cluster 5 correspond to rainfall events. The inner ring denotes the months: March to June.

In Figure 3, the CV of mean VAR_d values spanning one to fourteen consecutive recording days is presented. It is important to note that the sample size (number of days) varies among clusters, particularly with Clusters 1 and 2 having significantly fewer samples compared to the other three clusters. Clusters with a smaller sample size may yield lower CV values compared to those with a larger sample size. Therefore, our discussion focuses primarily on Clusters 3, 4, and 5, which have relatively larger and similar sample sizes. Among the three, Cluster 4 exhibits the lowest CV values, where the decline becomes less pronounced after recording for more than seven days. Conversely, the decrease in CV values for Clusters 3 and 5 persists until recordings reach 14 days, with their final CV values still slightly exceeding that of Cluster 4 when the recording duration is set at seven days.

Figure 3. Variation in daily vocal activity rate among clusters.

The graph displays the Coefficient of Variation (CV) against consecutive survey days (X-axis) for distinct clusters identified through hierarchical clustering (represented by colored lines). The number of days in each cluster is indicated in parentheses next to the cluster number in the legend. The Y-axis marks the CV values. Interpreting CV values for clusters 1 and 2 should be cautious, particularly for extended survey durations. Their limited days might result in artificially low CV values, potentially underestimating variation.

3.4 Diel sampling

Collinearity analysis revealed that the variance inflation factor (VIF) for all explanatory variables was less than 5. Thus, all variables were retained for GAM modeling. The fit of the GAM indicated a deviance explained of up to 80.8% (adjusted R² = 0.70). All species exhibited a significant influence on VAR_h with respect to the hour (p < 0.001). Dawn and dusk are defined as approximately an hour before and after sunrise and sunset, respectively. Given the study’s duration of four months, sunrise times oscillated between approximately 5 am and 6 am, while sunset times ranged from around 6 pm to 7 pm. Consequently, dawn is represented from 4 am to 7 am, and dusk from 5 pm to 8 pm.

Observations from Figure 4 regarding the hourly impact on VAR_h reveal that, except for the Collared Owlet and the Taiwan Bush Warbler, the remaining 10 species exhibited a significant positive influence on VAR_h during dawn. This influence gradually declined during the day and swiftly transitioned from a positive to a negative impact at dusk. Throughout the night, a consistent, highly negative influence was observed. The Collared Owlet displayed rapid fluctuations in its influence on VAR_h during dawn and dusk, transitioning from a mild positive effect during the day to a negative one, and maintaining a mildly negative influence at night. The Taiwan Bush Warbler transitioned from a strong negative impact on VAR_h during the late night to a positive one, peaking just before dawn and then declining. During the day, it transitioned from a mild positive to a negative influence, and finally, it exhibited intense fluctuations during dusk, soaring from a negative to a pronounced positive influence before plummeting to a strong negative impact.

Figure 4. Diurnal patterns of hourly vocal activity for twelve target bird species.

Each panel displays the GAM-predicted relationship between the hour of the day and the hourly vocal activity rate (VAR_h) for a specific species: (a) Collared Owlet, (b) Large-billed Crow, (c) Taiwan Bush Warbler, (d) Grey-chinned Minivet, (e) Taiwan Vivid Niltava, (f) Eurasian Nuthatch, (g) Taiwan Rosefinch, (h) Taiwan Yuhina, (i) Taiwan Shortwing, (j) Ashy Wood-Pigeon, (k) Green-backed Tit, and (l) Gray-headed Woodpecker. The y-axis represents the smooth effect of hour on VAR_h for each species. The solid blue line represents the predicted deviation with a 95% confidence interval (blue dashed lines). A reference line is shown at Y=0 (red dashed line). A deep gray shade indicates nighttime, while dawn (approx. 4 AM to 7 AM) and dusk (approx. 5 PM to 8 PM) are highlighted in light gray, reflecting variations due to sunrise and sunset times over the study's four-month span. Accompanying each plot are the estimated degrees of freedom (edf) and significance codes (*** p < 0.001, ** p < 0.01, * p < 0.05).

3.5 Hourly sampling

Analysis of 21 distinct combinations of coverage and dispersion revealed that higher proportions of recording time, coupled with shorter and more dispersed recording segments, result in VAR_m values from sampling more closely aligning with those from continuous recordings. This trend was consistent across all target species, as illustrated in Figure 5.

Figure 5. Comparison of vocal activity rates across sampling designs for twelve target bird species.

The graph showcases the mean differential VAR_m (vocalizations per minute) between continuous recordings and various sampling methods for each species. Each subplot presents six species differentiated by unique colors. The x-axis lists sampling designs, ordered by decreasing coverage (proportion of the hour recorded) and dispersion (denoted as X:Y, indicating X minutes of recording followed by Y minutes of pause). A reduced mean differential suggests a closer match between the VAR_m from the sampling method and the continuous recording.

4. Discussion

In this study, we selected a group of twelve bird species inhabiting subtropical montane forests, each species representing a distinct ecological guild. It is critical to acknowledge that this selection constitutes merely a small fraction of the entire avian community and the expansive soundscape. Moreover, our study is focused exclusively on a single type of vocalization for each species. Given the fact that different vocal types—such as songs versus alarm calls—may reflect different statuses of a species (Catchpole & Slater, 2003), the selection of vocal type could potentially impact the sampling design we propose. To minimize this effect, where practical, songs were predominantly chosen as the vocal type for the species under study. This approach implies that our findings are particularly tailored to optimize sampling designs for monitoring breeding species populations within the study area. Consequently, researchers should carefully consider the suitability of their monitoring objectives in light of their selected sampling design. Despite these constraints, the extensive data collected in this study provide valuable insights into the considerable variability in avian vocal activity rates across three distinct temporal scales, highlighting the importance of temporal sampling design in studies predominantly utilizing PAM.

4.1 Species and environmental factors

Our study elucidates how vocal activity in subtropical forest-dwelling birds is influenced by species, temporal factors, and external environmental conditions. We show that the VAR pattern is strongly affected by individual species’ interaction with vegetation, altitude and DOY. Climate also plays a significant role, impacting VAR across all species. These findings emphasize the significant challenges posed by utilizing PAM to infer the population status and trends of a specific species. These challenges become even more pronounced when the monitoring effort is directed at multi-species bird assemblages, especially when constrained by equipment and time. Consequently, choosing the most appropriate recording sampling design is crucial to ensure data representativeness and comparability.

4.2 Short-term periodic sampling

Throughout a bounded time period, like the breeding season examined here, researchers often consider the entire duration as a closed population, executing repeated data collections and comparative analyses within this window (Baillie, 1991). However, our analysis of short-term periodic sampling revealed pronounced species-specific temporal variations in vocal data over the course of the time sequence. We partitioned the survey season into five clusters, each reflecting relatively consistent vocal patterns. The vocal behaviors within the first two clusters displayed notable intra-cluster variability, potentially attributable to temporal nuances in breeding phases and vocalization rates among different species (Slagsvold, 1977). Given the observed variability in vocal behaviors early in the season, we recommend extra care when starting acoustic surveys at the beginning of the breeding season. In our study, Clusters 3 and 4 were identified as the optimal periods for acoustic surveys, as these intervals exhibited the lowest variation in vocalization activity across species. Moreover, these consecutive time frames totaled nearly eight weeks, offering greater flexibility in managing the rotation of recording devices.

For intervals characterized by minimal daily vocalization frequency shifts, we propose that a continuous 14-day recording effectively diminishes sampling variance. During phases with even more consistent patterns, such as the fourth cluster identified in this study (circa May), a seven-day recording suffices. Moreover, we advise discarding data from days with rainfall exceeding 40 mm and average wind speeds surpassing 3.0 m/s. These environmental conditions have been empirically demonstrated to considerably dampen vocal activity rates, a phenomenon corroborated in other avian studies (Vokurková et al., 2018; Robbins, 1981).

4.3 Diel and hourly sampling

Regarding diel sampling, even though our primary emphasis was on diurnal birds peaking in morning vocalizations, certain species, like the Taiwan Bush Warbler in this study, and approximately 30% of North American birds vocalize nocturnally (La, 2012). We therefore recommend sampling approximately one hour before and after sunrise to coincide with the morning chorus. If resources permit, recording sessions could commence from midnight to encompass vocal peaks of nocturnal species (Schaaf et al., 2023; Pérez-Granados & Schuchmann, 2020; Odom & Mennill, 2010).

In terms of hourly sampling, our findings align with studies on marine mammals, indicating that longer recording durations combined with shorter, more spaced-out intervals, yield vocal activity rates similar to continuous recordings (Thomisch et al., 2015). The scope of our investigation was confined to examining the effects of dispersion at the minute level, without delving into higher temporal units such as hours or days. This specific focus was dictated by the prevailing limitation that most ARUs are currently only programmable at the minute level. Additionally, there is a noted scarcity in prior research that has formulated dispersion sampling strategies for intervals extending beyond minutes. Investigations in the future, exploring higher temporal scales, might yield further insights. These could potentially enhance the understanding of sampling methodologies that are capable of reducing the duration of recordings, while simultaneously maintaining the integrity and quality of the data collected.

5. Conclusions

This study underscores the significance of optimizing temporal and sampling design in PAM from ecological and conservation perspectives. Through such optimization, we not only ensure efficient use of limited resources but also broaden the scope of the monitoring project in terms of temporal, spatial, and taxonomic. Such refinements enhance our understanding of avian community structures and their responses to environmental changes. Based on the findings of this study, within a similar research scope, we recommend the following guidelines for temporal sampling strategies:

1. Conduct acoustic surveys during the mid-breeding season (April to May), where VAR variability is relatively low.
2. Single survey sessions should last a minimum of seven consecutive days, with 14 days being ideal, to substantially reduce sampling variability.
3. Concentrating recording times around the morning (one hour before and after sunrise) can greatly improve detection rates within limited resources. If resources allow, recording can start as early as midnight to include species that peak in vocal activity at night.
4. Employ a schedule of recording for one minute followed by a five-minute rest (a time coverage of 1/6). This schedule, for most bird species, yields data closer to continuous recording compared to a 30-minute recording with a 30-minute rest interval (time coverage of 1/2), while only requiring one-third of the data volume.
5. Avoid recording or using data during weather conditions with rainfall greater than 40 mm, wind speeds exceeding 3 m/s, and temperatures below 20°C, as these significantly reduce VAR.

However, it is essential to emphasize that our study was specifically conducted on 12 breeding bird species within subtropical montane forests. Consequently, applying these findings to other ecosystems or to a broader range of avian taxa demands careful consideration. Moreover, when the focus of monitoring narrows down to a single species, it becomes crucial to devise a sampling strategy that aligns with the distinctive behaviors of that species. We advocate for future studies to build upon our foundational research, venturing into diverse ecological landscapes and including a broader spectrum of bird species.

When temporal sampling enables more economical collection of acoustic data while ensuring its representativeness, researchers around the world will have the opportunity to collaborate in seeking a consistent and cost-effective temporal sampling standard. This not only facilitates cross-dataset research but also supports manageable data sizes for global-scale or decadal long-term data compilations. Such efforts are instrumental in addressing macro-issues like climate change and promoting sustainable development for humanity.

Data availability

Underlying data

Zenodo: Underlying data for ‘Evaluating community-wide temporal sampling in passive acoustic monitoring: A comprehensive study of avian vocal patterns in subtropical montane forests’, https://www.doi.org/10.5281/zenodo.8304104 (Wu et al., 2023).

This project contains the following underlying data:

• Supplementary Material_v2.pdf. (This file encompasses Figures S1 to S9, Tables S1 to S4, and Appendix S2 which details the calculations for performance metrics.)
• VAR_m_all_columns.zip. (This file comprises columns such as site_name, longitude (WGS84), latitude (WGS84), altitude (meters above sea level), vegetation types, date, hour, minute, Day of Year (DOY), scientific_name, and Vocal Activity Rate per minute (VAR_m). The vocalization data is aggregated by site, species, and time (down to the minute).

Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0)

References

Alvarez-Berríos N, Campos-Cerqueira M, Hernández-Serna A, et al.: Impacts of Small-Scale Gold Mining on Birds and Anurans Near the Tambopata Natural Reserve, Peru, Assessed Using Passive Acoustic Monitoring. Trop. Conserv. Sci. 2016; 9(2): 832–851. Publisher Full Text
Baillie SR: Monitoring terrestrial breeding bird populations.Goldsmith B, editor. Monitoring for Conservation and Ecology. Netherlands: Springer; 1991; pp. 112–132. Publisher Full Text
Bateman HL, Riddle SB, Cubley ES: Using Bioacoustics to Examine Vocal Phenology of Neotropical Migratory Birds on a Wild and Scenic River in Arizona. Birds. 2021; 2(3): 261–274. Publisher Full Text
Blicharska M, Smithers RJ, Mikusiński G, et al.: Biodiversity’s contributions to sustainable development. Nat. Sustain. 2019; 2(12): 1083–1093. Publisher Full Text
Bruni A, Mennill DJ, Foote JR: Dawn chorus start time variation in a temperate bird community: Relationships with seasonality, weather, and ambient light. J. Ornithol. 2014; 155(4): 877–890. Publisher Full Text
Campos-Cerqueira M, Aide TM: Improving distribution data of threatened species by combining acoustic monitoring and occupancy modelling. Methods Ecol. Evol. 2016; 7(11): 1340–1348. Publisher Full Text
Catchpole CK, Slater PJ: Bird song: Biological themes and variations. Cambridge University Press; 2003.
Cazalis V, Princé K, Mihoub J-B, et al.: Effectiveness of protected areas in conserving tropical forest birds. Nat. Commun. 2020; 11(1): 4461. PubMed Abstract | Publisher Full Text | Free Full Text
Crespo J, Solís I, Barba E: Illegal Bird Hunting in Eastern Spain: A Declining Trend, But Still Worrying. Ardeola. 2020; 68(1). Publisher Full Text
Deichmann JL, Hernández-Serna A, Delgado CJA, et al.: Soundscape analysis and acoustic monitoring document impacts of natural gas exploration on biodiversity in a tropical forest. Ecol. Indic. 2017; 74: 39–48. Publisher Full Text
Diepstraten J, Willie J: Assessing the structure and drivers of biological sounds along a disturbance gradient. Glob. Ecol. Conserv. 2021; 31: e01819. Publisher Full Text
Digby A, Towsey M, Bell BD, et al.: Temporal and environmental influences on the vocal behaviour of a nocturnal bird. J. Avian Biol. 2014; 45(6): 591–599. Publisher Full Text
Ding T-S: Avian community ecology of mature forests in Mt. Yushan [Master Thesis]. National Taiwan University.1993.
Duchac LS, Lesmeister DB, Dugger KM, et al.: Passive acoustic monitoring effectively detects Northern Spotted Owls and Barred Owls over a range of forest conditions. Condor. 2020; 122: duaa017. Publisher Full Text
Ducrettet M, Forget P-M, Ulloa JS, et al.: Monitoring canopy bird activity in disturbed landscapes with automatic recorders: A case study in the tropics. Biol. Conserv. 2020; 245: 108574. Publisher Full Text
Dueñas M-A, Hemming DJ, Roberts A, et al.: The threat of invasive species to IUCN-listed critically endangered species: A systematic review. Glob. Ecol. Conserv. 2021; 26: e01476. Publisher Full Text
Favaro L, Cresta E, Friard O, et al.: Passive acoustic monitoring of the endangered African Penguin (Spheniscus demersus) using autonomous recording units and ecoacoustic indices. Ibis. 2021; 163(4): 1472–1480. Publisher Full Text
Fraixedas S, Lindén A, Piha M, et al.: A state-of-the-art review on birds as indicators of biodiversity: Advances, challenges, and future directions. Ecol. Indic. 2020; 118: 106728. Publisher Full Text
Franklin MJM, Major RE, Bradstock RA: How much survey effort is required to assess bird assemblages in fire-prone eucalypt forests using acoustic recorders? Wildl. Res. 2021; 48: 414–421. Publisher Full Text
Hoefer S, McKnight DT, Allen-Ankins S, et al.: Passive acoustic monitoring in terrestrial vertebrates: A review. Bioacoustics. 2023; 32: 506–531. Publisher Full Text
Jahn P, Ross J, MacKenzie D, et al.: Acoustic monitoring and occupancy analysis: Cost-effective tools in reintroduction programmes for roroa-great spotted kiwi. N. Z. J. Ecol. 2022; 46(1): 3466. Publisher Full Text
Kahl S, Wood CM, Eibl M, et al.: BirdNET: A deep learning solution for avian diversity monitoring. Eco. Inform. 2021; 61: 101236. Publisher Full Text
La Sorte FA, Horton KG, Johnston A, et al.: Seasonal associations with light pollution trends for nocturnally migrating bird populations. Ecosphere. 2022; 13(3). Publisher Full Text
La VT: Diurnal and nocturnal birds vocalize at night: A review. Condor. 2012; 114(2): 245–257. Publisher Full Text
Lahoz-Monfort JJ, Magrath MJL: A Comprehensive Overview of Technologies for Species and Habitat Monitoring and Conservation. Bioscience. 2021; 71(10): 1038–1062. PubMed Abstract | Publisher Full Text | Free Full Text
Machado RB, Aguiar L, Jones G: Do acoustic indices reflect the characteristics of bird communities in the savannas of Central Brazil? Landsc. Urban Plan. 2017; 162: 36–43. Publisher Full Text
Marques TA, Thomas L, Martin SW, et al.: Estimating animal population density using passive acoustics. Biol. Rev. 2013; 88(2): 287–309. PubMed Abstract | Publisher Full Text | Free Full Text
de Melo TN , Cerqueira MC, D’Horta FM, et al.: Impacts of a large hydroelectric dam on the Madeira River (Brazil) on floodplain avifauna. Acta Amazon. 2021; 51(4): 298–310. Publisher Full Text
Metcalf OC, Barlow J, Marsden S, et al.: Optimizing tropical forest bird surveys using passive acoustic monitoring and high temporal resolution sampling. Remote Sens. Ecol. Conserv. 2021; 8: 45–56. Publisher Full Text
Minister of the Interior: The 4th Overall Review of Yushan National Park Plan. Minister of the Interior; 2022.
Negret PJ, Maron M, Fuller RA, et al.: Deforestation and bird habitat loss in Colombia. Biol. Conserv. 2021; 257: 109044. Publisher Full Text
Northrup JM, Rivers JW, Yang Z, et al.: Synergistic effects of climate and land-use change influence broad-scale avian population declines. Glob. Chang. Biol. 2019; 25(5): 1561–1575. PubMed Abstract | Publisher Full Text
Odom KJ, Mennill DJ: A Quantitative Description of the Vocalizations and Vocal Activity of the Barred Owl. Condor. 2010; 112(3): 549–560. Publisher Full Text
Pereira H, Davidcooper H: Towards the global monitoring of biodiversity change. Trends Ecol. Evol. 2006; 21(3): 123–129. Publisher Full Text
Pérez-Granados C, Bota G, Giralt D, et al.: Vocal Activity Rate (VAR) index: A useful method to infer terrestrial bird abundance with acoustic monitoring. Ibis. 2019a; 161: 901–907. Publisher Full Text
Pérez-Granados C, Gómez-Catasús J, Bustillo-de la Rosa D, et al.: Effort needed to accurately estimate Vocal Activity Rate index using acoustic monitoring: A case study with a dawn-time singing passerine. Ecol. Indic. 2019b; 107: 105608. Publisher Full Text
Pérez-Granados C, Schuchmann K-L: Monitoring the annual vocal activity of two enigmatic nocturnal Neotropical birds: The Common Potoo (Nyctibius griseus) and the Great Potoo (Nyctibius grandis). J. Ornithol. 2020; 161(4): 1129–1141. Publisher Full Text
Pérez-Granados C, Schuchmann K-L, Marques MI: Passive acoustic monitoring of the Ferruginous Pygmy-Owl (Glaucidium brasilianum) over a complete annual cycle: Seasonality and monitoring recommendations. Stud. Neotropical Fauna Environ. 2021; 1–8. Publisher Full Text
Pérez-Granados C, Traba J: Esti mating bird density using passive acoustic monitoring: A review of methods and suggestions for further research. Ibis. 2021; 163: 765–783. Publisher Full Text
Robbins CS: Bird activity levels related to weather. Stud. Avian Biol. 1981; 6: 301–310.
Ross SRP-J, O’Connell DP, Deichmann JL, et al.: Passive acoustic monitoring provides a fresh perspective on fundamental ecological questions. Funct. Ecol. 2023; 37(4): 959–975. Publisher Full Text
Rumelt RB, Basto A, Mere Roncal C: Automated audio recording as a means of surveying tinamous (Tinamidae) in the Peruvian Amazon. Ecol. Evol. 2021; 11(19): 13518–13531. PubMed Abstract | Publisher Full Text | Free Full Text
Schaaf AA, Boullhesen M, Akmentins MS: Vocal activity of the rufous nightjar (Antrostomus rufus) in subtropical Yungas Andean forests of Argentina. Neotrop. Biodivers. 2023; 9(1): 59–63. Publisher Full Text
Shaw T, Schönamsgruber S, Cordeiro Pereira JM, et al.: Refining manual annotation effort of acoustic data to estimate bird species richness and composition: The role of duration, intensity, and time. Ecol. Evol. 2022; 12(11): e9491. PubMed Abstract | Publisher Full Text | Free Full Text
Shonfield J, Bayne EM: Autonomous recording units in avian ecological research: Current use and future applications. Avian Conserv. Ecol. 2017; 12(1): art14. Publisher Full Text
Slagsvold T: Bird Song Activity in Relation to Breeding Cycle, Spring Weather, and Environmental Phenology. Ornis Scand. 1977; 8(2): 197. Publisher Full Text
Sugai LSM, Silva TSF, Ribeiro JW, et al.: Terrestrial passive acoustic monitoring: Review and perspectives. Bioscience. 2019; 69(1): 15–25. Publisher Full Text
Symes LB, Kittelberger KD, Stone SM, et al.: Analytical approaches for evaluating passive acoustic monitoring data: A case study of avian vocalizations. Ecol. Evol. 2022; 12(4): e8797. PubMed Abstract | Publisher Full Text | Free Full Text
Thomisch K, Boebel O, Zitterbart DP, et al.: Effects of subsampling of passive acoustic recordings on acoustic metrics. J. Acoust. Soc. Am. 2015; 138(1): 267–278. PubMed Abstract | Publisher Full Text
Tsai P-Y, Ko C-J, Hsieh C, et al.: A trait dataset for Taiwan’s breeding birds. Biodivers. Data J. 2020; 8: e49735. PubMed Abstract | Publisher Full Text | Free Full Text
Vokurková J, Motombi FN, Ferenc M, et al.: Seasonality of vocal activity of a bird community in an Afrotropical lowland rain forest. J. Trop. Ecol. 2018; 34(1): 53–64. Publisher Full Text
Wood CM, Kahl S, Chaon P, et al.: Survey coverage, recording duration and community composition affect observed species richness in passive acoustic surveys. Methods Ecol. Evol. 2021; 12(5): 885–896. Publisher Full Text
Wood CM, Klinck H, Gustafson M, et al.: Using the ecological significance of animal vocalizations to improve inference in acoustic monitoring programs. Conserv. Biol. 2020; 35: 336–345. PubMed Abstract | Publisher Full Text
Wu S-H, Chang H-W, Lin R-S, et al.: SILIC: A cross database framework for automatically extracting robust biodiversity information from soundscape recordings based on object detection and a tiny training dataset. Eco. Inform. 2022; 68: 101534. Publisher Full Text
Wu S-H, Ko JC-J, Lin R-S, et al.: Underlying data for Evaluating community-wide temporal sampling in passive acoustic monitoring: A comprehensive study of avian vocal patterns in subtropical montane forests. [Dataset]. 2023. Publisher Full Text
Xu Y, Si Y, Wang Y, et al.: Loss of functional connectivity in migration networks induces population decline in migratory birds. Ecol. Appl. 2019; 29(7): e01960. PubMed Abstract | Publisher Full Text | Free Full Text
Yoo S, Kim H, Lee J, et al.: Seasonal and diurnal patterns of population vocal activity in avian brood parasites. Ibis. 2020; 162(3): 1001–1011. Publisher Full Text
Zuur AF, Ieno EN, Smith GM: Analysing ecological data. Springer; 2007; vol. 680. . Publisher Full Text
Zwerts JA, Stephenson PJ, Maisels F, et al.: Methods for wildlife monitoring in tropical forests: Comparing human observations, camera traps, and passive acoustic sensors. Conserv. Sci. Pract. 2021; 3(12). Publisher Full Text

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 11 Oct 2023

Author details Author details

Jerome Chie-Jen Ko
Roles: Conceptualization, Investigation, Methodology, Validation, Writing – Review & Editing

Ruey-Shing Lin
Roles: Conceptualization, Funding Acquisition, Supervision, Writing – Review & Editing

Chia-Hao Chang-Yang
Roles: Supervision, Writing – Review & Editing

Hsueh-Wen Chang
Roles: Supervision, Writing – Review & Editing

Competing interests

No competing interests were disclosed.

Grant information

The author(s) declared that no grants were involved in supporting this work.

Article Versions (2)

version 2

Revised

Published: 23 Jan 2024, 12:1299

https://doi.org/10.12688/f1000research.141951.2

version 1

Published: 11 Oct 2023, 12:1299

https://doi.org/10.12688/f1000research.141951.1

© 2024 Wu SH et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

SEE MORE DETAILS

CITE

how to cite this article

Wu SH, Ko JCJ, Lin RS et al. Evaluating community-wide temporal sampling in passive acoustic monitoring: A comprehensive study of avian vocal patterns in subtropical montane forests [version 2; peer review: 1 approved, 2 approved with reservations]. F1000Research 2024, 12:1299 (https://doi.org/10.12688/f1000research.141951.2)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?

Key to Reviewer Statuses VIEW HIDE

ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions

Version 2

VERSION 2

PUBLISHED 23 Jan 2024

Revised

Views

Reviewer Report 22 Apr 2024

Jerry S Cole, The Institute for Bird Populations, Petaluma, California, USA

Approved

https://doi.org/10.5256/f1000research.161617.r239937

Summary: The authors have done their best to address each of the concerns that I raised in my first review of the submission. I thank them for their thoroughness and thoughtful responses. I think the paper has been improved to ... Continue reading

CITE

Report a concern

Respond or Comment

Version 1

VERSION 1

PUBLISHED 11 Oct 2023

Views

Reviewer Report 04 Mar 2024

Bárbara Freitas, National Museum of Natural Sciences (Ringgold ID: 16625), Madrid, Community of Madrid, Spain

Approved with Reservations

https://doi.org/10.5256/f1000research.155438.r224078

In the manuscript ‘Evaluating community-wide temporal sampling in passive acoustic monitoring: A comprehensive study of avian vocal patterns in subtropical montane forests’, the authors aimed to investigate the impact of different temporal sampling designs on the detected vocal activity rate of 12 bird species. The authors showed that there is substantial variability in these species’ vocal activity rates across three distinct temporal scales (seasonal, diel, and hourly) and then provided specific recommendations for each one of them and for different environmental conditions.
Although this study is an interesting case study that provides valuable insights into optimizing temporal sampling designs for Passive Acoustic Monitoring, it needs a more thorough discussion. The recommendations provided appear overly broad and exaggerated, considering the specific scope of this study centred on 12 bird species within subtropical montane forests. The generalizations made may not adequately align with the specific context of this research. This limitation is only briefly mentioned in the conclusion. In this regard, the manuscript could substantially improve by elaborating further on the limitations and potential biases of the study design. Addressing the study's constraints, such as the limited species selection and specific geographical location, would provide a more nuanced understanding of the scope of the conclusions.
Furthermore, the manuscript would significantly benefit from discussing the potential applicability of the findings in different ecological contexts. Thus, authors should consider expanding upon how these findings might translate to other ecosystems or a more extensive range of avian taxa, or even other classes.
Finally, some recommendations regarding sampling durations and environmental conditions should be further elucidated. Clarifying the reasoning behind specific recommendations would enhance the applicability and understanding of these suggestions.

General comments

Throughout the manuscript, there is a notable overuse of acronyms, which impedes readability. This hinders comprehension and creates difficulty for readers. Reducing unnecessary acronyms and ensuring consistent explanations for those used is crucial to enhance the overall readability and understanding of the manuscript. By minimizing the reliance on acronyms and providing clear explanations, the text can become more accessible and easier for readers to navigate
In the abstract, certain acronyms are introduced but not subsequently utilized, contributing to confusion. Please also check if all acronyms are explained within the text.

Abstract
There is some important information missing: the place where the data was collected, in which season, and the number of days of the earliest two segments that showed high variability.

Introduction
The authors state that birds serve as an ‘ideal indicator taxon for monitoring terrestrial biodiversity due to their detectability, identifiability, diversity, widespread distribution, and migratory characteristics’. However, many of these characteristics are also present in other taxa. Emphasis should be given to the detectability and identifiability parts, especially through sound and the availability of automatic detectors for this class.
The statement regarding the use of sampling lacks references to support it. Authors should explain better and elaborate more on the exogenous and endogenous factors that modulate avian vocal activity. As it is now, it is not clear to the reader.

Methods
The authors do not indicate if this study was or was not preregistered. This must be provided, according to the journal guidelines: "Authors must include a statement to indicate if they did or did not preregister the research with or without a data analysis plan at an independent registry"
Target species section – it is not clear which criteria the authors followed to define a species as representative of each guild: which parameters were they taking into account? Regarding the selection criteria when two or more species were available for the same guilt, the authors explain that the species ‘that manifested an altitudinal distribution most congruent with’ the study objectives was chosen, but this is vague. The authors should be more specific and explain the criteria with examples. If, for example, high detectability was a factor used to select species, this should be mentioned. Lastly, Table S1 was indicated in the last part of this section but this table does not mention the 12 guilds.
The reader would benefit from having more information about the SILIC software. For example, the five classes ‘song’, ‘call’, ‘drumming’, and ‘unknown’ are labelled by SILIC or they are just grouped and then labelled by the authors? What is the aim or need of combining bounding boxes of the same species within a single recording?
It should be indicated the mean of the confidence score of the random sample of 100 detected vocalizations. the definition of AUC should be added as well as an explanation on why this and AP were used.
The reference for YOLOv5 should be added.

Results
The number of files and equivalent time for the used for subsequent analyses (from 12 stations over 112 days) should be indicated.

Discussion
This section is oversimplified and the recommendations lack robust baseline support.
The authors made use of words such as behaviors, internal and external factors without really specifying what they refer to.
The authors suggest sampling around one hour before and after sunrise to capture the morning chorus, even proposing starting recordings at midnight. However, it's crucial to note that these recommendations might lack empirical validation for this particular field site, so the manuscript should highlight the need for further testing or verification.
Regarding diel sampling, results should be put into context with many more different studies on birds and justification for these findings should be discussed. For example, see (Darras et al. 2019)¹

Minor comments

Fig 2- rainfall instead of Corrected Precipitation - names should be the same throughout the text
Fig S1 – This figure could be improved by adding a small map of the geographic position of Taiwan in the globe. Please add in the legend what orange means and that the map also depicts the relief.
Fig S2 – please include axis in each spectrogram. The legend is not necessary to have in all of them (frequency or time), but the axis is fundamental for visualization of the spectrogram.
Fig S5, S6, S7, S8 – Readability and understanding of the figures could be improved if the name of the species would be written as title of each graph. Also, precision and recall should be explained.
Table 1 – Please provide a column with the temporal coverage in minutes, to allow for easier understanding.
Table S3 – Please specify what positive and negative mean. If relative to detections this should be specific in the column name.

Is the work clearly and accurately presented and does it cite the current literature?

Partly
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Partly
If applicable, is the statistical analysis and its interpretation appropriate?

Yes
Are all the source data underlying the results available to ensure full reproducibility?

No
Are the conclusions drawn adequately supported by the results?

Partly

References

1. Darras K, Batáry P, Furnas BJ, Grass I, et al.: Autonomous sound recording outperforms human observation for sampling birds: a systematic map and user guide.Ecol Appl. 2019; 29 (6): e01954 PubMed Abstract | Publisher Full Text

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Bioacoustics, animal communication, passive acoustic monitoring, speciation

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

CITE

Report a concern

Respond or Comment

Views

Reviewer Report 02 Jan 2024

Laurel B Symes, Cornell University, Ithaca, New York, USA

Approved with Reservations

https://doi.org/10.5256/f1000research.155438.r224421

The goal of this paper is to evaluate how the measured vocal activity rate of birds differs depending on the temporal sampling strategy. Evaluating how detection results differ with sampling strategy is useful, particularly across different species and habitats globally, and can inform decisions about whether it is possible to use a generalized sampling strategies across a range of scenarios, or whether sampling strategies need to vary with season, latitude, habitat type, geography and other factors.

Overall, the paper seems like a solid evaluation of sampling strategies. Currently, the introduction is strongly focused on the temporal aspects of sampling. In the analysis and figures, the authors devote substantial attention to impacts of environmental co-variates, which may well make sense, but could use some set-up and introduction (or could be downplayed or eliminated). In addition, the authors could provide some additional synthesis/interpretation of their findings in the context of designing PAM-based studies. While the authors provide some specific guidance in the discussion, people seeking actionable information for study design may appreciate even more clear direction (such as headings for sections that discuss recommendations for hourly sampling, diel sampling, etc., and/or a summary in the conclusion that focuses on the key recommendations).

Major feedback:
>From the intro “Sampling design can be categorized into four temporal scales: annual, seasonal, diel, and hourly. An annual sampling design implies recording during one or several time periods within a year.”
Do you have a reference for the way that these four types of sampling are defined? I would have anticipated that annual sampling compared one year against another (rather than comparing seasons within a year) and that seasonal sampling would have compared one season to another vs “after being operated for a predetermined number of days at each location, the ARU is relocated to a subsequent site” (which to me would generally imply sampling periods shorter than a season).

>Hourly sampling design can be categorized into coverage (the proportion of recorded time within an hour) and dispersion (the number of recording segments within an hour).
Dispersion can affect sampling designs other than hourly as well (for example, if only certain days are sampled). You might consider suggest treating the different temporal scales as one set and then separately discussing dispersion (how thoroughly each of those temporal scales are sampled), whether or not you assess how dispersion of sampling affects outcomes at scales above hourly.

Some more specifics below:
Abstract:
The goal stated in the abstract background (the last sentence) is unclear. Could the authors rephrase the same to indicate clearly what they are attempting to do here? For example, why does the effective temporal sampling design matter?
You need to introduce what vocal activity rate means prior to discussing the ideal design for VAR data.

Introduction:
Having read your introduction – which reads very clearly, I think you might want to rephrase your abstract in the context of trying to identify the right sampling design or approach & why this is important for different ecological research projects/ecosystems/taxonomic groups.

I think your objectives should be switched – a) trying to identify how sampling designs impact vocal activity rate patterns and b) provide recommendations.

Based on the key questions asked – your introduction should introduce sampling design followed by a few sentences on vocal activity rate/analysis of acoustic data – and then you could ask your two key questions in your study area. [sampling design -> why this is important for vocal activity rates/patterns -> ask your questions – how does sampling design impact vocal activity rate calculations?]
Methods:
Is VAR_d a species-specific measure or is it the total number of vocalizations per day for that site irrespective of the species? Please rephrase/clarify.
I am unclear on Table 1 – were all these different sampling strategies deployed/implemented in the field or were these post-hoc analysis based on sub-sampling data?
What does Figure 1 mean ecologically? HCA statistically models Euclidean distance, but what does it mean that day 94 is closely associated with day 110 for example?

>gender,
In this context, “sex” may be preferrable to “gender”.

>Concentrating recordings during specific time, like dawn, often capture more species
Concentrating recordings during specific time, like dawn, often capture*s* more species

>For species without a song or the song was
For species without a song or *if* the song was

>Secondly, SILIC offers a unique capability to detect each vocalization’s exact start and end time within an audio recording, rather than merely identifying the presence or absence of a certain vocalization within a broad time frame.
How precisely is this approach able to pinpoint start time? Is this different from other software options that assign call identity within a 3 or 5 second window?

>In the discussion, you might address how choosing only one vocalization type (generally song) might influence results, vs something like alarm calls, which might have different temporal distribution

>What is ‘corrected’ precipitation? (vs measured precipitation)

>Accordingly, ‘song’ represented 9 of the 12 species, ‘call’ 2, and ‘unknown’ 1.
In supplemental materials, indicate which spectrograms corresponded to which vocalization type

>spanning 122 working days
I would remove “working” unless it has specific meaning. (often ‘working’ is used to mean Mon-Fri vs weekend days)

> Due to the fewer days encompassed by Clusters 1 and 2, when calculating the CV values for a higher number of days, there might be a tendency to obtain relatively lower CV values owing to the smaller sample sizes.
I found this sentence confusing. I think that it means that clusters 1 and 2 are smaller and should not be compared to clusters that contain higher numbers of days?

>Figure 2 is visually appealing but may be more complicated to interpret than a horizonal four-month graph. If the figure is retained, the authors might consider having time flow clockwise rather than counterclockwise.

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

Yes
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Bioacoustics, biodiversity, speciation

CITE

Report a concern

Author Response 01 Mar 2024

Shih-Hung Wu, Department of Biological Sciences, National Sun Yat-sen University, Kaohsiung, 804, Taiwan

01 Mar 2024

Author Response
Dear Dr. Laurel B Symes,
Thank you for your insightful review of our manuscript. We value your comments on evaluating vocal activity rates and sampling strategies across various species and ... Continue reading
Dear Dr. Laurel B Symes,
Thank you for your insightful review of our manuscript. We value your comments on evaluating vocal activity rates and sampling strategies across various species and habitats. In response to your suggestions, we have revised the manuscript accordingly and provide detailed responses to each of your comments below:

The goal of this paper is to evaluate how the measured vocal activity rate of birds differs depending on the temporal sampling strategy. Evaluating how detection results differ with sampling strategy is useful, particularly across different species and habitats globally, and can inform decisions about whether it is possible to use a generalized sampling strategies across a range of scenarios, or whether sampling strategies need to vary with season, latitude, habitat type, geography and other factors.

Overall, the paper seems like a solid evaluation of sampling strategies. Currently, the introduction is strongly focused on the temporal aspects of sampling. In the analysis and figures, the authors devote substantial attention to impacts of environmental co-variates, which may well make sense, but could use some set-up and introduction (or could be downplayed or eliminated). In addition, the authors could provide some additional synthesis/interpretation of their findings in the context of designing PAM-based studies. While the authors provide some specific guidance in the discussion, people seeking actionable information for study design may appreciate even more clear direction (such as headings for sections that discuss recommendations for hourly sampling, diel sampling, etc., and/or a summary in the conclusion that focuses on the key recommendations).

We have included a checklist of considerations in the Conclusions section to offer specific guidance for researchers conducting PAM studies in similar domains. (Section 5, Para. 1, Line 4)

Major feedback:
>From the intro “Sampling design can be categorized into four temporal scales: annual, seasonal, diel, and hourly. An annual sampling design implies recording during one or several time periods within a year.”
Do you have a reference for the way that these four types of sampling are defined? I would have anticipated that annual sampling compared one year against another (rather than comparing seasons within a year) and that seasonal sampling would have compared one season to another vs “after being operated for a predetermined number of days at each location, the ARU is relocated to a subsequent site” (which to me would generally imply sampling periods shorter than a season).

Indeed, we recognize that our use and definitions of 'annual sampling' and 'seasonal sampling' in the study may have been confusing for readers. We have decided to revise these terms to 'intra-annual sampling' and 'short-term periodic sampling' to avoid such ambiguity. (Abstract - Methods; Section 1, Para. 3, Line 2; Section 1, Para. 4, Line 1; Section 2.5.2, Para. 1, Line 8; Section 3.3; Section 4.2, Para. 1, Line 3)

>Hourly sampling design can be categorized into coverage (the proportion of recorded time within an hour) and dispersion (the number of recording segments within an hour).
Dispersion can affect sampling designs other than hourly as well (for example, if only certain days are sampled). You might consider suggest treating the different temporal scales as one set and then separately discussing dispersion (how thoroughly each of those temporal scales are sampled), whether or not you assess how dispersion of sampling affects outcomes at scales above hourly.

The scope of our investigation was confined to examining the effects of dispersion at the minute level, without delving into higher temporal units such as hours or days. This specific focus was dictated by the prevailing limitation that most ARUs are currently only programmable at the minute level. Additionally, there is a noted scarcity in prior research that has formulated dispersion sampling strategies for intervals extending beyond minutes. Investigations in the future, exploring higher temporal scales, might yield further insights. These could potentially enhance the understanding of sampling methodologies that are capable of reducing the duration of recordings, while simultaneously maintaining the integrity and quality of the data collected. We have added a section in the Discussion regarding the exploration of dispersion sampling strategies across different temporal scales. (Section 4.3, Para. 2, Line 3)

Some more specifics below:
Abstract:
The goal stated in the abstract background (the last sentence) is unclear. Could the authors rephrase the same to indicate clearly what they are attempting to do here? For example, why does the effective temporal sampling design matter?
You need to introduce what vocal activity rate means prior to discussing the ideal design for VAR data.

We have supplemented the abstract with a definition of VAR and its significance in the context of our study. (Abstract - Background)

Introduction:
Having read your introduction – which reads very clearly, I think you might want to rephrase your abstract in the context of trying to identify the right sampling design or approach & why this is important for different ecological research projects/ecosystems/taxonomic groups.

We have rewritten the background section of the abstract to clearly articulate the primary objectives of our study. (Abstract - Background)

I think your objectives should be switched – a) trying to identify how sampling designs impact vocal activity rate patterns and b) provide recommendations.

We have made revisions in accordance with your suggestions. (Section 1, Para. 8, Line 2)

Based on the key questions asked – your introduction should introduce sampling design followed by a few sentences on vocal activity rate/analysis of acoustic data – and then you could ask your two key questions in your study area. [sampling design -> why this is important for vocal activity rates/patterns -> ask your questions – how does sampling design impact vocal activity rate calculations?]

We have made revisions in accordance with your suggestions. (Section 1, Para. 4, Line 4; Para. 5, Line 4; Para. 6, Line 5)

Methods:
Is VAR_d a species-specific measure or is it the total number of vocalizations per day for that site irrespective of the species? Please rephrase/clarify.

VAR_d was calculated as the total daily vocalization count for each station and bird species separately. We have added a section to make the narrative more comprehensive and complete. (Section 2.5.1, Para. 2, Line 3)

I am unclear on Table 1 – were all these different sampling strategies deployed/implemented in the field or were these post-hoc analysis based on sub-sampling data?

Our study is based on continuous field recording data, and it simulates the outcomes and impacts of data acquisition under different temporal sampling scenarios. We have added explanatory text in the relevant sections to clarify this approach. (Section 2.5.4, Para. 1, Line 1; Explanation of Table 1.)

What does Figure 1 mean ecologically? HCA statistically models Euclidean distance, but what does it mean that day 94 is closely associated with day 110 for example?

Within each cluster, the VAR_d patterns of different bird species at each PAM station are similar among days within the same cluster but differ from those in other clusters. We have added explanatory text in the relevant section. (Section 3.3, Para. 1, Line 2)

>gender,
In this context, “sex” may be preferrable to “gender”.

Revised. (Section 1, Para. 7, Line 5)

>Concentrating recordings during specific time, like dawn, often capture more species
Concentrating recordings during specific time, like dawn, often capture*s* more species

Revised. (Section 1, Para. 7, Line 13)

>For species without a song or the song was
For species without a song or *if* the song was

Revised. (Section 2.4, Para. 2, Line 4)

>Secondly, SILIC offers a unique capability to detect each vocalization’s exact start and end time within an audio recording, rather than merely identifying the presence or absence of a certain vocalization within a broad time frame.
How precisely is this approach able to pinpoint start time? Is this different from other software options that assign call identity within a 3 or 5 second window?

SILIC can detect each vocalization at the millisecond level. We have made revisions in Section 2.4, Para. 1, Line 4.

>In the discussion, you might address how choosing only one vocalization type (generally song) might influence results, vs something like alarm calls, which might have different temporal distribution

We have revised the discussion segment focusing on the potential impacts of species and vocalization type on our research findings. (Section 4, Para. 1, Line 1)

>What is ‘corrected’ precipitation? (vs measured precipitation)

The corrected precipitation from NASA/POWER CERES/MERRA2 Native Resolution Daily Data is estimated through algorithms (climate models) that interpret satellite observations. These observations are then often adjusted or 'corrected' using various atmospheric data and models to provide more accurate estimates.

>Accordingly, ‘song’ represented 9 of the 12 species, ‘call’ 2, and ‘unknown’ 1.
In supplemental materials, indicate which spectrograms corresponded to which vocalization type

We have added information on the type of vocalization to each corresponding spectrogram. (Supplementary Material_v2.pdf)

>spanning 122 working days
I would remove “working” unless it has specific meaning. (often ‘working’ is used to mean Mon-Fri vs weekend days)

Revised. (Section 3.1, Para. 1, Line 1)

> Due to the fewer days encompassed by Clusters 1 and 2, when calculating the CV values for a higher number of days, there might be a tendency to obtain relatively lower CV values owing to the smaller sample sizes.
I found this sentence confusing. I think that it means that clusters 1 and 2 are smaller and should not be compared to clusters that contain higher numbers of days?

We have revised our discussion to more accurately address the differences in the CV values among clusters with varying sample sizes. (Section 3.3, Para. 2, Line 1)

>Figure 2 is visually appealing but may be more complicated to interpret than a horizonal four-month graph. If the figure is retained, the authors might consider having time flow clockwise rather than counterclockwise.

We have adjusted the figure to rotate clockwise, and we have also included Figure S9 in the Supplementary Material, which presents the data with a horizontal time axis. (Figure 2; Figure S9)
Dear Dr. Laurel B Symes,
Thank you for your insightful review of our manuscript. We value your comments on evaluating vocal activity rates and sampling strategies across various species and habitats. In response to your suggestions, we have revised the manuscript accordingly and provide detailed responses to each of your comments below:

The goal of this paper is to evaluate how the measured vocal activity rate of birds differs depending on the temporal sampling strategy. Evaluating how detection results differ with sampling strategy is useful, particularly across different species and habitats globally, and can inform decisions about whether it is possible to use a generalized sampling strategies across a range of scenarios, or whether sampling strategies need to vary with season, latitude, habitat type, geography and other factors.

Overall, the paper seems like a solid evaluation of sampling strategies. Currently, the introduction is strongly focused on the temporal aspects of sampling. In the analysis and figures, the authors devote substantial attention to impacts of environmental co-variates, which may well make sense, but could use some set-up and introduction (or could be downplayed or eliminated). In addition, the authors could provide some additional synthesis/interpretation of their findings in the context of designing PAM-based studies. While the authors provide some specific guidance in the discussion, people seeking actionable information for study design may appreciate even more clear direction (such as headings for sections that discuss recommendations for hourly sampling, diel sampling, etc., and/or a summary in the conclusion that focuses on the key recommendations).

We have included a checklist of considerations in the Conclusions section to offer specific guidance for researchers conducting PAM studies in similar domains. (Section 5, Para. 1, Line 4)

Major feedback:
>From the intro “Sampling design can be categorized into four temporal scales: annual, seasonal, diel, and hourly. An annual sampling design implies recording during one or several time periods within a year.”
Do you have a reference for the way that these four types of sampling are defined? I would have anticipated that annual sampling compared one year against another (rather than comparing seasons within a year) and that seasonal sampling would have compared one season to another vs “after being operated for a predetermined number of days at each location, the ARU is relocated to a subsequent site” (which to me would generally imply sampling periods shorter than a season).

Indeed, we recognize that our use and definitions of 'annual sampling' and 'seasonal sampling' in the study may have been confusing for readers. We have decided to revise these terms to 'intra-annual sampling' and 'short-term periodic sampling' to avoid such ambiguity. (Abstract - Methods; Section 1, Para. 3, Line 2; Section 1, Para. 4, Line 1; Section 2.5.2, Para. 1, Line 8; Section 3.3; Section 4.2, Para. 1, Line 3)

>Hourly sampling design can be categorized into coverage (the proportion of recorded time within an hour) and dispersion (the number of recording segments within an hour).
Dispersion can affect sampling designs other than hourly as well (for example, if only certain days are sampled). You might consider suggest treating the different temporal scales as one set and then separately discussing dispersion (how thoroughly each of those temporal scales are sampled), whether or not you assess how dispersion of sampling affects outcomes at scales above hourly.

The scope of our investigation was confined to examining the effects of dispersion at the minute level, without delving into higher temporal units such as hours or days. This specific focus was dictated by the prevailing limitation that most ARUs are currently only programmable at the minute level. Additionally, there is a noted scarcity in prior research that has formulated dispersion sampling strategies for intervals extending beyond minutes. Investigations in the future, exploring higher temporal scales, might yield further insights. These could potentially enhance the understanding of sampling methodologies that are capable of reducing the duration of recordings, while simultaneously maintaining the integrity and quality of the data collected. We have added a section in the Discussion regarding the exploration of dispersion sampling strategies across different temporal scales. (Section 4.3, Para. 2, Line 3)

Some more specifics below:
Abstract:
The goal stated in the abstract background (the last sentence) is unclear. Could the authors rephrase the same to indicate clearly what they are attempting to do here? For example, why does the effective temporal sampling design matter?
You need to introduce what vocal activity rate means prior to discussing the ideal design for VAR data.

We have supplemented the abstract with a definition of VAR and its significance in the context of our study. (Abstract - Background)

Introduction:
Having read your introduction – which reads very clearly, I think you might want to rephrase your abstract in the context of trying to identify the right sampling design or approach & why this is important for different ecological research projects/ecosystems/taxonomic groups.

We have rewritten the background section of the abstract to clearly articulate the primary objectives of our study. (Abstract - Background)

I think your objectives should be switched – a) trying to identify how sampling designs impact vocal activity rate patterns and b) provide recommendations.

We have made revisions in accordance with your suggestions. (Section 1, Para. 8, Line 2)

Based on the key questions asked – your introduction should introduce sampling design followed by a few sentences on vocal activity rate/analysis of acoustic data – and then you could ask your two key questions in your study area. [sampling design -> why this is important for vocal activity rates/patterns -> ask your questions – how does sampling design impact vocal activity rate calculations?]

We have made revisions in accordance with your suggestions. (Section 1, Para. 4, Line 4; Para. 5, Line 4; Para. 6, Line 5)

Methods:
Is VAR_d a species-specific measure or is it the total number of vocalizations per day for that site irrespective of the species? Please rephrase/clarify.

VAR_d was calculated as the total daily vocalization count for each station and bird species separately. We have added a section to make the narrative more comprehensive and complete. (Section 2.5.1, Para. 2, Line 3)

I am unclear on Table 1 – were all these different sampling strategies deployed/implemented in the field or were these post-hoc analysis based on sub-sampling data?

Our study is based on continuous field recording data, and it simulates the outcomes and impacts of data acquisition under different temporal sampling scenarios. We have added explanatory text in the relevant sections to clarify this approach. (Section 2.5.4, Para. 1, Line 1; Explanation of Table 1.)

What does Figure 1 mean ecologically? HCA statistically models Euclidean distance, but what does it mean that day 94 is closely associated with day 110 for example?

Within each cluster, the VAR_d patterns of different bird species at each PAM station are similar among days within the same cluster but differ from those in other clusters. We have added explanatory text in the relevant section. (Section 3.3, Para. 1, Line 2)

>gender,
In this context, “sex” may be preferrable to “gender”.

Revised. (Section 1, Para. 7, Line 5)

>Concentrating recordings during specific time, like dawn, often capture more species
Concentrating recordings during specific time, like dawn, often capture*s* more species

Revised. (Section 1, Para. 7, Line 13)

>For species without a song or the song was
For species without a song or *if* the song was

Revised. (Section 2.4, Para. 2, Line 4)

>Secondly, SILIC offers a unique capability to detect each vocalization’s exact start and end time within an audio recording, rather than merely identifying the presence or absence of a certain vocalization within a broad time frame.
How precisely is this approach able to pinpoint start time? Is this different from other software options that assign call identity within a 3 or 5 second window?

SILIC can detect each vocalization at the millisecond level. We have made revisions in Section 2.4, Para. 1, Line 4.

>In the discussion, you might address how choosing only one vocalization type (generally song) might influence results, vs something like alarm calls, which might have different temporal distribution

We have revised the discussion segment focusing on the potential impacts of species and vocalization type on our research findings. (Section 4, Para. 1, Line 1)

>What is ‘corrected’ precipitation? (vs measured precipitation)

The corrected precipitation from NASA/POWER CERES/MERRA2 Native Resolution Daily Data is estimated through algorithms (climate models) that interpret satellite observations. These observations are then often adjusted or 'corrected' using various atmospheric data and models to provide more accurate estimates.

>Accordingly, ‘song’ represented 9 of the 12 species, ‘call’ 2, and ‘unknown’ 1.
In supplemental materials, indicate which spectrograms corresponded to which vocalization type

We have added information on the type of vocalization to each corresponding spectrogram. (Supplementary Material_v2.pdf)

>spanning 122 working days
I would remove “working” unless it has specific meaning. (often ‘working’ is used to mean Mon-Fri vs weekend days)

Revised. (Section 3.1, Para. 1, Line 1)

> Due to the fewer days encompassed by Clusters 1 and 2, when calculating the CV values for a higher number of days, there might be a tendency to obtain relatively lower CV values owing to the smaller sample sizes.
I found this sentence confusing. I think that it means that clusters 1 and 2 are smaller and should not be compared to clusters that contain higher numbers of days?

We have revised our discussion to more accurately address the differences in the CV values among clusters with varying sample sizes. (Section 3.3, Para. 2, Line 1)

>Figure 2 is visually appealing but may be more complicated to interpret than a horizonal four-month graph. If the figure is retained, the authors might consider having time flow clockwise rather than counterclockwise.

We have adjusted the figure to rotate clockwise, and we have also included Figure S9 in the Supplementary Material, which presents the data with a horizontal time axis. (Figure 2; Figure S9)
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 01 Mar 2024

Shih-Hung Wu, Department of Biological Sciences, National Sun Yat-sen University, Kaohsiung, 804, Taiwan

01 Mar 2024

Author Response
Dear Dr. Laurel B Symes,
Thank you for your insightful review of our manuscript. We value your comments on evaluating vocal activity rates and sampling strategies across various species and ... Continue reading
Dear Dr. Laurel B Symes,
Thank you for your insightful review of our manuscript. We value your comments on evaluating vocal activity rates and sampling strategies across various species and habitats. In response to your suggestions, we have revised the manuscript accordingly and provide detailed responses to each of your comments below:

The goal of this paper is to evaluate how the measured vocal activity rate of birds differs depending on the temporal sampling strategy. Evaluating how detection results differ with sampling strategy is useful, particularly across different species and habitats globally, and can inform decisions about whether it is possible to use a generalized sampling strategies across a range of scenarios, or whether sampling strategies need to vary with season, latitude, habitat type, geography and other factors.

Overall, the paper seems like a solid evaluation of sampling strategies. Currently, the introduction is strongly focused on the temporal aspects of sampling. In the analysis and figures, the authors devote substantial attention to impacts of environmental co-variates, which may well make sense, but could use some set-up and introduction (or could be downplayed or eliminated). In addition, the authors could provide some additional synthesis/interpretation of their findings in the context of designing PAM-based studies. While the authors provide some specific guidance in the discussion, people seeking actionable information for study design may appreciate even more clear direction (such as headings for sections that discuss recommendations for hourly sampling, diel sampling, etc., and/or a summary in the conclusion that focuses on the key recommendations).

We have included a checklist of considerations in the Conclusions section to offer specific guidance for researchers conducting PAM studies in similar domains. (Section 5, Para. 1, Line 4)

Major feedback:
>From the intro “Sampling design can be categorized into four temporal scales: annual, seasonal, diel, and hourly. An annual sampling design implies recording during one or several time periods within a year.”
Do you have a reference for the way that these four types of sampling are defined? I would have anticipated that annual sampling compared one year against another (rather than comparing seasons within a year) and that seasonal sampling would have compared one season to another vs “after being operated for a predetermined number of days at each location, the ARU is relocated to a subsequent site” (which to me would generally imply sampling periods shorter than a season).

Indeed, we recognize that our use and definitions of 'annual sampling' and 'seasonal sampling' in the study may have been confusing for readers. We have decided to revise these terms to 'intra-annual sampling' and 'short-term periodic sampling' to avoid such ambiguity. (Abstract - Methods; Section 1, Para. 3, Line 2; Section 1, Para. 4, Line 1; Section 2.5.2, Para. 1, Line 8; Section 3.3; Section 4.2, Para. 1, Line 3)

>Hourly sampling design can be categorized into coverage (the proportion of recorded time within an hour) and dispersion (the number of recording segments within an hour).
Dispersion can affect sampling designs other than hourly as well (for example, if only certain days are sampled). You might consider suggest treating the different temporal scales as one set and then separately discussing dispersion (how thoroughly each of those temporal scales are sampled), whether or not you assess how dispersion of sampling affects outcomes at scales above hourly.

The scope of our investigation was confined to examining the effects of dispersion at the minute level, without delving into higher temporal units such as hours or days. This specific focus was dictated by the prevailing limitation that most ARUs are currently only programmable at the minute level. Additionally, there is a noted scarcity in prior research that has formulated dispersion sampling strategies for intervals extending beyond minutes. Investigations in the future, exploring higher temporal scales, might yield further insights. These could potentially enhance the understanding of sampling methodologies that are capable of reducing the duration of recordings, while simultaneously maintaining the integrity and quality of the data collected. We have added a section in the Discussion regarding the exploration of dispersion sampling strategies across different temporal scales. (Section 4.3, Para. 2, Line 3)

Some more specifics below:
Abstract:
The goal stated in the abstract background (the last sentence) is unclear. Could the authors rephrase the same to indicate clearly what they are attempting to do here? For example, why does the effective temporal sampling design matter?
You need to introduce what vocal activity rate means prior to discussing the ideal design for VAR data.

We have supplemented the abstract with a definition of VAR and its significance in the context of our study. (Abstract - Background)

Introduction:
Having read your introduction – which reads very clearly, I think you might want to rephrase your abstract in the context of trying to identify the right sampling design or approach & why this is important for different ecological research projects/ecosystems/taxonomic groups.

We have rewritten the background section of the abstract to clearly articulate the primary objectives of our study. (Abstract - Background)

I think your objectives should be switched – a) trying to identify how sampling designs impact vocal activity rate patterns and b) provide recommendations.

We have made revisions in accordance with your suggestions. (Section 1, Para. 8, Line 2)

Based on the key questions asked – your introduction should introduce sampling design followed by a few sentences on vocal activity rate/analysis of acoustic data – and then you could ask your two key questions in your study area. [sampling design -> why this is important for vocal activity rates/patterns -> ask your questions – how does sampling design impact vocal activity rate calculations?]

We have made revisions in accordance with your suggestions. (Section 1, Para. 4, Line 4; Para. 5, Line 4; Para. 6, Line 5)

Methods:
Is VAR_d a species-specific measure or is it the total number of vocalizations per day for that site irrespective of the species? Please rephrase/clarify.

VAR_d was calculated as the total daily vocalization count for each station and bird species separately. We have added a section to make the narrative more comprehensive and complete. (Section 2.5.1, Para. 2, Line 3)

I am unclear on Table 1 – were all these different sampling strategies deployed/implemented in the field or were these post-hoc analysis based on sub-sampling data?

Our study is based on continuous field recording data, and it simulates the outcomes and impacts of data acquisition under different temporal sampling scenarios. We have added explanatory text in the relevant sections to clarify this approach. (Section 2.5.4, Para. 1, Line 1; Explanation of Table 1.)

What does Figure 1 mean ecologically? HCA statistically models Euclidean distance, but what does it mean that day 94 is closely associated with day 110 for example?

Within each cluster, the VAR_d patterns of different bird species at each PAM station are similar among days within the same cluster but differ from those in other clusters. We have added explanatory text in the relevant section. (Section 3.3, Para. 1, Line 2)

>gender,
In this context, “sex” may be preferrable to “gender”.

Revised. (Section 1, Para. 7, Line 5)

>Concentrating recordings during specific time, like dawn, often capture more species
Concentrating recordings during specific time, like dawn, often capture*s* more species

Revised. (Section 1, Para. 7, Line 13)

>For species without a song or the song was
For species without a song or *if* the song was

Revised. (Section 2.4, Para. 2, Line 4)

>Secondly, SILIC offers a unique capability to detect each vocalization’s exact start and end time within an audio recording, rather than merely identifying the presence or absence of a certain vocalization within a broad time frame.
How precisely is this approach able to pinpoint start time? Is this different from other software options that assign call identity within a 3 or 5 second window?

SILIC can detect each vocalization at the millisecond level. We have made revisions in Section 2.4, Para. 1, Line 4.

>In the discussion, you might address how choosing only one vocalization type (generally song) might influence results, vs something like alarm calls, which might have different temporal distribution

We have revised the discussion segment focusing on the potential impacts of species and vocalization type on our research findings. (Section 4, Para. 1, Line 1)

>What is ‘corrected’ precipitation? (vs measured precipitation)

The corrected precipitation from NASA/POWER CERES/MERRA2 Native Resolution Daily Data is estimated through algorithms (climate models) that interpret satellite observations. These observations are then often adjusted or 'corrected' using various atmospheric data and models to provide more accurate estimates.

>Accordingly, ‘song’ represented 9 of the 12 species, ‘call’ 2, and ‘unknown’ 1.
In supplemental materials, indicate which spectrograms corresponded to which vocalization type

We have added information on the type of vocalization to each corresponding spectrogram. (Supplementary Material_v2.pdf)

>spanning 122 working days
I would remove “working” unless it has specific meaning. (often ‘working’ is used to mean Mon-Fri vs weekend days)

Revised. (Section 3.1, Para. 1, Line 1)

> Due to the fewer days encompassed by Clusters 1 and 2, when calculating the CV values for a higher number of days, there might be a tendency to obtain relatively lower CV values owing to the smaller sample sizes.
I found this sentence confusing. I think that it means that clusters 1 and 2 are smaller and should not be compared to clusters that contain higher numbers of days?

We have revised our discussion to more accurately address the differences in the CV values among clusters with varying sample sizes. (Section 3.3, Para. 2, Line 1)

>Figure 2 is visually appealing but may be more complicated to interpret than a horizonal four-month graph. If the figure is retained, the authors might consider having time flow clockwise rather than counterclockwise.

We have adjusted the figure to rotate clockwise, and we have also included Figure S9 in the Supplementary Material, which presents the data with a horizontal time axis. (Figure 2; Figure S9)
Dear Dr. Laurel B Symes,
Thank you for your insightful review of our manuscript. We value your comments on evaluating vocal activity rates and sampling strategies across various species and habitats. In response to your suggestions, we have revised the manuscript accordingly and provide detailed responses to each of your comments below:

The goal of this paper is to evaluate how the measured vocal activity rate of birds differs depending on the temporal sampling strategy. Evaluating how detection results differ with sampling strategy is useful, particularly across different species and habitats globally, and can inform decisions about whether it is possible to use a generalized sampling strategies across a range of scenarios, or whether sampling strategies need to vary with season, latitude, habitat type, geography and other factors.

Overall, the paper seems like a solid evaluation of sampling strategies. Currently, the introduction is strongly focused on the temporal aspects of sampling. In the analysis and figures, the authors devote substantial attention to impacts of environmental co-variates, which may well make sense, but could use some set-up and introduction (or could be downplayed or eliminated). In addition, the authors could provide some additional synthesis/interpretation of their findings in the context of designing PAM-based studies. While the authors provide some specific guidance in the discussion, people seeking actionable information for study design may appreciate even more clear direction (such as headings for sections that discuss recommendations for hourly sampling, diel sampling, etc., and/or a summary in the conclusion that focuses on the key recommendations).

We have included a checklist of considerations in the Conclusions section to offer specific guidance for researchers conducting PAM studies in similar domains. (Section 5, Para. 1, Line 4)

Major feedback:
>From the intro “Sampling design can be categorized into four temporal scales: annual, seasonal, diel, and hourly. An annual sampling design implies recording during one or several time periods within a year.”
Do you have a reference for the way that these four types of sampling are defined? I would have anticipated that annual sampling compared one year against another (rather than comparing seasons within a year) and that seasonal sampling would have compared one season to another vs “after being operated for a predetermined number of days at each location, the ARU is relocated to a subsequent site” (which to me would generally imply sampling periods shorter than a season).

Indeed, we recognize that our use and definitions of 'annual sampling' and 'seasonal sampling' in the study may have been confusing for readers. We have decided to revise these terms to 'intra-annual sampling' and 'short-term periodic sampling' to avoid such ambiguity. (Abstract - Methods; Section 1, Para. 3, Line 2; Section 1, Para. 4, Line 1; Section 2.5.2, Para. 1, Line 8; Section 3.3; Section 4.2, Para. 1, Line 3)

>Hourly sampling design can be categorized into coverage (the proportion of recorded time within an hour) and dispersion (the number of recording segments within an hour).
Dispersion can affect sampling designs other than hourly as well (for example, if only certain days are sampled). You might consider suggest treating the different temporal scales as one set and then separately discussing dispersion (how thoroughly each of those temporal scales are sampled), whether or not you assess how dispersion of sampling affects outcomes at scales above hourly.

The scope of our investigation was confined to examining the effects of dispersion at the minute level, without delving into higher temporal units such as hours or days. This specific focus was dictated by the prevailing limitation that most ARUs are currently only programmable at the minute level. Additionally, there is a noted scarcity in prior research that has formulated dispersion sampling strategies for intervals extending beyond minutes. Investigations in the future, exploring higher temporal scales, might yield further insights. These could potentially enhance the understanding of sampling methodologies that are capable of reducing the duration of recordings, while simultaneously maintaining the integrity and quality of the data collected. We have added a section in the Discussion regarding the exploration of dispersion sampling strategies across different temporal scales. (Section 4.3, Para. 2, Line 3)

Some more specifics below:
Abstract:
The goal stated in the abstract background (the last sentence) is unclear. Could the authors rephrase the same to indicate clearly what they are attempting to do here? For example, why does the effective temporal sampling design matter?
You need to introduce what vocal activity rate means prior to discussing the ideal design for VAR data.

We have supplemented the abstract with a definition of VAR and its significance in the context of our study. (Abstract - Background)

Introduction:
Having read your introduction – which reads very clearly, I think you might want to rephrase your abstract in the context of trying to identify the right sampling design or approach & why this is important for different ecological research projects/ecosystems/taxonomic groups.

We have rewritten the background section of the abstract to clearly articulate the primary objectives of our study. (Abstract - Background)

I think your objectives should be switched – a) trying to identify how sampling designs impact vocal activity rate patterns and b) provide recommendations.

We have made revisions in accordance with your suggestions. (Section 1, Para. 8, Line 2)

Based on the key questions asked – your introduction should introduce sampling design followed by a few sentences on vocal activity rate/analysis of acoustic data – and then you could ask your two key questions in your study area. [sampling design -> why this is important for vocal activity rates/patterns -> ask your questions – how does sampling design impact vocal activity rate calculations?]

We have made revisions in accordance with your suggestions. (Section 1, Para. 4, Line 4; Para. 5, Line 4; Para. 6, Line 5)

Methods:
Is VAR_d a species-specific measure or is it the total number of vocalizations per day for that site irrespective of the species? Please rephrase/clarify.

VAR_d was calculated as the total daily vocalization count for each station and bird species separately. We have added a section to make the narrative more comprehensive and complete. (Section 2.5.1, Para. 2, Line 3)

I am unclear on Table 1 – were all these different sampling strategies deployed/implemented in the field or were these post-hoc analysis based on sub-sampling data?

Our study is based on continuous field recording data, and it simulates the outcomes and impacts of data acquisition under different temporal sampling scenarios. We have added explanatory text in the relevant sections to clarify this approach. (Section 2.5.4, Para. 1, Line 1; Explanation of Table 1.)

What does Figure 1 mean ecologically? HCA statistically models Euclidean distance, but what does it mean that day 94 is closely associated with day 110 for example?

Within each cluster, the VAR_d patterns of different bird species at each PAM station are similar among days within the same cluster but differ from those in other clusters. We have added explanatory text in the relevant section. (Section 3.3, Para. 1, Line 2)

>gender,
In this context, “sex” may be preferrable to “gender”.

Revised. (Section 1, Para. 7, Line 5)

>Concentrating recordings during specific time, like dawn, often capture more species
Concentrating recordings during specific time, like dawn, often capture*s* more species

Revised. (Section 1, Para. 7, Line 13)

>For species without a song or the song was
For species without a song or *if* the song was

Revised. (Section 2.4, Para. 2, Line 4)

>Secondly, SILIC offers a unique capability to detect each vocalization’s exact start and end time within an audio recording, rather than merely identifying the presence or absence of a certain vocalization within a broad time frame.
How precisely is this approach able to pinpoint start time? Is this different from other software options that assign call identity within a 3 or 5 second window?

SILIC can detect each vocalization at the millisecond level. We have made revisions in Section 2.4, Para. 1, Line 4.

>In the discussion, you might address how choosing only one vocalization type (generally song) might influence results, vs something like alarm calls, which might have different temporal distribution

We have revised the discussion segment focusing on the potential impacts of species and vocalization type on our research findings. (Section 4, Para. 1, Line 1)

>What is ‘corrected’ precipitation? (vs measured precipitation)

The corrected precipitation from NASA/POWER CERES/MERRA2 Native Resolution Daily Data is estimated through algorithms (climate models) that interpret satellite observations. These observations are then often adjusted or 'corrected' using various atmospheric data and models to provide more accurate estimates.

>Accordingly, ‘song’ represented 9 of the 12 species, ‘call’ 2, and ‘unknown’ 1.
In supplemental materials, indicate which spectrograms corresponded to which vocalization type

We have added information on the type of vocalization to each corresponding spectrogram. (Supplementary Material_v2.pdf)

>spanning 122 working days
I would remove “working” unless it has specific meaning. (often ‘working’ is used to mean Mon-Fri vs weekend days)

Revised. (Section 3.1, Para. 1, Line 1)

> Due to the fewer days encompassed by Clusters 1 and 2, when calculating the CV values for a higher number of days, there might be a tendency to obtain relatively lower CV values owing to the smaller sample sizes.
I found this sentence confusing. I think that it means that clusters 1 and 2 are smaller and should not be compared to clusters that contain higher numbers of days?

We have revised our discussion to more accurately address the differences in the CV values among clusters with varying sample sizes. (Section 3.3, Para. 2, Line 1)

>Figure 2 is visually appealing but may be more complicated to interpret than a horizonal four-month graph. If the figure is retained, the authors might consider having time flow clockwise rather than counterclockwise.

We have adjusted the figure to rotate clockwise, and we have also included Figure S9 in the Supplementary Material, which presents the data with a horizontal time axis. (Figure 2; Figure S9)
Competing Interests: No competing interests were disclosed. Close
Report a concern

Views

Reviewer Report 19 Dec 2023

Jerry S Cole, The Institute for Bird Populations, Petaluma, California, USA

Approved with Reservations

https://doi.org/10.5256/f1000research.155438.r224082

Summary: The authors have produced a timely and well written article that helps to further advance our understanding of how VAR differs across species and in response to environmental conditions. I particularly enjoyed the section that used clustering to divide the full sampling season into distinct segments. It would be nice to see a bit more development of the discussion section, for instance suggestions for what additional avenues researchers might explore other than just replicating this study with other species in other locations. Thanks for the interesting read!

Data Availability:
Not sure if I missed this but I don't see a location where the raw data - for instance the output summary of number of bird detections per minute for each species on a given sampling day and hour. I assume that at a minimum the raw output from the classifier would be needed to reproduce the data analyses reported here. I see the supplementary material with plots and overall number of detections, but that would not be enough to reproduce the statistical analyses.

_________
Introduction

I enjoyed this introduction. Does a nice job to lead in to the aims of the study.

Paragraph (P) 3 - Might consider saying "more frequent vocalization" rather than "enhanced vocalization" to be more specific about what exactly is meant - if by "enhanced" you mean "more frequent and louder".
____________
Results

Section 3.1, P2 - Does "1,200 randomly sampled entries" mean detections by SILIC, or just random sections of the recordings? I think this is probably clarified later in the sentence, but something to think about.

Fig. 1 and 2 - Really enjoyed this method of defining distinct sampling periods through a cluster analysis.
__________
Discussion

General - Should researchers analyze data from only a single survey season cluster period? If so, which period?And what does this mean for multi-species analysis?

P2 - Not sure what "interplay of internal ..." means. Does that mean the biology of the bird is partly responsible for differing VAR?

__________
Conclusions

Seems like a good way to wrap it up. I do wonder about how this might apply to a full community of birds, but I suppose that is beyond the scope of this work.

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

Yes
Are all the source data underlying the results available to ensure full reproducibility?

Partly
Are the conclusions drawn adequately supported by the results?

Yes

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Bioacoustics, ecology, ornithology, automated sound classification

CITE

Report a concern

Author Response 01 Mar 2024

Shih-Hung Wu, Department of Biological Sciences, National Sun Yat-sen University, Kaohsiung, 804, Taiwan

01 Mar 2024

Author Response
Dear Dr. Jerry S. Cole,
We greatly appreciate your positive remarks and thoughtful suggestions, which will be instrumental in refining our paper. Here are our responses aligned with your comments:

... Continue reading
Dear Dr. Jerry S. Cole,
We greatly appreciate your positive remarks and thoughtful suggestions, which will be instrumental in refining our paper. Here are our responses aligned with your comments:

Summary: The authors have produced a timely and well written article that helps to further advance our understanding of how VAR differs across species and in response to environmental conditions. I particularly enjoyed the section that used clustering to divide the full sampling season into distinct segments. It would be nice to see a bit more development of the discussion section, for instance suggestions for what additional avenues researchers might explore other than just replicating this study with other species in other locations. Thanks for the interesting read!

We have added a visionary suggestion at the end of the 'Conclusions' section, where we express our hope for researchers worldwide to collaboratively develop a universal standard for temporal sampling. This initiative aims to enhance the transferability of PAM data and to bolster the capability for data integration and analysis on a global scale. (Section 5, Para. 3)

Data Availability:
Not sure if I missed this but I don't see a location where the raw data - for instance the output summary of number of bird detections per minute for each species on a given sampling day and hour. I assume that at a minimum the raw output from the classifier would be needed to reproduce the data analyses reported here. I see the supplementary material with plots and overall number of detections, but that would not be enough to reproduce the statistical analyses.

Thank you for the reminder. We have now updated the compressed file 'VAR_m_all_columns.zip' in the "Underlying Data" to include the data on the number of calls per minute for each species at each PAM station, along with the associated environmental variable data. (Section 3.1, Para. 3, Line 5)

_________
Introduction

I enjoyed this introduction. Does a nice job to lead in to the aims of the study.

Paragraph (P) 3 - Might consider saying "more frequent vocalization" rather than "enhanced vocalization" to be more specific about what exactly is meant - if by "enhanced" you mean "more frequent and louder".

We have revised our manuscript in accordance with your suggestions. (Section 1, Para. 3, Line 3)

____________
Results

Section 3.1, P2 - Does "1,200 randomly sampled entries" mean detections by SILIC, or just random sections of the recordings? I think this is probably clarified later in the sentence, but something to think about.

As detailed in Section 2.4, Para. 4, for the evaluation of the SILIC model, we randomly selected 100 detections per species from the results identified by SILIC. Each sample includes a confidence score assigned by SILIC. This approach allows us to assess the performance of the SILIC model and to determine an appropriate confidence score threshold for each bird species relevant to our study. Subsequently, we filtered the total detection results to include only those with a confidence score equal to or exceeding the set threshold score. This process provided us with the necessary data on bird vocal activity for our research. We have revised the text to make the meaning more complete and clear. (Section 3.1, Para. 2, Line 2)

Fig. 1 and 2 - Really enjoyed this method of defining distinct sampling periods through a cluster analysis.

Thank you for acknowledging the validity of our analytical approach.

__________
Discussion

General - Should researchers analyze data from only a single survey season cluster period? If so, which period?And what does this mean for multi-species analysis?

In our study, Clusters 3 and 4 were identified as the optimal periods for acoustic surveys, as these intervals exhibited the lowest variation in vocalization activity across species. Moreover, these consecutive time frames totaled nearly eight weeks, offering greater flexibility in managing the rotation of recording devices. We have added an explanatory text in Section 4.2, Para. 1, Line 8.

P2 - Not sure what "interplay of internal ..." means. Does that mean the biology of the bird is partly responsible for differing VAR?

We wish to clarify that vocal activity is influenced by a multitude of factors including species, temporal aspects, and external environmental conditions. We have accordingly revised the wording of this paragraph in Section 4.1, Para. 1, Line 1.

__________
Conclusions

Seems like a good way to wrap it up. I do wonder about how this might apply to a full community of birds, but I suppose that is beyond the scope of this work.

We intend to continue exploring the impact of temporal sampling on other species and environments in future work. Additionally, we look forward to results shared by researchers from other regions, which would contribute to the development of more broadly applicable universal standards.
Dear Dr. Jerry S. Cole,
We greatly appreciate your positive remarks and thoughtful suggestions, which will be instrumental in refining our paper. Here are our responses aligned with your comments:

Summary: The authors have produced a timely and well written article that helps to further advance our understanding of how VAR differs across species and in response to environmental conditions. I particularly enjoyed the section that used clustering to divide the full sampling season into distinct segments. It would be nice to see a bit more development of the discussion section, for instance suggestions for what additional avenues researchers might explore other than just replicating this study with other species in other locations. Thanks for the interesting read!

We have added a visionary suggestion at the end of the 'Conclusions' section, where we express our hope for researchers worldwide to collaboratively develop a universal standard for temporal sampling. This initiative aims to enhance the transferability of PAM data and to bolster the capability for data integration and analysis on a global scale. (Section 5, Para. 3)

Data Availability:
Not sure if I missed this but I don't see a location where the raw data - for instance the output summary of number of bird detections per minute for each species on a given sampling day and hour. I assume that at a minimum the raw output from the classifier would be needed to reproduce the data analyses reported here. I see the supplementary material with plots and overall number of detections, but that would not be enough to reproduce the statistical analyses.

Thank you for the reminder. We have now updated the compressed file 'VAR_m_all_columns.zip' in the "Underlying Data" to include the data on the number of calls per minute for each species at each PAM station, along with the associated environmental variable data. (Section 3.1, Para. 3, Line 5)

_________
Introduction

I enjoyed this introduction. Does a nice job to lead in to the aims of the study.

Paragraph (P) 3 - Might consider saying "more frequent vocalization" rather than "enhanced vocalization" to be more specific about what exactly is meant - if by "enhanced" you mean "more frequent and louder".

We have revised our manuscript in accordance with your suggestions. (Section 1, Para. 3, Line 3)

____________
Results

Section 3.1, P2 - Does "1,200 randomly sampled entries" mean detections by SILIC, or just random sections of the recordings? I think this is probably clarified later in the sentence, but something to think about.

As detailed in Section 2.4, Para. 4, for the evaluation of the SILIC model, we randomly selected 100 detections per species from the results identified by SILIC. Each sample includes a confidence score assigned by SILIC. This approach allows us to assess the performance of the SILIC model and to determine an appropriate confidence score threshold for each bird species relevant to our study. Subsequently, we filtered the total detection results to include only those with a confidence score equal to or exceeding the set threshold score. This process provided us with the necessary data on bird vocal activity for our research. We have revised the text to make the meaning more complete and clear. (Section 3.1, Para. 2, Line 2)

Fig. 1 and 2 - Really enjoyed this method of defining distinct sampling periods through a cluster analysis.

Thank you for acknowledging the validity of our analytical approach.

__________
Discussion

General - Should researchers analyze data from only a single survey season cluster period? If so, which period?And what does this mean for multi-species analysis?

In our study, Clusters 3 and 4 were identified as the optimal periods for acoustic surveys, as these intervals exhibited the lowest variation in vocalization activity across species. Moreover, these consecutive time frames totaled nearly eight weeks, offering greater flexibility in managing the rotation of recording devices. We have added an explanatory text in Section 4.2, Para. 1, Line 8.

P2 - Not sure what "interplay of internal ..." means. Does that mean the biology of the bird is partly responsible for differing VAR?

We wish to clarify that vocal activity is influenced by a multitude of factors including species, temporal aspects, and external environmental conditions. We have accordingly revised the wording of this paragraph in Section 4.1, Para. 1, Line 1.

__________
Conclusions

Seems like a good way to wrap it up. I do wonder about how this might apply to a full community of birds, but I suppose that is beyond the scope of this work.

We intend to continue exploring the impact of temporal sampling on other species and environments in future work. Additionally, we look forward to results shared by researchers from other regions, which would contribute to the development of more broadly applicable universal standards.
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 01 Mar 2024

Shih-Hung Wu, Department of Biological Sciences, National Sun Yat-sen University, Kaohsiung, 804, Taiwan

01 Mar 2024

Author Response
Dear Dr. Jerry S. Cole,
We greatly appreciate your positive remarks and thoughtful suggestions, which will be instrumental in refining our paper. Here are our responses aligned with your comments:

... Continue reading
Dear Dr. Jerry S. Cole,
We greatly appreciate your positive remarks and thoughtful suggestions, which will be instrumental in refining our paper. Here are our responses aligned with your comments:

Summary: The authors have produced a timely and well written article that helps to further advance our understanding of how VAR differs across species and in response to environmental conditions. I particularly enjoyed the section that used clustering to divide the full sampling season into distinct segments. It would be nice to see a bit more development of the discussion section, for instance suggestions for what additional avenues researchers might explore other than just replicating this study with other species in other locations. Thanks for the interesting read!

We have added a visionary suggestion at the end of the 'Conclusions' section, where we express our hope for researchers worldwide to collaboratively develop a universal standard for temporal sampling. This initiative aims to enhance the transferability of PAM data and to bolster the capability for data integration and analysis on a global scale. (Section 5, Para. 3)

Data Availability:
Not sure if I missed this but I don't see a location where the raw data - for instance the output summary of number of bird detections per minute for each species on a given sampling day and hour. I assume that at a minimum the raw output from the classifier would be needed to reproduce the data analyses reported here. I see the supplementary material with plots and overall number of detections, but that would not be enough to reproduce the statistical analyses.

Thank you for the reminder. We have now updated the compressed file 'VAR_m_all_columns.zip' in the "Underlying Data" to include the data on the number of calls per minute for each species at each PAM station, along with the associated environmental variable data. (Section 3.1, Para. 3, Line 5)

_________
Introduction

I enjoyed this introduction. Does a nice job to lead in to the aims of the study.

Paragraph (P) 3 - Might consider saying "more frequent vocalization" rather than "enhanced vocalization" to be more specific about what exactly is meant - if by "enhanced" you mean "more frequent and louder".

We have revised our manuscript in accordance with your suggestions. (Section 1, Para. 3, Line 3)

____________
Results

Section 3.1, P2 - Does "1,200 randomly sampled entries" mean detections by SILIC, or just random sections of the recordings? I think this is probably clarified later in the sentence, but something to think about.

As detailed in Section 2.4, Para. 4, for the evaluation of the SILIC model, we randomly selected 100 detections per species from the results identified by SILIC. Each sample includes a confidence score assigned by SILIC. This approach allows us to assess the performance of the SILIC model and to determine an appropriate confidence score threshold for each bird species relevant to our study. Subsequently, we filtered the total detection results to include only those with a confidence score equal to or exceeding the set threshold score. This process provided us with the necessary data on bird vocal activity for our research. We have revised the text to make the meaning more complete and clear. (Section 3.1, Para. 2, Line 2)

Fig. 1 and 2 - Really enjoyed this method of defining distinct sampling periods through a cluster analysis.

Thank you for acknowledging the validity of our analytical approach.

__________
Discussion

General - Should researchers analyze data from only a single survey season cluster period? If so, which period?And what does this mean for multi-species analysis?

In our study, Clusters 3 and 4 were identified as the optimal periods for acoustic surveys, as these intervals exhibited the lowest variation in vocalization activity across species. Moreover, these consecutive time frames totaled nearly eight weeks, offering greater flexibility in managing the rotation of recording devices. We have added an explanatory text in Section 4.2, Para. 1, Line 8.

P2 - Not sure what "interplay of internal ..." means. Does that mean the biology of the bird is partly responsible for differing VAR?

We wish to clarify that vocal activity is influenced by a multitude of factors including species, temporal aspects, and external environmental conditions. We have accordingly revised the wording of this paragraph in Section 4.1, Para. 1, Line 1.

__________
Conclusions

Seems like a good way to wrap it up. I do wonder about how this might apply to a full community of birds, but I suppose that is beyond the scope of this work.

We intend to continue exploring the impact of temporal sampling on other species and environments in future work. Additionally, we look forward to results shared by researchers from other regions, which would contribute to the development of more broadly applicable universal standards.
Dear Dr. Jerry S. Cole,
We greatly appreciate your positive remarks and thoughtful suggestions, which will be instrumental in refining our paper. Here are our responses aligned with your comments:

Summary: The authors have produced a timely and well written article that helps to further advance our understanding of how VAR differs across species and in response to environmental conditions. I particularly enjoyed the section that used clustering to divide the full sampling season into distinct segments. It would be nice to see a bit more development of the discussion section, for instance suggestions for what additional avenues researchers might explore other than just replicating this study with other species in other locations. Thanks for the interesting read!

We have added a visionary suggestion at the end of the 'Conclusions' section, where we express our hope for researchers worldwide to collaboratively develop a universal standard for temporal sampling. This initiative aims to enhance the transferability of PAM data and to bolster the capability for data integration and analysis on a global scale. (Section 5, Para. 3)

Data Availability:
Not sure if I missed this but I don't see a location where the raw data - for instance the output summary of number of bird detections per minute for each species on a given sampling day and hour. I assume that at a minimum the raw output from the classifier would be needed to reproduce the data analyses reported here. I see the supplementary material with plots and overall number of detections, but that would not be enough to reproduce the statistical analyses.

Thank you for the reminder. We have now updated the compressed file 'VAR_m_all_columns.zip' in the "Underlying Data" to include the data on the number of calls per minute for each species at each PAM station, along with the associated environmental variable data. (Section 3.1, Para. 3, Line 5)

_________
Introduction

I enjoyed this introduction. Does a nice job to lead in to the aims of the study.

Paragraph (P) 3 - Might consider saying "more frequent vocalization" rather than "enhanced vocalization" to be more specific about what exactly is meant - if by "enhanced" you mean "more frequent and louder".

We have revised our manuscript in accordance with your suggestions. (Section 1, Para. 3, Line 3)

____________
Results

Section 3.1, P2 - Does "1,200 randomly sampled entries" mean detections by SILIC, or just random sections of the recordings? I think this is probably clarified later in the sentence, but something to think about.

As detailed in Section 2.4, Para. 4, for the evaluation of the SILIC model, we randomly selected 100 detections per species from the results identified by SILIC. Each sample includes a confidence score assigned by SILIC. This approach allows us to assess the performance of the SILIC model and to determine an appropriate confidence score threshold for each bird species relevant to our study. Subsequently, we filtered the total detection results to include only those with a confidence score equal to or exceeding the set threshold score. This process provided us with the necessary data on bird vocal activity for our research. We have revised the text to make the meaning more complete and clear. (Section 3.1, Para. 2, Line 2)

Fig. 1 and 2 - Really enjoyed this method of defining distinct sampling periods through a cluster analysis.

Thank you for acknowledging the validity of our analytical approach.

__________
Discussion

General - Should researchers analyze data from only a single survey season cluster period? If so, which period?And what does this mean for multi-species analysis?

In our study, Clusters 3 and 4 were identified as the optimal periods for acoustic surveys, as these intervals exhibited the lowest variation in vocalization activity across species. Moreover, these consecutive time frames totaled nearly eight weeks, offering greater flexibility in managing the rotation of recording devices. We have added an explanatory text in Section 4.2, Para. 1, Line 8.

P2 - Not sure what "interplay of internal ..." means. Does that mean the biology of the bird is partly responsible for differing VAR?

We wish to clarify that vocal activity is influenced by a multitude of factors including species, temporal aspects, and external environmental conditions. We have accordingly revised the wording of this paragraph in Section 4.1, Para. 1, Line 1.

__________
Conclusions

Seems like a good way to wrap it up. I do wonder about how this might apply to a full community of birds, but I suppose that is beyond the scope of this work.

We intend to continue exploring the impact of temporal sampling on other species and environments in future work. Additionally, we look forward to results shared by researchers from other regions, which would contribute to the development of more broadly applicable universal standards.
Competing Interests: No competing interests were disclosed. Close
Report a concern

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 11 Oct 2023

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2	3
Version 2 (revision) 23 Jan 24	read
Version 1 11 Oct 23	read	read	read

Jerry S Cole, The Institute for Bird Populations, Petaluma, USA
Laurel B Symes, Cornell University, Ithaca, USA
Bárbara Freitas, National Museum of Natural Sciences (Ringgold ID: 16625), Madrid, Spain

Comments on this article

All Comments(0)

Add a comment

Browse by related subjects

Back to all reports

Reviewer Report

15 Views

22 Apr 2024 | for Version 2

Jerry S Cole, The Institute for Bird Populations, Petaluma, California, USA

15 Views Cite this report Responses(0)

Approved

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Bioacoustics, ecology, ornithology, automated sound classification

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

13 Views

04 Mar 2024 | for Version 1

Bárbara Freitas, National Museum of Natural Sciences (Ringgold ID: 16625), Madrid, Community of Madrid, Spain

13 Views Cite this report Responses(0)

Approved With Reservations

Is the work clearly and accurately presented and does it cite the current literature?

Partly
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Partly
If applicable, is the statistical analysis and its interpretation appropriate?

Yes
Are all the source data underlying the results available to ensure full reproducibility?

No
Are the conclusions drawn adequately supported by the results?

Partly

References

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Bioacoustics, animal communication, passive acoustic monitoring, speciation

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

51 Views

02 Jan 2024 | for Version 1

Laurel B Symes, Cornell University, Ithaca, New York, USA

51 Views Cite this report Responses(1)

Approved With Reservations

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

Yes
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Bioacoustics, biodiversity, speciation

Respond to this report

Responses (1)

Author Response

01 Mar 2024

Shih-Hung Wu, Department of Biological Sciences, National Sun Yat-sen University, Kaohsiung, 804, Taiwan

Dear Dr. Laurel B Symes,
Thank you for your insightful review of our manuscript. We value your comments on evaluating vocal activity rates and sampling strategies across various species and habitats. In response to your suggestions, we have revised the manuscript accordingly and provide detailed responses to each of your comments below:

The goal of this paper is to evaluate how the measured vocal activity rate of birds differs depending on the temporal sampling strategy. Evaluating how detection results differ with sampling strategy is useful, particularly across different species and habitats globally, and can inform decisions about whether it is possible to use a generalized sampling strategies across a range of scenarios, or whether sampling strategies need to vary with season, latitude, habitat type, geography and other factors.

Overall, the paper seems like a solid evaluation of sampling strategies. Currently, the introduction is strongly focused on the temporal aspects of sampling. In the analysis and figures, the authors devote substantial attention to impacts of environmental co-variates, which may well make sense, but could use some set-up and introduction (or could be downplayed or eliminated). In addition, the authors could provide some additional synthesis/interpretation of their findings in the context of designing PAM-based studies. While the authors provide some specific guidance in the discussion, people seeking actionable information for study design may appreciate even more clear direction (such as headings for sections that discuss recommendations for hourly sampling, diel sampling, etc., and/or a summary in the conclusion that focuses on the key recommendations).

We have included a checklist of considerations in the Conclusions section to offer specific guidance for researchers conducting PAM studies in similar domains. (Section 5, Para. 1, Line 4)

Major feedback:
>From the intro “Sampling design can be categorized into four temporal scales: annual, seasonal, diel, and hourly. An annual sampling design implies recording during one or several time periods within a year.”
Do you have a reference for the way that these four types of sampling are defined? I would have anticipated that annual sampling compared one year against another (rather than comparing seasons within a year) and that seasonal sampling would have compared one season to another vs “after being operated for a predetermined number of days at each location, the ARU is relocated to a subsequent site” (which to me would generally imply sampling periods shorter than a season).

Indeed, we recognize that our use and definitions of 'annual sampling' and 'seasonal sampling' in the study may have been confusing for readers. We have decided to revise these terms to 'intra-annual sampling' and 'short-term periodic sampling' to avoid such ambiguity. (Abstract - Methods; Section 1, Para. 3, Line 2; Section 1, Para. 4, Line 1; Section 2.5.2, Para. 1, Line 8; Section 3.3; Section 4.2, Para. 1, Line 3)

>Hourly sampling design can be categorized into coverage (the proportion of recorded time within an hour) and dispersion (the number of recording segments within an hour).
Dispersion can affect sampling designs other than hourly as well (for example, if only certain days are sampled). You might consider suggest treating the different temporal scales as one set and then separately discussing dispersion (how thoroughly each of those temporal scales are sampled), whether or not you assess how dispersion of sampling affects outcomes at scales above hourly.

The scope of our investigation was confined to examining the effects of dispersion at the minute level, without delving into higher temporal units such as hours or days. This specific focus was dictated by the prevailing limitation that most ARUs are currently only programmable at the minute level. Additionally, there is a noted scarcity in prior research that has formulated dispersion sampling strategies for intervals extending beyond minutes. Investigations in the future, exploring higher temporal scales, might yield further insights. These could potentially enhance the understanding of sampling methodologies that are capable of reducing the duration of recordings, while simultaneously maintaining the integrity and quality of the data collected. We have added a section in the Discussion regarding the exploration of dispersion sampling strategies across different temporal scales. (Section 4.3, Para. 2, Line 3)

Some more specifics below:
Abstract:
The goal stated in the abstract background (the last sentence) is unclear. Could the authors rephrase the same to indicate clearly what they are attempting to do here? For example, why does the effective temporal sampling design matter?
You need to introduce what vocal activity rate means prior to discussing the ideal design for VAR data.

We have supplemented the abstract with a definition of VAR and its significance in the context of our study. (Abstract - Background)

Introduction:
Having read your introduction – which reads very clearly, I think you might want to rephrase your abstract in the context of trying to identify the right sampling design or approach & why this is important for different ecological research projects/ecosystems/taxonomic groups.

We have rewritten the background section of the abstract to clearly articulate the primary objectives of our study. (Abstract - Background)

I think your objectives should be switched – a) trying to identify how sampling designs impact vocal activity rate patterns and b) provide recommendations.

We have made revisions in accordance with your suggestions. (Section 1, Para. 8, Line 2)

Based on the key questions asked – your introduction should introduce sampling design followed by a few sentences on vocal activity rate/analysis of acoustic data – and then you could ask your two key questions in your study area. [sampling design -> why this is important for vocal activity rates/patterns -> ask your questions – how does sampling design impact vocal activity rate calculations?]

We have made revisions in accordance with your suggestions. (Section 1, Para. 4, Line 4; Para. 5, Line 4; Para. 6, Line 5)

Methods:
Is VAR_d a species-specific measure or is it the total number of vocalizations per day for that site irrespective of the species? Please rephrase/clarify.

VAR_d was calculated as the total daily vocalization count for each station and bird species separately. We have added a section to make the narrative more comprehensive and complete. (Section 2.5.1, Para. 2, Line 3)

I am unclear on Table 1 – were all these different sampling strategies deployed/implemented in the field or were these post-hoc analysis based on sub-sampling data?

Our study is based on continuous field recording data, and it simulates the outcomes and impacts of data acquisition under different temporal sampling scenarios. We have added explanatory text in the relevant sections to clarify this approach. (Section 2.5.4, Para. 1, Line 1; Explanation of Table 1.)

What does Figure 1 mean ecologically? HCA statistically models Euclidean distance, but what does it mean that day 94 is closely associated with day 110 for example?

Within each cluster, the VAR_d patterns of different bird species at each PAM station are similar among days within the same cluster but differ from those in other clusters. We have added explanatory text in the relevant section. (Section 3.3, Para. 1, Line 2)

>gender,
In this context, “sex” may be preferrable to “gender”.

Revised. (Section 1, Para. 7, Line 5)

>Concentrating recordings during specific time, like dawn, often capture more species
Concentrating recordings during specific time, like dawn, often capture*s* more species

Revised. (Section 1, Para. 7, Line 13)

>For species without a song or the song was
For species without a song or *if* the song was

Revised. (Section 2.4, Para. 2, Line 4)

>Secondly, SILIC offers a unique capability to detect each vocalization’s exact start and end time within an audio recording, rather than merely identifying the presence or absence of a certain vocalization within a broad time frame.
How precisely is this approach able to pinpoint start time? Is this different from other software options that assign call identity within a 3 or 5 second window?

SILIC can detect each vocalization at the millisecond level. We have made revisions in Section 2.4, Para. 1, Line 4.

>In the discussion, you might address how choosing only one vocalization type (generally song) might influence results, vs something like alarm calls, which might have different temporal distribution

We have revised the discussion segment focusing on the potential impacts of species and vocalization type on our research findings. (Section 4, Para. 1, Line 1)

>What is ‘corrected’ precipitation? (vs measured precipitation)

The corrected precipitation from NASA/POWER CERES/MERRA2 Native Resolution Daily Data is estimated through algorithms (climate models) that interpret satellite observations. These observations are then often adjusted or 'corrected' using various atmospheric data and models to provide more accurate estimates.

>Accordingly, ‘song’ represented 9 of the 12 species, ‘call’ 2, and ‘unknown’ 1.
In supplemental materials, indicate which spectrograms corresponded to which vocalization type

We have added information on the type of vocalization to each corresponding spectrogram. (Supplementary Material_v2.pdf)

>spanning 122 working days
I would remove “working” unless it has specific meaning. (often ‘working’ is used to mean Mon-Fri vs weekend days)

Revised. (Section 3.1, Para. 1, Line 1)

> Due to the fewer days encompassed by Clusters 1 and 2, when calculating the CV values for a higher number of days, there might be a tendency to obtain relatively lower CV values owing to the smaller sample sizes.
I found this sentence confusing. I think that it means that clusters 1 and 2 are smaller and should not be compared to clusters that contain higher numbers of days?

We have revised our discussion to more accurately address the differences in the CV values among clusters with varying sample sizes. (Section 3.3, Para. 2, Line 1)

>Figure 2 is visually appealing but may be more complicated to interpret than a horizonal four-month graph. If the figure is retained, the authors might consider having time flow clockwise rather than counterclockwise.

We have adjusted the figure to rotate clockwise, and we have also included Figure S9 in the Supplementary Material, which presents the data with a horizontal time axis. (Figure 2; Figure S9)

View more View less

Competing Interests

No competing interests were disclosed.

Back to all reports

Reviewer Report

49 Views

19 Dec 2023 | for Version 1

Jerry S Cole, The Institute for Bird Populations, Petaluma, California, USA

49 Views Cite this report Responses(1)

Approved With Reservations

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

Yes
Are all the source data underlying the results available to ensure full reproducibility?

Partly
Are the conclusions drawn adequately supported by the results?

Yes

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Bioacoustics, ecology, ornithology, automated sound classification

Respond to this report

Responses (1)

Author Response

01 Mar 2024

Shih-Hung Wu, Department of Biological Sciences, National Sun Yat-sen University, Kaohsiung, 804, Taiwan

Dear Dr. Jerry S. Cole,
We greatly appreciate your positive remarks and thoughtful suggestions, which will be instrumental in refining our paper. Here are our responses aligned with your comments:

Summary: The authors have produced a timely and well written article that helps to further advance our understanding of how VAR differs across species and in response to environmental conditions. I particularly enjoyed the section that used clustering to divide the full sampling season into distinct segments. It would be nice to see a bit more development of the discussion section, for instance suggestions for what additional avenues researchers might explore other than just replicating this study with other species in other locations. Thanks for the interesting read!

We have added a visionary suggestion at the end of the 'Conclusions' section, where we express our hope for researchers worldwide to collaboratively develop a universal standard for temporal sampling. This initiative aims to enhance the transferability of PAM data and to bolster the capability for data integration and analysis on a global scale. (Section 5, Para. 3)

Data Availability:
Not sure if I missed this but I don't see a location where the raw data - for instance the output summary of number of bird detections per minute for each species on a given sampling day and hour. I assume that at a minimum the raw output from the classifier would be needed to reproduce the data analyses reported here. I see the supplementary material with plots and overall number of detections, but that would not be enough to reproduce the statistical analyses.

Thank you for the reminder. We have now updated the compressed file 'VAR_m_all_columns.zip' in the "Underlying Data" to include the data on the number of calls per minute for each species at each PAM station, along with the associated environmental variable data. (Section 3.1, Para. 3, Line 5)

_________
Introduction

I enjoyed this introduction. Does a nice job to lead in to the aims of the study.

Paragraph (P) 3 - Might consider saying "more frequent vocalization" rather than "enhanced vocalization" to be more specific about what exactly is meant - if by "enhanced" you mean "more frequent and louder".

We have revised our manuscript in accordance with your suggestions. (Section 1, Para. 3, Line 3)

____________
Results

Section 3.1, P2 - Does "1,200 randomly sampled entries" mean detections by SILIC, or just random sections of the recordings? I think this is probably clarified later in the sentence, but something to think about.

As detailed in Section 2.4, Para. 4, for the evaluation of the SILIC model, we randomly selected 100 detections per species from the results identified by SILIC. Each sample includes a confidence score assigned by SILIC. This approach allows us to assess the performance of the SILIC model and to determine an appropriate confidence score threshold for each bird species relevant to our study. Subsequently, we filtered the total detection results to include only those with a confidence score equal to or exceeding the set threshold score. This process provided us with the necessary data on bird vocal activity for our research. We have revised the text to make the meaning more complete and clear. (Section 3.1, Para. 2, Line 2)

Fig. 1 and 2 - Really enjoyed this method of defining distinct sampling periods through a cluster analysis.

Thank you for acknowledging the validity of our analytical approach.

__________
Discussion

General - Should researchers analyze data from only a single survey season cluster period? If so, which period?And what does this mean for multi-species analysis?

In our study, Clusters 3 and 4 were identified as the optimal periods for acoustic surveys, as these intervals exhibited the lowest variation in vocalization activity across species. Moreover, these consecutive time frames totaled nearly eight weeks, offering greater flexibility in managing the rotation of recording devices. We have added an explanatory text in Section 4.2, Para. 1, Line 8.

P2 - Not sure what "interplay of internal ..." means. Does that mean the biology of the bird is partly responsible for differing VAR?

We wish to clarify that vocal activity is influenced by a multitude of factors including species, temporal aspects, and external environmental conditions. We have accordingly revised the wording of this paragraph in Section 4.1, Para. 1, Line 1.

__________
Conclusions

Seems like a good way to wrap it up. I do wonder about how this might apply to a full community of birds, but I suppose that is beyond the scope of this work.

We intend to continue exploring the impact of temporal sampling on other species and environments in future work. Additionally, we look forward to results shared by researchers from other regions, which would contribute to the development of more broadly applicable universal standards.

View more View less

Competing Interests

No competing interests were disclosed.

Alongside their report, reviewers assign a status to the article:

Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions

[1] Alvarez-Berríos N, Campos-Cerqueira M, Hernández-Serna A, et al.: Impacts of Small-Scale Gold Mining on Birds and Anurans Near the Tambopata Natural Reserve, Peru, Assessed Using Passive Acoustic Monitoring. Trop. Conserv. Sci. 2016; 9(2): 832–851. Publisher Full Text

[2] Baillie SR: Monitoring terrestrial breeding bird populations.Goldsmith B, editor. Monitoring for Conservation and Ecology. Netherlands: Springer; 1991; pp. 112–132. Publisher Full Text

[3] Bateman HL, Riddle SB, Cubley ES: Using Bioacoustics to Examine Vocal Phenology of Neotropical Migratory Birds on a Wild and Scenic River in Arizona. Birds. 2021; 2(3): 261–274. Publisher Full Text

[4] Blicharska M, Smithers RJ, Mikusiński G, et al.: Biodiversity’s contributions to sustainable development. Nat. Sustain. 2019; 2(12): 1083–1093. Publisher Full Text

[5] Bruni A, Mennill DJ, Foote JR: Dawn chorus start time variation in a temperate bird community: Relationships with seasonality, weather, and ambient light. J. Ornithol. 2014; 155(4): 877–890. Publisher Full Text

[6] Campos-Cerqueira M, Aide TM: Improving distribution data of threatened species by combining acoustic monitoring and occupancy modelling. Methods Ecol. Evol. 2016; 7(11): 1340–1348. Publisher Full Text

[7] Catchpole CK, Slater PJ: Bird song: Biological themes and variations. Cambridge University Press; 2003.

[8] Cazalis V, Princé K, Mihoub J-B, et al.: Effectiveness of protected areas in conserving tropical forest birds. Nat. Commun. 2020; 11(1): 4461. PubMed Abstract | Publisher Full Text | Free Full Text

[9] Crespo J, Solís I, Barba E: Illegal Bird Hunting in Eastern Spain: A Declining Trend, But Still Worrying. Ardeola. 2020; 68(1). Publisher Full Text

[10] Deichmann JL, Hernández-Serna A, Delgado CJA, et al.: Soundscape analysis and acoustic monitoring document impacts of natural gas exploration on biodiversity in a tropical forest. Ecol. Indic. 2017; 74: 39–48. Publisher Full Text

[11] Diepstraten J, Willie J: Assessing the structure and drivers of biological sounds along a disturbance gradient. Glob. Ecol. Conserv. 2021; 31: e01819. Publisher Full Text

[12] Digby A, Towsey M, Bell BD, et al.: Temporal and environmental influences on the vocal behaviour of a nocturnal bird. J. Avian Biol. 2014; 45(6): 591–599. Publisher Full Text

[13] Ding T-S: Avian community ecology of mature forests in Mt. Yushan [Master Thesis]. National Taiwan University.1993.

[14] Duchac LS, Lesmeister DB, Dugger KM, et al.: Passive acoustic monitoring effectively detects Northern Spotted Owls and Barred Owls over a range of forest conditions. Condor. 2020; 122: duaa017. Publisher Full Text

[15] Ducrettet M, Forget P-M, Ulloa JS, et al.: Monitoring canopy bird activity in disturbed landscapes with automatic recorders: A case study in the tropics. Biol. Conserv. 2020; 245: 108574. Publisher Full Text

[16] Dueñas M-A, Hemming DJ, Roberts A, et al.: The threat of invasive species to IUCN-listed critically endangered species: A systematic review. Glob. Ecol. Conserv. 2021; 26: e01476. Publisher Full Text

[17] Favaro L, Cresta E, Friard O, et al.: Passive acoustic monitoring of the endangered African Penguin (Spheniscus demersus) using autonomous recording units and ecoacoustic indices. Ibis. 2021; 163(4): 1472–1480. Publisher Full Text

[18] Fraixedas S, Lindén A, Piha M, et al.: A state-of-the-art review on birds as indicators of biodiversity: Advances, challenges, and future directions. Ecol. Indic. 2020; 118: 106728. Publisher Full Text

[19] Franklin MJM, Major RE, Bradstock RA: How much survey effort is required to assess bird assemblages in fire-prone eucalypt forests using acoustic recorders? Wildl. Res. 2021; 48: 414–421. Publisher Full Text

[20] Hoefer S, McKnight DT, Allen-Ankins S, et al.: Passive acoustic monitoring in terrestrial vertebrates: A review. Bioacoustics. 2023; 32: 506–531. Publisher Full Text

[21] Jahn P, Ross J, MacKenzie D, et al.: Acoustic monitoring and occupancy analysis: Cost-effective tools in reintroduction programmes for roroa-great spotted kiwi. N. Z. J. Ecol. 2022; 46(1): 3466. Publisher Full Text

[22] Kahl S, Wood CM, Eibl M, et al.: BirdNET: A deep learning solution for avian diversity monitoring. Eco. Inform. 2021; 61: 101236. Publisher Full Text

[23] La Sorte FA, Horton KG, Johnston A, et al.: Seasonal associations with light pollution trends for nocturnally migrating bird populations. Ecosphere. 2022; 13(3). Publisher Full Text

[24] La VT: Diurnal and nocturnal birds vocalize at night: A review. Condor. 2012; 114(2): 245–257. Publisher Full Text

[25] Lahoz-Monfort JJ, Magrath MJL: A Comprehensive Overview of Technologies for Species and Habitat Monitoring and Conservation. Bioscience. 2021; 71(10): 1038–1062. PubMed Abstract | Publisher Full Text | Free Full Text

[26] Machado RB, Aguiar L, Jones G: Do acoustic indices reflect the characteristics of bird communities in the savannas of Central Brazil? Landsc. Urban Plan. 2017; 162: 36–43. Publisher Full Text

[27] Marques TA, Thomas L, Martin SW, et al.: Estimating animal population density using passive acoustics. Biol. Rev. 2013; 88(2): 287–309. PubMed Abstract | Publisher Full Text | Free Full Text

[28] de Melo TN , Cerqueira MC, D’Horta FM, et al.: Impacts of a large hydroelectric dam on the Madeira River (Brazil) on floodplain avifauna. Acta Amazon. 2021; 51(4): 298–310. Publisher Full Text

[29] Metcalf OC, Barlow J, Marsden S, et al.: Optimizing tropical forest bird surveys using passive acoustic monitoring and high temporal resolution sampling. Remote Sens. Ecol. Conserv. 2021; 8: 45–56. Publisher Full Text

[30] Minister of the Interior: The 4th Overall Review of Yushan National Park Plan. Minister of the Interior; 2022.

[31] Negret PJ, Maron M, Fuller RA, et al.: Deforestation and bird habitat loss in Colombia. Biol. Conserv. 2021; 257: 109044. Publisher Full Text

[32] Northrup JM, Rivers JW, Yang Z, et al.: Synergistic effects of climate and land-use change influence broad-scale avian population declines. Glob. Chang. Biol. 2019; 25(5): 1561–1575. PubMed Abstract | Publisher Full Text

[33] Odom KJ, Mennill DJ: A Quantitative Description of the Vocalizations and Vocal Activity of the Barred Owl. Condor. 2010; 112(3): 549–560. Publisher Full Text

[34] Pereira H, Davidcooper H: Towards the global monitoring of biodiversity change. Trends Ecol. Evol. 2006; 21(3): 123–129. Publisher Full Text

[35] Pérez-Granados C, Bota G, Giralt D, et al.: Vocal Activity Rate (VAR) index: A useful method to infer terrestrial bird abundance with acoustic monitoring. Ibis. 2019a; 161: 901–907. Publisher Full Text

[36] Pérez-Granados C, Gómez-Catasús J, Bustillo-de la Rosa D, et al.: Effort needed to accurately estimate Vocal Activity Rate index using acoustic monitoring: A case study with a dawn-time singing passerine. Ecol. Indic. 2019b; 107: 105608. Publisher Full Text

[37] Pérez-Granados C, Schuchmann K-L: Monitoring the annual vocal activity of two enigmatic nocturnal Neotropical birds: The Common Potoo (Nyctibius griseus) and the Great Potoo (Nyctibius grandis). J. Ornithol. 2020; 161(4): 1129–1141. Publisher Full Text

[38] Pérez-Granados C, Schuchmann K-L, Marques MI: Passive acoustic monitoring of the Ferruginous Pygmy-Owl (Glaucidium brasilianum) over a complete annual cycle: Seasonality and monitoring recommendations. Stud. Neotropical Fauna Environ. 2021; 1–8. Publisher Full Text

[39] Pérez-Granados C, Traba J: Esti mating bird density using passive acoustic monitoring: A review of methods and suggestions for further research. Ibis. 2021; 163: 765–783. Publisher Full Text

[40] Robbins CS: Bird activity levels related to weather. Stud. Avian Biol. 1981; 6: 301–310.

[41] Ross SRP-J, O’Connell DP, Deichmann JL, et al.: Passive acoustic monitoring provides a fresh perspective on fundamental ecological questions. Funct. Ecol. 2023; 37(4): 959–975. Publisher Full Text

[42] Rumelt RB, Basto A, Mere Roncal C: Automated audio recording as a means of surveying tinamous (Tinamidae) in the Peruvian Amazon. Ecol. Evol. 2021; 11(19): 13518–13531. PubMed Abstract | Publisher Full Text | Free Full Text

[43] Schaaf AA, Boullhesen M, Akmentins MS: Vocal activity of the rufous nightjar (Antrostomus rufus) in subtropical Yungas Andean forests of Argentina. Neotrop. Biodivers. 2023; 9(1): 59–63. Publisher Full Text

[44] Shaw T, Schönamsgruber S, Cordeiro Pereira JM, et al.: Refining manual annotation effort of acoustic data to estimate bird species richness and composition: The role of duration, intensity, and time. Ecol. Evol. 2022; 12(11): e9491. PubMed Abstract | Publisher Full Text | Free Full Text

[45] Shonfield J, Bayne EM: Autonomous recording units in avian ecological research: Current use and future applications. Avian Conserv. Ecol. 2017; 12(1): art14. Publisher Full Text

[46] Slagsvold T: Bird Song Activity in Relation to Breeding Cycle, Spring Weather, and Environmental Phenology. Ornis Scand. 1977; 8(2): 197. Publisher Full Text

[47] Sugai LSM, Silva TSF, Ribeiro JW, et al.: Terrestrial passive acoustic monitoring: Review and perspectives. Bioscience. 2019; 69(1): 15–25. Publisher Full Text

[48] Symes LB, Kittelberger KD, Stone SM, et al.: Analytical approaches for evaluating passive acoustic monitoring data: A case study of avian vocalizations. Ecol. Evol. 2022; 12(4): e8797. PubMed Abstract | Publisher Full Text | Free Full Text

[49] Thomisch K, Boebel O, Zitterbart DP, et al.: Effects of subsampling of passive acoustic recordings on acoustic metrics. J. Acoust. Soc. Am. 2015; 138(1): 267–278. PubMed Abstract | Publisher Full Text

[50] Tsai P-Y, Ko C-J, Hsieh C, et al.: A trait dataset for Taiwan’s breeding birds. Biodivers. Data J. 2020; 8: e49735. PubMed Abstract | Publisher Full Text | Free Full Text

[51] Vokurková J, Motombi FN, Ferenc M, et al.: Seasonality of vocal activity of a bird community in an Afrotropical lowland rain forest. J. Trop. Ecol. 2018; 34(1): 53–64. Publisher Full Text

[52] Wood CM, Kahl S, Chaon P, et al.: Survey coverage, recording duration and community composition affect observed species richness in passive acoustic surveys. Methods Ecol. Evol. 2021; 12(5): 885–896. Publisher Full Text

[53] Wood CM, Klinck H, Gustafson M, et al.: Using the ecological significance of animal vocalizations to improve inference in acoustic monitoring programs. Conserv. Biol. 2020; 35: 336–345. PubMed Abstract | Publisher Full Text

[54] Wu S-H, Chang H-W, Lin R-S, et al.: SILIC: A cross database framework for automatically extracting robust biodiversity information from soundscape recordings based on object detection and a tiny training dataset. Eco. Inform. 2022; 68: 101534. Publisher Full Text

[55] Wu S-H, Ko JC-J, Lin R-S, et al.: Underlying data for Evaluating community-wide temporal sampling in passive acoustic monitoring: A comprehensive study of avian vocal patterns in subtropical montane forests. [Dataset]. 2023. Publisher Full Text

[56] Xu Y, Si Y, Wang Y, et al.: Loss of functional connectivity in migration networks induces population decline in migratory birds. Ecol. Appl. 2019; 29(7): e01960. PubMed Abstract | Publisher Full Text | Free Full Text

[57] Yoo S, Kim H, Lee J, et al.: Seasonal and diurnal patterns of population vocal activity in avian brood parasites. Ibis. 2020; 162(3): 1001–1011. Publisher Full Text

[58] Zuur AF, Ieno EN, Smith GM: Analysing ecological data. Springer; 2007; vol. 680. . Publisher Full Text

[59] Zwerts JA, Stephenson PJ, Maisels F, et al.: Methods for wildlife monitoring in tropical forests: Comparing human observations, camera traps, and passive acoustic sensors. Conserv. Sci. Pract. 2021; 3(12). Publisher Full Text

Evaluating community-wide temporal sampling in passive acoustic monitoring: A comprehensive study of avian vocal patterns in subtropical montane forests

Abstract

Background

Methods

Results

Conclusions

Keywords

Revised Amendments from Version 1

1. Introduction

2. Methods

2.1 Study area

2.2 Soundscape data collection

2.3 Target species

2.4 Vocal detection and performance evaluation

2.5 Temporal sampling designs and statistical analyses

Table 1. Hourly sampling combinations from seven coverage designs.

3. Results

3.1 Vocal detection

3.2 Environment factors

Table 2. Model selection results for predicting daily vocal activity rate with GAMs.

3.3 Short-term periodic sampling

Figure 1. Hierarchical clustering of dates by vocal activity patterns.

Table 3. Day of year (DOY) clusters from hierarchical clustering analysis Using Daily Vocal Activity Rate.

Figure 2. Short-term periodic representation of vocal activity clusters and rainfall.

Figure 3. Variation in daily vocal activity rate among clusters.

3.4 Diel sampling

Figure 4. Diurnal patterns of hourly vocal activity for twelve target bird species.

3.5 Hourly sampling

Figure 5. Comparison of vocal activity rates across sampling designs for twelve target bird species.

4. Discussion

4.1 Species and environmental factors

4.2 Short-term periodic sampling

4.3 Diel and hourly sampling

5. Conclusions

Data availability

Underlying data

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated