<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="data-paper" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.164537.2</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Data Note</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>A raster-based dataset for spatio-temporal analysis of forest fires in the Amazon rainforest from 2001 to 2020</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 2; peer review: 1 approved, 2 approved with reservations]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Mahmood</surname>
                        <given-names>Mateen</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Investigation</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Moraga</surname>
                        <given-names>Paula</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Funding Acquisition</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-5266-0201</uri>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Makkah Province, 23955-6900, Saudi Arabia</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:paula.moraga@kaust.edu.sa">paula.moraga@kaust.edu.sa</email>
                </corresp>
                <fn fn-type="conflict">
                    <p>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>29</day>
                <month>1</month>
                <year>2026</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2025</year>
            </pub-date>
            <volume>14</volume>
            <elocation-id>916</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>24</day>
                    <month>1</month>
                    <year>2026</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2026 Mahmood M and Moraga P</copyright-statement>
                <copyright-year>2026</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/14-916/pdf"/>
            <abstract>
                <p>Forest fires are becoming increasingly common worldwide, posing a threat to the environment, economy, and society. Spatiotemporal analysis of forest fires is important to understand their characteristics and causes and to inform decision-making. This type of analysis requires the availability of a number of factors that contribute to fire occurrence, such as land use, environment, climate, and human activities, at high spatial and temporal resolutions. The South American Amazon rainforest covers a large area, and acquiring a useful dataset for analysis requires extensive effort and computer-intensive processing. This study investigates potential data sources, establishes a methodology, and prepares a dataset of attributes useful for spatiotemporal fire analysis. We provide a raster-based dataset that includes fires, land use, environment, and climate factors at a spatial resolution of 500 m and monthly temporal resolution from 2001 to 2020, which facilitates the analysis of forest fires in the Amazon. Moreover, because data sources and implementation procedures are detailed, this work also encourages similar research in other parts of the world.</p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>Amazon; Fires; Burnt Area; Land Cover; Elevation; Precipitation; Humidity; Temperature</kwd>
            </kwd-group>
            <funding-group>
                <funding-statement>The author(s) declared that no grants were involved in supporting this work.</funding-statement>
            </funding-group>
        </article-meta>
        <notes>
            <sec sec-type="version-changes">
                <label>Revised</label>
                <title>Amendments from Version 1</title>
                <p>This version adds the full data product names and IDs, clarifies the data integration and resolution procedures, and expands the description of the coordinate transformation process. The data processing and technical validation section has been updated to better highlight quality control procedures. It also includes several rephrased and clarifying statements to improve overall clarity.</p>
            </sec>
        </notes>
    </front>
    <body>
        <sec id="sec1" sec-type="intro">
            <title>Introduction</title>
            <p>The alarming increase in the frequency and severity of forest fires around the globe has become a significant threat to forested areas worldwide. These wildfires not only threaten human lives and their properties but also continue to contribute to the reshaping of local and global ecosystems. Because of their varying spatiotemporal nature at multiple scales, they are substantially diverse in their frequency, size, intensity, and pattern.
                <sup>
                    <xref ref-type="bibr" rid="ref1">1</xref>
                </sup> Similarly, the source of ignition is an amalgamation of numerous aspects such as weather, climate, land use, and other causes such as lightning, volcanic eruptions, rockfalls, and combustion material.
                <sup>
                    <xref ref-type="bibr" rid="ref2">2</xref>
                </sup> This constant vulnerability of forests exposed to wildfires is horrifying, but when considered in the context of ecological and socio-economic consequences, it poses a major challenge to fire management authorities and related stakeholders.
                <sup>
                    <xref ref-type="bibr" rid="ref3">3</xref>
                </sup>
            </p>
            <p>To ensure better preparedness and deploy improved preventive measures, the spatio-temporal relations between the probable causes of wildfires and the characteristics of those fire incidents must be analyzed. Such analysis will not only assist with mitigation but may also aid in the prediction and forecasting of future events by better understanding the underlying events propagating fire occurrences.
                <sup>
                    <xref ref-type="bibr" rid="ref4">4</xref>
                </sup> Such in-depth spatio-temporal statistical investigations of these complex interactions require the collection of all available associated attributes, combined from heterogeneous sources (with varying extents, spatial scales, temporal resolutions, file formats, etc.) into a processed unified structure available in the form of common specifications.</p>
            <p>The South American Amazon is one of the largest rainforests in the world
                <sup>
                    <xref ref-type="bibr" rid="ref5">5</xref>
                </sup> and hosts thousands of wildfires annually.
                <sup>
                    <xref ref-type="bibr" rid="ref6">6</xref>
                </sup> Despite numerous studies related to spatio-temporal statistical analysis of forest fires in many regions of the world,
                <sup>
                    <xref ref-type="bibr" rid="ref2">2</xref>,
                    <xref ref-type="bibr" rid="ref4">4</xref>,
                    <xref ref-type="bibr" rid="ref7">7</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref9">9</xref>
                </sup> there remains a notable scarcity of basin-wide, multivariate longitudinal studies for the entire Amazon region. While some research has addressed specific drivers of ignition, existing Amazon-specific studies tend to be limited to sub-regions or specific administrative boundaries.
                <sup>
                    <xref ref-type="bibr" rid="ref3">3</xref>,
                    <xref ref-type="bibr" rid="ref10">10</xref>
                </sup> For a study area of this size, data collection is a time-intensive task, with exhaustive pre-processing requiring cumbersome setups. Hence, the development of an Amazon-wide database that includes all available attributes related to fires, integrated into a common format, is required.</p>
            <p>The aim of this work is to provide a scientific community with a dataset related to spatiotemporal forest fire analysis for the Amazon region. The dataset includes historical data of 20 years (2001-2020) in a monthly temporal resolution for the complete extent of the Amazon region at a spatial scale of 500 m. Because the study area of the entire Amazon rainforest is large, the raw data sources must be at a global or regional level (in South America). Otherwise, data for the same attribute are expected to be gathered from multiple local-level sources, raising concerns regarding data integrity. Global- and regional-level satellite-based raster products were acquired and further clipped for the South American region to compute three types of data: 
                <italic toggle="yes">(a) raw data</italic>, 
                <italic toggle="yes">(b) pre-processed data</italic> and 
                <italic toggle="yes">(c) working data.</italic> A schematic overview of this study is presented in 
                <xref ref-type="fig" rid="f1">
Figure 1</xref>. 
                <italic toggle="yes">Raw data</italic> refer to data file(s) extracted from the accessed data packages (i.e., data layer of the subject attribute, taken out from the data package containing various other attribute layers as well). The extracted attribute layers have varying spatial resolutions, dissimilar spatial extents, different spatial projections, and inconsistent file formats. Raw data are pre-processed to acquire 
                <italic toggle="yes">pre-processed data</italic>, with the attribute layers in a consistent file format and with the same projection system. Finally, all attribute layers are processed to obtain 
                <italic toggle="yes">working data</italic>, with the data extent confined to the Amazon region and with fixed spatial resolution, such that each raster cell of an attribute layer aligns exactly over the raster cell of the other attribute layer.</p>
            <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                <label>
Figure 1. </label>
                <caption>
                    <title>Schematic overview of the data processing process.</title>
                </caption>
                <graphic id="gr1" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/195424/8e7ed734-d448-496a-b237-210e5b5e148c_figure1.gif"/>
            </fig>
            <p>This manuscript presents the complete process of data collection for raster-based attributes of forest fires in the Amazon rainforest, along with a description of the methodological baseline and details of the implementation process. The availability of such a ready-made dataset with a detailed methodology of data collection and computer-intensive preprocessing procedures will be useful to many researchers working in the domain of forest fire analysis. For example, this dataset has been used to map the geographic and temporal distributions of burned areas and risk factors in the Amazon from 2001 to 2020 using an ensemble approach that harnesses a range of machine learning algorithms.
                <sup>
                    <xref ref-type="bibr" rid="ref11">11</xref>
                </sup> Furthermore, this dataset provides encouragement for developing similar datasets tailored to varying study regions, spatial resolutions, and research domains.
                <sup>
                    <xref ref-type="bibr" rid="ref12">12</xref>
                </sup>
            </p>
        </sec>
        <sec id="sec2" sec-type="methods">
            <title>Methods</title>
            <p>The Amazon rainforest has an area of over 5.2 million square kilometers, covers approximately one-third of South America, and extends into eight countries.
                <sup>
                    <xref ref-type="bibr" rid="ref5">5</xref>
                </sup> Within this region, data management authorities in each country generally focus on their own regions. To create a database for the entire extent of the Amazon rainforest and to ensure that all relevant areas of potential importance are included in the study area, we defined the study area for this work as the entire Amazon basin, as shown in 
                <xref ref-type="fig" rid="f2">
Figure 2</xref>. The extent of the study area can be defined as -79.43629, -18.00816: -44.49108, 8.66346 with the coordinate reference system EPSG:4326 - World Geodetic System (WGS) 84 - Geographic. For spatiotemporal modeling, the selection of the data period needs to have a considerable temporal range as well as data availability for the chosen period. A review of the literature related to spatiotemporal modeling of forest fires, as summarized in 
                <xref ref-type="table" rid="T1">
Table 1</xref>, indicates that a period of 5-30 years with monthly or yearly frequency is used for the temporal characterization of forest fires. Keeping in view what is available for the Amazon Rainforest (for the whole region), we decided to proceed with a data period of 20 years from 2001 to 2020, with a monthly frequency as the temporal resolution. The spatial resolution was finalized as 500 m for the final 
                <italic toggle="yes">spatial grid.</italic> This is based not only on the available data for the Amazon Rainforest but also on the computational complexity involved in a study area of approximately 5 million square kilometers.</p>
            <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                <label>
Figure 2. </label>
                <caption>
                    <title>Study area of Amazon rainforest.</title>
                    <p>Amazon boundary obtained from.
                        <sup>
                            <xref ref-type="bibr" rid="ref20">20</xref>
                        </sup>
                    </p>
                </caption>
                <graphic id="gr2" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/195424/8e7ed734-d448-496a-b237-210e5b5e148c_figure2.gif"/>
            </fig>
            <table-wrap id="T1" orientation="portrait" position="float">
                <label>
Table 1. </label>
                <caption>
                    <title>Summary of study characteristics from previous works related to forest fire analysis.</title>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th align="left" colspan="1" rowspan="1" valign="top">Reference</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">Study region</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">Study area</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">Data period</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">
Temporal resolution</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">A
                                <sup>
                                    <xref ref-type="bibr" rid="ref4">4</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Southern France</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">40,000 sq.km</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">1995-2018</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Monthly</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">B
                                <sup>
                                    <xref ref-type="bibr" rid="ref21">21</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Autazes, Brazil</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">7,632 sq.km</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">1985-2015</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Monthly</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">C
                                <sup>
                                    <xref ref-type="bibr" rid="ref22">22</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">South Korea</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">99,720 sq.km</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">1980-2000</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Annual</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">D
                                <sup>
                                    <xref ref-type="bibr" rid="ref2">2</xref>,
                                    <xref ref-type="bibr" rid="ref7">7</xref>,
                                    <xref ref-type="bibr" rid="ref23">23</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Catalonia, Spain</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">30,000 sq.km</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">2004-2008</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Multi-Year
</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">E
                                <sup>
                                    <xref ref-type="bibr" rid="ref8">8</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Castellon, Spain</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">6,632 sq.km</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">2001-2006</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Multi-Year
</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">F
                                <sup>
                                    <xref ref-type="bibr" rid="ref24">24</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Islamabad, Pakistan</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">158 sq.km</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">2005-2018</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Multi-Month
</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">G
                                <sup>
                                    <xref ref-type="bibr" rid="ref25">25</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">California and Nevada, USA</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">120,000 sq.km</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">1984-2006</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Multi-Month
</td>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
            <p>In addition to the study design involving spatial resolution, temporal frequency, and spatial data extent, another equally important aspect is the selection of covariates. These variables can be broadly categorized as attributes related to land use, climate, the environment, topography, and human activities. Land use and land cover (LULC) variables are highly related to forest fires, as the type of land surface not only determines fire ignition but also its propagation. Climatic variables, such as humidity, precipitation, wind speed, and temperature, also influence the occurrence of forest fires. Topographic variables such as elevation, slope, and aspect are also of core importance as they regulate how quickly a fire will move up or down the hills. Finally, human activities also play a critical role in the initiation of forest fires. Hence, variables such as population density, buildings, and the urban-forest interface are of high significance. 
                <xref ref-type="table" rid="T2">
Table 2</xref> summarizes the list of potential forest fire analysis attributes discussed in the literature.</p>
            <table-wrap id="T2" orientation="portrait" position="float">
                <label>
Table 2. </label>
                <caption>
                    <title>Summary of study attributes from previous works related to forest fire analysis.</title>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th align="left" colspan="1" rowspan="1" valign="top">Reference</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">
Description of attributes</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">F
                                <sup>
                                    <xref ref-type="bibr" rid="ref3">3</xref>,
                                    <xref ref-type="bibr" rid="ref8">8</xref>,
                                    <xref ref-type="bibr" rid="ref9">9</xref>,
                                    <xref ref-type="bibr" rid="ref26">26</xref>,
                                    <xref ref-type="bibr" rid="ref27">27</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Land Use Effects/Vegetation Type/Deforestation/Forest Type/Land Cover</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">G
                                <sup>
                                    <xref ref-type="bibr" rid="ref4">4</xref>,
                                    <xref ref-type="bibr" rid="ref9">9</xref>,
                                    <xref ref-type="bibr" rid="ref26">26</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Population Density/Housing Density/Buildings</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">H
                                <sup>
                                    <xref ref-type="bibr" rid="ref3">3</xref>,
                                    <xref ref-type="bibr" rid="ref4">4</xref>,
                                    <xref ref-type="bibr" rid="ref26">26</xref>,
                                    <xref ref-type="bibr" rid="ref27">27</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Elevation, Slope and Aspect</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">I
                                <sup>
                                    <xref ref-type="bibr" rid="ref9">9</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Humidity</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">J
                                <sup>
                                    <xref ref-type="bibr" rid="ref9">9</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Wind Speed</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">K
                                <sup>
                                    <xref ref-type="bibr" rid="ref4">4</xref>,
                                    <xref ref-type="bibr" rid="ref9">9</xref>,
                                    <xref ref-type="bibr" rid="ref26">26</xref>,
                                    <xref ref-type="bibr" rid="ref27">27</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Temperature</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">L
                                <sup>
                                    <xref ref-type="bibr" rid="ref4">4</xref>,
                                    <xref ref-type="bibr" rid="ref9">9</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Precipitation</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">M
                                <sup>
                                    <xref ref-type="bibr" rid="ref8">8</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Isothermality</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">N
                                <sup>
                                    <xref ref-type="bibr" rid="ref4">4</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Protected Zones</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">O
                                <sup>
                                    <xref ref-type="bibr" rid="ref3">3</xref>,
                                    <xref ref-type="bibr" rid="ref8">8</xref>,
                                    <xref ref-type="bibr" rid="ref9">9</xref>,
                                    <xref ref-type="bibr" rid="ref26">26</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Road Density, Distance to Road</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">P
                                <sup>
                                    <xref ref-type="bibr" rid="ref3">3</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Maximum Cumulative Water Deficit</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Q
                                <sup>
                                    <xref ref-type="bibr" rid="ref3">3</xref>,
                                    <xref ref-type="bibr" rid="ref8">8</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Soil Type/Soil Texture/Soil Permeability</td>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
            <sec id="sec3">
                <title>Data collection</title>
                <p>From the list of attributes identified from the literature as potentially related to forest-fire analysis (
                    <xref ref-type="table" rid="T2">
Table 2</xref>), not all of them are available for the entire Amazon Rainforest, let alone for the study period 2001-2020. Specifically, variables such as protected zones, isothermality, and maximum cumulative water deficit were only available for certain regions and for a particular time period. Similarly, elevation-related attributes were only available for certain years between the period 2001-2020. In this study, attributes that were available for the complete Amazon region and for the selected time period of 2001-2020, are identified and further acquired, as detailed in 
                    <xref ref-type="table" rid="T3">
Table 3</xref>, with 
                    <italic toggle="yes">Date of Access: 01 May 2022.</italic> This section details the complete data-acquisition process related to each collected attribute.</p>
                <table-wrap id="T3" orientation="portrait" position="float">
                    <label>
Table 3. </label>
                    <caption>
                        <title>Summary of collected attributes related to forest fire analysis, with original temporal resolution of monthly frequency (except Land Cover which is Annual, and Elevation which is One time).</title>
                        <p>These attributes were pre-processed to acquire 
                            <italic toggle="yes">working data</italic> at 500 meters and monthly resolution, for the period of 2001 to 2020.</p>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="left" colspan="1" rowspan="1" valign="top">S#</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Variable name</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Description</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Spatial resolution</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">
Source</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">1.</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Burnt Area</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Classes (Burnt, Not Burnt, Water)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">500 meters</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">MODIS
                                    <sup>
                                        <xref ref-type="bibr" rid="ref28">28</xref>
                                    </sup>
                                </td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Land Cover (Annual)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">11 Classes of Land Cover</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">5,600 meters</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">MODIS
                                    <sup>
                                        <xref ref-type="bibr" rid="ref29">29</xref>
                                    </sup>
                                </td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">3.</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Precipitation</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Average rate of precipitation</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">10,000 meters</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">GES-DISC
                                    <sup>
                                        <xref ref-type="bibr" rid="ref30">30</xref>
                                    </sup>
                                </td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">4.</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Soil Moisture</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Model-calculated
</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">37,000 meters</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">CPC
                                    <sup>
                                        <xref ref-type="bibr" rid="ref31">31</xref>
                                    </sup>
                                </td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">5.</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Elevation (One-time)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Based on Digital Elevation Model</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1,000 meters</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">EarthEnv
                                    <sup>
                                        <xref ref-type="bibr" rid="ref32">32</xref>
                                    </sup>
                                </td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">6.</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Land Surface Temperature</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Daytime observations</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">5,000 meters</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">MODIS
                                    <sup>
                                        <xref ref-type="bibr" rid="ref33">33</xref>
                                    </sup>
                                </td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">7.</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Specific Humidity</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Model-calculated
</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1,000 meters</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">GES DISC
                                    <sup>
                                        <xref ref-type="bibr" rid="ref34">34</xref>
                                    </sup>
                                </td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">8.</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Evapotranspiration (ET)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Model-calculated
</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1,000 meters</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">GES DISC
                                    <sup>
                                        <xref ref-type="bibr" rid="ref34">34</xref>
                                    </sup>
                                </td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">9.</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Near Surface Wind Speed</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Model-calculated
</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1,000 meters</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">GES DISC
                                    <sup>
                                        <xref ref-type="bibr" rid="ref34">34</xref>
                                    </sup>
                                </td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">10.</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Near Surface Air Temperature</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Model-calculated
</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1,000 meters</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">GES DISC
                                    <sup>
                                        <xref ref-type="bibr" rid="ref34">34</xref>
                                    </sup>
                                </td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <p>

                    <bold>

                        <italic toggle="yes">Burnt Area (BA)</italic>
</bold>
                </p>
                <p>The data product acquired was MODIS/Terra+Aqua Direct Broadcast Burned Area Monthly L3 Global 500 m SIN Grid V006 MCD64A1 Version 6.1, which is a gridded burnt area product at a resolution of 500 m, available in Hierarchical Data Format (HDF) format. The product provides the date of burn (in the form of the day of the year) for individual cells with additional classes, such as unburnt, missing data, and water. The data product is available for the period 2000 to the present (2022), with global spatial coverage in the form of regional subsets. The layers extracted from the data source are for regions 5 and 6, which cover the Amazon area. The data layer values are in units of a day, with a valid range of data values as between 1-366 (representing the day of the year). Further details related to the product, including the quality assessment and known issues, are available at MODIS MCD64A1 (
                    <ext-link ext-link-type="uri" xlink:href="https://lpdaac.usgs.gov/products/mcd64a1v061/">https://lpdaac.usgs.gov/products/mcd64a1v061/</ext-link>

                    <ext-link ext-link-type="uri" xlink:href="https://lpdaac.usgs.gov/products/mcd64a1v061/">)</ext-link>.</p>
                <p>As the burnt area product is available at the regional level, an additional data processing step for the burnt area product is the merging of two separate regional-level products to cover the entire region of the Amazon basin boundary. Additionally, the data were re-classified to assign a single value of 1 to all burn dates (1-366) to identify the cell with burn data as simply burnt. Hence, working data has four classes (burnt, unburnt, missing, and water) with values (1, 0, -1, and -2), respectively.</p>
                <p>

                    <bold>

                        <italic toggle="yes">Land Cover (LC)</italic>
</bold>
                </p>
                <p>The data product acquired was MODIS/Terra+Aqua Land Cover Type Yearly L3 Global 0.05Deg CMG V006 MCD12C1 Version 6, which consists of three gridded land cover classification schemes at a resolution of 5,600 m, available in the HDF format. The three available classification schemes include 
                    <italic toggle="yes">Maps of the International Geosphere-Biosphere Programme (IGBP)</italic> providing 17 classes, 
                    <italic toggle="yes">University of Maryland (UMD)</italic> providing 16 classes, and 
                    <italic toggle="yes">Leaf Area Index (LAI)</italic> providing 11 classes. LAI classification schemes are extracted from the data product as 11 classes are sufficient for representation of different land covers in terms of Water, Urban, Forest, Grassland, etc., and additional classes available in other schemes are further subdivisions of forests and grassland types. The data product is available for the period 2000 to the present (2022) with global spatial coverage. The details of the land cover classes of the LAI scheme are provided in 
                    <xref ref-type="table" rid="T4">
Table 4</xref>. The name of the layer extracted from the data source is Land Cover Type-3, with a range of data values between classes 0 and 10. Further details related to the product, including the quality assessment and known issues, are available at MODIS MCD12C1 (
                    <ext-link ext-link-type="uri" xlink:href="https://lpdaac.usgs.gov/products/mcd12c1v006/">https://lpdaac.usgs.gov/products/mcd12c1v006/</ext-link>

                    <ext-link ext-link-type="uri" xlink:href="https://lpdaac.usgs.gov/products/mcd12c1v006/">)</ext-link>.</p>
                <table-wrap id="T4" orientation="portrait" position="float">
                    <label>
Table 4. </label>
                    <caption>
                        <title>Class details of Leaf Area Index (LAI) classification scheme, from MODIS.
                            <sup>
                                <xref ref-type="bibr" rid="ref29">29</xref>
                            </sup>
                        </title>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="left" colspan="1" rowspan="1" valign="top">Class name</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Value</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Description</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Water Bodies</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Permanent water bodies</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Grasslands</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">1</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Dominated by herbaceous annuals (&lt;2 m)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Shrublands</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Shrub (1-2 m)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Broadleaf Croplands</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">3</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Dominated by herbaceous annuals (&lt;2 m) - cultivated with broadleaf crops</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Savannas</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">4</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">From 10% to 60% tree cover (&gt;2 m)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Evergreen Broadleaf Forests</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">5</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Dominated by evergreen broadleaf and palmate trees (&gt;2 m)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Deciduous Broadleaf Forests</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">6</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Dominated by deciduous broadleaf trees (&gt;2 m)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Evergreen Needleleaf Forests</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">7</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Dominated by evergreen conifer trees (&gt;2 m)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Deciduous Needleleaf Forests</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">8</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Dominated by deciduous needleleaf (larch) tree (&gt;2 m)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Non-Vegetated Lands</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">9</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Non-vegetated barren (sand, rock, soil) /permanent snow and ice</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Urban and Built-up Lands</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">10</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Impervious surface area including building materials, asphalt, and vehicles</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Unclassified</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">255</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Missing inputs</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <p>

                    <bold>

                        <italic toggle="yes">Precipitation</italic>
</bold>
                </p>
                <p>The data product acquired is Integrated Multi-satellite Retrievals for GPM (Global Precipitation Measurement)-based multi-satellite precipitation product, Version 06 B, available in Hierarchical Data Format version 5 (HDF5) format. The product provides a monthly product of average precipitation rates at a 0.1 &#x00b0;&#x00d7; 0.1 &#x00b0; (approximately 10,000 m at the equator) spatial resolution, estimated from numerous precipitation-relevant satellite passive microwave (PMW) sensors. The dataset is available for 2000&#x2013;2021 with global spatial coverage. The values are represented in 
                    <italic toggle="yes">millimeters per hour</italic> (
                    <italic toggle="yes">mm/hr</italic>), with a scale factor of 1000 and missing values marked with -9999. Thus, a value of 500 indicates 500/1000 mm/h. Further details related to the product are available at the GES-DISC GPM IMERG Final Precipitation L3 
                    <ext-link ext-link-type="uri" xlink:href="https://disc.gsfc.nasa.gov/datasets/GPM_3IMERGM_06/summary">(</ext-link>

                    <ext-link ext-link-type="uri" xlink:href="https://disc.gsfc.nasa.gov/datasets/GPM_3IMERGM_06/summary">https://disc.gsfc.nasa.gov/datasets/GPM_3IMERGM_06/summary</ext-link>

                    <ext-link ext-link-type="uri" xlink:href="https://disc.gsfc.nasa.gov/datasets/GPM_3IMERGM_06/summary">)</ext-link>.</p>
                <p>

                    <bold>

                        <italic toggle="yes">Soil moisture</italic>
</bold>
                </p>
                <p>The data product acquired is a model-calculated (not directly observed) averaged soil moisture water height equivalent, namely CPC Soil Moisture Version 2, available in the GEOTIFF format. The data are a monthly product of 0.5 &#x00b0;&#x00d7; 0.5 &#x00b0;(approximately 37,000 m at the equator) spatial resolution, with data available from 1948 to the present (2022). The spatial coverage of the product is 89.75N&#x2013;89.75S, 0.25E&#x2013;359.75E. The values are represented in 
                    <italic toggle="yes">millimeters</italic> (
                    <italic toggle="yes">mm</italic>), with missing values marked as -9999. Further details related to the product are available at CPC Soil Moisture (
                    <ext-link ext-link-type="uri" xlink:href="https://psl.noaa.gov/data/gridded/data.cpcsoil.html">https:</ext-link>

                    <ext-link ext-link-type="uri" xlink:href="https://psl.noaa.gov/data/gridded/data.cpcsoil.html">//psl.noaa.gov/data/gridded/data.cpcsoil.html</ext-link>

                    <ext-link ext-link-type="uri" xlink:href="https://psl.noaa.gov/data/gridded/data.cpcsoil.html">)</ext-link>.</p>
                <p>In the preprocessing of the Soil Moisture data product, data transformation is implemented as an additional step. As the source data have a spatial offset, not aligning with the reference base map, the data are transformed to correct alignment using the Geospatial Data Abstraction Library (GDAL).
                    <sup>
                        <xref ref-type="bibr" rid="ref13">13</xref>
                    </sup>
                </p>
                <p>

                    <bold>

                        <italic toggle="yes">Elevation</italic>
</bold>
                </p>
                <p>The acquired data product is a global multivariate package related to terrain features, which can serve many large-scale research publications. The data product is based on a 250 m Digital Elevation Model (DEM), available in Tagged Image File Format (TIF) format, from Global Multi-Resolution Terrain Elevation Data 2010 (GMTED2010).
                    <sup>
                        <xref ref-type="bibr" rid="ref14">14</xref>
                    </sup> This data product provides many topographic variables, such as elevation, slope, aspect, northness, elasticity, roughness index, and topographic position index at different resolutions of 1, 10, 50, or 100 km, with global spatial coverage; however, our focus is only on elevation. The Elevation values are represented in 
                    <italic toggle="yes">meters</italic> (
                    <italic toggle="yes">m</italic>). Further details related to this product are available at 
                    <ext-link ext-link-type="uri" xlink:href="https://www.earthenv.org/topography">(</ext-link>

                    <ext-link ext-link-type="uri" xlink:href="https://www.earthenv.org/topography">https://www.earthenv.org/topography</ext-link>

                    <ext-link ext-link-type="uri" xlink:href="https://www.earthenv.org/topography">)</ext-link>.</p>
                <p>

                    <bold>

                        <italic toggle="yes">Land Surface Temperature (LST)</italic>
</bold>
                </p>
                <p>The data product acquired was MODIS/Terra Land-Surface Temperature/Emissivity Monthly Global 0.05Deg CMG MOD11C3 Version 6, which is a monthly Land Surface Temperature &amp; Emissivity (LST&amp;E) value product at a spatial resolution of 0.05 &#x00b0; (approximately 5,600 m), available in the HDF format. The data product provides values for both daytime and nighttime observations, along with other details related to the quality assessment. The data product is available for the period 2000 to the present (2022) with global spatial coverage. The temperature values are represented in 
                    <italic toggle="yes">kelvin</italic> (
                    <italic toggle="yes">K</italic>), with a scale factor of 0.02 and a range of values between 7,500 and 65,535. Thus, the LST value equal to X represents X*0.02 kelvin. Further details related to this product are available at MODIS MOD11C3 
                    <ext-link ext-link-type="uri" xlink:href="https://lpdaac.usgs.gov/products/mod11c3v006/">(</ext-link>

                    <ext-link ext-link-type="uri" xlink:href="https://lpdaac.usgs.gov/products/mod11c3v006/">https://lpdaac.usgs.gov/products/mod11c3v006/</ext-link>

                    <ext-link ext-link-type="uri" xlink:href="https://lpdaac.usgs.gov/products/mod11c3v006/">)</ext-link>.</p>
                <p>

                    <bold>

                        <italic toggle="yes">Specific humidity, Evapotranspiration (ET), wind and air temperature</italic>
</bold>
                </p>
                <p>The acquired data provides a set of parameters related to land surface observations. The data is a simulation-based product of the Noah 3.6.1, model from Famine Early Warning Systems, Network (FEWS NET) Land Data Assimilation System (FLDAS). All the provided variables are available as a monthly product in a 0.10 degree spatial resolution (approximately 1,000 m at the equator) and available (as a layer) in NETCDF file format. The dataset is available for the period from 1982 to the present (2022) with global spatial coverage. The values of Specific Humidity are represented as (
                    <italic toggle="yes">kg/kg</italic>), using a ratio between kilogram of water (moisture) per kilogram of air; whereas Evapotranspiration, Wind and Air Temperature are measured in (
                    <italic toggle="yes">kg/m</italic>
                    <sup>2</sup>
                    <italic toggle="yes">s</italic>), (
                    <italic toggle="yes">m/s</italic>) and 
                    <italic toggle="yes">kelvin</italic> (
                    <italic toggle="yes">K</italic>), respectively. Further details related to the product are available at the GES DISC-FLDAS Noah Land Surface Model L4 (
                    <ext-link ext-link-type="uri" xlink:href="https://disc.gsfc.nasa.gov/datasets/FLDAS_NOAH01_C_GL_M_001/summary">https://disc.gsfc.nasa.gov/datasets/FLDAS_NOAH01_C_GL_M_001/summary</ext-link>

                    <ext-link ext-link-type="uri" xlink:href="https://disc.gsfc.nasa.gov/datasets/FLDAS_NOAH01_C_GL_M_001/summary">)</ext-link>.</p>
                <p>While these model-derived variables (including Soil Moisture) introduce inherent numerical uncertainties compared to direct field observations, they are incorporated to expand the suite of available environmental covariates, providing the multivariate depth necessary for robust spatiotemporal analysis across the Amazon. As the primary objective of this work is the curation and standardization of a basin-wide Amazon dataset, a formal independent uncertainty analysis remains beyond the scope of this data curation effort. By providing these products in a common, analysis-ready format, this work establishes the necessary foundation for future studies to conduct such analytical sensitivity assessments and empirical validations.</p>
            </sec>
            <sec id="sec4">
                <title>Data processing</title>
                <p>All of the various attributes collected in the database have different spatial resolutions, as described in 
                    <xref ref-type="table" rid="T3">
Table 3</xref>. Similarly, not all variables are available at monthly resolution, as Land Cover and Elevation are annual and one-time, respectively. Moreover, all of these variables cover different spatial extents and have dissimilar spatial orientations. To obtain a dataset with all the variables at a fixed spatial extent and resolution, we constructed a spatial grid of 500m resolution covering the Amazon region and obtained the cell values for this raster following the steps described below. Similarly, we executed the process to achieve a monthly temporal resolution for all variables, with the data period from 2001 to 2020.</p>
                <p>To achieve temporal harmonization across these differing frequencies, we adopted a 
                    <italic toggle="yes">temporal expansion</italic> framework where annual and static values are mapped consistently across the corresponding twelve monthly increments of each year. This ensures that every monthly snapshot in the 240-month time series contains a complete suite of environmental covariates, enabling dynamic analyses such as fire risk modeling or temporal trend assessment. This approach maintains temporal sensitivity by allowing dynamic climate variables to fluctuate monthly while the slower-evolving landscape attributes such as Land Cover provide a stable structural context for each year.</p>
                <p>Although the collected data packages for different attributes have heterogeneous specifications, their processing generally follows a common workflow. A methodological baseline of the processing steps is shown in 
                    <xref ref-type="fig" rid="f3">
Figure 3</xref>. Specifically, 
                    <italic toggle="yes">Accessed Data</italic> refers to the downloaded data package from data sources in various formats, such as HDF, HDF5, NETCDF, GEOTIFF, and TIF. Accessed data in source data formats, such as HDF, HDF5, and NETCDF, contained several layers with different attributes, and the layer related to the subject attribute was extracted from this set of layers. Accessed data with the source data formats of GEOTIFF or TIF contained only the required layer that was extracted. These extracted layers are referred to as the 
                    <italic toggle="yes">raw data.</italic>
                </p>
                <fig fig-type="figure" id="f3" orientation="portrait" position="float">
                    <label>
Figure 3. </label>
                    <caption>
                        <title>Overview of methodology for data processing.</title>
                    </caption>
                    <graphic id="gr3" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/195424/8e7ed734-d448-496a-b237-210e5b5e148c_figure3.gif"/>
                </fig>
                <p>To resolve the inherent inconsistencies in spatial orientation and resolution, we developed a standardized processing framework. First, all 
                    <italic toggle="yes">raw data</italic> layers are projected onto EPSG:102033-South America Albers Equal Area Conic, to acquire the 
                    <italic toggle="yes">Projected Layer</italic>. This equal-area coordinate reference system was specifically chosen to maintain geometric fidelity across the vast longitudinal expanse of the Amazon basin,
                    <sup>
                        <xref ref-type="bibr" rid="ref15">15</xref>
                    </sup> minimizing the distortion of area and shape that typically occurs in global projections. To correct for grid misalignments and boundary distortions caused by the differing source orientations of the datasets collected, we introduced a standardized spatial grid as a master template. By mapping all attributes onto this fixed spatial grid, we ensured that every pixel across all variables represents the exact same geographical footprint, thereby eliminating spatial offsets and ensuring seamless interoperability between datasets. </p>
                <p>The projected layers are at either a global or regional-level (based on the specifications of the data source), and to confine them all to the Amazon Basin boundary, these layers were further clipped using a shapefile-based (vector) Amazon Basin boundary. This clipped layer is labelled as 
                    <italic toggle="yes">pre-processed data</italic>.</p>
                <p>Although all layers are cropped to the Amazon basin boundary, their respective cells may not exactly align with each other owing to differences in their source data extent, cell-grid orientation, and spatial resolution. To obtain layers of the same spatial extent, resolution, and orientation, we executed a rigorous two-step disaggregation and resampling procedure to transfer cell values from the 
                    <italic toggle="yes">pre-processed data</italic> to the fixed spatial grid (master template). The spatial grid covered the entire Amazon Basin boundary and had a cell resolution of 500 m. The value of each cell was transferred to this grid for each attribute, and the process was repeated for all attributes, thereby creating a separate spatial grid for each attribute.</p>
                <p>First, layers with resolution coarser than 500 m (ranging up to 37 km) were disaggregated to approximately 500 m. The disaggregation factor varied for each attribute, based on the spatial resolution of the source data. Following this, the terra:resample function in R was used to transfer information from the attribute layers to the fixed spatial grid. To minimize spatial inaccuracy in this environmentally heterogeneous region, the resampling method was tailored to the variable type: the &#x2018;near&#x2019; (nearest neighbor) method was employed for Land Cover, Burnt Area, Soil Moisture, Specific Humidity, Evapotranspiration, Near Surface Wind Speed, and Near Surface Air Temperature to preserve original discrete values and categorical integrity. Conversely, &#x2018;bilinear&#x2019; interpolation was used for Land Surface Temperature and Precipitation to accurately represent the continuous spatial gradients of these atmospheric phenomena.</p>
                <p>The resulting spatial grids corresponding to each attribute constitute the final layers available for analysis, hence called 
                    <italic toggle="yes">working data</italic>. This workflow was followed for each monthly file (i.e., 240 files over 20 years) to achieve a consistent monthly temporal resolution from 2001 to 2020. By maintaining this monthly sensitivity for dynamic climate variables while holding slower-evolving landscape variables such as Land Cover constant within each annual cycle, the dataset remains sensitive to the immediate environmental drivers for fire while providing a stable structural context. While an analytical sensitivity assessment regarding the impact of integrated variable frequencies is a valuable research direction, such an analysis is beyond the scope of this data curation work.</p>
                <p>
                    <xref ref-type="fig" rid="f4">
Figure 4</xref> illustrates an example of Land Surface Temperature in January 2020 for all three categories of 
                    <italic toggle="yes">raw data</italic>, 
                    <italic toggle="yes">pre-processed data</italic> and 
                    <italic toggle="yes">working data.</italic> Similarly, 
                    <xref ref-type="fig" rid="f5">
Figure 5</xref> presents an example of a single monthly instance from January 2020 for all the variables collected.</p>
                <fig fig-type="figure" id="f4" orientation="portrait" position="float">
                    <label>
Figure 4. </label>
                    <caption>
                        <title>Land surface temperature for January 2020.</title>
                        <p>

                            <italic toggle="yes">Top</italic>: Raw data (Global), 
                            <italic toggle="yes">Bottom</italic>: Pre-processed data (cropped) and working data (re-sampled spatial grid).</p>
                    </caption>
                    <graphic id="gr4" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/195424/8e7ed734-d448-496a-b237-210e5b5e148c_figure4.gif"/>
                </fig>
                <fig fig-type="figure" id="f5" orientation="portrait" position="float">
                    <label>
Figure 5. </label>
                    <caption>
                        <title>Plots of the variables related to forest fires for the region of Amazon Rainforest in January 2020.</title>
                    </caption>
                    <graphic id="gr5" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/195424/8e7ed734-d448-496a-b237-210e5b5e148c_figure5.gif"/>
                </fig>
                <p>In terms of implementation, pre-processing work was completed using GIS software, and the processing work was executed in the statistical computing software R.
                    <sup>
                        <xref ref-type="bibr" rid="ref16">16</xref>,
                        <xref ref-type="bibr" rid="ref17">17</xref>
                    </sup> All data layers were managed using the SpatRaster data structure in the terra package,
                    <sup>
                        <xref ref-type="bibr" rid="ref18">18</xref>
                    </sup> ensuring a transparent, scripted workflow. This algorithmic approach serves as a systematic process log, minimizing accumulated errors across the 240 temporal layers and ensuring the reproducibility of the dataset.</p>
            </sec>
            <sec id="sec5">
                <title>Technical validation</title>
                <p>The raster-based dataset of covariates presented in this study is a collection of established datasets that do not include any newly created data records. This work mainly focuses on exhaustive data search and its acquisition process, followed by computer-intensive pre-processing to develop a dataset for the Amazon region. As noted in the Data Collection section, these industry-standard source datasets have undergone rigorous independent validation, as documented in their respective technical documentation; therefore, a secondary validation against field observations is beyond the scope of this curation effort. To ensure process transparency, the dataset follows standardized naming conventions.</p>
                <p>While the primary contribution of this work lies in workflow standardization and the resolution of dataset fragmentation, this technical framework serves as the essential infrastructure required for future algorithmic advancements. By delivering a harmonized, analysis-ready foundation, this study enables high-level environmental research and predictive modeling,
                    <sup>
                        <xref ref-type="bibr" rid="ref11">11</xref>
                    </sup> that were previously hindered by spatial and temporal data incompatibility. This standardized curation ensures that the resulting 
                    <italic toggle="yes">working data</italic> is fit-for-purpose for complex spatiotemporal analyses and provides a reproducible baseline for the wider research community.</p>
            </sec>
        </sec>
        <sec id="sec6">
            <title>License</title>
            <p>The raster-based dataset of covariates presented in this study was published under a Creative Commons Attribution 4.0, International (CC BY 4.0) License (
                <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link>), which permits use, sharing, adaptation, distribution, and reproduction in any medium or format, as long as appropriate credit is given to the authors and the source, a link to the license is provided, and it is indicated if changes were made.</p>
        </sec>
    </body>
    <back>
        <sec id="sec9" sec-type="data-availability">
            <title>Data availability</title>
            <p>The dataset with all the collected variables related to forest fires is available at the Zenodo repository titled &#x2018;Raster-based dataset for spatio-temporal analysis of forest fires in the Amazon rainforest from 2001 to 2020&#x2019; (
                <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/">https://doi.org/10.5281/</ext-link>

                <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/zenodo.7215402">zenodo.7215402</ext-link>

                <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/zenodo.7215402">)</ext-link>.
                <sup>
                    <xref ref-type="bibr" rid="ref35">35</xref>
                </sup> The dataset comprises three folders for each of the ten variables, referring to the data categories of 
                <italic toggle="yes">Raw Data</italic>, 
                <italic toggle="yes">Pre-Processed Data</italic> and 
                <italic toggle="yes">Working Data</italic> with names 
                <italic toggle="yes">01. Raw Data</italic>, 
                <italic toggle="yes">02. Pre-Processed Data</italic> and 
                <italic toggle="yes">03. Working Data</italic>, respectively. An additional 
                <italic toggle="yes">Read Me</italic> document includes details regarding the coordinate system, data extent, and data sources. All files were in GEOTIFF format, which can be accessed using the statistical software R
                <sup>
                    <xref ref-type="bibr" rid="ref16">16</xref>
                </sup> or any of the GIS software, such as Quantum GIS - QGIS (opensource) (
                <ext-link ext-link-type="uri" xlink:href="https://www.qgis.org/en/site/">https://www.qgis.org/en/site/</ext-link>

                <ext-link ext-link-type="uri" xlink:href="https://www.qgis.org/en/site/">)</ext-link>, GRASS GIS (opensource) 
                <ext-link ext-link-type="uri" xlink:href="https://grass.osgeo.org/">(</ext-link>

                <ext-link ext-link-type="uri" xlink:href="https://grass.osgeo.org/">https://grass.osgeo.org/</ext-link>

                <ext-link ext-link-type="uri" xlink:href="https://grass.osgeo.org/">),</ext-link> or ArcGIS (proprietary) (
                <ext-link ext-link-type="uri" xlink:href="https://www.arcgis.com/index.html">https://www.arcgis.com/index.html</ext-link>

                <ext-link ext-link-type="uri" xlink:href="https://www.arcgis.com/index.html">)</ext-link>.</p>
            <p>In the case of Land Cover, which is annual-based data, the filename includes the variable short name (Landcover), the respective data category (raw for raw data, preproc for pre-processed data, or working for working data), and the year (2001&#x2013;2020):

                <preformat orientation="portrait" position="float" preformat-type="computer code" xml:space="preserve">
[Landcover]_[data_category]_[year].tif</preformat>
            </p>
            <p>In the case of elevation, which is only one-time data, the filename includes the variable&#x2019;s short name and the respective data category:

                <preformat orientation="portrait" position="float" preformat-type="computer code" xml:space="preserve">
[Elevation]_[data_category].tif</preformat>
            </p>
            <p>For all other variables, the filename includes the variable short name, the respective data category, and the year and month:

                <preformat orientation="portrait" position="float" preformat-type="computer code" xml:space="preserve">
[variable_short_name]_[data_category]_[year]_[month].tif</preformat>
            </p>
            <p>To load and visualize the data in R, .tif files of any of the three categories can be loaded as a raster by using the raster
                <sup>
                    <xref ref-type="bibr" rid="ref19">19</xref>
                </sup> or terra packages. The plot function of terra can be used to visualize the raster as follows:

                <preformat orientation="portrait" position="float" preformat-type="computer code" xml:space="preserve">r &lt;- terra::rast(&#x2019;&lt;filepath/filename.tif&gt;&#x2019;) plot(r)</preformat>
            </p>
            <p>Similarly, to visualize the data in Quantum GIS (QGIS), .tif file can be loaded to select the raster option in the Data Source Manager:

                <preformat orientation="portrait" position="float" preformat-type="computer code" xml:space="preserve">[Data Source Manager &gt; Raster &gt; (filepath)]</preformat>
            </p>
        </sec>
        <ref-list>
            <title>References</title>
            <ref id="ref1">
                <label>1</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Pimont</surname>
                            <given-names>F</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Prediction of regional wildfire activity in the probabilistic bayesian framework of firelihood.</article-title>
                    <source>

                        <italic toggle="yes">Ecol. Appl.</italic>
</source>
                    <year>2021</year>;<volume>31</volume>:<fpage>e02316</fpage>.
                    <pub-id pub-id-type="pmid">33636026</pub-id>
                    <pub-id pub-id-type="doi">10.1002/eap.2316</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref2">
                <label>2</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Serra</surname>
                            <given-names>L</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Juan</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Varga</surname>
                            <given-names>D</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Spatial pattern modelling of wildfires in catalonia, spain 2004&#x2013;2008.</article-title>
                    <source>

                        <italic toggle="yes">Environ. Model. Softw.</italic>
</source>
                    <year>2013</year>;<volume>40</volume>:<fpage>235</fpage>&#x2013;<lpage>244</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.envsoft.2012.09.014</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref3">
                <label>3</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Dos Reis</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Alencastro Gra&#x00e7;a</surname>
                            <given-names>PML</given-names>
                            <prefix>de</prefix>
                        </name>

                        <name name-style="western">
                            <surname>Yanai</surname>
                            <given-names>AM</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Forest fires and deforestation in the central amazon: Effects of landscape and climate on spatial and temporal dynamics.</article-title>
                    <source>

                        <italic toggle="yes">J. Environ. Manag.</italic>
</source>
                    <year>2021</year>;<volume>288</volume>:<fpage>112310</fpage>.
                    <pub-id pub-id-type="pmid">33761331</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.jenvman.2021.112310</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref4">
                <label>4</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Opitz</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bonneu</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gabriel</surname>
                            <given-names>E</given-names>
                        </name>
</person-group>:
                    <article-title>Point-process based bayesian modeling of space&#x2013;time structures of forest fire occurrences in mediterranean france.</article-title>
                    <source>

                        <italic toggle="yes">Spatial Stat.</italic>
</source>
                    <year>2020</year>;<volume>40</volume>:<fpage>100429</fpage>.
                    <pub-id pub-id-type="doi">10.1016/j.spasta.2020.100429</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref5">
                <label>5</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Watson</surname>
                            <given-names>G</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">Amazon Rainforest.</italic>
</source>
                    <publisher-name>Weigl Publishers</publisher-name>;<year>2019</year>.</mixed-citation>
            </ref>
            <ref id="ref6">
                <label>6</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Bonilla-Aldana</surname>
                            <given-names>D</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Brazil burning! what is the potential impact of the amazon wildfires on vector-borne and zoonotic emerging diseases? &#x2013; a statement from an international experts meeting.</article-title>
                    <source>

                        <italic toggle="yes">Travel Med. Infect. Dis.</italic>
</source>
                    <year>2019</year>;<volume>31</volume>:<fpage>101474</fpage>.
                    <pub-id pub-id-type="pmid">31494225</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.tmaid.2019.101474</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref7">
                <label>7</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Juan</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mateu</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Saez</surname>
                            <given-names>M</given-names>
                        </name>
</person-group>:
                    <article-title>Pinpointing spatio-temporal interactions in wildfire patterns.</article-title>
                    <source>

                        <italic toggle="yes">Stoch. Env. Res. Risk A.</italic>
</source>
                    <year>2012</year>;<volume>26</volume>:<fpage>1131</fpage>&#x2013;<lpage>1150</lpage>.
                    <pub-id pub-id-type="doi">10.1007/s00477-012-0568-y</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref8">
                <label>8</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Arag&#x00f3;</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Juan</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>D&#x00ed;az-Avalos</surname>
                            <given-names>C</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Spatial point process modeling applied to the assessment of risk factors associated with forest wildfires incidence in castell&#x00f3;n, spain.</article-title>
                    <source>

                        <italic toggle="yes">Eur. J. For. Res.</italic>
</source>
                    <year>2016</year>;<volume>135</volume>:<fpage>451</fpage>&#x2013;<lpage>464</lpage>.
                    <pub-id pub-id-type="doi">10.1007/s10342-016-0945-z</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref9">
                <label>9</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Papakosta</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Straub</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>Probabilistic prediction of daily fire occurrence in the mediterranean with readily available spatio-temporal data.</article-title>
                    <source>

                        <italic toggle="yes">iForest-Biogeosciences For.</italic>
</source>
                    <year>2016</year>;<volume>10</volume>:<fpage>32</fpage>.</mixed-citation>
            </ref>
            <ref id="ref10">
                <label>10</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Cano-Crespo</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Traxl</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Thonicke</surname>
                            <given-names>K</given-names>
                        </name>
</person-group>:
                    <article-title>Spatio-temporal patterns of extreme fires in amazonian forests.</article-title>
                    <source>

                        <italic toggle="yes">The Eur. Phys. J. Special Top.</italic>
</source>
                    <year>2021</year>;<volume>230</volume>:<fpage>3033</fpage>&#x2013;<lpage>3044</lpage>.
                    <pub-id pub-id-type="doi">10.1140/epjs/s11734-021-00164-3</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref11">
                <label>11</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Abid</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gonzalez</surname>
                            <given-names>JA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rivera</surname>
                            <given-names>OR</given-names>
                            <prefix>de</prefix>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Mapping the spatio-temporal distribution of burned areas in the amazon from 2001 to 2020: An ensemble modeling approach.</article-title>
                    <source>

                        <italic toggle="yes">Environ. Ecol. Stat.</italic>
</source>
                    <year>2025</year>;<volume>32</volume>:<fpage>707</fpage>&#x2013;<lpage>734</lpage>.
                    <pub-id pub-id-type="doi">10.1007/s10651-025-00661-x</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref12">
                <label>12</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Moraga</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Baker</surname>
                            <given-names>L</given-names>
                        </name>
</person-group>:
                    <article-title>rspatialdata: a collection of data sources and tutorials on downloading and visualising spatial data using r.</article-title>
                    <source>

                        <italic toggle="yes">F1000Res.</italic>
</source>
                    <year>2022</year>;<volume>11</volume>:<fpage>770</fpage>.
                    <pub-id pub-id-type="pmid">36016994</pub-id>
                    <pub-id pub-id-type="doi">10.12688/f1000research.122764.1</pub-id>
                    <pub-id pub-id-type="pmcid">PMC9363973</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref13">
                <label>13</label>
                <mixed-citation publication-type="book">
                    <collab>GDAL/OGR contributors</collab>:
                    <source>

                        <italic toggle="yes">GDAL/OGR Geospatial Data Abstraction software Library.</italic>
</source>
                    <publisher-name>Open Source Geospatial Foundation</publisher-name>;<year>2020</year>.</mixed-citation>
            </ref>
            <ref id="ref14">
                <label>14</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Danielson</surname>
                            <given-names>JJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gesch</surname>
                            <given-names>DB</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">Global multi-resolution terrain elevation data 2010 (GMTED2010).</italic>
</source>
                    <publisher-loc>DC, USA</publisher-loc>:
                    <publisher-name>US Department of the Interior, US Geological Survey Washington</publisher-name>;<year>2011</year>.</mixed-citation>
            </ref>
            <ref id="ref15">
                <label>15</label>
                <mixed-citation publication-type="other">
                    <article-title>ESRI. Albers - arcmap.</article-title>
                </mixed-citation>
            </ref>
            <ref id="ref16">
                <label>16</label>
                <mixed-citation publication-type="book">
                    <collab>R Core Team</collab>:
                    <source>

                        <italic toggle="yes">R: A Language and Environment for Statistical Computing.</italic>
</source>
                    <publisher-loc>Vienna, Austria</publisher-loc>:
                    <publisher-name>R Foundation for Statistical Computing</publisher-name>;<year>2020</year>.</mixed-citation>
            </ref>
            <ref id="ref17">
                <label>17</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Moraga</surname>
                            <given-names>P</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">Spatial Statistics for Data Science: Theory and Practice with R. Data Science series.</italic>
</source>
                    <publisher-loc>Boca Raton, Florida</publisher-loc>:
                    <publisher-name>Chapman &amp; Hall/CRC</publisher-name>;<year>2023</year>.</mixed-citation>
            </ref>
            <ref id="ref18">
                <label>18</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hijmans</surname>
                            <given-names>RJ</given-names>
                        </name>
</person-group>:
                    <article-title>terra: Spatial Data Analysis.</article-title>
                    <source>

                        <italic toggle="yes">R package version 1.5-21.</italic>
</source>
                    <year>2022</year>.</mixed-citation>
            </ref>
            <ref id="ref19">
                <label>19</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hijmans</surname>
                            <given-names>RJ</given-names>
                        </name>
</person-group>:
                    <article-title>raster: Geographic Data Analysis and Modeling.</article-title>
                    <source>

                        <italic toggle="yes">R package version 3.5-21.</italic>
</source>
                    <year>2022</year>.</mixed-citation>
            </ref>
            <ref id="ref20">
                <label>20</label>
                <mixed-citation publication-type="other">
                    <collab>Amazon Basin Polygon</collab>:
                    <article-title>ESRI ArcGIS.</article-title>
                    <ext-link ext-link-type="uri" xlink:href="https://www.arcgis.com/home/item.html?id=f2c5f8762d1847fdbcc321716fb79e5a">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref21">
                <label>21</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Reis</surname>
                            <given-names>M</given-names>
                            <prefix>dos</prefix>
                        </name>

                        <name name-style="western">
                            <surname>Alencastro Gra&#x00e7;a</surname>
                            <given-names>PML</given-names>
                            <prefix>de</prefix>
                        </name>

                        <name name-style="western">
                            <surname>Yanai</surname>
                            <given-names>AM</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Forest fires and deforestation in the central amazon: Effects of landscape and climate on spatial and temporal dynamics.</article-title>
                    <source>

                        <italic toggle="yes">J. Environ. Manag.</italic>
</source>
                    <year>2021</year>;<volume>288</volume>:<fpage>112310</fpage>.
                    <pub-id pub-id-type="pmid">33761331</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.jenvman.2021.112310</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref22">
                <label>22</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Kim</surname>
                            <given-names>SJ</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Multi-temporal analysis of forest fire probability using socio-economic and environmental variables.</article-title>
                    <source>

                        <italic toggle="yes">Remote Sens.</italic>
</source>
                    <year>2019</year>;<volume>11</volume>:<fpage>86</fpage>.
                    <pub-id pub-id-type="doi">10.3390/rs11010086</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref23">
                <label>23</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Trilles</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Juan</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Diaz</surname>
                            <given-names>L</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Integration of environmental models in spatial data infrastructures: A use case in wildfire risk prediction.</article-title>
                    <source>

                        <italic toggle="yes">IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens.</italic>
</source>
                    <year>2013</year>;<volume>6</volume>:<fpage>128</fpage>&#x2013;<lpage>138</lpage>.
                    <pub-id pub-id-type="doi">10.1109/JSTARS.2012.2236538</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref24">
                <label>24</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Tariq</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Forest fire monitoring using spatial-statistical and geo-spatial analysis of factors determining forest fire in margalla hills, islamabad, pakistan.</article-title>
                    <source>

                        <italic toggle="yes">Geomat. Nat. Haz. Risk.</italic>
</source>
                    <year>2021</year>;<volume>12</volume>:<fpage>1212</fpage>&#x2013;<lpage>1233</lpage>.
                    <pub-id pub-id-type="doi">10.1080/19475705.2021.1920477</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref25">
                <label>25</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Miller</surname>
                            <given-names>JD</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Safford</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Crimmins</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Quantitative evidence for increasing forest fire severity in the sierra nevada and southern cascade mountains, california and nevada, usa.</article-title>
                    <source>

                        <italic toggle="yes">Ecosystems.</italic>
</source>
                    <year>2009</year>;<volume>12</volume>:<fpage>16</fpage>&#x2013;<lpage>32</lpage>.
                    <pub-id pub-id-type="doi">10.1007/s10021-008-9201-9</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref26">
                <label>26</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Serra</surname>
                            <given-names>L</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Spatio-temporal log-gaussian cox processes for modelling wildfire occurrence: the case of catalonia, 1994&#x2013;2008.</article-title>
                    <source>

                        <italic toggle="yes">Environ. Ecol. Stat.</italic>
</source>
                    <year>2014</year>;<volume>21</volume>:<fpage>531</fpage>&#x2013;<lpage>563</lpage>.
                    <pub-id pub-id-type="doi">10.1007/s10651-013-0267-y</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref27">
                <label>27</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>M&#x00f8;ller</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>D&#x00ed;az-Avalos</surname>
                            <given-names>C</given-names>
                        </name>
</person-group>:
                    <article-title>Structured spatio-temporal shot-noise cox point process models, with a view to modelling forest fires.</article-title>
                    <source>

                        <italic toggle="yes">Scand. J. Stat.</italic>
</source>
                    <year>2010</year>;<volume>37</volume>:<fpage>2</fpage>&#x2013;<lpage>25</lpage>.
                    <pub-id pub-id-type="doi">10.1111/j.1467-9469.2009.00670.x</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref28">
                <label>28</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Giglio</surname>
                            <given-names>L</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Justice</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Boschetti</surname>
                            <given-names>L</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>MODIS/Terra+aqua burned area monthly L3 global 500m SIN grid V061.</article-title>
                    <year>2021</year>.</mixed-citation>
            </ref>
            <ref id="ref29">
                <label>29</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Friedl</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Sulla-Menashe</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>MCD12C1 MODIS/Terra+Aqua land cover type yearly L3 global 0.05deg CMG V006.</article-title>
                    <year>2015</year>.</mixed-citation>
            </ref>
            <ref id="ref30">
                <label>30</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Huffman</surname>
                            <given-names>GJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Stocker</surname>
                            <given-names>EF</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bolvin</surname>
                            <given-names>DT</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <source>

                        <italic toggle="yes">GPM IMERG Final Precipitation L3 1 month 0.1 degree x 0.1 degree V06.</italic>
</source>
                    <publisher-loc>Greenbelt, MD</publisher-loc>:
                    <publisher-name>Goddard Earth Sciences Data</publisher-name>;<year>2019</year>.</mixed-citation>
            </ref>
            <ref id="ref31">
                <label>31</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Fan</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Van Den Dool</surname>
                            <given-names>H</given-names>
                        </name>
</person-group>:
                    <article-title>Climate prediction center global monthly soil moisture data set at 0.5 resolution for 1948 to present.</article-title>
                    <source>

                        <italic toggle="yes">J. Geophys. Res.-Atmos.</italic>
</source>
                    <year>2004</year>;<volume>109</volume>.
                    <pub-id pub-id-type="doi">10.1029/2003JD004345</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref32">
                <label>32</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Amatulli</surname>
                            <given-names>G</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A suite of global, cross-scale topographic variables for environmental and biodiversity modeling.</article-title>
                    <source>

                        <italic toggle="yes">Sci Data.</italic>
</source>
                    <year>2018</year>;<volume>5</volume>:<fpage>1</fpage>&#x2013;<lpage>15</lpage>.
                    <pub-id pub-id-type="doi">10.1038/sdata.2018.40</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref33">
                <label>33</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Wan</surname>
                            <given-names>Z</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hook</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hulley</surname>
                            <given-names>G</given-names>
                        </name>
</person-group>:
                    <article-title>MOD11C3 MODIS/Terra land surface Temperature/Emissivity monthly L3 global 0.05deg CMG V006.</article-title>
                    <year>2015</year>.</mixed-citation>
            </ref>
            <ref id="ref34">
                <label>34</label>
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Mcnally</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hsl</surname>
                            <given-names>N</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">FLDAS Noah Land Surface Model L4 Global Monthly 0.1 x 0.1 degree (MERRA-2 and CHIRPS).</italic>
</source>
                    <publisher-loc>Greenbelt, MD, USA</publisher-loc>:
                    <publisher-name>Goddard Earth Sciences Data</publisher-name>;<year>2018</year>.</mixed-citation>
            </ref>
            <ref id="ref35">
                <label>35</label>
                <mixed-citation publication-type="data">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Mahmood</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Moraga</surname>
                            <given-names>P</given-names>
                        </name>
</person-group>:
                    <data-title>Raster-based dataset for spatio-temporal analysis of forest fires in the Amazon rainforest from 2001 to 2020 (Version 1.0).</data-title>[Dataset].
                    <source>

                        <italic toggle="yes">Zenodo.</italic>
</source>
                    <year>2022</year>.
                    <pub-id pub-id-type="doi">10.5281/zenodo.7215402</pub-id>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report482336">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.195424.r482336</article-id>
            <title-group>
                <article-title>Reviewer response for version 2</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Babu</surname>
                        <given-names>Suresh Babu KV</given-names>
                    </name>
                    <xref ref-type="aff" rid="r482336a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-9867-6014</uri>
                </contrib>
                <aff id="r482336a1">
                    <label>1</label>University of Cyprus, Nicosia, Cyprus</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>2</day>
                <month>6</month>
                <year>2026</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2026 Babu SBK</copyright-statement>
                <copyright-year>2026</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport482336" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.164537.2"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>This work presents a large-scale, harmonized, analysis-ready dataset designed for forest fire research in the Amazon basin. It integrates 20 years of monthly environmental and fire-related variables at a spatial resolution of 500 meters across the entire Amazon region. The dataset fills a major gap in basin-wide wildfire research by giving researchers a standardized and reproducible way to do multivariate spatiotemporal analysis.</p>
            <p> By converting heterogeneous raw data into a consistent format, including projection, extent, and resolution, the study significantly facilitates researchers&#x2019; investigations into wildfire dynamics in the Amazon. Additionally, it offers a valuable methodological framework for creating similar datasets in other regions or for various environmental applications.</p>
            <p>Are sufficient details of methods and materials provided to allow replication by others?</p>
            <p>Yes</p>
            <p>Is the rationale for creating the dataset(s) clearly described?</p>
            <p>Yes</p>
            <p>Are the datasets clearly presented in a useable and accessible format?</p>
            <p>Yes</p>
            <p>Are the protocols appropriate and is the work technically sound?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Wildfire risk modeling, fire danger prediction, Burned area, Fire forecasting, Fire detection, Burned area mapping etc.</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report422841">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.181063.r422841</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Kganyago</surname>
                        <given-names>Mahlatse</given-names>
                    </name>
                    <xref ref-type="aff" rid="r422841a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-9553-0378</uri>
                </contrib>
                <aff id="r422841a1">
                    <label>1</label>University of Johannesburg, Johannesburg, South Africa</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>8</day>
                <month>11</month>
                <year>2025</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2025 Kganyago M</copyright-statement>
                <copyright-year>2025</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport422841" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.164537.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>
                <bold>A raster-based dataset for spatio-temporal analysis of forest fires in the Amazon rainforest from 2001 to 2020</bold>
            </p>
            <p> The authors present a curated dataset of the amazon basin including various landscape and climate variables necessary to model long-term trends of forest fires. The paper is generally well-written and methods can be replicated in other regions.</p>
            <p> 
                <bold>Abstract</bold>
            </p>
            <p> &#x201c;and monthly resolution&#x201d; &#x2013; I suggest that authors add temporal after monthly.</p>
            <p> 
                <bold>Introduction </bold>
            </p>
            <p> &#x201c;The South American Amazon is one of the largest&#x2026;&#x201d; &#x2013; should start as new paragraph since it diverts from the idea communicated in this paragraph.</p>
            <p> &#x201c;similar studies do not exist for the Amazon region&#x201d; &#x2013; Please confirm through a comprehensive literature search. I doubt this is entirely accurate.</p>
            <p> &#x201c;this dataset encourages the creation of similar datasets&#x201d; &#x2013; please rephrase to improve clarity.</p>
            <p> 
                <bold>Methods</bold>
            </p>
            <p> &#x201c;[-79.43629, -18.00816: -44.49108, 8.66346]&#x201d; &#x2013; Should not be in brackets.</p>
            <p> &#x201c;MCD64A1&#x201d; , &#x201c;MCD12C1&#x201d;, &#x201c;MOD11C3&#x201d; &#x2013; please provide name of the product in addition to its ID or Acronymn.</p>
            <p> &#x00a0;&#x201c;&#x2026;acquired is (Integrated Multi-satellite Retrievals for GPM (Global Precipitation Measurement (GPM)&#x2026;&#x201d; &#x2013; Should not be in brackets.</p>
            <p> [89.75N&#x2013;89.75S, 0.25E&#x2013;359.75E] - Should not be in brackets.</p>
            <p> </p>
            <p> &#x201c;&#x2026;as &#x2013; &#x00a0;-9999.&#x201d; &#x2013; not clear, please consider removing the dash.</p>
            <p> </p>
            <p> &#x201c;Noah 3.6.1, model&#x2026;&#x201d; &#x2013; the comma here creates fragmentation.</p>
            <p> </p>
            <p> &#x201c;&#x2026;and are termed Working Data&#x201d; &#x2013; The term has been used above, but only defined here. Please consider defining a term when they are first mentioned.</p>
            <p> </p>
            <p> &#x201c;file (240 files over 20 years)&#x201d; &#x2013; please insert &#x201c;i.e.,&#x201d; in brackets.</p>
            <p> </p>
            <p> 
                <bold>Data availability</bold>
            </p>
            <p> There is space on the data link which breaks the hyperlink. Please correct.</p>
            <p> 
                <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/">https://doi.org/10.5281/</ext-link> 
                <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/zenodo.7215402">zenodo.7215402</ext-link>
            </p>
            <p>Are sufficient details of methods and materials provided to allow replication by others?</p>
            <p>Yes</p>
            <p>Is the rationale for creating the dataset(s) clearly described?</p>
            <p>Yes</p>
            <p>Are the datasets clearly presented in a useable and accessible format?</p>
            <p>Yes</p>
            <p>Are the protocols appropriate and is the work technically sound?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Remote sensing of vegetation</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <sub-article article-type="response" id="comment15297-422841">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Moraga</surname>
                            <given-names>Paula</given-names>
                        </name>
                        <aff>King Abdullah University of Science and Technology Computer Electrical and Mathematical Science and Engineering Division, Thuwal, Makkah Province, Saudi Arabia</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>18</day>
                    <month>1</month>
                    <year>2026</year>
                </pub-date>
            </front-stub>
            <body>
                <p>Thank you for your helpful and insightful comments. Please find our responses below.</p>
                <p> </p>
                <p> &#x201c;and monthly resolution&#x201d; &#x2013; I suggest that authors add temporal after monthly.</p>
                <p> 
                    <underline>Response:</underline> We have added the word &#x201c;temporal&#x201d; to clarify the nature of the resolution.</p>
                <p> </p>
                <p> &#x201c;The South American Amazon is one of the largest&#x2026;&#x201d; &#x2013; should start as new paragraph since it diverts from the idea communicated in this paragraph.</p>
                <p> 
                    <underline>Response:</underline> A new paragraph has been created at this location to ensure a more logical flow of ideas.</p>
                <p> </p>
                <p> &#x201c;similar studies do not exist for the Amazon region&#x201d; &#x2013; Please confirm through a comprehensive literature search. I doubt this is entirely accurate.</p>
                <p> 
                    <underline>Response:</underline> We have refined the text to clarify that while sub-regional studies exist, there is a scarcity of integrated, basin-wide, multivariate studies for the entire Amazon region.</p>
                <p> </p>
                <p> &#x201c;this dataset encourages the creation of similar datasets&#x201d; &#x2013; please rephrase to improve clarity.</p>
                <p> 
                    <underline>Response:</underline> We have rephrased this to emphasize that the dataset provides a framework/standard for the development of future longitudinal fire databases in other regions.</p>
                <p> </p>
                <p> &#x201c;[-79.43629, -18.00816: -44.49108, 8.66346]&#x201d; &#x2013; Should not be in brackets.</p>
                <p> 
                    <underline>Response:</underline> The brackets have been removed from the geographic coordinate strings.</p>
                <p> </p>
                <p> &#x201c;MCD64A1&#x201d; , &#x201c;MCD12C1&#x201d;, &#x201c;MOD11C3&#x201d; &#x2013; please provide name of the product in addition to its ID or Acronymn.</p>
                <p> 
                    <underline>Response:</underline> Full product names have been added alongside their respective IDs.</p>
                <p> </p>
                <p> &#x201c;&#x2026;acquired is (Integrated Multi-satellite Retrievals for GPM (Global Precipitation Measurement (GPM)&#x2026;&#x201d; &#x2013; Should not be in brackets.</p>
                <p> 
                    <underline>Response:</underline> The unnecessary nested parentheses have been removed for better readability.</p>
                <p> </p>
                <p> [89.75N&#x2013;89.75S, 0.25E&#x2013;359.75E] - Should not be in brackets.</p>
                <p> 
                    <underline>Response:</underline> Brackets have been removed from the spatial extent definition.</p>
                <p> </p>
                <p> &#x201c;&#x2026;as &#x2013;&#x00a0; -9999.&#x201d; &#x2013; not clear, please consider removing the dash.</p>
                <p> 
                    <underline>Response:</underline> The dash has been removed to clarify that -9999 is the specific value used for missing data.</p>
                <p> </p>
                <p> &#x201c;Noah 3.6.1, model&#x2026;&#x201d; &#x2013; the comma here creates fragmentation.</p>
                <p> 
                    <underline>Response:</underline> The comma has been removed to improve the sentence flow.</p>
                <p> </p>
                <p> &#x201c;&#x2026;and are termed Working Data&#x201d; &#x2013; The term has been used above, but only defined here. Please consider defining a term when they are first mentioned.</p>
                <p> 
                    <underline>Response:</underline> The first use of 
                    <italic>Working Data</italic> has been corrected to ensure conceptual clarity. This and similar terms are introduced in the Introduction section, and defined and explained in the Data Processing section.</p>
                <p> </p>
                <p> &#x201c;file (240 files over 20 years)&#x201d; &#x2013; please insert &#x201c;i.e.,&#x201d; in brackets.</p>
                <p> 
                    <underline>Response:</underline> The text now has an added i.e., as recommended.</p>
                <p> </p>
                <p> There is space on the data link which breaks the hyperlink. Please correct.</p>
                <p> 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/">https://doi.org/10.5281/</ext-link> 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/zenodo.7215402">zenodo.7215402</ext-link>
                </p>
                <p> 
                    <underline>Response:</underline> The space in the DOI URL has been removed.</p>
            </body>
        </sub-article>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report418949">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.181063.r418949</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>JAYA</surname>
                        <given-names>I NENGAH SURATI</given-names>
                    </name>
                    <xref ref-type="aff" rid="r418949a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r418949a1">
                    <label>1</label>IPB University, Bogor, Indonesia</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>27</day>
                <month>10</month>
                <year>2025</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2025 JAYA INS</copyright-statement>
                <copyright-year>2025</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport418949" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.164537.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>Overall, this manuscript makes a meaningful contribution to advancing understanding of forest fire dynamics in the tropical Amazon through an integrative, data-driven approach. &#x00a0;This manuscript presents a comprehensive methodological framework for integrating heterogeneous spatial and temporal datasets to develop a high-resolution spatio-temporal database for forest fire analysis in the Amazon region. The approach is technically robust and highly relevant to large-scale environmental monitoring, forest management, and climate research.</p>
            <p> The novelty of the study lies not in producing new observations but in the harmonization of heterogeneous remote sensing and model-based datasets through rigorous preprocessing and geospatial standardization. This contributes a ready-to-use dataset that bridges gaps between data availability, accessibility, and analytical readiness&#x2014;serving as a foundation for advanced environmental and fire research.</p>
            <p> While the study demonstrates methodological rigor and conceptual clarity, several areas require further elaboration to enhance the manuscript&#x2019;s scientific depth, reproducibility, and practical impact. In particular, the data preprocessing procedures, validation strategy, and uncertainty assessment need to be described in greater detail to strengthen the transparency and reliability of the proposed workflow.</p>
            <p> Overall, this manuscript makes a meaningful contribution to advancing understanding of forest fire dynamics in the tropical Amazon through an integrative, data-driven approach. &#x00a0;This manuscript presents a comprehensive methodological framework for integrating heterogeneous spatial and temporal datasets to develop a high-resolution spatio-temporal database for forest fire analysis in the Amazon region. The approach is technically robust and highly relevant to large-scale environmental monitoring, forest management, and climate research.</p>
            <p> The novelty of the study lies not in producing new observations but in the harmonization of heterogeneous remote sensing and model-based datasets through rigorous preprocessing and geospatial standardization. This contributes a ready-to-use dataset that bridges gaps between data availability, accessibility, and analytical readiness&#x2014;serving as a foundation for advanced environmental and fire research.</p>
            <p> While the study demonstrates methodological rigor and conceptual clarity, several areas require further elaboration to enhance the manuscript&#x2019;s scientific depth, reproducibility, and practical impact. In particular, the data preprocessing procedures, validation strategy, and uncertainty assessment need to be described in greater detail to strengthen the transparency and reliability of the proposed workflow.</p>
            <p> </p>
            <p> 
                <bold>Major Comments</bold> 
                <list list-type="order">
                    <list-item>
                        <p>Data Integration and Resolution Consistency</p>
                        <p> The integration of datasets with diverse spatial (250 m&#x2013;37 km) and temporal (annual&#x2013;monthly) resolutions may lead to interoperability inconsistencies. The authors should explicitly describe the 
                            <italic>re-sampling</italic> and 
                            <italic>disaggregation</italic> procedures, including interpolation methods and error assessments, to minimize spatial inaccuracy, particularly in environmentally heterogeneous regions.</p>
                    </list-item>
                    <list-item>
                        <p>Uncertainty of Model-Derived Variables</p>
                        <p> The inclusion of model-based parameters (e.g., soil moisture, evapotranspiration) increases temporal completeness but introduces uncertainty. An 
                            <italic>uncertainty analysis</italic> and comparison with field or independent observational datasets are strongly recommended to enhance the reliability of the results.</p>
                    </list-item>
                    <list-item>
                        <p>Reprojection and Coordinate Transformation</p>
                        <p> The reprojection of datasets to a standard coordinate reference system (e.g., EPSG:102033) must be carefully documented. The authors should describe the methods used to maintain geometric fidelity and correct for potential grid misalignments or boundary distortions.</p>
                    </list-item>
                    <list-item>
                        <p>Temporal Harmonization and Analytical Sensitivity</p>
                        <p> Given the integration of variables with differing temporal frequencies (e.g., annual land cover versus monthly climate data), a clear 
                            <italic>temporal harmonization framework</italic> is needed. This should ensure that the resulting datasets retain temporal sensitivity for dynamic analyses such as fire risk modeling or temporal trend assessment.</p>
                    </list-item>
                    <list-item>
                        <p>Workflow Transparency and Quality Control</p>
                        <p> With more than 240 temporal layers per attribute, reproducibility is a primary concern. The authors are encouraged to implement and document standardized 
                            <italic>quality control procedures</italic>&#x2014;including metadata compliance (e.g., ISO 19115), workflow logs, and version tracking&#x2014;to ensure process transparency and minimize accumulated processing errors.</p>
                    </list-item>
                    <list-item>
                        <p>Validation and Methodological Innovation</p>
                        <p> Although the study effectively tackles dataset fragmentation and incompatibility using open-source solutions, its primary innovation lies in workflow standardization rather than algorithmic advancement. Incorporating validation experiments or developing a predictive model would strengthen the methodological contribution and scientific novelty of the work.</p>
                    </list-item>
                </list>
            </p>
            <p>Are sufficient details of methods and materials provided to allow replication by others?</p>
            <p>Partly</p>
            <p>Is the rationale for creating the dataset(s) clearly described?</p>
            <p>Yes</p>
            <p>Are the datasets clearly presented in a useable and accessible format?</p>
            <p>Yes</p>
            <p>Are the protocols appropriate and is the work technically sound?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Applied remote sensing and quantitative approach (machine learning/deep learning) in forestry and environment, e.g., forest fire, landslide, spatial modelling.</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <sub-article article-type="response" id="comment15296-418949">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Moraga</surname>
                            <given-names>Paula</given-names>
                        </name>
                        <aff>King Abdullah University of Science and Technology Computer Electrical and Mathematical Science and Engineering Division, Thuwal, Makkah Province, Saudi Arabia</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>18</day>
                    <month>1</month>
                    <year>2026</year>
                </pub-date>
            </front-stub>
            <body>
                <p>Thank you for your thorough review and insightful comments. We address each comment separately below.</p>
                <p> </p>
                <p> 1. Data Integration and Resolution Consistency</p>
                <p> The integration of datasets with diverse spatial (250 m&#x2013;37 km) and temporal (annual&#x2013;monthly) resolutions may lead to interoperability inconsistencies. The authors should explicitly describe the re-sampling and disaggregation procedures, including interpolation methods and error assessments, to minimize spatial inaccuracy, particularly in environmentally heterogeneous regions.</p>
                <p> </p>
                <p> 
                    <underline>Response:</underline> We have detailed our two-step alignment procedure in the Data Processing section (paragraphs 6 and 7). This includes an initial disaggregation of coarse datasets followed by a strategic resampling using the terra package in R. To ensure spatial accuracy across heterogeneous landscapes, we implemented "nearest neighbor" resampling for categorical/discrete variables to preserve data integrity, and "bilinear" interpolation for continuous atmospheric variables to accurately represent spatial gradients.</p>
                <p> </p>
                <p> 2. Uncertainty of Model-Derived Variables</p>
                <p> The inclusion of model-based parameters (e.g., soil moisture, evapotranspiration) increases temporal completeness but introduces uncertainty. An uncertainty analysis and comparison with field or independent observational datasets are strongly recommended to enhance the reliability of the results.</p>
                <p> </p>
                <p> 
                    <underline>Response:</underline> We agree that model-derived products involve inherent uncertainties. We have added a clarifying statement at the end of the Data Collection section explicitly defining the intended use of these parameters. While independent field validation is a valuable endeavor, it remains beyond the scope of this data curation manuscript. Our primary objective is to provide a standardized, analysis-ready infrastructure for the research community. The utility and reliability of this curated framework have already been demonstrated in a published application (Abid et al., 2025, Environ Ecol Stat, 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s10651-025-00661-x">https://doi.org/10.1007/s10651-025-00661-x</ext-link>).</p>
                <p> </p>
                <p> 3. Reprojection and Coordinate Transformation</p>
                <p> The reprojection of datasets to a standard coordinate reference system (e.g., EPSG:102033) must be carefully documented. The authors should describe the methods used to maintain geometric fidelity and correct for potential grid misalignments or boundary distortions.</p>
                <p> </p>
                <p> 
                    <underline>Response:</underline> We have expanded the description of our coordinate transformation process in Data Processing (paragraph 4). The use of EPSG:102033 (South America Albers Equal Area Conic) is now explicitly justified as a measure to maintain geometric fidelity and minimize area/shape distortions across the basin&#x2019;s longitudinal extent. To resolve grid misalignments, we describe the implementation of a standardized fixed spatial grid as a master template, ensuring identical geographical footprints for every pixel across all variables.</p>
                <p> </p>
                <p> 4. Temporal Harmonization and Analytical Sensitivity</p>
                <p> Given the integration of variables with differing temporal frequencies (e.g., annual land cover versus monthly climate data), a clear temporal harmonization framework is needed. This should ensure that the resulting datasets retain temporal sensitivity for dynamic analyses such as fire risk modeling or temporal trend assessment.</p>
                <p> </p>
                <p> 
                    <underline>Response:</underline> We have updated the Data Processing (paragraph 2) to define our "temporal expansion" framework, where annual/static variables are mapped consistently across corresponding monthly increments to create a synchronized 240-month time series. While we recognize that assessing the impact of data frequency on model sensitivity is an important research direction, such analytical performance testing is outside the scope of this work, which focuses on the workflow standardization and the resolution of dataset fragmentation.</p>
                <p> </p>
                <p> 5. Workflow Transparency and Quality Control</p>
                <p> With more than 240 temporal layers per attribute, reproducibility is a primary concern. The authors are encouraged to implement and document standardized quality control procedures&#x2014;including metadata compliance (e.g., ISO 19115), workflow logs, and version tracking&#x2014;to ensure process transparency and minimize accumulated processing errors.</p>
                <p> </p>
                <p> 
                    <underline>Response:</underline> We have updated the Data Processing (paragraph 8 and 10) and Technical Validation sections to highlight our quality control procedures. By utilizing a scripted algorithmic pipeline (terra package), we ensure that all 240 monthly layers are processed with identical parameters, effectively serving as an automated workflow log to eliminate manual intervention errors. While formal ISO-compliant certification is beyond the scope of this curation effort, our systematic documentation provides a transparent, reproducible framework already validated by its use in published dynamic research (Abid et al., 2025).</p>
                <p> </p>
                <p> 6. Validation and Methodological Innovation</p>
                <p> Although the study effectively tackles dataset fragmentation and incompatibility using open-source solutions, its primary innovation lies in workflow standardization rather than algorithmic advancement. Incorporating validation experiments or developing a predictive model would strengthen the methodological contribution and scientific novelty of the work.</p>
                <p> </p>
                <p> 
                    <underline>Response:</underline> We have added a clarifying statement in the Technical Validation section regarding the scientific novelty of this work. We believe that resolving dataset fragmentation through a standardized, analysis-ready framework is a critical scientific contribution that removes the primary bottleneck for modeling in the Amazon. While developing a new predictive model is beyond the current scope, the utility of this dataset has been proven in recent research (Abid et al., 2025), where the curated variables demonstrated the sensitivity required for complex ensemble modeling of fire dynamics.</p>
            </body>
        </sub-article>
    </sub-article>
</article>
