Analysis of the stationarity and correlation of the global temperature and carbon dioxide time series

Upul Rupassara; Sarah Frantsvog; Ashley Holen; Karen Robinson

doi:10.12688/f1000research.139583.1

Home Browse Analysis of the stationarity and correlation of the global temperature...

ALL Metrics

-

Views

-

Downloads

Get PDF

Get XML

Export

▬

✚

Case Study

Analysis of the stationarity and correlation of the global temperature and carbon dioxide time series

[version 1; peer review: awaiting peer review]

Upul Rupassara ¹, Sarah Frantsvog¹, Ashley Holen¹, Karen Robinson¹

PUBLISHED 31 Aug 2023

Author details Author details

¹ Mathematics and Computer Science, Minot State University, Minot, North Dakota, 58707, USA

Upul Rupassara
Roles: Conceptualization, Data Curation, Formal Analysis, Investigation, Methodology, Project Administration, Resources, Software, Supervision, Validation, Visualization, Writing – Original Draft Preparation, Writing – Review & Editing

Sarah Frantsvog
Roles: Investigation, Resources, Writing – Original Draft Preparation, Writing – Review & Editing

Ashley Holen
Roles: Investigation, Resources, Writing – Original Draft Preparation, Writing – Review & Editing

Karen Robinson
Roles: Investigation, Resources, Writing – Original Draft Preparation, Writing – Review & Editing

OPEN PEER REVIEW

REVIEWER STATUS AWAITING PEER REVIEW

This article is included in the Climate gateway.

Abstract

Background: The rapid and ongoing phenomenon of global warming has negatively
impacted both the Earth’s environment and its inhabitants. Time series and regression analysis techniques play a significant role in weather forecasting and the interpretation of climate data. One of the key characteristics of time series analysis
is stationarity.
Methods: In this article, we explore how detrending and differencing techniques can be used to transform the time series of global temperature and carbon dioxide into stationary series. Regression models and goodness of fit tests were used to examine the relationship between carbon dioxide and data on global temperature. A cross-correlation time series model is also used to assess those time series’ lagging
and leading characteristics.
Results: The study of data on global temperature anomalies indicates that detrending and differencing are helpful in transforming temperature time series into stationary time series. However, the first differencing and detrending methods do
not make the carbon dioxide time series stationary; instead, an alternate transformation is needed. Neither the carbon dioxide time series nor the global temperature time series lag or lead with regard to the cross-correlation function.
Conclusions: In this article, we looked into stationarity and some other topics associated with correlation in terms of data on CO2 and global temperature. Stationarity is one of the important properties to check before conducting a more thorough investigation of the time series. To transform a non-stationary time series into a stationary one, there are numerous techniques available. However, in this article, we
just pay attention to detrending and differencing and how those methods perform with respect to time series data for global temperature and carbon dioxide.

Keywords

Time, Series, Temperature, Global, Stationary

Corresponding author: Upul Rupassara

Competing interests: No competing interests were disclosed.

Grant information: The author(s) declared that no grants were involved in supporting this work.

Copyright: © 2023 Rupassara U et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Rupassara U, Frantsvog S, Holen A and Robinson K. Analysis of the stationarity and correlation of the global temperature and carbon dioxide time series [version 1; peer review: awaiting peer review]. F1000Research 2023, 12:1074 (https://doi.org/10.12688/f1000research.139583.1) First published: 31 Aug 2023, 12:1074 (https://doi.org/10.12688/f1000research.139583.1) Latest published: 31 Aug 2023, 12:1074 (https://doi.org/10.12688/f1000research.139583.1)

Introduction

Since the years 1850–1900, the rapid and relentless phenomenon of global warming has severely and negatively altered Earth’s environment and its inhabitants. NASA describes global warming as “the long-term heating of Earth’s surface observed since the pre-industrial period (between 1850 and 1900) due to human activities, primarily fossil fuel burning, which increases heat-trapping greenhouse gas levels in Earth’s atmosphere” (NASA 2023). Carbon dioxide (CO₂) is naturally present in the atmosphere as part of the Earth’s carbon cycle (the natural circulation of carbon among the atmosphere, oceans, soil, plants, and animals) (EPA 2023). Unlike oxygen or nitrogen (which make up most of our atmosphere), greenhouse gases absorb heat radiating from the Earth’s surface and re-release it in all directions–including back toward the surface (Lindsey 2022). In order to preserve Earth’s future, action must be taken now. The first step is tracking how atmospheric carbon dioxide emissions affect global warming rates.

The correlation between global temperature and carbon dioxide levels has been investigated by many researchers in the literature (Florides and Christodoulides 2009; Macedo and Madaleno 2023; Hereher 2016; Palmer et al. 2007; Woodward and Gray 1993).

In the article, Global warming and carbon dioxide through science, (Florides and Christodoulides 2009) revealed that there is no way to definitively say whether or not CO₂ directly impacts global warming and temperature increases. This study used three independent sets of data (collected from ice cores and chemistry) to perform an analysis. Through a specific regression analysis of their data, they found that the data stating that there is a correlation between CO₂ concentration and temperature relies heavily on specific choices of data. From this conclusion, they found that for both chemistry data and ice core data, “one cannot be positive that indeed such a correlation exists”. Through further research into the topic, there is evidence that CO₂ is not inherently harmful to the environment, in addition to the fact that there is very little evidence linking CO₂ levels and temperature. In the end, they concluded, “It is our view that it is not yet sufficient, let alone rigorous, evidence that anthropogenic CO₂ increase is indeed the main factor contributing towards the global warming of the $20^{th}$ century”.

Evidence from a Maximum Entropy Approach (Macedo and Madaleno 2023) uses a statistical approach based on maximum entropy to conduct a study that supports the results from different techniques that found that CO₂ does, in fact, impact the increase in global temperature. This study, along with a recent review of emerging literature, led them to the conclusion that the impact of CO₂ on our atmosphere is trending toward a detrimental conclusion. They further state that if people continue emitting CO₂ into the atmosphere, it will have a detrimental impact on human, plant, and animal health in the long run.

A time series-related analysis was used in Hereher (2016) to study the global temperature trends of land surface temperatures and climate changes based on the temperature data in Egypt. This study reports serious climate change in Egypt by detecting variability of land surface temperature (LST) over the last decade at selected locations, with varied geomorphological characteristics and human stressors. The time series for the land surface temperatures were acquired from satellite images from 2003 to 2014, totaling 276 images. The analysis suggests that the LST in Egypt increases by 1.54 °C/decade. The variation of LST depends on latitude, geology, topography, and surface albedo. The dataset used in this analysis is named MODIS, from the NASA Land Processes Distributed Active Archive Center website. This analysis found that the time-series MODIS LST data proved sufficient for the short-term monitoring of land surface temperature variations in Egypt. It is noted that geology, topography, and surface albedo have significant impacts on the LST of Egypt. Further, it was revealed that between the years of 2003-2014, the LST of Egypt increased by 0.3–1.06 °C/decade, and for urban areas, the excess LST is higher at 1.54 °C/decade.

According to a new isothermal analysis by Palmer et al. (2007) in order to produce a more accurate depiction of the underlying warming. They proposed a new analysis of millions of ocean temperature profiles intended to filter out local dynamical changes. According to the authors’ comments, it was a difficult analysis since oceans do not warm uniformly across the globe. They present decadal-scale analyses of the ocean’s thermal state relative to a fixed isotherm. This new diagnostic is less prone to the influence of dynamical processes at both high and low frequencies, and the results present a more globally uniform picture of ocean warming. The limitation of the isothermal analysis is that high-latitude oceans were not included.

Using the Autoregressive Moving Average (ARMA) model, a study of time series data on global warming and trends is presented in Woodward and Gray (1993). They concluded that atmospheric greenhouse gases will affect the continued projection of the warming trend of global temperatures. Further, they stated that there is no conclusive evidence that this trend will continue. This is due to the difference between data on long-term trends and data on random trends with the length of the temperature series.

Data sources

Global temperature anomalies and carbon dioxide data were obtained from (NASA 2023) global climate change website. The temperature anomalies (in Celsius) and carbon dioxide (parts per million) were used from 1960 to 2022. The temperature anomalies are used instead of the absolute temperature data, as they accurately represent the temperature variability over larger areas. Further, they give a frame of reference that allows more meaningful comparisons between locations and more accurate calculations of temperature trends (NOAA 2023).

Statistical time series review

This section provides an overview of the definitions and test formulas that we use in this article to examine the time series models. Time series analysis can be used to examine the outcomes of either a planned or unplanned intervention as well as to better understand the underlying naturalistic process and the pattern of change over time (Velicer and Molenaar 2012). In time series analysis, regularity is essential, and if time series data exhibits irregular behavior over time, establishing meaningful conclusions about the underlying causes will be challenging. This regular behavior is explained by the concept of stationarity. In general, to satisfy the conditions of stationarity, the time series must satisfy some strong conditions, and those conditions are too strong for most of the applications. Therefore, most time series seen in real life are analyzed using a weaker version of stationarity.

The weakly stationary time series should satisfy the following conditions (Shumway and Stoffer 2017).

(1) The expected value (mean) of the time series is constant and does not depend on time.
(2) The autocovariance function does not depend on the actual time value but only depends on the time value through the time difference.

If two time series are taken into account simultaneously, such as $x_{t}$ and $y_{t}$ , they are said to be jointly stationary if the individual time series are stationary and the cross-covariance function solely depends on the lag value.

To evaluate the model’s goodness of fit, we utilize Akaike’s Information Criterion (AIC), the bias-corrected Akaike’s Information Criterion (AICc), and Bayesian Information Criterion (BIC). Those criteria measure the goodness of fit of the models by balancing the number of parameters of the model and the error of the fit. If $k$ is the number of parameters of the model, $n$ is the number of observations, and SSE is the sum squared error of the fit, then AIC is given by

(1)

AIC = ln (SSE / n) + \frac{n + 2 k}{n} .

The AICc is given by

(2)

AICc = ln (SSE / n) + \frac{n + k}{n - k - 2} .

With the Bayesian corrected term, the BIC takes the form

(3)

BIC = ln (SSE / n) + \frac{k ln (n)}{n} .

More details about those information criteria can be found in Akaike (1974, 1973, 1969) and Schwarz (1978).

Detrending and differencing are the two fundamental methods for transforming a non-stationary time series into a stationary one. The independence of the residuals can be visually demonstrated by using the residual plots. The autocorrelation plots and cross-correlation plots can be used the explain the nature of the correlation at different lag values. If $μ_{x, t}$ and $μ_{y, t}$ are the means of the time series $x_{t}$ and $y_{t}$ respectively, the autocovariance function of $x_{t}$ is defined by

(4)

cov (x_{t + h}, x_{t}) = E [(x_{t + h} - μ_{x, t + h}) (x_{t} - μ_{x, t})],

where

h

represents the time shift or lag value. The cross-covariance function for

x_{t}

and

y_{t}

is given by

(5)

cov (x_{t + h}, y_{t}) = E [(x_{t + h} - μ_{x, t}) (y_{t} - μ_{y, t})]

The autocorrelation and cross-correlation functions are normalized versions of the above formulas. For a given time series $x_{t}$ , and $y_{t}$ the autocorrelation (ACF) and cross-correlation (CCF) functions are given by

(6)

ACF (x_{s}, x_{t}) = \frac{cov (x_{s}, x_{t})}{\sqrt{cov (x_{s}, x_{s}) cov (x_{t}, x_{t})}},

and

(7)

CCF (x_{s}, y_{t}) = \frac{cov (x_{s}, y_{t})}{\sqrt{cov (x_{s}, x_{s}) cov (y_{t}, y_{t})}},

respectively.

Because they depend on the locations of time points $s$ and $t$ , the autocovariance and cross-covariance functions can change during the course of the series. If the autocovariance function depends on the separation $h = | t - s |$ rather than the points where the time series are situated, we are able to analyze sample time series data when there is only one series available.

Analysis of the global temperature and carbon dioxide time series

We use the global temperature anomalies from 1960 to 2022, obtained from NASA (2023). As demonstrated in Figure 1 most of the planet is warming (yellow, orange, and red). Only a few locations, most of them in the southern hemisphere oceans, cooled over this time period. According to the fact highlighted in Lindsey and Dahlman (2023) earth’s temperature has risen by an average of 0.14° Fahrenheit per decade since 1880. The temperature anomaly variability (in Celsius), including pre-industrial time, is shown in Figure 2. The trend is shown by the red dashed line while smoothing splines with the smoothing parameter value spar is 0.5, is demonstrated by the blue spline.

Figure 1. Recent temperature trend (1993-2022).

Climate.gov Media: https://www.climate.gov/media/15022.

Figure 2. Temperature anomaly from 1960 to 2022.

The red dashed line shows the trend and the blue line shows the temperature variability with smoothing spline (spar = 0.5).

The trend and irregular behavior of the temperature series require a modification to make it stationary before additional analysis can be done to uncover its other properties. The stationary time series are easy to study using established principles created in the time series literature because of their predictable long-term behavior.

We investigate detrending and differencing techniques to transform the original time series into stationary time series.

Stationarity in the temperature time series

The trend stationary model is the form of non-stationary series that is easiest to work with. As seen in Figure 2, the temperature series exhibits stationary behavior around the trend line (trend stationary). The stationary component of the time series might thus be filtered by just removing the trend. This trend stationary model can be written as

(8)

T_{t} = μ_{t} + T_{t}^{s}

where

μ_{t}

is the trend and

T_{t}^{s}

is the stationary component. The equation for the trend line can be used to detrend the data, and it can be estimated by ordinary least squares regression.

(9)

{\hat{μ}}_{t} = - 33.82 + 0.02 t

Using global temperature anomaly data from 1960 to 2022, the estimated trend line equation can be obtained as follows:

Figure 2 shows the superimposed estimated trend line (red). To get the detrended series, we simply subtract ${\hat{μ}}_{t}$ from $T_{t}$ ,

(10)

{\hat{T}}_{t}^{s} = T_{t} + 33.82 - 0.02 t

Because of the error of the estimated model in Eq. 10, ${\hat{T}}_{t}^{s}$ may not be independent and identically distributed. In fact, our main goal is investigating the behavior of this stationary component ${\hat{T}}_{t}^{s}$ . According to the graphical representation in Figure 3a, detrending eliminates the original temperature time series trend. One of the main requirements of a stationary time series is the maintenance of a constant mean across the specified time period, which the elimination of trends helps to achieve. Figure 4a and Figure 4b show the autocorrelation function for the original and detrended global temperature anomaly time series respectively. The majority of the autocorrelation values for the detrended series are inside the 95% confidence band, indicating a significant improvement in the independence of lag-related correlation.

Figure 3. Detrended and differenced global temperature series from 1960 to 2022.

Figure 4. ACF for a) Global temperature anomaly, b) Detrended, and c) Differenced time series.

Differencing is the other technique we are utilizing here and in time series literature this is known as integrating. Differencing simply means subtracting past values from the current value. Figure 3b shows the differenced global temperature anomaly time series. This series also shows a similar pattern to that observed in the detrended series.

However, instead of treating drift as a fixed model, it can be modeled using a stochastic component. We define the stochastic drift model

(11)

μ_{t} = α + μ_{t - 1} + w_{t}

where

w_{t}

is the independent identically distributed random variable (called white noise or Gaussian process), which follows the normal distribution with mean 0 and fixed variance (say)

σ^{2}

. If

w_{t}

is independent of

T_{t}^{s}

, differencing the model in Eq. 8 and substituting Eq. 11, we get

(12)

T_{t} - T_{t - 1} = α + w_{t} + T_{t}^{s} - T_{t - 1}^{s}

Since $T_{t}^{s}$ is the stationary component of the global temperature time series $T_{t}$ , we can show that the difference $T_{t}^{s} - T_{t - 1}^{s}$ is also stationary. For that define, $Z_{t} = T_{t}^{s} - T_{t - 1}^{s}$ . Then $E (Z_{t})$ does not depend on time $t$ since $T_{t}^{s}$ is stationary for $1960 \leq t < 2022$ .

Another key requirement of a stationary time series is the independence of the autocovariance from the time. It can depend on the time difference, but not on the actual time value. If $h$ represents the lag or shift value, the autocorrelation can be found as

(13)

cov (Z_{t + h}, Z_{t}) = 2 cov (T_{t + h}^{s}, T_{t}^{s}) - cov (T_{t + h + 1}^{s}, T_{t}^{s}) - cov (T_{t + h - 1}^{s}, T_{t}^{s})

Therefore, the autocorrelation of the differenced temperature time series does not depend on the actual time value, $t$ rather on the time lag or time difference. In addition to the computation results, this can be further verified by the ACF plots for differenced temperature data. Figure 4a displays the autocorrelation for the initial temperature anomaly. The ACF value decreases as the lag value increases, providing strong evidence for the temporal reliance of the covariance. On the other hand, Figure 4b and Figure 4c show, the autocorrelation for detrended and differenced data respectively. In comparison to the original time series ACF plot, both graphs demonstrate a significant reduction in the time-dependent correlation. The blue horizontal lines represent the 95% confidence level.

Stationarity of the carbon dioxide time series

The atmospheric carbon dioxide level, measured in parts per million (ppm) during the period from 1960 to 2022 is shown in Figure 5. We use detrending and referencing techniques to analyze the stationarity behavior of the CO₂ time series. Figure 6a shows the detrended carbon dioxide series. Unlike in the temperature series, the detrending does not remove the trend of the original CO₂ series. Also, according to Figure 6b the first differenced series either does not show stationary behavior in terms of the trend component or it still shows some sort of trend. But the second differenced series in Figure 6c shows a significant improvement toward the stationarity. Therefore, although the first differencing does not work here, the second differencing transforms the CO₂ series into a stationary time series. There are different transformations available in the time series literature to transform non-stationary time series into stationary form. In this article, we are not going to analyze other transformations, but one can try log or Box-Cox (Sakia 1992) family transformations.

Figure 5. Carbon dioxide level (ppm) from 1960 to 2022.

The red dashed line shows the trend (spar = 1) and the blue line shows the CO₂ variability with smoothing spline (spar = 0.5).

Figure 6. Detrended and Differenced CO₂ time series.

Figure 7a shows the ACF of the original carbon dioxide series. Detrending and differencing (first difference) are unable to eliminate the autocorrelation of the CO₂ time series as they did in the temperature series. Figure 7b-c illustrates this by demonstrating how the height of the vertical lines decreases as the lag increases. It is clear that autocorrelation breaks a stationarity rule by leaning on the actual time value.

Figure 7. ACF vs Lag: a) CO₂ (original) time series, b) CO₂ detrended time series, and c) CO₂ differenced (first differenced) time series.

Higher order differencing

When first-order differencing fails to convert the original data into a stationary form, higher-order differencing may be necessary for the majority of time series applications. If a quadratic trend is present, second-order differencing is the optimal order to apply. In general, if the trend is linear, first-order differencing might be sufficient. If a non-stationary time series needs to be differenced d times in order to become stationary, it is said to be of order d integrated. The generalized form of higher-order differencing can be described by the backshift operator, say $B$ . According to the exposition in Shumway and Stoffer (2017), this operator can be defined as

(14)

{Bx}_{t} = x_{t - 1}

where

x_{t}

is any time series. Extending to the power

k

, we can get the generalized form of the Backshift operator

(15)

B^{k} x_{t} = x_{t - k}

If the difference is defined by

(16)

{Dx}_{t} = x_{t} - x_{t - 1}

Then we can write,

(17)

{Dx}_{t} = (1 - B) x_{t},

and further extending this iteration into the

k^{th}

power

(18)

D^{k} = {(1 - B)}^{k} .

Various differencing orders are used to decrease the time series’ unwanted deviations. These techniques are frequently used in ARIMA time series models.

Although $d = 1$ works for global temperature anomalies, the CO₂ time series required higher-order differencing. The milder quadratic pattern suggests second-order differencing, and the corresponding figure after the second difference is shown in Figure 6c. As demonstrated in the autocorrelation plot for the second differenced time series in Figure 8, progress toward stationarity can be seen as the correlation values at different lags decline in contrast to the first differenced auto-correlation plot.

Figure 8. ACF plot: Carbon dioxide differenced series (d = 2).

Along with the graphical evidence of stationarity, the Augmented DickeyFuller (ADF) test (Dickey and Fuller 1981) can be utilized to determine whether or not the given time series is stationary. The ADF test results are shown in Table 1. The p-value for the detrended and differenced time series of the global temperature is less than 0.01. This allows us to reject the null hypothesis of the ADF test and accept the resulting time series as stationary within the corresponding significance level $α = 0.05$ . The hypothesis tests for detrended and first differenced CO₂ time series are not statistically significant at $α = 0.05$ , but the integrated order 2 series is stationary.

Table 1. Dickey - Fuller test’s p - values for detrended and differenced time series.

Time series/Transformation	Detrended	Differenced (d = 1)	Differenced (d = 2)
Global temperature anomaly	$< 0.01$	$< 0.01$	$< 0.01$
Carbon dioxide	0.99	0.07	$< 0.01$

Models for correlation analysis

We analyze the basic global temperature trend model and temperature and carbon dioxide models. We compare two models using AIC, AICc, BIC, R-squared value, and sum squared error (SSE). The main objective of these models is to further analyze the effect of carbon dioxide on global temperatures. The basic trend model for temperature (model 1):

(19)

T = β_{0} + β_{1} t + w_{t}

where

T

is the global temperature anomaly (°C) and

t

is the time in years (

1960 \leq t \leq 2022

),

w_{t}

is the white noise and is assumed to be normally distributed with mean zero and fixed variance (say)

σ^{2} .

The trend model (model 1) in Eq. 19 can be improved by introducing CO₂ as an explanatory variable.

Figure 9 shows the autocorrelation values for the carbon dioxide series at different lag values. Due to the considerable strength of the lag-related correlation in the second model, we adjust the carbon dioxide data for its mean, $\bar{C} = 358.9675$ (ppm). If $C_{t}$ is the carbon dioxide level at time $t$ , then we have model 2:

(20)

T = β_{0}^{'} + β_{1}^{'} t + β_{2}^{'} C^{'} + w_{t}

where

C^{'} = C_{t} - \bar{C}, 1960 \leq t \leq 2022

, is the adjusted carbon dioxide level (in ppm). If

k

is the number of parameters of the model and df is the degree of freedom, Table 2 shows the summary statistics for the two models.

Figure 9. Scatter-plot matrix relating current CO₂ values ( $C_{t}$ ), to past CO₂ values ( $C_{t - h}$ ) for $h = 1,2,3,4 .$ The values at the upper right corner are the sample autocorrelation at the corresponding lag values.

Table 2. Summary statistics for the global temperature models.

Model	$k$	SSE	$df$	$R^{2}$	AIC	AICc	BIC
Model 1	2	0.632	61	0.905	-3.539	-3.501	-4.471
Model 2	3	0.495	60	0.924	-3.750	-3.707	-4.648

The value $k$ yielding the minimum AIC, AICc, and BIC specifies the best model. We note that the second model is substantially better than the first model. Model 2, which includes CO₂ accounting for 92.4% of the variability ( $R^{2}$ value of model 2), which was 90.5% ( $R^{2}$ value of model 1) without that. Also, it gives the best value for AIC and BIC. In addition, we can notice that AIC and AICc are nearly equal. To calculate those values, we used the formulas given in the equations 1, 2, and 3. Without using the aforementioned formulas, one can utilize the regression model summaries to obtain these values, although there might be a few minor variations from the corresponding values shown in Table 2.

Further, the trend model (model 1) can be compared to the model with carbon dioxide (model 2) with the null hypothesis, $H_{0} : β_{3} = 0$ . The corresponding $F$ statistic:

(21)

F = \frac{({SSE}_{r} - SSE) / (q - r)}{SSE / (n - q - 1)}

where

{SSE}_{r}

is the sum squared error of the reduced model, and

q

and

r

are the numbers of predictor variables in the full and reduced models, respectively. When

q = 2

,

r = 1

, and

n = 63

(22)

F = \frac{(0.632 - 0.495) / 1}{0.495 / 60} = 16.606,

which exceeds

F_{1, 60} (0.001) = 11.973

. Hence, model 2 is a better prediction model compared to model 1.

Further for the purpose of predicting the global temperature the prediction model can be given by,

(23)

{\hat{T}}_{t} = 4.943 - 0.002 t + 0.012 (C_{t} - 358.968)

where

{\hat{T}}_{t}

is the estimated temperature (anomaly) at time

t

. A negative (but very small) weight is present as the time coefficient. But a relatively larger constant (

β_{0}

) value mitigates the effect of the negative weight. The positive weight of the carbon dioxide indicates a positive contribution to the global temperature when

C_{t} > 358.968

(ppm). Because of this, whether carbon dioxide has a positive or negative effect on the rise in global temperatures depends on its relative value to the average annual carbon dioxide level (in this study, the average was computed throughout the years from 1960 to 2022).

Model assumption

This section examines the validity of model 2 that we previously presented. Here, we go into further detail about the regression model’s primary assumptions, such as linearity, residual normalcy, homoscedasticity, and independence of the residual errors.

As shown in Figure 10 “Residual vs. Fitted” subplot, the horizontal line without any obvious patterns is a sign of a solid linear relationship. In Figure 10, the “Normal Q-Q plot” is used to determine whether the residuals are distributed normally. It’s ideal if the residual points fall along the dashed straight line, which is not perfectly satisfied in this case. A “Scale Location” subplot is utilized to verify the homoscedasticity (homogeneity of variance) of the residuals. Homoscedasticity can be detected by a horizontal line with evenly spaced points, which is satisfied except for a few outliers as shown in Figure 10, “Scale-Location” subplot. A few influential observations can be identified according to Figure 10, “Residual vs. Leverage” subplot. According to the model diagnostic analysis, since model 2 satisfies the fundamental assumptions, it can be considered a viable model to explain the relationship between temperature anomaly and CO₂ across time.

Figure 10. Residual vs Fitted, Normal Q-Q plot, Scale Location plot, and Residual vs Leverage plot.

Cross correlation: Global temperature anomaly vs carbon dioxide

Analyzing the potential leading and lagging relationships between the global temperature and carbon dioxide series is another intriguing investigation. Leading and lagging relationships might be helpful when one time series is used to predict another. Let $T_{t}$ and $C_{t}$ represent the global temperature and CO₂ time series respectively. Consider the model of the form

(24)

T_{t} = β_{t}^{l} C_{t - l} + w_{t}

where

β_{t}^{l}

is a real value that depends on

t

or

l

.

C_{t}

lead

T_{t}

if

l > 0

, and lag

T_{t}

if

l < 0

. If

w_{t}

is uncorrelated with

C_{t}

time series, the cross-correlation is given by

(25)

cov (T_{s}, C_{t}) = cov (β_{t}^{l} C_{s - l} + w_{s}, C_{t}) = cov (β_{t}^{l} C_{s - l}, C_{t}), t > 0, s > 0 .

According to the graphical demonstration of the cross-correlation function in Figure 11a for the original data, no time series leads or lags other series since peak shows occur at $l = 0$ . When viewed in relation to the zero-lag value, the cross-correlation is approximately symmetric. However, those original series are not stationary. We take integration order 2 into consideration when we analyze the behavior of the cross-correlation functions of the (weakly) stationary CO₂ and global temperature anomaly series. Although the first integrated temperature series is stationary, we use the differencing order 2 ( $d = 2$ ) to transform both series in order to tackle the dimension issues. According to Figure 11b most dominant cross-correlation for stationary series occurs at $l = 0$ and $l = 1$ . Those integrated series are jointly stationary if the cross-correlation or cross-covariance function is a function only on the lag value.

Figure 11. Cross correlation of global temperature anomaly and carbon dioxide time series from 1960 to 2022: a) Original, b) Stationary (d = 2).

Discussion

According to the summary statistics criterion in Table 2, the impact of CO₂ on the global temperature cannot be negligible. The models we’ve examined can be improved by adding additional greenhouse gases and other factors that could have an impact on the global temperature. Although carbon dioxide is not the only greenhouse gas that can affect the temperature, it occupies a significant amount of space compared to other greenhouse gases.

Data availability

The data used in this analysis is available on the NASA global climate change website [NASA(2023)].

Source data

This project contains the following underlying data:

• Carbon dioxide data; https://climate.nasa.gov/vital-signs/carbon-dioxide
• Temperature data; https://climate.nasa.gov/vital-signs/global-temperature

Software availability

We used R/R Studio as the statistical program for our computations and data visualizations.

• Archived source code available from: DOI: 10.5281/zenodo.8234016

Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC BY 4.0).

References

Akaike H: A new look at the statistical model identification. IEEE Trans. Autom. Control. 1974; 19(6): 716–723. Publisher Full Text
Akaike HInformation Theory and an Extension of the Maximum Likelihood Principle.Petrov BN, Csaki F, editors. Proceedings of the 2nd International Symposium on Information Theory. Budapest: Akademiai Kiado; 1973; pp. 267–281.
Akaike H: Fitting autoregressive models for prediction. Ann. Inst. Stat. Math. 1969; 21: 243–247. Publisher Full Text
Dickey DA, Fuller WA: Likelihood ratio statistics for autoregressive time series with a unit root. Econometrica. 1981; 49: 1057–1072. Publisher Full Text
[EPA] Environmental Protection Agency: Overview of Greenhouse Gases.2023. Last updated on April 13, 2023. Reference Source
Florides GA, Christodoulides P: Global warming and carbon dioxide through sciences. Environ. Int. 2009; 35(2): pp. 390–401. 0160-4120. Publisher Full Text
Hereher ME: Time series trends of land surface temperatures in Egypt: a signal for global warming. Environ. Earth Sci. 2016; 75: 1218. Publisher Full Text
Lindsey R, Dahlman L: Climate Change: Global temperature. Science and information for a climate-smart nation.2023. January 18, 2023. Reference Source Reference Source
Lindsey R: Climate Change: Atmospheric Carbon Dioxide. Science and information for a climate-smart nation.2022. June 23, 2022. Reference Source Reference Source
Macedo P, Madaleno M: Global Temperature and Carbon Dioxide Nexus: Evidence from a Maximum Entropy Approach. Energies. 2023; 16(16): 277. Publisher Full Text
NASA: Global climate change, Vital signs of the planet.2023. last updated on April 20. Reference Source
[NOAA] National centers for environmental information: National oceanic and atmospheric administration.2023. Reference Source
Palmer MD, Haines K, Tett SFB, et al.: Isolating the signal of ocean global warming. Geophys. Res. Lett. 2007; 34: L23610. Publisher Full Text
Sakia RM: The Box-Cox Transformation Technique: A Review. Journal of the Royal Statistical Society. Series D (The Statistician). 1992; 41(2): 169–178. Publisher Full Text
Schwarz G: Estimating the Dimension of a Model. Ann. Stat. 1978; 6: 461–464. Publisher Full Text
Shumway RH, Stoffer DS: Time Series Analysis and Its Applications With R Examples. 4th ed.Springer; 2017. 978-3-319-52451-1. Publisher Full Text
Velicer WF, Molenaar PC: Time Series Analysis for Psychological Research. Handbook of Psychology. 2nd ed.Weiner I, Schinka JA, Velicer WF, editors. 2012. Publisher Full Text
Woodward WA, Gray HL: Global Warming and the Problem of Testing for Trend in Time Series Data. J. Clim. 1993; 6: 953–962. Publisher Full Text

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 31 Aug 2023

Author details Author details

¹ Mathematics and Computer Science, Minot State University, Minot, North Dakota, 58707, USA

Upul Rupassara
Roles: Conceptualization, Data Curation, Formal Analysis, Investigation, Methodology, Project Administration, Resources, Software, Supervision, Validation, Visualization, Writing – Original Draft Preparation, Writing – Review & Editing

Sarah Frantsvog
Roles: Investigation, Resources, Writing – Original Draft Preparation, Writing – Review & Editing

Ashley Holen
Roles: Investigation, Resources, Writing – Original Draft Preparation, Writing – Review & Editing

Karen Robinson
Roles: Investigation, Resources, Writing – Original Draft Preparation, Writing – Review & Editing

Competing interests

No competing interests were disclosed.

Grant information

The author(s) declared that no grants were involved in supporting this work.

Article Versions (1)

version 1

Published: 31 Aug 2023, 12:1074

https://doi.org/10.12688/f1000research.139583.1

Copyright

© 2023 Rupassara U et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

0

SEE MORE DETAILS

CITE

how to cite this article

Rupassara U, Frantsvog S, Holen A and Robinson K. Analysis of the stationarity and correlation of the global temperature and carbon dioxide time series [version 1; peer review: awaiting peer review]. F1000Research 2023, 12:1074 (https://doi.org/10.12688/f1000research.139583.1)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 31 Aug 2023

Open Peer Review

Reviewer Status

AWAITING PEER REVIEW

Comments on this article

All Comments(0)

Add a comment

Sign up for content alerts

Browse by related subjects

[1] Akaike H: A new look at the statistical model identification. IEEE Trans. Autom. Control. 1974; 19(6): 716–723. Publisher Full Text

[2] Akaike HInformation Theory and an Extension of the Maximum Likelihood Principle.Petrov BN, Csaki F, editors. Proceedings of the 2nd International Symposium on Information Theory. Budapest: Akademiai Kiado; 1973; pp. 267–281.

[3] Akaike H: Fitting autoregressive models for prediction. Ann. Inst. Stat. Math. 1969; 21: 243–247. Publisher Full Text

[4] Dickey DA, Fuller WA: Likelihood ratio statistics for autoregressive time series with a unit root. Econometrica. 1981; 49: 1057–1072. Publisher Full Text

[5] [EPA] Environmental Protection Agency: Overview of Greenhouse Gases.2023. Last updated on April 13, 2023. Reference Source

[6] Florides GA, Christodoulides P: Global warming and carbon dioxide through sciences. Environ. Int. 2009; 35(2): pp. 390–401. 0160-4120. Publisher Full Text

[7] Hereher ME: Time series trends of land surface temperatures in Egypt: a signal for global warming. Environ. Earth Sci. 2016; 75: 1218. Publisher Full Text

[8] Lindsey R, Dahlman L: Climate Change: Global temperature. Science and information for a climate-smart nation.2023. January 18, 2023. Reference Source Reference Source

[9] Lindsey R: Climate Change: Atmospheric Carbon Dioxide. Science and information for a climate-smart nation.2022. June 23, 2022. Reference Source Reference Source

[10] Macedo P, Madaleno M: Global Temperature and Carbon Dioxide Nexus: Evidence from a Maximum Entropy Approach. Energies. 2023; 16(16): 277. Publisher Full Text

[11] NASA: Global climate change, Vital signs of the planet.2023. last updated on April 20. Reference Source

[12] [NOAA] National centers for environmental information: National oceanic and atmospheric administration.2023. Reference Source

[13] Palmer MD, Haines K, Tett SFB, et al.: Isolating the signal of ocean global warming. Geophys. Res. Lett. 2007; 34: L23610. Publisher Full Text

[14] Sakia RM: The Box-Cox Transformation Technique: A Review. Journal of the Royal Statistical Society. Series D (The Statistician). 1992; 41(2): 169–178. Publisher Full Text

[15] Schwarz G: Estimating the Dimension of a Model. Ann. Stat. 1978; 6: 461–464. Publisher Full Text

[16] Shumway RH, Stoffer DS: Time Series Analysis and Its Applications With R Examples. 4th ed.Springer; 2017. 978-3-319-52451-1. Publisher Full Text

[17] Velicer WF, Molenaar PC: Time Series Analysis for Psychological Research. Handbook of Psychology. 2nd ed.Weiner I, Schinka JA, Velicer WF, editors. 2012. Publisher Full Text

[18] Woodward WA, Gray HL: Global Warming and the Problem of Testing for Trend in Time Series Data. J. Clim. 1993; 6: 953–962. Publisher Full Text

Analysis of the stationarity and correlation of the global temperature and carbon dioxide time series

Abstract

Keywords

Introduction

Data sources

Statistical time series review

(1)

(2)

(3)

(4)

(5)

(6)

(7)

Analysis of the global temperature and carbon dioxide time series

Figure 1. Recent temperature trend (1993-2022).

Figure 2. Temperature anomaly from 1960 to 2022.

Stationarity in the temperature time series

(8)

(9)

(10)

Figure 3. Detrended and differenced global temperature series from 1960 to 2022.

Figure 4. ACF for a) Global temperature anomaly, b) Detrended, and c) Differenced time series.

(11)

(12)

(13)

Stationarity of the carbon dioxide time series

Figure 5. Carbon dioxide level (ppm) from 1960 to 2022.

Figure 6. Detrended and Differenced CO2 time series.

Figure 7. ACF vs Lag: a) CO2 (original) time series, b) CO2 detrended time series, and c) CO2 differenced (first differenced) time series.

Higher order differencing

(14)

(15)

(16)

(17)

(18)

Figure 8. ACF plot: Carbon dioxide differenced series (d = 2).

Table 1. Dickey - Fuller test’s p - values for detrended and differenced time series.

Models for correlation analysis

(19)

(20)

Figure 9. Scatter-plot matrix relating current CO2 values (Ct), to past CO2 values (Ct−h) for h=1,2,3,4. The values at the upper right corner are the sample autocorrelation at the corresponding lag values.

Table 2. Summary statistics for the global temperature models.

(21)

(22)

(23)

Model assumption

Figure 10. Residual vs Fitted, Normal Q-Q plot, Scale Location plot, and Residual vs Leverage plot.

Cross correlation: Global temperature anomaly vs carbon dioxide

(24)

(25)

Figure 11. Cross correlation of global temperature anomaly and carbon dioxide time series from 1960 to 2022: a) Original, b) Stationary (d = 2).

Discussion

Data availability

Source data

Software availability

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated

Figure 6. Detrended and Differenced CO₂ time series.

Figure 7. ACF vs Lag: a) CO₂ (original) time series, b) CO₂ detrended time series, and c) CO₂ differenced (first differenced) time series.

Figure 9. Scatter-plot matrix relating current CO₂ values ( $C_{t}$ ), to past CO₂ values ( $C_{t - h}$ ) for $h = 1,2,3,4 .$ The values at the upper right corner are the sample autocorrelation at the corresponding lag values.