1 Introduction

Analysis of spatiotemporal observations is of major importance for scientists due to spatial and also temporal variability of the observable physical quantities. Meteorological observations due to their spatial dependence and temporal variation are characterized as spatiotemporal data. Analysis of data, such as air temperature, precipitation, chlorophyll, salinity etc. is crucial for understanding the underlying processes and improving the prediction of climate changes (Gaur and Simonovic 2015; Meiling et al. 2014; Paruch et al. 2014; Darand and Mansouri Daneshvar 2014; Bagiorgas et al. 2012; Craigmile and Guttorp 2011; Pasini et al. 2012; Sigrist et al. 2012; Drakopoulos et al. 2011; Tastu et al. 2011; Triacca 2005; Nittis et al. 2001a, b; Wilks 1995).

Over the last decades, due to the fact that the distribution and changes of the sea and oceanic variables is a major component in the hydrological cycle, the analysis by scientists using data from buoys focused on the temporal and spatial variability of general circulation structures, and on understanding the basic characteristic mechanism that occur in the examined basin. More specifically, research has been focused on the general circulation of the basin, the water mass formation processes, and the effects of these mechanisms on the biochemical cycles at seasonal time scales in order to design a reliable efficient system that could be useful for predicting atmospheric and oceanographic variables.

Several buoy systems have been developed such as the Poseidon system which is a monitoring, forecasting and information system for the Hellenic Seas and Coastal areas, operated by the Hellenic Center for Marine Research (HCMR) providing real time information and forecasts for a large number of observations (Soukissian et al. 1999). Several numerical investigations have been realized using data series provided by the Poseidon system (Kassis et al. 2013; Velaoras et al. 2013; Bagiorgas et al. 2012; Kopasakis et al. 2012; Korres et al. 2011; Papadopoulos and Katsafados 2009; Papadopoulos et al. 2002; Nittis et al. 2001a, b).

The data used in this work were provided from the Poseidon system mentioned above. The data were collected from sea watch buoys which are equipped with meteorological and oceanographic sensors. In this work, we utilized time series measured from Athos (Aegean Sea) and Pylos (Ionian Sea) stations which have differences in atmospheric and hydrodynamic characteristics. These locations are part of the Eastern Mediterranean Sea where water masses of different basin interact and transform at various temporal and spatial scales.

Since the knowledge of meteorological parameter interactions, can help to better understand the cause-effect mechanisms in climate science, the objective of the present work was to investigate such mechanisms with an alternative view focused on the relationships among physical variables. Additionally, this work examines relationships and patterns at the micro-scale level (locally at each station) as well as at macro-scale level (globally between the stations) using a range of methods from linear to nonlinear measures (Kugiumtzis 2013; Papana et al. 2013; Paluš 2014; Probst et al. 2012; Nelson-Wong et al. 2009). First, the cross correlation analysis among variables was performed to study the correlations between the pairs of variables as a function of the time lag. Since the existence of correlation between two variables does not necessarily demonstrate the direction of causality, we applied Causality analysis in order to test the significance of causal relationships between specific variables. In this way, we applied the Granger Causality method, which has already been successfully applied to a wide range of data from various fields of social science, economics, engineering and neurosciences, to identify the presence of linear causal interactions between variables, based on prediction theory (Papana et al. 2013; Attanasio et al. 2013; Pasini et al. 2012; Attanasio et al. 2012; Bilancia and Vitale 2012; Kodra et al. 2011; Seth 2010).

Also, our analysis includes, for each variable from Athos and Pylos station, the performance of non-linear measures such as mutual information combined with simple descriptive statistic measures. Furthermore, we estimated the Hurst Exponent and the Hjorth parameters which are additional dynamic detectors (Hjorth 1970). The results show that there are correlations and causalities between variables, both from the same buoy and also between different spatial buoys and the performance of these methods may uncover hidden qualitative differences between variables which cannot be determined from the simple statistical analysis.

The remainder of this paper is organized as follows. In Section 2, we portray the site and data description. In Section 3, we present the methodology employed. The results and discussion are presented in Section 4. Finally, the conclusions are presented in Section 5.

2 Site and Data Description

The data are collected from buoys that have been developed by the Hellenic Center for Marine Research (HCMR) in the framework of the Poseidon project (www.poseidon.hcmr.gr) (Soukissian et al. 1999). The Poseidon system is an operational monitoring, forecasting and information system for the Hellenic seas, providing in real time information and forecasts for a large number of observations. The system consists of a network of 10 oceanographic buoys installed in the Aegean and Ionian Sea. The buoys are equipped with meteorological, oceanographic sensors which collect meteorological data, such as atmospheric pressure, air temperature, wind speed and direction, physical data such as salinity, water temperature, and biochemical data such as chlorophyll and dissolved oxygen.

In this work, we employed data from monitoring buoys located at Athos (Aegean Sea) and Pylos (Ionian Sea), and cover a period from January 2011 to November 2012 which corresponds to a duration of 23 months. The choice of the given sites was based on the fact that these basins present different physical characteristics. The major feature of the Aegean Sea is the existence of many straits connecting the basin with the rest Eastern Mediterranean and Black Sea as well as the Cyclades islands that separate the Aegean in two basins. The intrusion through the Dardanelles Straits of water mass interacts with the main water. The Ionian Sea is characterized by complex hydrology where different water masses of Western Mediterranean and Adriatic Sea interact and also connect to the Aegean Sea through the strait lying between the Peloponnese peninsulas (Velaoras and Lascaratos 2010; Kassis et al. 2013).

In Table 1, we summarize the latitude, longitude, the data collection period and the collected observations for each station, and the physical locations are depicted in Fig. 1.

Table 1 Main Parameters and measurement variable of the stations
Fig. 1
figure 1

Locations of Poseidon network buoys. The red cycle indicates Athos station while the green cycle corresponds to Pylos station. Yellow cycles present the location of the remaining boys in the Aegean and Ionian Seas

Atmospheric Pressure (Atm. P.), Air Temperature (Air T) and Wind Speed (Win. S) were recorded at 3 m height above sea level while Water Temperature (Wat. T) at 3 m depth. The sampling interval was 3 h for all variables and every measurement is an average over 10 min, resulting in 8 values per day. Thus, the length of the time series is 5.600 values (in units of Δt = 3 h). We used the Piecewise Cubic Hermite Interpolating Polynomial as an interpolation method (Fritsch and Carlson 1980) to fill the missing values which occurred either as a result of transmission or power problems or due to maintenance of the sensors. Also, extreme values, spikes and obviously wrong values were removed.

3 Methodology

One of the major challenges of the researchers is to identify and analyze patterns in global climate in order to understand the interactions between physical variables. Climate scientists often want to understand mechanisms on a micro (local) level and a macro (global) level scale. The methodology to study the spatiotemporal variation of meteorological observations includes estimation of non-linear measures such as mutual information combined with conventional descriptive statistic measures, as well as linear and non-linear dynamic detectors such as Hurst Exponent and Hjorth parameters. The cross correlation analysis between all possible pairs of variables both at local scale (each station) and global scale (between the two stations) was performed. Furthermore, the cause-effect relationships between the meteorological variables was examined using the Granger causality methodology.

3.1 Descriptive Statistics

We performed conventional descriptive analysis and in order to summarize and describe our data set. More specifically, we estimated the mean, median, variance, standard deviation and kurtosis of each time series.

3.2 Mutual Information

For two random variables Mutual Information I(t) is a quantitative measurement of the amount of information that one variable contains about another (Fraser and Swinney 1986). In the time series analysis it is used to measure any form of linear and nonlinear relations and is the most popular measure used to select the appropriate delay time τ for state space reconstruction. Mutual information between elements of a time series x(t i ) is defined as:

$$ I\left(\tau \right)={\displaystyle \sum_{x\left({t}_i\right),x\left({t}_i+\tau \right)}P\left(x\left({t}_i\right),x\left({t}_i+\tau \right)\right)\ast \log \left[\frac{P\left(x\left({t}_i\right),x\left({t}_i+\tau \right)\right)}{P\left(x\left({t}_i\right)\ast Px\left({t}_i+\tau \right)\right)}\right]}, $$
(1)

where x(t i ) is the i-th data point of time series, P(x(t i )) is the probability density at x(t i ), P(x(t i )x(t i +τ)) is the joint probability density at x(t i ), x(t i +τ) and τ is the delay time.

3.3 Estimation of Hurst Exponent

In the time series analysis, Hurst exponent (H) is a measure of long range correlation which exhibits properties of self similarity. Estimation of the Hurst exponent was firstly reported in hydrology by Hurst (1951) while investigating the discharge time series of the Nile River and subsequently has been used in diverse areas of research such as climate science, geophysics and economics just to mention a few. The oldest method to calculate the Hurst exponent, called Rescaled Range or R/S analysis, was proposed by Mandelbrot and Wallis (1969) and estimates of H are based on the R/S statistics.

Suppose we have a time series of N measurements. We divide the time series into N s shorter subseries of length n = N, N/2, N/4. For each subseries we define the range R n

$$ {R}_n=\underset{1\le k\le s}{ \max}\left[{\displaystyle \sum_i^k\Big({x}_{ns+i}-\overline{x_n}}\right]-\underset{1\le k\le s}{ \min}\left[{\displaystyle \sum_i^k\Big({x}_{ns+i}-\overline{x_n}}\right], $$
(2)

wheren = 0,1,……N s −1 , N s =N/S and

$$ \overline{x_n}=\frac{1}{s}{\displaystyle \sum_{i=1}^s{x}_{ns+i}} $$
(3)

The sample standard deviation is defined as

$$ {S}_n=\sqrt{\frac{1}{s}{\displaystyle \sum_{i=1}^s{\left({x}_{ns+i}-\overline{x_n}\right)}^2}}. $$
(4)

Then, the rescaled range is defined as the ratio R n /S n . The Hurst exponent is estimated by calculating the average rescaled range for all subseries of length n. Ιt can be shown that the R/S statistics follows the relation

$$ \begin{array}{ccc}\hfill {\left(R/S\right)}_n=c\cdot {n}^H\hfill & \hfill as\hfill & \hfill n\to \infty .\hfill \end{array} $$
(5)

Hence, we can plot the diagram log(R/S) n vs log(n) and we calculate the Hurst exponent using linear least squares regression. With this definition, the Hurst exponent of 0.5 indicates a random series, when a value of H lying within the interval 0.5 < H < 1 indicates persistent behaviour (i.e., a large value is followed by a large value, a small value by a small one), while a Hurst exponent value between 0 and 0.5 exists for time series with anti-persistent behaviour (i.e., a large value is followed by a small value, a small value is followed by a large value).

3.4 Hjorth Parameters

The Hjorth descriptors, namely activity, mobility and complexity, were mainly developed for quantification of an Electroencephalogram (EEG) by Hjorth (1970). The Hjorth parameters for a given time series x(t) are called normalized slope descriptors because they can be defined as first and second derivatives and are respectively defined as follows:

$$ Activity={m}_o $$
(6)
$$ Mobility=\sqrt{\frac{m_2}{m_0}} $$
(7)
$$ Complexity=\sqrt{\frac{\raisebox{1ex}{${m}_4$}\!\left/ \!\raisebox{-1ex}{${m}_2$}\right.}{\raisebox{1ex}{${m}_2$}\!\left/ \!\raisebox{-1ex}{${m}_0$}\right.}}, $$
(8)

where m 0 is the variance (square of the standard deviation) of the variable, m 2 is the variance of the first derivative of the variable, and m 4 is the variance of the second derivative of the variable.

Activity is a measure of the squared standard deviation of the amplitude and represents the width of the signal. Mobility is defined as the squared root of the ratio between the variances of the first derivative and the amplitude, and is related to the mean frequency of the time series. Complexity is the ratio between the mobility of the first derivative and the mobility of the non-linear time series. Complexity indicates the deviation of the slope and can be seen as a measure of change in frequency of the signal.

3.5 Cross Correlation

Cross correlation analysis ( Nelson-Wong et al. 2009; Probst et al. 2012) is a statistical tool that measures the simultaneous linear coupling relationship between two time series x(t) and y(t) that are out of the phase with a lag time t. It calculates the correlation between time series, when shifted against one another as a function of number of observations. The shape of the cross correlation coefficient curve versus time represents the variation of the correlation among the two time series depending on the lag time. However, we must mention that the presence of correlation does not imply necessarily causality.

The cross correlation between X t and Y t+k is called the k th order cross correlation of X and Y. The sample estimate of this cross correlation, called r k , is calculated using the formula:

$$ {r}_k=\frac{{\displaystyle \sum_{i=1}^{n-k}\left({X}_i-\overline{X}\right)\left({Y}_{i+k}-\overline{Y}\right)}}{\sqrt{{\displaystyle \sum_{i=1}^n{\left({X}_i-\overline{X}\right)}^2{\displaystyle \sum_{i=1}^n{\left({Y}_i-\overline{Y}\right)}^2}}}} $$
(9)
$$ \overline{X}=\frac{1}{n}{\displaystyle \sum_{i=1}^n{X}_i} $$
(10)
$$ \overline{Y}=\frac{1}{n}{\displaystyle \sum_{i=1}^n{Y}_i} $$
(11)

The time index k, is allowed to be either positive or negative. The large sample standard error of the sample cross correlations is simply \( 1/\sqrt{N} \) and \( 2/\sqrt{N} \) so that large sample confidence limits are ± \( 2/\sqrt{N} \) (at the 5 % level). If the above is computed for all delays d = 0,1,2,…N-1 then it results in a cross correlation series of twice the length as the original series.

3.6 Granger Causality

Identifying causal connectivity between meteorological variables is an important issue. Granger Causality analysis is one of the most popular computational approaches, that has already been applied with great success in a range of fields such as social sciences, financial, computational, engineering, neuroscience and environmental sciences in order to assess causality among variables. First, Wiener (1956) introduced the idea that if the prediction of one time series can be improved by incorporating the knowledge of a second time series, then the latter can be said to have a causal influence on the first. Granger (1969) implemented this idea in the context of linear autoregressive models of stochastic processes.

Let us suppose that we have two stochastic processes X t and Y t with autoregressive representations in the form:

$$ {X}_t={\displaystyle \sum_{j=1}^{\infty }{a}_{1j}{X}_{t-j}+{\displaystyle \sum_{j=1}^{\infty }{b}_{1j}{Y}_{t-j}+{\varepsilon}_t}} $$
(12)
$$ {Y}_t={\displaystyle \sum_{j=1}^{\infty }{c}_{1j}{X}_{t-j}}+{\displaystyle \sum_{j=1}^{\infty }{d}_{1j}{Y}_{t-j}+{\eta}_t} $$
(13)

where a, b, c and d are coefficients describing the linear interactions between the variables, ε and η are residuals (prediction errors) for each time series and j indicating the time lags.

The covariance matrix of the noise terms is:

$$ \Sigma =Cov\left(\begin{array}{c}\hfill {\varepsilon}_t\hfill \\ {}\hfill {\eta}_t\hfill \end{array}\right) $$
(14)

If the variance of ε (or η) is reduced by the inclusion of the X t (or Y t ) terms in the first (or second) equation, then it is said that X t (or Y t ) Granger-(G)-causes X t (or Y t ). In other words, X t G-causes Y t if the coefficients in b 1j are jointly significantly different from zero. We used the Conditional Granger Causality methods described in the ‘Multivariate Granger Causality Toolbox’ (MVGC) by Barnett and Seth (2014).

4 Results and Discussion

The possibilities of extracting useful information about the meteorological characteristics of the Aegean and Ionian Sea through the analysis of observation from Seawatch buoys are examined. In Fig. 2a to d, we plot the time series of Atmospheric Pressure (Atm. P), Air Temperature (Air. T), Wind Speed (Win. S) and Water Temperature (Wat. T) of Athos and Pylos as recorded. The horizontal axis refers to time (t) and the vertical axis refers to each measurement variable. Seasonal variability is prominent in Fig. 2b and d.

Fig. 2
figure 2

Time series of Atmospheric Pressure (a), Air Temperature (b), Wind Speed (c) and Water Temperature (d) of the Athos Station (blue line) and Pylos Station (red line) (time is given in units Δt = 3 h)

As we have mentioned before for each time series, we first calculated the simple statistical measures, i.e., the mean, median, variance, standard deviation, and kurtosis. The objective of this part is to study the performance of simple statistical estimators and check whether it is possible to identify patterns among variables. The results of mean, median variance, standard deviation and kurtosis are summarized in Table 2. As we can clearly observe, the values of variance, standard deviation and kurtosis of Pylos station for all measured variables are lower than those of Athos station, indicating the differences in the hydrodynamic characteristics of the two locations. Specifically, from the results of mean and median of Atmospheric pressure variable, both from Athos and Pylos stations the values are nearly identical, indicating that the Atm. P. variable has no significant difference between the stations. Also, at a global level for Air Temperature and Water Temperature variables, comparing the results of mean and median, we can see that they are higher at Pylos station than at Athos station. This is evidence which indicates the different meteorological and hydrodynamics conditions of the two basins (Aegean and Ionian) (Velaoras and Lascaratos 2010; Kassis et al. 2013).

Table 2 Descriptive statistics measures

Then, we evaluated the mutual information, with the aid of Eq. (1), which detects linear and nonlinear dependencies between time series. In order to compare the estimated mutual information from different variables, in Fig. 3 we present the normalized values of I(τ).

Fig. 3
figure 3

Normalized mutual information of the time series from Athos station (red lines) and Pylos station (blue lines) (time lag is given in units Δt = 3 h)

A first remark from Fig. 3 is that the behavior of the mutual information function is nearly identical for each variable at both the Athos and Pylos station. Also, from Fig. 3 it can be seen that for any value of time delay the mutual information function of the Win. S. of Athos and also Pylos station clearly attains the lowest values. This behavior is expected since wind speed varies more frequently, as seen in Fig. 2a–d than all the other quantities and affects more directly pressure than temperature. Pressure presents a slower decay than wind speed, which means that the values of the time series of the Atm. P. are less independent in all time lags than the values of the time series of Win. S. This is constant with the physical interpretation as we can also see from the Fig. 2a and d the Atm. P values varies less frequently than the values of Win. S. The Air. T time series present higher correlations than Win. S and finally the higher values (for any value of time delay) are observed for the Wat. T time series. Mutual information values of Air T. time series and Wat. T time series seem to have also a kind of periodic behavior with a maximum around 1 day (8 values per day). We have to mention that Air temperature presents lower mutual information values than water temperature since air temperature changes faster than temperature water (we remind that water has a significantly higher specific heat than air).

Next, we estimated the linear and nonlinear dynamic detectors, i.e., the Hurst exponent and Hjorth parameters. The evaluation of Hurst exponent for each time series with R/S method is shown in Table 3. The Hurst exponents of the observations from both Athos and Pylos stations attain almost the same values and are sufficiently greater than 0.5. These values indicate persistent behaviour and a long memory in the time series. We can see that Air and water temperature present the highest persistence while Atmospheric Pressure presents lower persistence followed by wind speed. These results are in qualitative agreement with the results of mutual information.

Table 3 Hurst exponent with R/S method and Hjorth mobility and complexity

The results of Hjorth parameters mobility and complexity of the time series are also presented in Table 3. We can also see that for Win. S. time series Hjoth mobility and complexity present different behavior than the rest time series. Specifically, for Win. S time series, the Hjorth mobility is lower at Pylos than at Athos station and consequently the Hjorth complexity is higher at Pylos than at Athos stations.

4.1 Spatiotemporal Dependencies

In order to examine whether there exist correlations between observations, we employed the cross correlation function (ccf). Since there are many significant correlations at different time lags, we choose as the value of correlation, the first highest significant value either at positive or negative time lags. For instance, in Fig. 4a we present the result of the cross correlation function between Air Temperature and Water Temperature from Athos station. The blue lines indicate the confidence intervals (±\( 2/\sqrt{N} \) = 0.0268). In the upper row, we present the time series and in the bottom the results of the cross correlation function. We can clearly see, that the first higher correlation is positive with a value ccf = 0.92 at a positive time lag of 23. This means that the Air T time series leads Wat. T time series by almost 3 days (since it presents a time lag 23 in units of Δt = 3 h) and has a positive influence on Wat. T variable. Also, the result of the performance of the ccf function among Atmospheric Pressure from Athos station and Water Temperature from Pylos station are shown in Fig. 4b. In this case, we chose the first higher negative correlation with a value ccf = −0.44 at a time lag = 75. This means that the two variables present a negative correlation with positive time lag and Wat. T time series at Pylos station and leads Atm. P data at Athos station by almost 75 days (582/8), since the sampling interval was 3 h with an average over 10 min resulting in 8 values per day and both time series are negatively correlated.

Fig. 4
figure 4

The upper row shows the time series and the lower row shows the cross correlation as a function of time delay for a: a positive relationship and b for a negative relationship (time is in units of Δt = 3 h). The blue lines denote expected 95 % confidence intervals (±0.0268)

Consequently, in Fig. 5a we present the results of cross correlation of the time series at Athos station and Pylos station at micro-scale level and at macro-scale level in order to examine whether Athos variables have any effect on the observed spatiotemporal dependencies on the variables from Pylos station and vice – versa.

Fig. 5
figure 5

Cross correlation of measurement variables both at local scale and at global scale among variables from Athos and Pylos station (time is in units of Δt = 3 h)

In each cell, the upper value refers to the cross correlation value which if it is positive indicates a positive correlation between the variables, while if it is negative indicates a negative correlation. The lower value in the cell represents the correlation time (time lag) where the highest value of cross correlation occurs; if it is positive, indicates that the first variable (the variable that is on the row) leads the second variable (on the column). Figure 5b illustrates the representative constructed graph where the values of cross correlation are also indicated.

In general, from Fig. 5a it can be seen that both Athos and Pylos stations demonstrate almost the same behavior with respect to the values of cross correlation. We observe a negative correlation between Atmospheric Pressure and Air Temperature (ccf = −0.43), Atmospheric Pressure and Water Temperature (ccf = −0.44 Athos / ccf = −0.46 Pylos), Air Temperature and Wind Speed (ccf = −0.36 Athos / ccf = −0.33 Pylos), and Water Temperature with Wind Speed (ccf = −0.31). At a micro-level, both at Athos and Pylos stations, there exist positive correlations with the strongest correlation structures between Air temperature and Water Temperature (ccf = 0.92 Athos / ccf = 0.94 Pylos). The lowest positive cross correlation occurs between Atmospheric Pressure and Wind Speed around the value of 0.22 to 0.24 time series both at local level (Athos or Pylos station).

Moreover, at a global scale we can observe that there are strong positive correlations among variables from Athos and Pylos station. Specifically, larger positive values of cross correlation can be observed between Atmospheric Pressure of Athos and Atmospheric Pressure of Pylos station (ccf = 0.86), Air Temperature of Athos and Atmospheric Pressure of Pylos (ccf = 0.94), Air Temperature of Athos and Water Temperature of Pylos (ccf = 0.92), Water Temperature of Athos and Air Temperature of Pylos (ccf = 0.92), and finally, Water Temperature of Athos and Water Temperature of Pylos (ccf = 0.95). It is noted that also at global level the Air and Water Temperature are negatively correlated with Atmospheric Pressure and Wind Speed.

While the cross correlation values exhibit similar behavior, we do not observe the same behavior for the correlation time lags. In fact, we observe difference in correlation time lags, at the micro-scale (locally at each station) between Air T and Wat. T, where the time lag at Pylos station is three times the time lag of Athos. Another difference occurs between Wat. T and Win. S where at Athos station the Win. S leads Wat. T time series with a time lag of 38 days while at Pylos station the time lag is a half day.

Once the cross correlation among variables has been discussed, we can then proceed with the performance causality measures between the variable for identifying causal connectivity among them. Knowledge about causalities would be of great importance, since it provides information on the spatiotemporal dependencies. First, at a micro-scale level, the normalized results of the performance of Granger Causality measure are summarized in Fig. 6a–b for Athos and in Fig. 6c–d for Pylos station.

Fig. 6
figure 6

Granger causality among variables from Athos and Pylos station

The direction of causalities is from columns to rows. We observe first at local scale that the Granger causality values are different at Athos and Pylos station. The analysis denotes that Wind Speed of Athos causes Air Temperature at Athos (GC = 0.57), while there is no cause effect of Wind Speed of Pylos to Air Temperature of Pylos (GC = 0.00). Moreover, we observe that there is a weak causality between Wind Speed of Pylos to Water Temperature at Pylos (GC = 0.29), which does not occur at Athos station. Additionally, there is a cause from Water Temperature of Athos to Atmospheric Pressure of Athos (GC = 0.16) which is not observed in Pylos (GC = 0.00). However, the other values of causality among variables, are very similar. From Fig. 6e to d, where the Granger causalities are presented, it is of great interest to observe that the Granger causality value (GC) from Atmospheric Pressure of Pylos seems to affect the Atmospheric Pressure of Athos (GC = 1.00) more than the Atmospheric Pressure of Athos the Atmospheric Pressure of Pylos (GG = 0.82).

According to the theory of hypothesis testing to determine whether the result is statistically significant, significance values are calculated (p-values), which define the probability of impact since the null hypothesis is true. The null hypothesis is rejected if the p-value is less than the specified level of significance (a = 0.05). A statistically significant result is that in which the observed p-value is less than 5 %. We assume that there is no causality between the two time series (the null hypothesis Ho) by using the statistical test of Fisher distribution or as known the F-test on the alternative H1 (there is causality).

In the case of calculation of causality between the variables, the p-values by applying the criterion F-statistics (Barnett and Seth 2014) and the price level of significance using the criterion False Discovery Rate (FDR) (Benjamini and Hochberg 1995) shown in Fig. 7a–b for Athos station, Fig. 7c–d for Pylos station and Fig. 7e–h between observation from Athos to Pylos and vice-versa. The value 1 in the matrix of the Statistical significance level means that we can reject the null hypothesis (there is no causality among variables).

Fig. 7
figure 7

P values result, significance level of p-values (at 95 %) of Granger causality among variables, ab from Athos station, bc from Pylos station, e and g from Athos to Pylos station, and f and g from Pylos to Athos station

At Athos station, we can observe (Fig. 7a–b) that all the causalities between the variables are significant at a confidence level greater than 95 %, except the causality between Air T. and Win. S. and the causality between Wat. T. and Win. S. Moreover, in all the other cases at Pylos station we can reject the null hypothesis (no causality) apart from the causality between Wat. T. and Win. S. At a global scale (among variable from different stations) although the value of the causality is not high (Fig. 6e and f), summarizing the results of Fig. 7g–h we can generally observe that the variables from the area of Pylos influence the variables of Athos region more than they are influenced.

5 Conclusions

In this work data from Seawatch buoys at Aegean and Ionian Sea provided by Poseidon monitoring buoy for the period from January 2011 to November 2012 are analyzed with the purpose of understanding the underlying spatiotemporal characteristics of meteorological variables. As the understanding of the mechanisms could help improve the prediction of these systems, we focused on examining relationships and patterns at the micro-scale level (locally at each station) as well as at macro-scale level (globally between the stations). Apart from the performance of simple statistical measures and some linear and nonlinear indicators, we applied the cross correlation method in order to discover if changes at Athos station may outcome changes at Pylos station or vice versa. In general, the results show significant dependencies among variables indicating that the cross correlation analysis can be a powerful method, as there were significant positive and negative correlations between the data. Since the existence of correlation itself does not necessarily demonstrate the cause —effect relationship, we estimated the directionality of interactions among meteorological variables through the analysis of Granger causality based on prediction theory. We have shown that there is evidence of causal link among variables. Additionally, the results of the causality analysis have indicated that at a local scale there are some significant differences.

The proposed analysis could be extended to other spatiotemporal time series and extract more complex spatiotemporal dependencies between other physical variables too. Also, an important extension of this study is to incorporate the methodology of Complex Network analysis to the field of climate science (Ebbert-Uphoff 2012; Siggiridou et al. 2014; Hlinka et al. 2013). In the climate context, Complex Networks, constitute a powerful tool in order to identify hidden patterns of weather relationships and to analyze climate dynamics, through the analysis of networks topological measures, i.e., degree, clustering coefficient, average path length, diameter and modularity (Donges et al. 2009; Costa et al. 2011).