An electrostatics method for converting a time-series into a weighted complex network

Tsiotas, Dimitrios; Magafas, Lykourgos; Argyrakis, Panos

doi:10.1038/s41598-021-89552-2

An electrostatics method for converting a time-series into a weighted complex network

Article
Open access
Published: 03 June 2021

Volume 11, article number 11785, (2021)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

An electrostatics method for converting a time-series into a weighted complex network

Download PDF

Dimitrios Tsiotas^1,2,3,
Lykourgos Magafas³ &
Panos Argyrakis^3,4

1727 Accesses
5 Citations
1 Altmetric
Explore all metrics

Abstract

This paper proposes a new method for converting a time-series into a weighted graph (complex network), which builds on electrostatics in physics. The proposed method conceptualizes a time-series as a series of stationary, electrically charged particles, on which Coulomb-like forces can be computed. This allows generating electrostatic-like graphs associated with time-series that, additionally to the existing transformations, can be also weighted and sometimes disconnected. Within this context, this paper examines the structural similarity between five different types of time-series and their associated graphs that are generated by the proposed algorithm and the visibility graph, which is currently the most popular algorithm in the literature. The analysis compares the source (original) time-series with the node-series generated by network measures (that are arranged into the node-ordering of the source time-series), in terms of a linear trend, chaotic behaviour, stationarity, periodicity, and cyclical structure. It is shown that the proposed electrostatic graph algorithm generates graphs with node-measures that are more representative of the structure of the source time-series than the visibility graph. This makes the proposed algorithm more natural rather than algebraic, in comparison with existing physics-defined methods. The overall approach also suggests a methodological framework for evaluating the structural relevance between the source time-series and their associated graphs produced by any possible transformation.

Temporal Complex Network Analysis

An enhanced version of the SSA-HJ-biplot for time series with complex structure

Article 18 April 2023

HOTVis: Higher-Order Time-Aware Visualisation of Dynamic Graphs

Introduction

The multidisciplinary nature of networks^1–3 has introduced new directions in the time-series research that led to the emergence of the complex network analysis of time-series. This newly established research field showed a remarkable development, at a multidisciplinary level⁴, when scholars conceptualized^5–7 that transforming a time-series into a graph can produce insights that are not visible by current time-series approaches. In general, studying the topology of a graph instead of the structure of a time-series promotes time-series analysis because it enlarges the embedding of the available information, from a first-order tensor (i.e. the time-series vector) into a second-order tensor (i.e. the graph connectivity matrix)⁸. Within this context, Zhang and Small⁷ were the first who constructed graphs from pseudo-periodic time-series, and Yang and Yang⁶ applied thresholds to the correlation matrix to convert it into a connectivity matrix. Xu et al.⁹ proposed a transformation for creating graphs from time-series based on different dynamic systems. Lacasa et al.⁵ built on the intuition of considering a time-series as a landscape and introduced a connectivity criterion based on visibility from optics physics. Gao and Zin¹⁰ proposed methods (i.e. flow pattern complex network, dynamic complex network, and fluid–structure complex network) to construct complex networks from experimental flow signals, and Donner et al.¹¹ introduced a recurrence method converting graphs from time-series based on the phase-space of a dynamical system. Amongst the existing methods, the natural visibility graph (NVG) or, in synonym, the visibility graph algorithm (VGA) of Lacasa et al.⁵ seems to prevail in the literature either in terms of citations, or in the number of applications^12–14, or the number of derivative methods, such as the horizontal visibility graph of Luque et al.¹⁵, and the visibility expansion algorithm of Tsiotas and Charakopoulos⁸,¹⁶. The popularity of VGA can be either due to its intuitive conceptualization from optics physics, which makes comprehension and interpretation of results easier, or to its topological consistency to convert periodic time-series to regular graphs, random time-series to random graphs, and fractal time-series to scale-free graphs. However, this method builds on a binary connectivity criterion, which leads to the development of binary connections and thus to unweighted graphs⁵. Therefore the VGA is by definition restricted in generating visibility graphs that are disassociated from the numerical scale of the source (original) time-series.

Aiming at serving the demand of promoting a weighted conceptualization in the complex network analysis of time-series, this paper introduces a method for converting a time-series into a weighted graph by using an electrostatics transformation algorithm based on Coulomb’s law. The proposed method is driven by a dual motivation: the first builds on the example of the VGA⁵, which implies that physics-defined transformations can be more intuitive and easily comprehensive than the algebraic (or computational) ones. The second one is based on the universality of Coulomb’s inverse square law¹⁷, which grounded the development of essential research in electromagnetism but also inspired multidisciplinary research in economics¹⁸, urban and spatial planning and transport engineering^19,20, biology²¹, geophysics²², computational²³ and communication sciences²⁴, etc. Within this multidisciplinary context, the proposed method conceptualizes a time-series as a sequence of stationary and electrically charged particles (nodes) and generates an electrostatic graph based on pair-wise calculations of Coulomb’s law across the time-series nodes. The Coulomb-like forces are assigned as weights in the connectivity matrix of the electrostatic graph and can be seen as a measure of relevance between two nodes, in terms of the sign, scale, and spatial proximity. This approach allows quantifying the interaction between the time-series nodes and thus to conceptualize the dynamics of a time-series through the effect of electrostatic forces applied between the nodes.

The remainder of the paper is organized as follows: Sect. 2 (methods) describes the proposed ESG algorithm and its modeling context, it introduces the node-series of network measures concept in the ESG, and it briefly describes the methods used for testing the performance of the proposed algorithm. Section 3 (Results) shows the results of the multilevel analysis testing the performance of the proposed algorithm in comparison with a well-established method of converting a time-series to a graph. Finally, in Sect. 4, conclusions are given.

Methods

The proposed ESG algorithm

Let us consider a time-series X = {x₁, x₂,…, x_n} with n$\in \mathbb{N}$ number of nodes i$\in$ X, where each one has a real numeric value X(i) = x_i$\in {\mathbb{R}}$. If we assume that every node i in the time-series can be seen as a static particle of electrical charge q(i)≡q_i = x_i, we can define an (either attractive or repulsive) electrostatic force F_ij applied between any pair of nodes i,j (Fig. 1), according to the inverse-square Coulomb’s law expressed by the relation¹⁷:

$$F_{ij} = k_{e} \cdot \frac{{q_{i} \cdot q_{j} }}{{\left( {d_{ij} } \right)^{2} }} = k_{e} \cdot \frac{{x_{i} \cdot x_{j} }}{{\left( {d_{ij} } \right)^{2} }},$$

(1)

where q_i and q_j are the electrostatic charges of nodes i and j, d_ij is the intermediate discrete distance between nodes i,j that expresses steps of separation and is defined by the difference (i–j), and k_e is the Coulomb’s constant.

This assumption allows considering a time-series X as a series of stationary and electrically charged particles (i.e. time-series nodes), on which we can compute a square matrix with the Coulomb-like forces F(X) = {F_ij | i,j = 1, …, n}, according to the relation:

$$(1)\mathop {\mathop \Leftrightarrow \limits_{{k_{e} \equiv 1}} }\limits^{{q_{i} { = }X(i) = x_{i} }} F\left( X \right) = \{ F(X(i),X(j)) \equiv F_{{ij}} = \frac{{x_{i} \cdot x_{j} }}{{\left( {i - j} \right)^{2} }}|i,j = {\text{1}},{\text{ }} \ldots ,n\} ,$$

(2)

where d_ij = (i − j) and k_e is the Coulomb’s constant¹⁷, which can be considered as a scale factor and in this paper is set to k_e = 1.

The square structure of the F(X) matrix (with the Coulomb-like forces) can be seen as an electrostatic graph-model ESG, where each element F_ij $\in {\mathbb{R}}$ expresses the (attractive or repulsive) electrostatic force applied between any pair of nodes i,j. When it is important to note that the ESG is associated with the time-series X, we can symbolize the electrostatic graph as ESG(X). In terms of graph theory²⁵, F(X) is the weighted connectivity matrix of an undirected graph G_ESG(V,E), where V is the node-set and E is the edge-set. The weights (w_ij) in the ESG’s weighted connectivity matrix are equal to the Coulomb-like forces (w_ij = F_ij) and can be seen as a measure of similarity between two nodes, in terms of the sign, scale, and spatial proximity. In particular, positive weights (w_ij > 0) indicate that nodes i,j have homogeneous arithmetic signs, where negative cases (w_ij < 0) imply that they have heterogeneous signs. Also, high w_ij scores may imply either that nodes i,j are close in the time-series line, in terms of spatial proximity, or that they have relatively high arithmetic values or both. Within the context of the electrostatic conceptualization, the attraction expressed by a negative Coulomb-like force (w_ij = –F_ij) can be seen as a tendency of the nodes to balance their heterogeneity and converge toward the horizontal axis, whereas the repulsion expressed by a positive force (w_ij = + F_ij) can be seen as a tendency of the nodes to escape from their homonymous electrostatic balance and thus to evolve (either increasingly or decreasingly) through time.

By definition, Coulomb’s law determines a field of infinite distance, where the electrostatic forces are noticed at infinity, although they are negligible. This property makes the ESG by default a fully connected (complete) graph K_n, namely a graph where all nodes are linked to any other. Provided that a complete graph K_n has a trivial topology, in terms of complexity (since the average degree is always $\left\langle k \right\rangle$ = n–1 and most of other metrics, such as average path length, network diameter, graph density, and clustering coefficient are equal to one), we filter the set E of the ESG connections, aiming to generate more complex topologies of electrostatic graphs. In particular, we consider a threshold F_c, defined within the interval $F_{c} \in \left( {\min \left\{ {F_{ij} } \right\},\max \left\{ {F_{ij} } \right\}} \right)$, so that the weighted connectivity matrix W_ESG include those values that are equal or above F_c, as it is expressed by the relation:

$$W_{ESG} = \{ F_{ij} \ne 0 \in F(X):F_{ij} \ge F_{c} \} \subseteq F(X),$$

(3)

where F(X) is the Coulomb-like matrix defined in relation (1). This filtering allows considering numerous electrostatic graphs ESG(X), which are expressed as a function W_ESG = f(F_c) of the threshold-variable F_c. To introduce a reference value to the threshold-variable F_c, we define a typical value f_z by the relation:

$$F_{c} (f_{z} ) = f_{z} = \frac{1}{{n - {1}}} \cdot \sum\limits_{n} {x_{n} } = \frac{n}{n - 1} \cdot \left\langle x \right\rangle = n \cdot {\text{sgn}} \left( {\left\langle x \right\rangle } \right) \cdot \frac{{\sqrt {\left| {\left\langle x \right\rangle } \right|} \cdot \sqrt {\left| {\left\langle x \right\rangle } \right|} }}{{\left( {\sqrt {n - {1}} } \right)^{2} }},$$

(4)

where n is the number of time-series nodes, $\left\langle \cdot \right\rangle$ is the average operator, and sgn(·) is the sign (or signum) function²⁶. In numeric terms, the f_z filtering describes that non-zero elements of the weighted ESG’s connectivity matrix are those with values higher than the adjusted mean-value $\frac{n}{n - 1} \cdot \left\langle x \right\rangle$ of the time-series X. In physical (electrostatic) terms, f_z describes an electrostatic force that is n-times greater than this applied to a pair of particles with electrical charges q_i, q_j = $\sqrt {\left| {\left\langle x \right\rangle } \right|}$(i.e. equal to the square-root of the absolute mean-value of the time-series X), which are d_ij = $\sqrt {n - {1}}$ (i.e. equal to the square-root of the time-series length) steps of separation distant.

Within this context, the proposed ESG algorithm is implemented in four steps, as it is shown in Fig. 2. First, we compute the matrix F(X) of Coulomb-like forces, according to the relations (1) and (2). Secondly, we apply to F(X) the connectivity filter and compute the weighted connectivity matrix W_ESG, according to the relations (3) and (4). Thirdly, we manage the disconnected data (i.e. mainly the diagonal element yielding infinite computations due to zero distances included in the denominator) of F(X), by substituting “inf” (infinite) values by zeros. Fourthly, we create the graph-layout of the ESG(X) based on the weighted connectivity matrix W_ESG.

According to the first four steps of the algorithm, we can generate the electrostatic graph ESG(X), which is associated with a time-series X and is an undirected and weighted graph with a non-trivial topology. In this graph model, we can further compute several network measures and metrics and thus reveal the topological properties of the ESG. Therefore, at the fifth and final step of the algorithm, we compute node-series of network measures of the ESG(X), and afterward, we compare their structural relevance with this of the source time-series X. The procedure is described in more detail in the following paragraphs.

Node-series of network measures

The electrostatic graph ESG(X) is a graph-model G_ESG(V,E) where each network node v_i $\in$ V is the same with a time-series node i $\in$ X, namely v_i≡i $\in$ V,X. Therefore, for every node-measure Y (e.g. node degree, local clustering coefficient, closeness, betweenness, and eigenvector centrality, etc.) of the ESG, we can arrange the scores Y(v_i) = y_i into the time-series X = {x₁, x₂,…, x_n} ordering, and thus to configure node-series X(Y) = {y₁, y₂,…, y_n} of the ESG network measures that are associated with the source time-series. This allows comparing the source time-series X with these of the ESG node-series X(Ys) and detecting possible structural similarities that can be seen as a measure of relevance between the time-series and the ESG. The available network (node) measures that participate in the construction of the node-series are shown in Table 1.

Table 1 The node measures that are considered in the analysis ^1,27,27,29

Full size table

In terms of notation, for a (source) time-series X = {x₁, x₂,…, x_n}, where n$\in {\mathbb{N}}$ and x_i$\in {\mathbb{R}}$, we can write its associated node-series for the network measure Y as X(Y) = {Y(x₁), Y(x₂),…, Y(x_n)} = {y₁, y₂,…, y_n}. We can read that X(Y) is “the node-series of the network measure Y, which is computed for the ESG that is associated with the time-series X” or, in brief, that X(Y) is “the node-series of (the measure) Y for the ESG”. Within this context, we can compute the node-series for the measures of degree X(Y = k) = {k₁, k₂,…, k_n}, strength X(s) = {s₁, s₂,…, s_n}, clustering coefficient X(C) = {C₁, C₂,…, C_n}, betweenness centrality X(CB) = {CB₁, CB₂,…, CB_n}, closeness centrality X(CC) = {CC₁, CC₂,…, CC_n}, and eigenvector centrality X(CE) = {CE₁, CE₂,…, CE_n}, according to the mathematical formulas shown in Table 1. Provided that we can generate a node-series for any graph G(X) that is associated with a time-series X, we can include a subscript index in the notation X_G(Y) when necessary (e.g. X_ESG(k)) to denote the type of graph that the time-series is associated with.

The effect of the connectivity threshold on the ESG topology

The connectivity threshold F_c that is applied to the Coulomb-like matrix, according to relation (3), is determinative for the configuration of the ESG topology. To illustrate this, let us consider the series X_1:100 = {1, 2,…, 100} of the first hundred natural numbers. By applying to this series sequentially the connectivity thresholds F_c = 0, F_c = 1, F_c = 5, F_c = 10, F_c = 25, F_c = 50, F_c = f_z, F_c = 75, and F_c = 100, we get various ESGs, as it is shown in Fig. 3.

As it can be observed, the ESGs shown in Fig. 3 appear quite different in terms of graph density and node arrangement in the adjacency matrix. In particular, as the F_c becomes greater, the connectivity strip toward the main diagonal in the adjacency matrix becomes narrower, expressing each time a separate connectivity pattern. To examine whether and how the network topology is affected by changes in F_c, we compute a set of network measures and metrics (average degree $\left\langle k \right\rangle$, clustering coefficient C, graph density ρ, modularity Q, average path length $\left\langle l \right\rangle$, network diameter d(G), and the number of components) for a series of ESGs that are generated by applying connectivity thresholds ranging within the interval F_c$\in$[0, n² = max{ X_1:100}² = 10⁴]. This approach assumes that the network topology is collectively approximated by the set of available network measures, where each measure represents a certain topological aspect. The results of the analysis are shown in Fig. 4, where each network measure is expressed as a function of the connectivity threshold F_c.

Also, is evident that all network measures considerably fluctuate as the connectivity threshold F_c changes. The cases of average degree $\left\langle k \right\rangle$(Fig. 4a), clustering coefficient C (Fig. 4b), and graph density ρ (Fig. 4c) follow a declining pattern to the changes of F_c, the cases of average path length $\left\langle l \right\rangle$(Fig. 4e) and graph density d(G) (Fig. 4f) follow a bell-shaped pattern of negative skew (asymmetry), whereas the number of components (Fig. 4g) follows an increasing pattern. For the case of modularity Q (Fig. 4d), the performance of this measure appears considerably invariant along the biggest part of the F_c’s interval. As far as the typical value f_z is concerned, we can observe that this value cannot be related to border (i.e. min or max) distribution values, but it can be quite indicative of the average performance of the topological aspects of the ESGs. This indication can support the goodness of the choice of defining the typical f_z value within a physical (Coulomb-like) context, as it is shown in relation (4).

Overall, this analysis shows that the choice of the connectivity threshold F_c can be determinative to the topological features and generally the topology of the resulting ESG. This observation is evident even by the examination of a simple linear series of natural ascending numbers, which can only be considered as an indicative approach for the ESG construction. However, even this simple consideration sufficed to highlight the dependence between the connectivity threshold and the ESG’s network topology and thus to introduce a methodological path for optimally defining the F_c value. The examination of the optimum or most representative threshold is a matter of specialized optimization analysis that introduces avenues of further research and falls outside the scope of this paper. However, the physically defined approaches, as this of the Coulomb-like definition of the F_c shown in relation (4), or others utilizing methods from other disciplines can become insightful toward this optimization direction and are suggested for further research promoting multidisciplinary conceptualization. For instance, further research on this topic can apply to different types of time-series and more thorough optimization analysis in the choice of F_c. For the scope of this paper, the choice of the typical value f_z for the connectivity threshold is considered satisfactory to provide a reference value that is representative of the average topological features of the ESGs.

Testing the performance of the ESG algorithm

The analysis examines five different types of time-series, as it is shown in Fig. 5. The first one (Fig. 5a) was extracted from AirPassengers³⁰ and is a time-series with a linear trend (abbreviated: X_a≡AIR), including the monthly totals of US airline passengers for the period 1949 to 1960 (144 cases). The second one (Fig. 5b) was extracted from LorentzTS³¹ and is a typical Lorentz chaotic time-series (X_b≡CHAOS) generated from the Lorenz differential equations, on standard values sigma = 10.0, r = 28.0, and b = 8/3. This time-series has a length of 1900 cases. The third one (Fig. 5c) was extracted from DEOK.hourly³² and is a part (the first 5000 cases) of a broader stationary time-series (of 57,739 cases) including estimated energy consumption, in Megawatts (MW), for the Duke Energy Ohio/Kentucky (X_c≡DEOK). Next, the fourth one (Fig. 5d) was extracted from Wolfer-sunspot-numbers³³ and is a periodic time-series including Wolfer sunspot numbers (X_d≡SUNSPOTS), for the period 1770 to 1771 (280 cases). The fifth one (Fig. 5e) was extracted from Daily-minimum-temperatures-in-me³⁴ and is a cyclical time-series including daily minimum temperatures in Melbourne, Australia (X_e≡TEMP), for the period 1981–1990 (3650 cases). Links to the time-series databases are available in the reference list.

To examine the effectiveness of the proposed algorithm, we firstly compare the structure of the source time-series X with its node-series X_ESG(Ys) of the ESG node measures (Ys). Such comparisons are driven by the rationale that the ESG is a transformation (conversion) of a time-series to a complex network and therefore possible similarities that can be detected in the structural properties (e.g. data variability, linear trend, chaotic, stationary, periodic, and cyclical structure) between the original time-series and its associated ESG node-series can be seen as aspects of homeomorphism describing this transformation. In general, this approach is expected to illustrate the level at which the topology of the associated electrostatic graph ESG(X) sufficiently incorporates structural information of the source time-series X. Secondly, we compare the structure of the X_ESG(Ys) node-series with this of their concordant node-series X_VGA(Ys) of the node measures (Ys) computed in the visibility graphs defined by Lacasa et al.⁵. The comparisons between the source time-series and its associated node-series (either of ESG or VGA conversion) build on a multilevel analysis consisting of five tests; the first one detects similarities in data-variability (i.e. whether the original time-series and the node-series have the same fluctuation patterns) based on the Pearson’s bivariate coefficient of correlation^35,36, the second one in linear-trend by using the Linear Regression (LSLR) fitting³⁶, the third one in chaotic-structure based on the correlation dimension and embedding dimension diagram³⁷, the fourth one in stationary-structure based on the augmented Dickey-Fuller test (ADF) for a unit root ³⁸, and the fifth one in periodic-structure based on autocorrelation function³⁸. Each test is briefly described in the following paragraphs.

The visibility graph algorithm

The natural visibility algorithm (NVG) was proposed by Lacasa et al.⁵ and builds on the intuition of considering a time-series as a path of successive mountains of different height, where each represents the value of the time-series at a certain time. In this time-series landscape, an “observer” standing on the top of a mountain can see (either forward or backward) as far as possible, provided that no other top obstructs its visibility field (Fig. 6).

In mathematical terms, each time-series node (t_i, x(t_i)) corresponds to a graph node i≡(t_i, x(t_i))$\in$ V, and thus two nodes i,j $\in$ V are connected (i,j)$\in$ E in the visibility graph when the following inequality (NVG connectivity criterion) is satisfied:

$$X(t_{k} ) < X(t_{i} ) + (X(t_{j} ) - X(t_{i} ))\frac{{t_{k} - t_{i} }}{{t_{j} - t_{i} }}, \, \forall k \in (i,j){ ,}$$

(6)

where X(t_i) and X(t_j) are the numerical values of the time-series nodes (t_i, x(t_i))≡i and (t_j, x(t_j))≡j and t_i, t_j express their time points. In geometric terms, a visibility line can be drawn between two time-series nodes i,j $\in$ V, if no other intermediating node (t_k, x(t_k))≡k obstructs their visibility. That is, two time-series nodes are connected in the visibility graph whether no other intermediary node is higher so that to intersect the visibility line defined by this pair of nodes (Fig. 6). Therefore, two time-series nodes can enjoy a connection in the associated visibility graph if they are visible through a visibility line. The visibility algorithm conceptualizes the time-series as a landscape and generates a visibility graph associated with this landscape. The associated (to the time-series) visibility graph is a complex network where complex network analysis can be further applied^8,16.

Correlation analysis

At the first step of the analysis, we detect linear correlations between the source time-series X and the available (ESG and VGA) node-series. This approach examines whether the original time-series X and the node-series {X_i(k), X_i(s), X_i(C), X_i(CB), X_i(CC), and X_i(CE) | i = ESG,VGA} have the same fluctuation patterns and thus they can be considered as relevant in terms of data variability. In this analysis, the Pearson’s bivariate coefficient of correlation^35,36 is used, which ranges at the interval r_X,Y $\in$[–1,1] and detects linear (either positive or negative) correlations when $\left| {r_{XY} } \right| \to 1$.

Test of the linear trend

To detect a linear trend, we apply linear fittings to the source time-series X and to its associated node-series { X_i(k), X_i(s), X_i(C), X_i(CB), X_i(CC), and X_i(CE) | i = ESG,VGA}. According to this approach³⁶, a linear curve $\hat{y} = b \cdot f(x) + c$ is fitted to the available data that bests describes their variability. The curve fitting algorithm estimates the parameters b, c minimizing the square differences $y_{i} - \hat{y}_{i}$³⁶, according to the relation:

$$\min \left\{ {e = \mathop \sum \limits_{i = 1}^{n} \left[ {y_{i} - \hat{y}_{i} } \right]^{2} = \mathop \sum \limits_{i = 1}^{n} \left[ {y_{i} - \left( {\mathop \sum \nolimits b_{i} f_{i} (x) + c} \right)} \right]^{2} } \right\},$$

(7)

where y_i express the observed and $\hat{y}_{i}$ the estimated values. The optimization method that is used is the Least-Squares Linear Regression (LSLR) method³⁶, which assumes that the differences $e = \mathop \sum \limits_{i = 1}^{n} \left[ {y_{i} - \hat{y}_{i} } \right]^{2}$ follow the normal distribution e ~ N(0,$\sigma_{e}^{2}$). The goodness of the model fit is measured by the coefficient of determination R², which is defined by the expression^35,36:

$$R^{2} = {{\left( {\mathop \sum \nolimits_{i = 1}^{n} \left( {\hat{y}_{i} - \overline{y}} \right)^{2} } \right)} \mathord{\left/ {\vphantom {{\left( {\mathop \sum \nolimits_{i = 1}^{n} \left( {\hat{y}_{i} - \overline{y}} \right)^{2} } \right)} {\left( {\mathop \sum \nolimits_{i = 1}^{n} \left( {y_{i} - \overline{y}} \right)^{2} } \right)}}} \right. \kern-\nulldelimiterspace} {\left( {\mathop \sum \nolimits_{i = 1}^{n} \left( {y_{i} - \overline{y}} \right)^{2} } \right)}},$$

(8)

where $\overline{y}$ is the average of the observations and n is the number of cases (i.e. the series length). The coefficient of determination expresses the amount of variability of the response variable that is expressed by the linear model and ranges within the interval [0,1], indicating perfect linear determination when R² = 1^35,36. Within this context, amongst the ESG and VGA node-series, those being closer to the source time-series X in determination and model configuration (i.e. values in b and c estimators) are considered as more relevant to X in terms of linear trend.

Detection of chaotic structure

To detect chaotic structure in a time-series, we examine the patterns of the correlation (v) versus the embedding dimension (m) scatter plots (v,m). According to the Chaos theory³⁷, the correlation dimension (v) is a measure of the dimensionality of the space occupied by a set of random points and thus is used to determine the dimension of the fractal objects, which is often called fractal dimension. For a time-series X = {x_i | i = 1, …, n}, the correlation integral C(ε) is calculated by the expression^39,40:

$$C\left( \varepsilon \right) = \mathop {\lim }\limits_{n \to \infty } \frac{N(\varepsilon )}{{n^{2} }}\sim \varepsilon^{v} ,$$

(9)

where N(ε) is the total number of pairs of time-series points (x_i, x_j) with a distance smaller than ε, namely d(x_i,x_j) = d_ij < ε. As the number of points tends to infinity (n → ∞), and therefore as their corresponding distances tend to zero (d_ij → 0), the correlation integral tends to the quantity C(ε) ~ ε^v, where v is the so-called correlation dimension. Intuitively, the correlation dimension expresses the ways to which the points can be close to each other along different dimensions and is expected to rise faster when the space of embedding is of a higher dimension. Therefore, the correlation (v) versus the embedding dimension (m) diagram (v,m) can provide insights into how the time-series points are close to each other, as the dimensionality of the space of embedding increases^39,40. Within this context, amongst the ESG and VGA node-series, those with the (v,m) diagram being closer to the source time-series X are considered as more relevant to the original time-series, in terms of chaotic structure.

Detection of stationarity

To detect stationarity in the available series we apply the augmented Dickey-Fuller test (ADF) for a unit root ³⁸. The ADF algorithm examines the null hypothesis (H_o) that a unit-root is present in the model’s time-series data, which is expressed by the relation:

$$y_{t} = c + \delta {\text{t + }}\phi \cdot y_{t - 1} + \beta_{1} \cdot \Delta y_{t - 1} + \cdots + \beta_{p} \cdot \Delta y_{t - p} + \varepsilon_{t} ,$$

(10)

where Δ is the differencing operator (Δy_t = y_t − y_t−1), p is the number of lagged difference terms (specified by the user), c is a drift term, δ is a deterministic trend coefficient, $\phi$ is an autoregressive coefficient, β_i are the regression coefficients of the lag differences, and ε_t is a mean zero innovation process. According to Eq. (10), the unit-root hypothesis testing is expressed as follows³⁸:

$$H_{o} :\phi = {\text{1 vs}}{. }H_{1} : \phi < {1,}$$

(11)

and the (lag adjusted) test statistic DF is defined by the expression³⁸:

$$DF = \frac{{N(\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{\phi } - 1)}}{{(1 - \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{\beta }_{1} - ... - \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{\beta }_{p} )}},$$

(12)

where the uppercase symbol ‘^’ expresses an estimator. Within this context, amongst the node-series of ESG and VGA, first those satisfying the null hypothesis and then those that have more similar DF statistics with the source time-series X are considered as more relevant to the original time-series, in terms of stationarity.

Detection of periodicity and cyclical structure

To detect periodicity in the available time-series we use the autocorrelation function (ACF), which is defined as:

$$\rho (s,t) = \frac{{\gamma_{x} (s,t)}}{{\sqrt {\gamma_{x} (s,s)\gamma_{x} (t,t)} }},$$

(13)

where (s,t) are time points and γ_x(s,t) is the autocovariance function of the variable x³⁸. In general, the ACF measures the linear predictability of the series at time t by using only the value x_s (at time s) with a time-lag dt = t − s. The ACF lies within the interval − 1 ≤ ρ(s,t) ≤ 1, where positive coefficient values imply a positive linear trend and negative values a negative one. Based on the ACF, we construct a set of ACF-variables, where the first refers to the source time-series X and the others to the node-series X_i(k), X_i(s), X_i(C), X_i(CB), X_i(CC), and X_i(CE), where index i can be either i = ESG or VGA. Each variable includes 30 elements corresponding to ACFs of lag dt = 1,2,…,30, respectively, namely:

$$ACF(X) = \left\{ {\rho (t,t + 1),\rho (t,t + 2),...,\rho (t,t + 30)} \right\}.$$

(14)

By constructing these ACF-variables, we compute the Pearson’s bivariate coefficient of correlation^35,36 to detect linear correlations between the ACF(X) variable of the source time-series X and the other node-series variables. Within this context, amongst the available ESG and VGA node-series variables, those being higher correlated with the source time-series X are considered as more relevant to the original time-series in terms of periodicity and cyclical (i.e. periodic with a constant oscillation height) structure.

Results

Spy plots and graph layouts

The spy plots and graph layouts of the ESG(X) and VGA(X) graphs associated with the time-series X are shown in Fig.A1-A5 (in the Appendix). The spy plots are matrix-plots displaying with dots the non-zero elements of the adjacency matrix and they can thus represent the graph topology within the matrix-space^3,41. On the other hand, network visualization is implemented by using the “Force-Atlas” layout, which is available in the open-source software of Bastian et al.⁴². This layout is generated by a force-directed algorithm, which applies repulsion strengths between network hubs while it arranges hubs’ connections into surrounding clusters. Graph models that are represented in this layout have therefore their hubs centered and mutually distant (i.e. intermediate distance between hubs is the highest possible), whereas lower-degree nodes are placed as closely as possible to their hubs³.

As it can be observed in Fig.A1 (Appendix), the ESG(X_a) spy plot has a connectivity pattern configuring a tie (along the main diagonal) of increasing width (Fig.A1.a,c, Appendix), which appears indicative of the increasing trend of the source time-series (X_a = AIR). An aspect of such trend is also evident in the chain-like ESG(X_a) graph layout (Fig.A1.e, Appendix), where a cluster of hubs appears on the right side that resembles the tie configuration shaped in the spy plot. Also, the saw-like pattern of the source time-series appears smoother in the pattern of the 2d ESG(X_a) spy plot (Fig.A1.a, Appendix), whereas is more evident in the diagonal arrangement of the 3d ESG(X_a) spy plot (Fig.A1.c, Appendix). On the other hand, the VGA(X_a) spy plot configures a periodic pattern (Fig.A1.b,d, Appendix), where no linear trends are visible. This can be also observed in the VGA(X_a) graph layout (Fig.A1.f, Appendix), which shapes an almost symmetric hub-and-spoke pattern.

In Fig.A2, the ESG(X_b) spy plot configures a fractal-like tiling (Fig.A2.a, Appendix) illustrating a chaotic structure. Although such structure in the ESG(X_b) graph layout (Fig.A2.f, Appendix) is not that clear, we can observe two major components composing the electrostatic graph of X_b (Lorentz time-series). This is a result of the positive and negative values in the structure of the source time-series (X_b), illustrating the ability of the electrostatic graph (ESG) algorithm to generate disconnected graphs^1,41,27. Although connectivity is generally a desirable property in complex networks, the ability of the ESG algorithm to generate disconnected graphs can be insightful for removing past or unnecessary information (noise) of the time-series, therefore proposing avenues for further research. On the other hand, the VGA(X_b) graph layout (Fig.A2.f, Appendix) better illustrates a chaotic structure than its spy plot (Fig.A2.b,d Appendix) does, which is more illustrative to a periodic than to chaotic structure.

Next, in Fig.A3 (Appendix) the ESG(X_c) spy plot (Fig.A3.a,c, Appendix) configures a tie (along the main diagonal), with an almost constant width, which complies with the stationary structure of the source time-series (X_c = DEOK). Some evidence of stationarity can be also observed in the concentrated (solid-like) pattern of the ESG(X_c) graph layout (Fig.A3.e, Appendix). On the other hand, neither the VGA(X_c) spy plot (Fig.A3.b,d) nor graph layout (Fig.A3.f, Appendix) are illustrative of a stationary structure describing the original time-series (X_c).

In Fig.A4 (Appendix), the ESG(X_d) spy plot also configures a tie (along the main diagonal) with repeated knot-concentrations (Fig.A4.a,c, Appendix), which complies with the periodic structure of the source time-series (X_d = SUNSPOTS). Some insightful indications of such periodicity can be also observed in the clustered (torus-like) pattern that is shown in the ESG(X_d) graph layout (Fig.A4.e, Appendix). On the other hand, the VGA(X_c) spy plot (Fig.A4.b,d, Appendix) has an interesting periodic pattern, which is slightly mixed by the square areas of the other connections. However, the VGA(X_d) graph layout (Fig.A4.f, Appendix) does not appear illustrative of the periodic structure describing the source time-series (X_d).

Finally, the ESG(X_e) spy plot configures a tie (along the main diagonal) with repeated slightly thicker segments (Fig.A5.a,c, Appendix), which can relate to the cyclical structure describing the source time-series (X_e = TEMP). However, such cyclical structure is almost hidden in the chain pattern of graph components that have an odd arrangement in the ESG(X_d) graph layout (Fig.A5.e, Appendix). Periodicity can become clearer whether the layout will be further stretched to succeed symmetric arrangement similar to this of Fig.A4.e (Appendix). On the other hand, the VGA(X_c) spy plot shapes a clearer periodic pattern (Fig.A5.b,d, Appendix), which (although difficult) can be observed in the graph layout (Fig.A5.f, Appendix). Overall, the proposed ESG algorithm appears at least as capable as the VGA is in generating graphs of topologies representative of their source time-series. This observation will be also quantitatively tested in the following sections.

Correlation analysis

To compare patterns in data variability between the source and the ESG and VGA node-series (see Fig. A6-A10, Appendix), we apply a Pearson’s bivariate correlation analysis, the results of which are shown in Table 2. Amongst the available correlation coefficients, we compare concordant pairs (r(X,X_ESG(z), r(X,X_VGA(z)|z = k, C, CB, CC, and CE) between ESG and VGA node-series and we denote pairwise maxima (max{(r(X,X_ESG(z), r(X,X_VGA(z)}) in bold font. Cases with the X_ESG(s) node-series are paired with those of corresponding degree X_VGA(k), due to the similarity of the measures of node degree (k) and node strength (s), for the binary and weighted networks. Within this context, according to Table 2, in the case of the X_a time-series, the variability of the ESG node-series is overall closer to this of the source time-series (X_a) than the variability of the VGA node-series overall is, because the ESG node-series count 5 out of 6 maxima, whereas the VGA node-series count just one. This observation implies that the ESG transformation generates graphs that better preserve fluctuations with a linear trend of the original time-series than the VGA does. On the contrary, in the case of the chaotic time-series (X_b), the VGA node-series count 5 out of 6 maxima (a double count is given for the k,s pair), whereas the ESG node-series count just one. This observation implies that the VGA transformation better preserves chaotic fluctuations of the original time-series than the ESG does.

Table 2 Results of the Pearson’s bivariate correlation analysis.

Full size table

In the case of the X_c (DEOK), the ESG node-series count 4 out of 6 maxima, whereas the VGA node-series count 2 out of 6, which implies that the ESG transformation better preserves stationary fluctuations of the original time-series than the ESG does. In the case of the X_d (SUNSPOTS), the ESG node series count 5 out of 6 maxima, whereas the VGA node-series count just one, which implies that the ESG transformation better preserves periodical fluctuations of the original time-series than the VGA does. In the case of the X_e (TEMP) both the ESG and the VGA node-series count 3 out of 6 maxima, showing a balanced performance. As far as the measure of strength (s) (see Fig. A11, Appendix) is concerned, the analysis shows that, for all types of time-series except the chaotic one (X_b, CHAOS), the ESG node-series have higher performance than the VGAs. Overall, this pair-wise consideration illustrates that the variability of ESG node-series is closer to the source time-series (X_i) than of the VGAs, since the first count 18 out of 30 maximum cases, whereas the latter count 12 out of 30 maxima.

Test of the linear trend

The test of the linear trend was applied to ESG and VGA node-series associated with the X_a (AIR) time-series, which is a time-series with a known linear trend. The results of the analysis are shown in Table 3, where first it can be observed that the source (X_a: AIR) time-series is well described by a linear regression model (R² = 0.8536). However, none of the VGA node-series can sufficiently retain this linear structure, as is evident by the low coefficients of determination ranging from R² = 0.0002 to R² = 0.0132.

Table 3 Linear regression fittings for the X_a (Air) time-series.

Full size table

On the contrary, the ESG node-series of degree X_ESG(k), strength X_ESG(s), and eigenvector centrality X_ESG(CE) have a considerable linear structure, as is denoted by their respective coefficients of determination R² = 0.6916, R² = 0.8012, and R² = 0.7579. It should be noted that among these cases, the strength node-series X_ESG(s) have the highest determination. Overall, this analysis illustrates that the ESG algorithm appears more capable than the VGA in generating graphs that can preserve aspects of the linear trend of the source time-series.

Detection of chaotic structure

In this part of the analysis, the correlation versus the embedding dimension diagrams (v,m) of the VGA and the ESGs node-series are compared for preserving the chaotic structure of the source time-series X_b (CHAOS), which is already known as a chaotic time-series constructed on the Lorenz equations. The results are shown in Fig. A7 (Appendix), where all (v,m) diagrams of the ESG node-series (except this of eigenvector centrality X_b,ESG(CE)) illustrate the chaotic structure, but of different characteristics than the source chaotic time-series X_b. However, the (v,m) diagrams of strength X_b,ESG(s) and the original time-series X_b almost coincide, a fact that implies a relevant chaotic structure between these time-series. On the other hand, the degree X_b,VGA(k), and possibly the eigenvector centrality X_b,VGA(CE) VGA node-series illustrate a chaotic structure of high dimensionality, which are also of different characteristics than the original chaotic time-series X_b. Overall, the chaos analysis shows that the ESG is a more capable transformation in incorporating the chaotic structure of the source time-series in the network topology. Particularly, the measure of strength shows the most relevant chaotic structure that almost coincides with this of source time-series.

Detection of stationarity

The test of stationarity was applied to the X_c (DEOK) time-series, which is a part of an already known stationary time-series. The results of the analysis are shown in Table 4, where, first, it can be observed that is 7.03% likely for X_c to have a unit-root and thus to be a non-stationary time-series. This result implies that the null-hypothesis (stating a null unit-root) cannot be rejected, and thus that the source (X_c) time-series cannot be considered as a stationary one. As it can be observed, the results for all VGA node-series imply that all cases are statistically safe to be considered as stationary series, which opposes the indication of the original time-series.

Table 4 ADF test for stationarity of the X_c (DEOK) time-series.

Full size table

On the other hand, the ESG results imply that 4 out of 5 ESG node-series cannot be considered as stationary ones and thus resembling the structure of the original time-series. An interesting observation here is that the p-values of the VGA node-series are (although insufficient indications to retain the null hypothesis) closer than those of the ESG node-series, in terms of distance. These results imply that the non-stationary effects, which are immanent in the source time-series, probably appear more intensely in the structure of the ESG node-series than of the VGA ones.

Detection of periodicity and cyclical structure

This part of analysis builds on bivariate correlations, which are applied to autocorrelation variables ACF(X) that are defined in relation (14) with lag 1,2,…,30, where X = X_d (SUNSPOTS time-series), X_e (TEMP time-series), k (degree node-series), C (clustering coefficient node-series), CB (betweenness centrality node-series), CC (closeness centrality node-series), and CE (eigenvector centrality node-series). The results of the analysis are shown in Table 5, where the correlation coefficients r_XY and their significances are provided, with X$\in${ ACF(X_d), ACF(X_e)} and Y$\in${ACF_i(k), ACF_i(s), ACF_i(C), ACF_i(CB), ACF_i(CC), ACF_i(CE) | where i = VGA, ESG}.

Table 5 Correlations of ACFs^(*) between the source and the node-series (Sunspots and Temp).

Full size table

For the case of X_d (SUNSPOTS) time-series, we can observe that 4 out of 6 VGA node-series (k, k≡s, C, CE) and 3 out of 6 ESG node-series (k, s, CC) are significantly correlated with the original time series X_d. Amongst these significant results, the VGA node-series have 2 maxima of concordant pairs, whereas the ESG node-series have also 2 maxima. Moreover, the node-series of strength (X_d,ESG(s)) has the highest correlation coefficient amongst all available node-series for the SUNSPOTS (X_d) typology, illustrating a better performance of the ESG algorithm to preserve periodicity, probably due to its capability in generating weighted electrostatic networks. For the case of X_e (TEMP) time-series, 1 VGA node-series (closeness centrality) is significantly correlated with the source time-series, where all ESG node-series are significantly correlated with the original time-series. In terms of pairwise comparisons, the VGA node-series count 1 (out of 6) maximum case, whereas the ESG node-series count 5 out of 6 maxima. However, although is high, the strength does suggest the highest of the maxima of the TEMP (X_e) time-series concordant pairs. Overall, this analysis shows that the ESG node-series appear more capable than the VGA ones in preserving periodic and cyclical characteristics of the source time-series.

Conclusions

This paper proposed a new algorithm, the Electrostatic Graph Algorithm (ESG), for converting a time-series into a graph (complex network). The ESG builds on the conceptualization of considering a time-series as a series of stationary and electrically charged particles, on which Coulomb-like forces can be computed. The proposed algorithm provides an added value to complex network analysis of time-series due to its ability to produce weighted graphs, which is currently not applicable. This additional property was quantitatively examined in this paper and was found to produce graphs that are more representative of the structure of the source (original) time-series, implying that the proposed algorithm suggests a transformation that is more natural rather than algebraic, in comparison with the existing methods. In particular, the analysis showed that the ESG node-series can better preserve the linear trend and stationary structural properties of the source time-series in comparison with the VGA node-series and that they appear slightly better in preserving periodical and cyclical structural properties of the original time-series than the VGA node-series can. On the other hand, the VGA node-series appeared slightly better in preserving the chaotic structural properties of the original time-series in comparison with the ESG node-series, which complies with the claim of the VGA authors regarding the added value of their algorithm. However, in almost all the parts of the analysis, the ESG node-series of the measure of strength outperformed their concordant VGA node-series. This result highlighted the added value of the proposed algorithm in generating weighted graphs, in which the measure of node strength can only be computed. Therefore the ESG algorithm attributes to the generated graphs information that is more representative of the source time-series, due to the weights included in the graph structure. Another property of the proposed ESG algorithm to generate disconnected graphs was indirectly examined by the detection of chaotic and periodic structures, where the ESG algorithm sufficed to provide disconnected graphs, whereas the VGA did not. This analysis showed that insufficient connectivity does not restrict the ESG node-series to preserve the structural characteristics of the source time-series, since the generated electrostatic graphs were representative of the structure of the original time-series. The authors believe that the property of insufficient connectivity introduces avenues of further research in the field of noise reduction in the time-series analysis. Other avenues of further research can emerge towards the direction of either choosing the optimum or most representative connectivity threshold to produce the ESGs or examining the applicability of the proposed algorithm to solve problems where standard methods fail to analyze efficiently the time-series, such as the time evolution of stock price, within the framework of Black Scholes model, and others. The overall approach also suggests a methodological framework for evaluating the structural relevance between the source time-series and their associated graphs produced by any possible transformation.

References

Barabasi, A.-L. Network science. Philos. Trans. R. Soc. Lond. A Math. Phys. Eng. Sci. 371(1987), 20120375 (2013).
ADS Google Scholar
Brandes, U., Robins, G., McCranie, A. & Wasserman, S. What is network science?. Netw. Sci. 1, 1–15 (2013).
Article Google Scholar
Tsiotas, D. Detecting different topologies immanent in scale-free networks with the same degree distribution. Proc. Natl. Acad. Sci. U. S. A. (PNAS) 116(14), 6701–6706 (2019).
Article MathSciNet CAS Google Scholar
Gao, Z.-K., Small, M. & Kurths, J. Complex network analysis of time-series. Europhys. Lett. 116, 50001 (2016).
Article ADS Google Scholar
Lacasa, L., Luque, B., Ballesteros, F., Luque, J. & Nuno, J. C. From time-series to complex networks: The visibility graph. Proc. Natl. Acad. Sci. 105(13), 4972–4975 (2008).
Article ADS MathSciNet CAS Google Scholar
Yang, Y. & Yang, H. Complex node-series analysis. Phys. A 387(5), 1381–1386 (2008).
Article ADS Google Scholar
Zhang, J. & Small, M. Complex network from pseudoperiodic time-series: Topology versus dynamics. Phys. Rev. Lett. 96(23), 238701 (2006).
Article ADS CAS Google Scholar
Tsiotas, D. & Charakopoulos, A. VisExpA: Visibility expansion algorithm in the topology of complex networks. SoftwareX 11, 100379 (2020).
Article Google Scholar
Xu, X., Zhang, J. & Small, M. Superfamily phenomena and motifs of networks induced from time-series. Proc. Natl. Acad. Sci. 105(50), 19601–19605 (2008).
Article ADS MathSciNet CAS Google Scholar
Gao, Z.-K. & Zin, N. Flow-pattern identification and nonlinear dynamics of gas-liquid two-phase flow in complex networks. Phys. Rev. E 79(6), 066303 (2009).
Article ADS Google Scholar
Donner, R. V., Zou, Y., Donges, J. F., Marwan, N. & Kurths, J. Recurrence networks—a novel paradigm for nonlinear time-series analysis. New J. Phys. 12(3), 033025 (2010).
Article ADS Google Scholar
Iacobello, G., Scarsoglio, S. & Ridolfi, L. Visibility graph analysis of wall turbulence time-series. Phys. Lett. A https://doi.org/10.1016/j.physleta.2017.10.027 (2017).
Article MATH Google Scholar
Jiang, W., Wei, B., Zhan, J., Xie, C. & Zhou, D. A visibility graph power averaging aggregation operator: A methodology based on network analysis. Comput. Ind. Eng. 101, 260–268 (2016).
Article Google Scholar
Liu, C., Zhou, W. X. & Yuan, W. K. Statistical properties of visibility graph of energy dissipation rates in three-dimensional fully developed turbulence. Phys. A 389(13), 2675–2681 (2010).
Article CAS Google Scholar
Luque, B., Lacasa, L., Ballesteros, F. & Luque, J. Horizontal visibility graphs: Exact results for random time-series. Phys. Rev. E 80(4), 046103 (2009).
Article ADS CAS Google Scholar
Tsiotas, D. & Charakopoulos, A. Visibility in the topology of complex networks: Introducing a new approach. Physica Α 505, 280–292 (2018).
Article Google Scholar
Serway, R. Physics for Scientists & Engineers with Modern Physics 6th edn. (Thomson Books, California, 2004).
Google Scholar
Ghasemi, M., Ghavidel, S., Aghaei, J., Akbari, E. & Li, L. CFA optimizer: A new and powerful algorithm inspired by Franklin’s and Coulomb’s laws theory for solving the economic load dispatch problems. Int. Trans. Electr. Energy Syst. 28(5), e2536 (2018).
Article Google Scholar
Sousa, W. & de Oliveira, R. Coulomb’s law discretization method: A new methodology of spatial discretization for the radial point interpolation method. IEEE Antennas Propag. Mag. 57(2), 277–293 (2015).
Article ADS Google Scholar
Lai, Y., Lv, Z., Li, K. C. & Liao, M. Urban traffic Coulomb’s law: A new approach for taxi route recommendation. IEEE Trans. Intell. Transp. Syst. 20(8), 3024–3037 (2018).
Article Google Scholar
Van Hooydonk, G. Gauge symmetry, chirality and parity effects in four-particle systems: Coulomb’s law as a universal function for diatomic molecules. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 56(12), 2273–2331 (2000).
Article ADS Google Scholar
Cagnoli, B., & Manga, M. Granular mass flows and Coulomb's friction in shear cell experiments: Implications for geophysical flows. J. Geophys. Res. Earth Surf. 109(F4), 1–12 (2004).
Rahkar-Farshi, T. & Behjat-Jamal, S. A multimodal firefly optimization algorithm based on Coulomb’s Law. Int. J. Adv. Comput. Sci. Appl. 7(5), 134–141 (2016).
Google Scholar
Zhang, H., Wei, D., Hu, Y., Lan, X. & Deng, Y. Modeling the self-similarity in complex networks based on Coulomb’s law. Commun. Nonlinear Sci. Numer. Simul. 35, 97–104 (2016).
Article ADS MathSciNet Google Scholar
West, D. B. Introduction to Graph Theory, 2nd Ed. (Prentice Hall, Upper Saddle River, 2001).
Yun, B. I. & Petkovic, M. S. Iterative methods based on the signum function approach for solving nonlinear equations. Numer. Algorithms 52(4), 649–662 (2009).
Article ADS MathSciNet Google Scholar
Boccaletti, S. et al. The structure and dynamics of multilayer networks. Phys. Rep. 544, 1–122 (2014).
Koschutzki, D., Lehmann, K., Peeters, L. & Richter, S. Centrality indices. In Network Analysis (eds Brandes, U. & Erlebach, T.) 16–61 (Springer, Berlin, 2005).
Chapter Google Scholar
Newman, M. E. J. Networks: An Introduction (Oxford University Press, Oxford, 2010).
Book Google Scholar
AirPassengers. Monthly totals of a US airline passengers from 1949 to 1960 (2020). https:// https://www.kaggle.com/chirag19/air-passengers [accessed: 29/6/20]
Lorentz, T.S. Lorentz typical chaotic time-series created by Runge-Kutta integration of the Lorenz equations, on standard values sigma=10.0, r=28.0, and b=8/3. time-series database (2020). http://www.physics.emory.edu/faculty/weeks/research/tseries1.html [accessed: 29/6/20]
DEOK.hourly. Duke Energy Ohio/Kentucky (DEOK) estimated energy consumption in Megawatts (MW) (2020). https://www.kaggle.com/robikscube/hourly-energy-consumption?select=DEOK_hourly.csv [accessed: 29/6/20]
Wolfer-sunspot-numbers, (2020) “Wolfer sunspot numbers, 1770 to 1771”, time-series, time-series database available at the URL: https://www.kaggle.com/dougcresswell/time-series-practice-datasets?select=wolfer-sunspot-numbers-1770-to-1.csv [accessed: 29/6/20].
Daily-minimum-temperatures-in-me. Daily minimum temperatures in Melbourne, Australia, 1981–1990”, time-series (2020). https://www.coursehero.com/file/26192773/daily-minimum-temperatures-in-mecsvxlsx/ [accessed: 29/6/20]
Norusis, M. SPSS 16.0 Statistical Procedures Companion (Prentice Hall Publications, New Jersey, USA, 2008).
Google Scholar
Walpole, R. E., Myers, R. H., Myers, S. L. & Ye, K. Probability & Statistics for Engineers & Scientists 9th edn. (Prentice Hall Publications, New York, 2012).
MATH Google Scholar
Alligood, K. T., Sauer, T. D. & Yorke, J. A. Chaos 105–147 (Springer, New York, 1996).
Book Google Scholar
Shumway, R. H. & Stoffer, D. S. Time Series Analysis and Its Applications, With R Examples 4th edn. (Springer, Switzerland, 2017).
Book Google Scholar
Hanias, M., Tsakonas, S., Magafas, L., Thalassinos, E. I. & Zachilas, L. Deterministic chaos and forecasting in Amazon’s share prices. Equilib. Q. J. Econ. Econ. Policy 15(2), 253–273 (2020).
Google Scholar
Magafas, L., Hanias, M., Tablatou, A. & Konstantaki, P. Non-linear properties of the VIX index. Int. J. Prod. Manag. Assess. Technol. 5(2), 16–24 (2017).
Google Scholar
Tsiotas, D. Detecting differences in the topology of scale-free networks grown under time-dynamic topological fitness. Sci. Rep. 10(1), 10630 (2020).
Article ADS CAS Google Scholar
Bastian, M., Heymann, S., Jacomy, M. Gephi: An open source software for exploring and manipulating networks. In Proceedings of the Third International ICWSM Conference, pp. 361–362 (2009).

Download references

Author information

Authors and Affiliations

Department of Regional and Economic Development, Agricultural University of Athens, Amfissa, Greece
Dimitrios Tsiotas
Adjunct Academic Staff, School of Social Sciences, Hellenic Open University, 10677, Athens, Greece
Dimitrios Tsiotas
Laboratory of Complex Systems, Department of Physics, International Hellenic University, Kavala, Greece
Dimitrios Tsiotas, Lykourgos Magafas & Panos Argyrakis
Department of Physics, Aristotle University of Thessaloniki, Thessaloniki, Greece
Panos Argyrakis

Authors

Dimitrios Tsiotas
View author publications
You can also search for this author in PubMed Google Scholar
Lykourgos Magafas
View author publications
You can also search for this author in PubMed Google Scholar
Panos Argyrakis
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.T. designed research; D.T. and L.M. performed research; D.T., L.M., and P.A. contributed new reagents/analytic tools; D.T., L.M., and P.A. analyzed data; D.T., L.M., and P.A. wrote the manuscript; All authors reviewed the manuscript.

Corresponding author

Correspondence to Dimitrios Tsiotas.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tsiotas, D., Magafas, L. & Argyrakis, P. An electrostatics method for converting a time-series into a weighted complex network. Sci Rep 11, 11785 (2021). https://doi.org/10.1038/s41598-021-89552-2

Download citation

Received: 08 December 2020
Accepted: 19 April 2021
Published: 03 June 2021
DOI: https://doi.org/10.1038/s41598-021-89552-2
Springer Nature Limited

This article is cited by

Multi-span transition networks: a new unified framework for analyzing time series
- Jieren Xie
- Guanghua Xu
- Sicong Zhang
Nonlinear Dynamics (2024)

An electrostatics method for converting a time-series into a weighted complex network

Abstract

Similar content being viewed by others

Temporal Complex Network Analysis

An enhanced version of the SSA-HJ-biplot for time series with complex structure

HOTVis: Higher-Order Time-Aware Visualisation of Dynamic Graphs

Introduction

Methods

The proposed ESG algorithm

Node-series of network measures

The effect of the connectivity threshold on the ESG topology

Testing the performance of the ESG algorithm

The visibility graph algorithm

Correlation analysis

Test of the linear trend

Detection of chaotic structure

Detection of stationarity

Detection of periodicity and cyclical structure

Results

Spy plots and graph layouts

Correlation analysis

Test of the linear trend

Detection of chaotic structure

Detection of stationarity

Detection of periodicity and cyclical structure

Conclusions

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Multi-span transition networks: a new unified framework for analyzing time series

Search

Navigation