1 Introduction

In this paper, we consider a fullerene Cn as a convex polytope modeling a closed three-dimensional carbon-cage with n atoms, cf. [1]. Each vertex is connected to exactly three other vertices, such that the facets are pentagons and hexagons only. Using the classical Euler relation and Eberhard’s theorem [19, Section 13.3] for simple three-dimensional convex polytopes one can conclude that the number of pentagonal facets is always equal to 12 and n is even. In this case, the number of hexagons is \(m_6=\frac{n}{2}-10\), cf. [17]. Theoretically, Cn exists for \(n=20\) and all even \(n\ge 24\), see [1]. In the sequel we call such a n feasible. However, up to now nly a few of them (some isomers of Cn with \(n=60,70,76,78,80,82,84\)) are separated (i.e., are chemically stable and can be synthesized in considerable mass quantities), cf. [21, 27, 30, 38]. Another difficult problem is the combinatorial constructive enumeration of all isomers of \(C_n\), see [7, 11, 35] and references in [16, Section 3]. For instance, a set of four operations (including that of Endo-Kroto) enables the construction of all fullerenes for each even n starting with \(n=24\) (barrel), see [12, 16]. Other examples for such operations are the generalized Stone-Wales operation (cf. [2], which is known to be not complete) and the buckygen (introduced 2012 in [8]) which is up to now the fastest algorithm to generate all \(C_n\)-isomers.

Fig. 1
figure 1

IPR-Isomers of C60 (Buckminster fullerene) and C70. Courtesy of Max von Delius

Combinatorial isomer is a class of combinatorially equivalent polytopes. Two polytopes P and \(P'\) are called combinatorially equivalent if there exists a one-to-one mapping between the lattice of all facets of P and \(P'\) that is inclusion-preserving [19, p. 38]. Hence, this equivalence class is determined by the graph of vertices, or equivalently, by the dual graph of facets. Both are uniquely described by their adjacency matrices. With increasing amount of atoms n, the number of isomers \(\hbox {ISO}\left( n\right) \) grows fast as \(O(n^9)\), cf. [35]. For instance, there exist a unique C20–isomer (\(\hbox {ISO}(20)=1\)) which is dodecahedron, a Platonic solid, but C60 has already 1812 combinatorial isomers, see [6]. Among all isomers of C60, only one (the so–called Buckminster fullerene, an Archimedean truncated icosahedron) has all pentagonal facets being not adjacent. Such isomers are called IPR-fullerenes (from Isolated Pentagon Rule), cf. Fig. 1. As illustrated in Fig. 2, the number ISO-IPR\(\left( n\right) \) of IPR-isomers also grows asymptotically as \(O(n^9)\) with increasing number of atoms n [7, 32].

The huge variety of possible fullerene isomers with large n (even in the IPR–class) makes the problem of finding molecules with remarkable chemical and physical attributes (including thermodynamic and kinetic stability, permeability, electric conductivity, light diffraction, etc.) extremely difficult. Hence, a need for fast and computationally cheap classification methods of isomers arises. In the literature, there already exist a number of functionals (called topological indices or chemical structure-descriptors) allowing to find a certain order of isomers, cf. [4, 15, 20]. These descriptors are of topological, geometric or physical nature. For the practical separation of fullerenes, their potential energetic level is of primary significance. The paper [34] computes the relative energies of all 1812 isomers of C60 using the density functional theory (DFT) [31] and testing 26 chemical descriptors for their correlation with the energetic ordering of these isomers. The authors identify 7 rules among 26 which they call good stability criteria. These criteria are defined as those able to identify correctly the first two energetically most stable and the three energetically least stable isomers in the correct energetic order such that the correlation coefficient between the ordering according to these rules and the energetic order is at least 0.6. However, the DFT calculations are based on the approximative numerical solution of electronic Schrödinger (linear partial differential) equations requiring from half an hour up to one day of calculation time per isomer (\(n=60\)) on a usual personal computer. Moreover, the convergence of the DFT numerical method is not guaranteed.

Following the famous question by Mark Kac Can one hear the shape of a drum? [22] we try to “hear” the shape of a fullerene from the spectrum of its adjacency matrix provided that Cn–isomers can be mapped bijectively onto their spectra. We also give an example of two different C44–isomers with the same spectrum of adjacency matrix of hexagonal facets. We propose a new method of clustering and of classification of Cn–isomers for any \(n\ge 24\), \(n\ne 44\), based on combinatorial and graph theoretic structure of dual graphs \(T^6_{n}\) of their hexagonal facets. For \(n=60\) we show in this paper that it yields a good stability criterion with correlation coefficient of 0.9. The spectral analysis of graphs based on adjacency matrices of their vertices has a long standing tradition [3, 9, 10]. However, we show that for the complete classification and ordering of fullerenes it is sufficient to use

  • The dual graph \(T^6_{n}\) of hexagons since the positions of 12 pentagons can be reconstructed out of cycles larger than triangles and degrees of vertices in \(T^6_n\).

  • Newton polynomials of the spectrum of the adjacency matrix \(A^6_{n}\) of hexagons up to a certain degree \(k^*\) since it is well known that they are numerically more stable than the eigenvalues themselves. These polynomials can be computed directly as a trace of \(\left( A^6_{n} \right) ^k\), \(k=2q\), \(q=1,\ldots , k^*/2\), whereas the matrix multiplication is computationally less demanding than finding all eigenvalues of a matrix. Hereby, we use a graph theoretical interpretation of Newton polynomials of adjacency matrices of graphs in terms of their cycle numbers.

Our method is computationally very fast requiring \(O(n^3\log n)\) operations with a total of 1.15s CPU time on a Intel Core i5-8300H (2.3 GHz) (\(n=60\)). The high correlation of the obtained ordering with the DFT energetic order allows to figure out few energetically stable isomers at a low computational cost. For these isomer candidates, the detailed DFT analysis can be further performed.

Fig. 2
figure 2

Logarithm of the numbers of \(C_n\)-isomers ISO(n), IPR-isomers \(IPR-ISO(n)\) and their upper bound for all feasible \(n\in \lbrace 20,\ldots ,400 \rbrace \), on a double logarithmic scale

In order to construct the facet adjacency matrices, a certain enumeration algorithm of all facets is required. Since the spectra of these matrices are invariant with respect to the enumeration of facets, the choice of this algorithm does not matter from the mathematical point of view. For all fullerenes with \(24\le n < 380\), we use the spiral rule first introduced in [26] (where it is called an orange peel scheme) to enumerate all pentagons and hexagons and generate their adjacency matrices \(A_{n}^5\) and \(A_{n}^6\), respectively. The first fullerene not obeying the spiral rule is a C380–isomer, and the second counterexample is one of over 90 billion C384-isomers. All other isomers of Cn with \(n\le 450\) stick to this rule, cf. [25]. For fullerenes without a facet spiral, a generalized spiral [37] can be used for the one-to-one facet enumeration.

Our spectral approach is illustrated on all isomers of C60 which are the best studied fullerenes, especially the above mentioned famous Buckminster (soccer ball-like molecule). Such molecular structures are all allotropic forms of carbon [27].

2 Spectral analysis of C n

For a feasible n a fullerene isomer \(P\in C_n\) is a simple, compact and convex polytope in \(\mathbb {R}^3\) with all \(m:=n/2+2\) facets being either one of 12 pentagons or one of \(n/2-10\) hexagons:

$$\begin{aligned} P := \{ x \in \mathbb {R}^3 \,|\, a_i x+b_i \ge 0\; i=1,\ldots , m\}, \quad a_i\ne 0, \; b_i\in \mathbb {R}, \text{ for } \text{ all } i . \end{aligned}$$

Its ith facet \(f_i\) is given by \( f_i := \{ x\in \mathbb {R}^3 \,|\, a_i x + b_i = 0\}\cap P\) , \(i=1,\ldots , m.\) P can be mapped on a two–dimensional graph in a way that edge crossing is avoided and vertex connectivity information is retained. First, one has to choose a facet and rotate P so that this facet is located parallel to the (xy)–plane at some distance below a fixed projection point q. Next, one draws a line starting in q to each vertex of the polyhedron and extends this line until it crosses the (xy)–plane. The intersections are the vertices of the new two-dimensional graph, also called Schlegel diagram. Although such a projection is not bijective, it yields a full combinatorial invariant of P. Each of \(f_i\) can be chosen to be initially parallel to the \(x-y\)–plane. Depending on this choice, the resulting graphs can be very different, see [17]. In Fig. 3a and b, one can see two possible Schlegel diagrams for the Buckminster fullerene (\(n=60\)). The graphs on these Schlegel diagrams are equivalent in the sense that they have the same vertex connectivity. From the definition of a fullerene it immediately follows that the corresponding planar graph is 3-regular. We denote a planar graph of a fullerene by F.

Fig. 3
figure 3

Two different, but combinatorially equivalent, Schlegel diagrams of Buckminster fullerene

Assume that \(C_n\) has a facet spiral which is defined as an order of facets such that each facet shares an edge with the previous and next one. This spiral can be presented by a facet spiral sequence. It is a sequence of twelve integers, which determines the position of the twelve pentagons in the facet spiral. Another representation is a sequence of fives and sixes such that the k–th number in the sequence indicates whether the k–th facet in the spiral is a pentagon or a hexagon [1]. We use the first approach in our software [5]. For instance, the facet spiral sequence for Buckminster fullerene represented by its Schlegel diagram in Fig. 3a is \(\left( 1,7,9,11,13,15,18,20,22,24,26,32\right) \). By \(C_{n,i}\) we mean the \(i^\text {th}\) \(C_n\)–isomer according to the lexicographical order of facet spiral sequences, see also [17, Chapter 2].

2.1 Dual facet graphs, adjacency matrices and their spectra

Let \(G=(V(G),E(G))=(V,E)\) be a finite undirected graph with vertex set V and edge set E. Let \(|V|=m\) be the number of vertices in V. We denote by \(A_G=A\) the adjacency matrix of G. Important properties of adjacency matrices in context of graph spectra can be found in [14].

Define the spectrum \(\sigma (A)\) of A as a set of its eigenvalues \(\lambda _i(A)=\lambda _i(G)=\lambda _i\), \(i=1,\ldots , m\). An induced subgraph H of G is a graph with vertices set \(V(H)\subseteq V(G)\) and all of the edges of G connecting pairs of vertices in V(H).

Due to the symmetry of A, it holds \(\sigma (A)\subset \mathbb {R}\). For the trace of A it obviously holds \(\text {tr}(A)=\sum _{i=1}^m \lambda _i\). Later the Newton polynomial \(N(A,k):=\text {tr}(A^k)=\sum _{i=1}^m \lambda _i^k\) of degree k with \(k\in \mathbb {N}\) and an adjacency matrix A, will be of interest to us. It is well–known that the spectrum of an \(m\times m\)–matrix A can be uniquely restored from the values N(Ak),  \(k=1,\ldots ,m\), cf. e.g. [18, p. 93].

Lemma 1

Let \(k\le m\) be an integer and A be the adjacency matrix of a graph G with m vertices. Then

  1. (a)

    The Newton polynomials can be calculated recursively as

    $$\begin{aligned} N(A,k)= -k\sum _{|H|=k}(-1)^{e(H)+c(H)}2^{c(H)}-\sum _{j=2}^{k-2}N(A,k-j) \sum _{H: |H|=j} (-1)^{e(H)+c(H)} 2^{c(H)}, \end{aligned}$$
    (1)

    where the inner sum runs over all subgraphs H of G with j nodes and connected components being either edges or cycles, e(H) being the numbers of edges among these components and c(H) being the number of cycles.

  2. (b)

    The Newton polynomial of degree k can be interpreted as the number of all cycles of length k in G.

Here, we call a cycle of length k any closed path (possibly with self–intersections) with k edges from a vertex to itself.

Proof

  1. (a)

    It is known that Newton polynomials can be represented as polynomials of elementary symmetric polynomials of the eigenvalues \(\lambda _i(A)\), \(i=1,\ldots ,m\) with integer coefficients, see [24, Chapter 11, § 53]. Since each elementary symmetric polynomial \(S_k\) of \(\lambda _1(A), \ldots , \lambda _m(A)\) is a sum of principal minors of A of the corresponding degree \(k\in \mathbb {N}\) (cf. [29, p. 495]), and these minors have integer values due to \(a_{i,j}\in \{0,1\}\), we get that the values of N(Ak) are integers. Moreover, \(S_1=N(A,1)=0\). For \(k\ge 2\) we have

    $$\begin{aligned} N(A,k)= -kS_j + \sum _{j=2}^{k-2}(-1)^{j-1}N(A,k-j)S_j, \end{aligned}$$
    (2)

    where we put \(S_j=0\), \(j>m\). By [3, Theorem 3.10], it holds

    $$\begin{aligned} S_j=(-1)^j\sum _{H: |H|=j} (-1)^{e(H)+c(H)} 2^{c(H)}. \end{aligned}$$
    (3)
  2. (b)

    This interpretation follows immediately from [9, Proposition 1.3.1], since the ith diagonal entry of the kth power of A is the number of walks of length k from vertex i to itself.

\(\square \)

For fullerenes, vertex adjacency matrices and their spectra are well-studied, see [1, Section 4.5]. As mentioned above, we generate dual facet graphs \(T_n\) out of the Schlegel diagrams of \(C_n\) and consider the spectra of their adjacency matrices. In \(T_n\) the original facets become vertices, and the original vertices become facets. The edges of the dual graph show adjacency relations between original facets: two nodes of the dual graph are connected by an edge if the corresponding facets of the fullerene are adjacent, i.e. share an edge. In Fig. 4a, one can see the dual graph \(T_{60}\) of all facets of Buckminster fullerene with the Schlegel diagram in Fig. 3a. Notice that (for the sake of legibility) the facet \(f_1\) is displayed five times in Fig. 4a, whereas it should occur just once. In this paper, we use red, green and white nodes in the images of the dual graphs of \(C_n\) for pentagons, hexagons and unspecified facets, respectively.

Consider two important induced subgraphs \(T^5_{n}\) and \(T^6_{n}\) of \(T_n\). The graph \(T^5_{n}\) illustrates the connectivity between pentagonal facets, i.e. it always contains 12 vertices. For instance, the graph \(T^5_{n}\) of every IPR-isomer consists of 12 disconnected vertices. This being said, it is evident that the number of isomers of \(C_n\) with the very same graph \(T^5_{n}\) increases rapidly with increasing n, cf. Fig. 2 for the IPR-case. Hence, considering the graph \(T^5_{n}\) does not yield an invariant for all \(C_n\)-isomers. In order to characterize all isomers we need the graph \(T^6_{n}\) showing the connectivity of all \(m_6\) hexagonal facets of a \( C_n\)-isomer. As an example the graph \(T^6_{60}\) of the Buckminster fullerene is shown in Fig. 4b. It turns out that \(T^6_{n}\) completely characterizes the graph \(T_n\). We denote by \(A_n, A_n^5,A_n^6\) the adjacency matrix of \(T_n, T_n^5\) and \(T_n^6\), respectively.

Remark 1

  1. (a)

    For \(k>m\) a formula similar to (1) can be derived, by which it follows that traces of the \(k^\text {th}\) power of A with \(k>m\) are linear combinations of traces of smaller powers. This can be explained by the fact that subgraphs have at most as much vertices as the whole graph.

  2. (b)

    An alternative approach is to insert formula (2) into itself, which yields a representation of Newton polynomials as a polynom with several unknowns being Newton polynomials of lower degree.

  3. (c)

    The condition that every vertex in a fullerene has valency three corresponds to the fact that the dual graph \(T_n\) consists of triangles only. However, the subgraphs \(T^5_{n}\) and \(T^6_{n}\) may also have larger cycles.

  4. (d)

    No 4-cycles exist neither in \(T_{n}\) nor in \(T^5_{n}\) and \(T^6_{n}\), see [12, Theorem 4.15 (1)].

  5. (e)

    The problem of description of all simple cycles (i.e., closed loops without self–intersections) of pentagonal or hexagonal facets of length k is crucial to combinatorial classification of fullerenes.

Fig. 4
figure 4

Dual facet graphs of Buckminster fullerene

Denote by \( \mathbb {N}_0\) the set of natural numbers and zero. Obviously, it holds \(N(A^6_n,k)\in \mathbb {N}_0\) for all \(k\in \mathbb {N}\).

For the dual graphs \(T^5_{n}\) and \(T^6_{n}\) we construct their adjacency matrices \(A_{n}^5\) and \(A_{n}^6\). Since the number of unit entries in each line does not exceed 5 for \(A^5_{n}\) or 6 for \(A^6_{n}\) it holds by Gershgorin’s theorem that

$$\begin{aligned} \sigma ( A_{n}^5) \subset [-5,5],\,\, \sigma (A_{n}^6) \subset [-6,6]. \end{aligned}$$

In the sequel, let us concentrate on the properties of \(\sigma ( A^6_{n})\). The classical Frobenius–Perron theory applied to graph spectra [9, Proposition 3.1.1] yields a more accurate estimate for the largest eigenvalue of \(A_{n}^6\) which is positive and of multiplicity one if \(T_n^6\) is connected. The following properties of eigenvalues and their connection with valencies in a graph can be found in [9, Proposition 3.1.2], [13, Theorem 1.2], [9, Comment on Proposition 3.1.2], [3, Lemma 3.16].

Lemma 2

Let G be a graph with m vertices and valencies \(\kappa _1,\ldots ,\kappa _m\), and H be an induced subgraph of G. Let \(\lambda _{\max }(G)\) and \(\lambda _{\max }(H)\) be the largest eigenvalues of the adjacency matrix \(A_G\) and \(A_H\).

  1. (a)

    If G is connected, then \(\kappa _{\min }\le \bar{\kappa }\le \lambda _{\max }(G)\le \kappa _{max}\), with \(\bar{\kappa }\) being its mean valency and \(\kappa _{\max }\) the maximum valency of the vertices in G. In particular, \(\lambda _{\max }(G)=\kappa \) holds if G is \(\kappa \)-regular.

  2. (b)

    It holds \(\sqrt{\frac{1}{m}\sum _{i=1}^m \kappa _i^2}\le \lambda _{\max }(G)\le \kappa _{\max }\).

  3. (c)

    It holds \(\lambda _{\max }(H)\le \lambda _{\max }(G)\).

Notice that for all connected irregular graphs \(T_n,T_n^5\) and \(T_n^6\) we have \(\bar{\kappa }>1\) and \(\kappa _{max}\le 6\). For non–connected or irregular graphs we get only \(0\le \bar{\kappa } \le \kappa _{max}\). For instance, the dual graph \(T^6_{60}\) of the Buckminster fullerene (see Fig. 4b) is connected and regular with \(\kappa =3\), thus the largest eigenvalue of \(A_{60,6}\) for it is equal to 3.

In general, the value \(\theta :=\kappa _{max} - \bar{\kappa }\ge 0\) is a measure of the asymmetry of the dual facet graph which we call the asymmetry coefficient.

Remark 2

If \(k\rightarrow \infty \), we have \(N(A_n^6,k)/ \lambda _{max}^{k} \sim a_{max}+(-1)^k \mathbf{1}(- \lambda _{max}\in \sigma (A_{n}^6))\), where \(\mathbf{1}(B)\) is the indicator function of B and \(a_{max}\) is the multiplicity of the eigenvalue \(\lambda _{max}\). Indeed, by [3, Theorem 6.3], it holds \(- \lambda _{max}\in \sigma ( A_{n}^6)\) iff our dual graph \(T_{n}^6\) is bipartite, i.e., it has no cycles of all odd lengths, cf. [3, Corollary 3.12]. In this case, \(- \lambda _{max}\) has necessarily multiplicity one, see [3, Lemma 3.13]. Since for large \(n\ge 40\) the dual graph \(T_{n}^6\) of hexagons of any isomer of \(C_n\) contains either a 3– or a 5–cycle, it holds \(- \lambda _{max}\not \in \sigma (A_n^6)\), and the behavior of the the whole Newton polynomial \(\text {tr}\left( \left( A_{n}^6\right) ^{k}\right) \) for large powers k is dominated by \( \lambda _{max}^{k}\).

Let |S| be the cardinality of a finite set S.

Lemma 3

For all feasible n, consider a \(C_n\)isomer with graphs \(T_{n}\), \(T_{n}^5\) and \(T_{n}^6\). Then it holds

$$\begin{aligned} |E(T^6_{n})|=|E(T^5_{n})|+ \frac{3n}{2}-60. \end{aligned}$$

Proof

Since the graph \(T_n\) of a \(C_n\)–isomer is 3–regular, it has \(\frac{3n}{2}\) edges. An edge \((v,w)\in E(T_n)\) is either an edge between two pentagons, \((v,w)\in E(T_n^5)\), or between two hexagons, \((v,w)\in E(T_n^6)\), or between a pentagon and a hexagon, \((v,w)\in E(T_n\setminus \left( T_n^5 \cup T_n^6\right) )\). Since the number of pentagons is always 12, there are \(60-2|E(T_n^5)|\) edges between pentagons and thus

$$\begin{aligned} |E(T_n^5)|+|E(T_n^6)|+60-2|E(T_n^5)|=\frac{3n}{2}, \end{aligned}$$

which finishes the proof. \(\square \)

3 Cospectral isomers

Definition 1

Let \(A_G\),\(A_H\) be adjacency matrices of graphs G and H with m vertices each. G and H (or \(A_G\) and \(A_H\)) are said to be cospectral if \(\sigma (A_G)=\sigma (A_H)\).

It is easy to prove that isomorphic graphs are cospectral. The inverse statement is in general not true (cf. e.g. a counterexample in [36]). The natural question arises: Which graphs are determined by their spectrum? [36]. Some specific graphs like paths, complete graphs, regular complete bipartite graphs, cycles and their complements yield a positive answer to this question. In what follows, we provide new examples of non–isomorphic cospectral graphs coming from the world of fullerenes.

We derive some theoretical results for sets of non-cospectral graphs and apply them to fullerene isomers. To begin with, recall the following

Lemma 4

For graphs G and H with m vertices and adjacency matrices \(A_G\) and \(A_H\) the following statements are equivalent:

  1. (a)

    G and H are cospectral.

  2. (b)

    \(A_G\) and \(A_H\) have the same characteristic polynomial.

  3. (c)

    \(N(A_G,k)=N(A_H,k)\) for all \(k=1,\ldots ,m\).

Proof

Equivalence of (a) and (b) is obvious.

Now prove the equvalence of (c) and (b). As stated in [24, Chapter 11, Section 53] elementary symmetric polynomials \(S_k\) of degree k of eigenvalues of A can be expressed as

$$\begin{aligned} S_k=(-1)^{k-1}\frac{N(A,k)}{k}-\frac{1}{k}\sum _{i=1}^{k-1}(-1)^i S_{k-i}N(A,i), \,\,\, 1\le k\le n. \end{aligned}$$

Since \(S_1=N(A,0)=0\) and \(S_2=-N(A,2)/2\), every \(S_k\) with \(2\le k\le n\) can be computed just knowing \(N(A,2),\ldots ,N(A,k)\). Hence, the symmetric polynomials are identical for both \(A_G\) and \(A_H\). Finally, the coefficients of the characteristic polynomials can be expressed as \((-1)^kS_k\), so the characteristic polynomials are equal as well. \(\square \)

For all even \(24\le n \le 150\), we checked the existence of cospectral pairs of \(C_n\)-isomers within \(T_n\), \(T^5_{n}\) and \(T^6_{n}\) graphs by using Lemma 4 and computing all Newton polynomials in Python’s int-precision.

For \(n < 30\) no pair of cospectral isomers with respect to \(T_n,T_n^5\) and \(T_n^6\) can be found. Our results for \(30\le n \le 60\) are listed in Table 1. This table can be extended to any \(n>60\) with positive integers in its first and second rows, and zeroes in its third row.

Considering the whole dual graph \(T_n\), one finds a lot of non-isomorphic but cospectral graphs. In fact, it seems likely that infinitely many such pairs exist. One pair of cospectral isomers with \(n=44\) is illustrated in Figs. 5a and b. To explain the difference within this pair, we need the following

Definition 2

Let G and H be two graphs.

  1. (a)

    A fragment F of G is a connected induced subgraph of G. In particular, for fullerenes we call fragments of \(T_n^5\) and \(T_n^6\) pentagon-fragments and hexagon-fragments of \(T_n\), respectively.

  2. (b)

    Two non-isomorphic graphs G and H are called fragment flipped if two isomorphic fragments \(F_G\) in G and \(F_H\) in H exist such that the remaining graphs \(G\setminus F\) and \(H\setminus F\) are isomorphic.

As one can see in Fig. 5a and b, the two cospectral C44-isomers are fragment flipped. However, in general such a flip does not preserve the spectrum of a graph. To illustrate this, Fig. 5c and d contains a non–cospectral fragment flipped pair of C60–isomers.

For \(n\in \lbrace 32, 36,40, 52\rbrace \) a pair of distinct isomers exists with the same spectrum \(\sigma \left( T_{n}^6\right) \), since their graphs \(T_{n}^6\) are isomorphic.

Table 1 Number of non-unique spectra of graphs \(T_n\), \(T^5_{n}\) and \(T^6_{n}\) of Cn–isomers

For a fixed \(40\le n\le 150\) at least three and at most 298 non-unique spectra \(\sigma (T_n^5)\) exist. Since the amount of different arrangements of 12 pentagons is limited and the number of isomers grows rapidly with increasing n, there must exist isomers with same subgraphs \(T_n^5\). For the hexagonal dual graphs we found cospectral pairs only for \(n\in \lbrace 30,32,34,36,38,40,52\rbrace \). We also checked that these cospectral pairs of \(T_n^6\)-graphs are isomorphic.

The above empirical findings lead to the following

Conjecture 1

  1. (a)

    For all feasible n the graph \(T_n^6\) is uniquely determined by its spectrum.

  2. (b)

    For all feasible \(n \ge 60\) at least two \(C_n\)isomers exist having the same spectrum with respect to \(T_n\).

  3. (c)

    For all feasible \(n\ge 54\) two \(C_n\)isomers are isomorphic iff they are cospectral with respect to \(T_n^6\).

  4. (d)

    For any feasible n at least one of the spectra \(\sigma (T_n)\), \(\sigma (T_n^5)\), \(\sigma (T_n^6)\) is unique for all \(C_n\)isomers.

For a graph G with m vertices we denote by \(|\sigma (G)|\) the set of absolute values of eigenvalues \(\lambda _i\in \sigma (G)\) for \(i=1,\ldots ,m\) sorted in descending order, i.e. \(|\sigma (G)|:=\lbrace |\lambda _i|\mid \lambda _i \in \sigma (G),\, |\lambda _i|\ge |\lambda _{i+1}|\,\forall i=1,\ldots ,m-1 \rbrace \), and call it the absolute spectrum of G. Denote by \(|\sigma _{[0,1)}(G)|\) and \(|\sigma _{[1,\infty )}(G)|\) the part of \(|\sigma (G)|\) with \(0\le |\lambda |<1\) and \(1\le |\lambda |\). It holds \(\lambda _1(G)=\lambda _{\max }(G)\)

Theorem 1

Let \(\Gamma \) be a set of graphs with m vertices and with distinct absolute spectra such that \(\lambda _{\max }(G)>1\) for all \(G\in \Gamma \). Then all graphs \(G\in \Gamma \) can be uniquely characterized by at most two Newton polynomials of even degrees \(k_1^*\), \(k_2^*\) with \(m\ge k_1^*\ge k_2^*.\)

Proof

Ordering all absolute spectra \(|\sigma (G)|\) lexicographically yields a unique order on the set \(\Gamma \). Then all graphs G from \(\Gamma \) can be distinguished either by \(|\sigma _{[1,\infty )}(G)|\) or by \(|\sigma _{[0,1)}(G)|\).

In the first case, let us consider all \(G\in \Gamma \) with distinct \(|\sigma _{[1,\infty )}(G)|\). The sum \(\sum _{\lambda \in |\sigma _{[0,1)}(G)|}|\lambda |^k\) converges to 0 for \(k\rightarrow \infty \), so its influence on the Newton polynomials \(N(A_G,k)\) for k large enough can be neglected. Since \(|\sigma _{[1,\infty )}(G)|\) is unique for all considered graphs G there must exist a degree \(k_1^*\) such that the values of \(N(A_G,k)\) for all even \(k\ge k_1^*\) are distinct.

Now if some graphs \(G\in \Gamma \) have the identical part of absolute spectrum \(|\sigma _{[1,\infty )}(G)|\), they must differ within the part \(|\sigma _{[0,1)}(G)|\). Hence, there must always exist a degree \(k_2^*\le k_1^*\) such that the sum \(\sum _{\lambda \in |\sigma _{[0,1)}(G)|}|\lambda |^{k_2^*}\) distinguishes these graphs.

Lemma 4 c) and Remark 1 a) yield the number of vertices in the graph G as an upper bound for \(k_1^*\) and \(k_2^*\). \(\square \)

Corollary 1

Assuming Conjecture 1to be true, all \(C_n\)-isomers can be uniquely characterized by at most two Newton polynomials \(N(A_G,k_1^*)\) and \(N(A_G,k_2^*)\), \(k_1^*\le k_2^*\) even, with respect to at least one of the graphs \(G\in \lbrace T_n, T_n^5,T_n^6\rbrace \).

Proof

This follows from Theorem 1 and Lemma 2, since \(\lambda _{\max }(G)>1\) for at least one of the graphs \(G\in \lbrace T_n T_n^5,T_n^6\rbrace \). \(\square \)

Remark 3

  1. (i)

    Note that the set of \(C_n\)–isomers with distinct largest eigenvalues \(\lambda _{max}(G)\) for some \(G\in \lbrace T_n T_n^5,T_n^6\rbrace \) has distinct sets \(|\sigma _{[1,6]}(G)|\).

  2. (ii)

    It is important to stress that we consider even degrees \(k\le m\) only. In general, values of Newton polynomials \(N(A_G,k)\) with odd degrees k do not distinguish between graphs G. To illustrate this point, consider the hexagon graphs of two specific isomers of C28. The graph \(T_{28,1}^6\) of the first isomer consists of four isolated hexagons and the other graph \(T_{28,2}^6\) of two pairs of two adjacent hexagons. One gets the following spectra:

    $$\begin{aligned} \sigma (T_{28,1}^6)=\lbrace 0, 0 ,0,0\rbrace , \,\,\sigma (T_{28,2}^6)=\lbrace -1,-1,1,1\rbrace . \end{aligned}$$

    It follows directly that Newton polynomials of any odd degree do not distinguish between C28, 1 and C28, 2, but \(N(A_{28,1}^6,k)\not =N(A_{28,2}^6,k)\) for every even \(k\ge 2\).

Fig. 5
figure 5

Two examples of pentagon-fragment (marked with an X) flipped isomers. C44, 37 and C44, 38 are cospectral, but C60, 4 and C60, 5 are not

4 Spectral classification and stability prediction

In this section, we decompose the family of all \(C_n\)–isomers into subsets, which we call clusters, using the Newton polynomials \(N(A_G,k)\), \(k=2,4,6,\ldots ,k^*\) with adjacency matrix \(A_{G}\). This approach can be applied to any graph \(G\in \lbrace T_n,T_n^5,T_n^6\rbrace \) with no cospectral isomers.

Definition 3

  1. (a)

    For a feasible n and a given even integer k, we call a set of isomers with same value \(N(A_G,k)\) of Newton polynomial of degree k a cluster of \(C_n\). For a fixed n, the family of all these clusters is called a clusterization of \(C_n\). A clusterization such that its every cluster has exactly one element is called a complete clusterization.

  2. (b)

    We define \(k_{single}^*\) as the minimal degree k of Newton polynomials which is needed for a complete clusterization.

As we have seen in Lemma 1 (b), the Newton polynomial of degree two and three is equal to twice the number of edges and six times the number of triangles in the considered graph. The interpretation of \(N(A_n,4)\) is a bit more complex. We have to count all possible cycles of length four. In Fig. 6, the idea of calculation of \(N(A_{60},4)\) is illustrated on Buckminster fullerene C60, 1812. There, all five possible cycles of length four together with their frequencies are listed for one pentagon in C60, 1812.

Fig. 6
figure 6

Five bf different kinds of cycles of length four beginning and starting in the central vertex (X) in (a)

For the graph \(T_n\), one can show that the sum of these frequencies over all vertices in \(T_{n}\) only depends on n. So, \(N(A_n,k)\) for \(k\le 4\) can be neglected for the clusterization of \(C_n\) on the basis of \(T_n\), since these Newton polynomials have the same value for all \(C_n\)-isomers.

Nevertheless, for \(T_n^6\) we have to consider every Newton polynomial of even degree, since in \(T_n^6\) the number of vertices is fixed, but neither the number of edges between them nor the number of triangles in \(T_n^6\) is determined by n.

In the following section we use \(T_{60}^6\) in order to get a complete clusterization of C60 as an example.

4.1 Clusterization of C 60 using Newton polynomials

Recall that by Euler’s formula each C60-isomer has 90 edges and 32 facets with 12 pentagons and 20 hexagons among them.

Fig. 7
figure 7

Schlegel diagrams and dual graphs of hexagons of five C60-isomers

The Newton polynomial \(N(A^6_{60},2)\) can take on 18 distinct values \(60= t_1< \ldots < t_{18}=100\). For values \(t_1=60, t_2=64, t_{16}=92, t_{17}=96\) and \(t_{18}=100\), there exists exactly one C60-isomer with \(N(A^6_{60},2)=t_i\), \(i\in \lbrace 1,2, 16,17,18 \rbrace \). These isomers are \(C_{60,1812}, C_{60,1809}, C_{60,2}, C_{60,3},C_{60,1}\), respectively. Their Schlegel diagrams and dual hexagonal graphs \(T_{60}^6\) are shown in Fig. 7. Moreover, our numerical results show that for any degree \(k\ge 2\) with respect to \(T_{60}^6\) these isomers form a cluster with one single element. Ordering these five isomers according to \(N(A^6_{60},k)\) does not change with increasing degree \(k\ge 2\). In addition, Newton polynomials of all other C60-isomers are bounded by \(N(A_{60,1809}^6,k)\) and \(N(A_{60,2}^6,k)\), i.e. \(N(A_{60,i}^6,k)\in \left( N(A_{60,1809}^6,k), N(A_{60,2}^6,k) \right) \) for all \(i\notin \lbrace 1,2,3,1809,1812 \rbrace \) and all \(k\ge 2\).

Table 2 Number of clusters with respect to Newton polynomial \(N(A_{60}^6,k)\) for even \(2\le k\le 100\)

We checked that for any pair of two C60-isomers with distinct Newton polynomials of degree k the Newton polynomials \(N(A_{60}^6,\tilde{k})\) with \(k\le \tilde{k}\le 100\) are distinct as well. So, it holds \(k_1^*=k_2^*=k_{single}^*=12\), where \(k_1^*\) and \(k_2^*\) are from Theorem 1. One can observe that the number of distinct Newton polynomials and so the number of clusters with one element is monotone growing with k. Numbers of clusters and clusters with one element for all even \(2\le k\le 100\) are listed in Table 2.

Fig. 8
figure 8

Minimal degree \(k_{single}^*\) (a) and \(k_{pair}^*\) (b) needed for the complete clusterization of \(C_n\)-isomers with \(28\le n \le 150\)

We applied the above clusterization scheme to \(C_n\), \(28\le n\le 150\) and plotted n against \(k^*_{single}\) in Fig. 8a. Recall that a (pessimistic) upper bound for \(k^*_{single}\) is the number of vertices in \(T_n^6\), i.e. \(k_{single}^*\le m_6=\frac{n}{2}-10\) due to Lemma 4. However, the good news is that the actual growth rate of \(k^*_{single}\) is logarithmic with n. Using MATLAB curve fitting toolbox [28] we get

$$\begin{aligned} k_{single}^*(n)\approx -15.13+7.801\log \left( 0.7614n-12 \right) , \end{aligned}$$

with a coefficient of determination \(R^2=0.9499\).

Next, we use pairs of Newton polynomials \(\left( N(A_n^6,k_1),N(A_n^6,k_2)\right) \) with \(k_1<k_2\le k_{single}^*\) in order to cluster all C60-isomers. By considering additionally a second Newton polynomial of lower degree, we hope to decrease the needed degree to get a complete clusterization. We define \(k_{pair}^*\) as the minimal \(k_2\) such that a complete clusterization is given. Indeed, this approach decreases the needed degree significantly (compare both plots in Fig. 8), and therefore, reduces computational costs. For C60 and \(T_{60}^6\) the following four tuples with \(k_2 <k_{single}^*\) of degrees of Newton polynomials lead to a full classification:

$$\begin{aligned} \varvec{k}=(k_1,k_2)\in \left\{ (6,8), (4,10), (6,10), (8,10) \right\} . \end{aligned}$$

We get \(k_{pair}^*=8\). Next we plotted all values for \(k_{pair}^*\) against n and assumed a logarithmic function as for \(k_{single}^*\). Using MATLAB curve fitting toolbox we get the following approximation

$$\begin{aligned} k_{pair}^*(n)&\approx -24.83+10.29\log \left( 0.334n+9.989 \right) \end{aligned}$$

with a coefficient of determination of \(R^2=0.8277\).

A third hierarchical approach in order to decrease the needed degree uses a vector with all Newton polynomials up to degree \(k\le k_{single}^*\). Analogously to the first two approaches, we define \(k_{hierarchical}^*\) as the minimal k which yields a complete clusterization. This approach decreases e.g. the degree for \(n=78\) from \(k_{pair}^*=12\) to \(k_{hierarchical}^*=10\). So, this approach does not change the needed degree significantly. Nevertheless, we performed the same interpolation using MATLAB curve fitting Toolbox and got

$$\begin{aligned} k_{hierarchical}^*(n)&\approx -97.05+19.83\log \left( 1.466n+125.5 \right) . \end{aligned}$$

with \(R^2=0.8818\).

4.2 Relative energy

A fullerene isomer, which can be chemically separated with a significant mass quantity and uniquely characterized, is called stable. In order to decide which C60-isomer can be stable the relative energy of all of them was calculated with high-accuracy quantum chemistry methods and discussed in [34]. Here, relative means compared with the Buckminster fullerene \(C_{60,1812}\) which has the lowest DFT-energy at the \(\text {PW6B95-D3}^{\text {ATM}}/\text {def2-QZVP}\) level (cf. [34]), i.e. Buckminster fullerene has a relative energy of 0. In the sequel, we say that an isomer \(C_{n,i}\) is energetically more stable than \(C_{n,j}\), \(i\not = j\), if \(C_{n,i}\) has a smaller energy than \(C_{n,j}\).

Table 3 Coefficients of linear regression Independent variable \(\sim \) relative energy for all C60-isomers

According to [34] the most stable isomer is \(C_{60,1812}\) and the second stable one is \(C_{60,1809}\). At the other end of the ranking the three least stable ones are \(C_{60,2}\), \(C_{60,3}\) and \(C_{60,1}\). It has been assumed for a long time that an isomer is the more stable the less adjacent pentagon it has. Indeed, calculations of [34] allow the conclusion that each pair of two adjacent pentagons leads to a increase in the relative energy of an isomer of about 20 to 25 kcal \(\text {mol}^{-1}\). The amount of such pentagon pairs can be described with Fowler-Manolopoulos pentagon indices \(p_i:=\# \{\text{ pentagons } \text{ which } \text{ are } \text{ adjacent } \text{ to } i \text{ other } \text{ pentagons } \}\), such that the sum of \(p_1\) up to \(p_5\) is equal to 12 for every fullerene, cf. [17]. Based on these values, the pentagon signature \(P_1=1/2 \sum _{i=1}^5ip_i\) can be calculated, which quantifies the amount of connected pentagons. Clustering all \(C_{60}\)-isomers according to the pentagon signature, five isomers stand out, namely \(C_{60,1812} (P_1=0)\), \(C_{60,1809} (P_1=2)\), \(C_{60,2} (P_1=16)\), \(C_{60,3} (P_1=18)\) and \(C_{60,1} (P_1=20)\). The signature can be easily read from Fig. 7a–e. Pentagons signatures of the remaining isomers lie between 2 and 16. For each of these values, at least two isomers exist with the same pentagon signature. By Lemma 1 b) and 3 we get the following

Proposition 1

For any \(C_n\)-isomer it holds

$$\begin{aligned} P_1=\frac{N(A_n^6,2)}{2}-\frac{3n}{2}+60. \end{aligned}$$

In Table 4, C60-isomers are listed in the same order given by their relative energy, by their pentagon signature and their Newton polynomial of degree 2.

Table 4 Five C60-isomers with unique Newton polynomial \(N(A^6_{60},2)\) and Pentagon signature \(P_1\) sorted by their relative energy in ascending order

Indeed, one gets more information about a fullerene structure looking on hexagons than on pentagons. This becomes clear looking at fullerenes with large n. For example, C80 has 7 IPR-isomers, so their pentagon structure and pentagon signature are the same. Nevertheless, only two of them have been produced in pure form, although DFT calculations have been done for all of them, see [23]. As a result of [23], only two IPR–isomers can be claimed stable. Hence, all descriptors based on the pentagon structure do not properly predict stability.

In [34] a good stability criterion is defined as the one which can identify \(C_{60,1812}\) and \(C_{60,1809}\) as the most stable and \(C_{60,2}, C_{60,3}\) and \(C_{60,1}\) as the least stable isomers in the correct energetic order. Additionally, the Pearson coefficient \(\rho \) of linear correlation between the relative energies of all C60–isomers and their criterion values should be larger than 0.6. Finally, the slope and the Pearson correlation coefficient in the linear regression of relative energy vs. the criterion for C60–isomers with \(P_1\in \lbrace 4,\ldots , 14 \rbrace \) should have the same sign.

As we have seen in Table 4, Newton polynomials yield the correct order of the most and least stable C60-isomers. Next, we perform a linear regression (using MATLAB curve fitting Toolbox) of \(N(A_{60}^6, k)\) vs. relative energies of all C60 isomers for all even \(4\le k\le 12\). For the case \(k=2\) Newton polynomial is equivalent to the 1st moment hexagon Signature \(H_1\), which is listed in [34, Table 3] as a good stability criterion. Hence, \(N(A_{60}^6,2)\) is a good stability criterion as well and can be neglected in further considerations. Table 6 shows that Pearson correlation coefficient \(\rho \) is much higher than 0.6 for all considered k. For degrees \(k=8,10,12\), Newton polynomials \(N(A_{60}^6, k)\) get very large, and therefore we took a logarithmic scale. But even with linear scale, one gets correlation coefficients larger than 0.6 in these three cases, compare Table 3.

Table 5 Linear regression of an independent variable \(N(A_{60}^6,k)\), \(4\le k\le 12\) even, versus relative energy over the subsets \(G_i\) of C60–isomers

Next we divided all isomers of C60 into 18 subsets \(G_i=\lbrace P\in C_{60} \mid P_1(P)=i \rbrace \) according to their pentagon signature \(i\in \lbrace 0,2,3,\ldots ,16,18,20\rbrace \). Then we performed a linear regression of Newton polynomials of different degrees vs. relative energies of isomers in \(G_i\) for every \(i\notin \lbrace 0,2,3,15,16, 18, 20 \rbrace \) as it is required in [34]. Our results are listed in Appendix, Table 5. For \(k\in \lbrace 4,6\rbrace \) and \(i\in \lbrace 4,5\rbrace \) we get Pearson correlation coefficients and slopes with a negative sign, unlike for all other combinations of k and i. This can be explained by the fact that \(G_4\) and \(G_5\) do not contain many isomers. More precisely, \(|G_4|=17,\) and \(|G_5|=86\) holds. So, neglecting these two cases would yield that \(N(A_{60}^6,k)\) with \(k=4,6\) is a good stability criterion.

For \(k=10,12\) one gets positive slopes and Pearson correlation coefficients in all cases, and therefore Newton polynomials of degree 10 and 12, in particular of degree \(k_{single}^*\), entirely fulfil all conditions of a good stability criterion (Table 6).

Table 6 Pearson correlation coefficient \(\rho \) and the slope of linear regression between Newton polynomials and the relative energy of all C60–isomers given in [34]

To check whether Newton polynomials can distinguish between IPR-isomers, i.e. yield their energetically correct order, we computed Newton polynomials of all 31924 C80-isomers. Within the whole set of C80 the seven IPR-isomers have the smallest Newton polynomials. But ordering the set of IPR-isomers according to Newton polynomials leads to the observation that the most stable IPR-isomers have the greatest Newton polynomials. These seven isomers are listed in Table 7. So, it seems that with increasing n Newton polynomials \(N(A_{n}^6,k)\) for even \(k\ge 2\) remain a good stability criterion.

Table 7 Seven IPR-isomers of C80, their relative energy in \(\hbox {kcal/mol}^{-1}\) and Newton polynomials. Only the isomers 31918 and 31919 can be chemically separated so far, cf. [23]

4.3 Asymmetry coefficients of isomers of C 60

The Fowler asymmetry parameter is claimed to be a good stability criterion [34]. Check whether the asymmetry coefficient \(\theta \) defined in Sect. 2 is a good stability criterion as well. The asymmetry coefficients of isomers shown in Fig. 7 are \(\theta _1=1, \,\theta _2=1.4,\,\theta _3=1.2,\,\theta _{1809}=0.8\) and \(\theta _{1812}=0\). The histogram of \(\theta \) of all C60-isomers is shown in Fig. 9.

Fig. 9
figure 9

Histogram of 23 different asymmetry coefficients of C60-isomers

It turns out that the asymmetry coefficient \(\theta \) is not a good stability criterion since it does not preserve the energetic order required in [34]. Thus, the asymmetry coefficient of eight isomers is equal 2.4, which is the largest value. These isomers are \(C_{60,1334},C_{60,1554},C_{60,1676},\) \(C_{60,1740},C_{60,1741},C_{60,1742},C_{60,1761}\) and \(C_{60,1784}\). Figure 10 shows \(C_{60,1784}\), which has six hexagons with valency two, twelve hexagons with valency four and two hexagons with valency six. So, the mean valency is 3.6, the maximal valency is 6 and the resulting asymmetry coefficient equals 2.4.

Fig. 10
figure 10

Schlegel diagram of one of the eight isomers with largest asymmetry coefficient \(\theta =2.4\)

In Table 8, asymmetry coefficients and energetic order numbers (1 = most stable, 1812 = least stable) [34] are listed for some outstanding isomers of C60, i.e. the ones with smallest and largest \(\theta \) as well as those shown in Fig. 7. One can see that the three least stable as well as the two most stable isomers have the asymmetry coefficient less than 2.4. In addition, the relative stability of the eight isomers with \(\theta =2.4\) varies from 313 to 1122. This leads to the conclusion that \(\theta \) is not a good stability predictor.

Table 8 Comparison of the asymmetry coefficient \(\theta \) and energetic stability of some C60–isomers

5 Conclusion

We present an easy to compute functional of spectra of the graphs \(T_{n}\) and \(T_n^6\), whereas one gets a complete classification of fullerenes only using the hexagonal dual graph. It turned out that the spectrum of \(T_n\) does not determine the fullerene uniquely. On the other hand, for any pair of non-isomorphic hexagonal subgraphs, the spectrum always differs. For seven feasible n, we found isomers with the same hexagonal dual graphs. Apart from that, the whole fullerene graph \(T_n\) seems to be uniquely determined by its subgraph \(T_n^6\) and, therefore, by the spectrum \(\sigma \left( T_n^6\right) \).

Moreover, it becomes apparent that the Newton polynomial of degree \(2,10,12(=k_{single}^*)\) of \(T_{n}^6\) appears to be a good stability criterion. So, Newton polynomials of \(T_{60}^6\) can be added to the list presented in [34, Table 3] as indices, which, depending on the degree, fulfil the criteria partly or entirely. We show that Newton polynomials generalize the Pentagon signature and better describe the fullerene structure. The interpretation of these Newton polynomials is very easy for \(k\le 3\), but gets demanding with increasing k.