1 Introduction

In the last half-century the electron population analyses (EPA) turned out to be very useful and commonly used tools in probing the electronic structure and chemical reactivity of molecules. In brief, EPA-procedures give rise to partition the electron density of the whole molecule between atoms, chemical bonds, molecular fragments, etc. Such partition can be performed within either physical or Hilbert space of molecular orbitals (MO) and there is a multitude of diverse EPA-procedures among which the most commonly used are those proposed by: Mulliken [1, 2], Löwdin [3, 4], Weinhold [5], Bader [6], Hirshfeld [7] and Merz-Kollman [8].

Although this is a field of research that seems to be regarded by some scientists as exhausted, in this short paper we would like to address some of the general issues respecting stationarity and uniqueness of electron probability distributions within the framework of MO theory and resolution of atomic orbitals (AO). Our considerations will be confined only to closed-shell ground states at two levels of theory, Hartree-Fock (HF) and configuration interaction (CI) [9]. The main purpose of this article is to briefly introduce a new method of determining stationary electron probability distributions, based on natural orbitals (NO) and involving some concepts from the Orbital Communication Theory (OCT) of the chemical bond [1012] and the theory of Markov Processes [13].

2 Stationarity from idempotency

For simplicity and transparency of definitions we assume without loss of generality the wavefunctions to be real. First, let us consider the one-determinant wavefunction of the ground-state molecular system, \(|\Psi _{HF}\rangle \), with \(N\) electrons doubly occupying the \(n^o\) lowest of MOs, \(|\varvec{\varPhi }^o\rangle \), from the Hilbert space of the all \(n\) orthonormal (spinless) molecular orbitals,

$$\begin{aligned} |\varvec{\varPhi }\rangle = \left\{ |\varPhi _1^o\rangle ,\cdots ,|\varPhi _{n^o}^o\rangle , |\varPhi _{n^o+1}^v\rangle ,\cdots ,|\varPhi _{n}^v\rangle \right\} , \end{aligned}$$
(1)

generated as the linear combinations of basis functions \(|\varvec{\chi }\rangle \), representing orthogonal atomic orbitals,

$$\begin{aligned} |\varvec{\varPhi }\rangle =|\varvec{\chi }\rangle \mathbf{C}, \end{aligned}$$
(2)

where square matrix C groups the relevant LCAO MO expansion coefficients,

$$\begin{aligned} \mathbf{C}=\left(\mathbf{C}^o\mid \mathbf{C}^v\right)=\{ C_{\mu ,i}=\langle \chi _{\mu }|\varPhi _i\rangle \}. \end{aligned}$$
(3)

Obviously, \(n=n^o+n^v\), and superscripts \(o\) and \(v\) refer to occupied and virtual MO-subspaces, respectively.

A set of such MOs uniquely defines the system one-electron density,

$$\begin{aligned} \rho (\mathbf{r}) = 2\sum _{i=1}^{n^o}\left|\varvec{\varPhi }(\mathbf{r})\right|^2 = \varvec{\chi }(\mathbf{r})\varvec{\gamma }\varvec{\chi }^{\dagger }(\mathbf{r}), \end{aligned}$$
(4)

where \({\gamma }\) is the so called charge-and-bond-order (CBO) matrix,

$$\begin{aligned} {\gamma }=2\mathbf{C}^o\mathbf{C}^{o\dagger }, \qquad \gamma _{\mu ,\nu }=2\sum _{i=1}^{n^o}C_{\mu ,i}C_{\nu ,i}, \end{aligned}$$
(5)

satisfying the following idempotency relation appropriate for the closed-shell systems (duodempotency relation):

$$\begin{aligned} \varvec{\gamma }^2 = 2\varvec{\gamma }, \qquad \text{ or}\qquad \sum _{\nu }\gamma _{\mu ,\nu }^2=2\gamma _{\mu ,\mu }. \end{aligned}$$
(6)

Diagonal elements of symmetric matrix CBO, \(\{\gamma _{\mu ,\mu }\}\), can be regarded as effective electron occupations of AOs, \(N_{\mu }\), determining the corresponding electron probability in AO resolution:

$$\begin{aligned} \mathbf{p} = \left\{ p_{\mu } = N^{-1}N_{\mu } = 2N^{-1}\sum _{i=1}^{n^o}C_{\mu ,i}^2 \right\} , \end{aligned}$$
(7)

with the appropriate normalization condition

$$\begin{aligned} \sum _{\mu }p_{\mu } = N^{-1}\sum _{\mu }\gamma _{\mu ,\mu } = N^{-1}N = 1. \end{aligned}$$
(8)

The second equality in (6) allows one to define the so called conditional probability matrix P

$$\begin{aligned} \mathbf{P} = \left\{ P_{\nu |\mu } = (2\gamma _{\mu ,\mu })^{-1}\gamma _{\mu ,\nu }^2\right\} , \end{aligned}$$
(9)

and the following normalization condition

$$\begin{aligned} \sum _{\nu }P_{\nu |\mu } = (2\gamma _{\mu ,\mu })^{-1}\sum _{\nu }\gamma _{\mu ,\nu }^2 = (2\gamma _{\mu ,\mu })^{-1}(2\gamma _{\mu ,\mu }) = 1. \end{aligned}$$
(10)

According to OCT, the element of CBO-matrix, \(\gamma _{\nu ,\mu }\), represents the appropriately renormalized amplitude of the corresponding conditional probability \(P_{\nu |\mu }\) of ”observing” in atomic orbital \(|\chi _\nu \rangle \) the electron originally assigned to \(|\chi _\mu \rangle \). Consequently, \(P_{\nu |\mu }>0\) gives rise to direct communication (through-space interaction [14, 15]) between AOs, \(\mu \rightarrow \nu \), while a sequence of \(r\) serially-connected communications determined by conditional probabilities \(P_{\nu |\mu }^r\),

$$\begin{aligned} \mu \rightarrow \pi ^{1}\rightarrow \cdots \rightarrow \pi ^{r-1}\rightarrow \nu ,\qquad P_{\nu |\mu }^r=P_{\pi ^1|\mu }\cdots P_{\nu |\pi ^{r-1}}>0, \end{aligned}$$
(11)

refers to indirect AO-communication (through-bridge interaction [1416]). It has to be noticed that such information-propagation cascade of AO-channels one can recognize as an analogue to the time-homogeneous Markov chain with a finite state space [13]; then \(P_{\nu |\mu }^r\) represents the relevant \(r\)-step transition probability.

Due to idempotency relation (6) the electron distribution p (row vector) is said to be stationary in the sense of Markov if:

$$\begin{aligned} \mathbf{p}=\mathbf{p}\mathbf{P} \qquad \text{ or} \text{ else}\qquad \mathbf{P}^T\mathbf{p}^T=1\mathbf{p}^T. \end{aligned}$$
(12)

As stated by the Perron-Frobenius theorem [17], for any arbitrary matrix with non-negative elements of each column normalized to 1 there is always at least one stationary probability distribution. In fact, as follows from the second equality of the last equation, the stationary distribution can be ragarded as a normalized left eigenvector of the conditional probability matrix associated with the eigenvalue of 1. Therefore, for given matrix P one can find the relevant vector p by solving the eigenproblem (12) but, in general, the resulting probability distribution is not guaranteed to be unique, i.e. there can be more that one eigenvector of P associated with the eigenvalue 1. It follows from the Perron-Frobenius theorem that the uniqueness of the stationary electron distribution is always assured if the \(x\)th power of the conditional probability matrix converges to a rank-one matrix in which each row is the stationary distribution in question,

$$\begin{aligned} \lim _{x\rightarrow \infty }\mathbf{P}^x=\mathbf{1}\mathbf{p}, \end{aligned}$$
(13)

where 1 stands for the column vector with all entries equal 1. Essentially, for a given conditional probability matrix its stationary distribution p is non-degenerated provided that:

  • matrix P represents an aperiodic Markov chain which virtually means that for all atomic orbitals \(\{|\chi _\mu \rangle \}\) the direct self-communication \(\mu \rightarrow \mu \) is always possible, \(P(\mu |\mu )>0\). In practice, aperiodicity can be always attained by appropriate reduction of the AO-space size, i.e. by removing atomic orbitals for which \(\gamma _{\mu ,\mu }<\epsilon \), where \(\epsilon \) represents the fixed threshold value;

  • matrix P represents an irreducible Markov chain. i.e. if all atomic orbitals are directly and/or indirectly communicated with each other. Also this requirement is straightforward to achieve by eliminating (if present) those AOs for which \(\gamma _{\mu \mu }>2-\epsilon \) (ussually core orbitals). The subspace od mutually communicated atomic orbitals defines a single communicating class.

Conditional probability (9) and relation (12) are crucial for the electron population analysis. Indeed, each row of matrix P allows one to decompose effective electron population \(N_{\mu }\) into terms related to localized (diagonal) and emphdelocalized (off-diagonal) populations, i.e. \(N_{\mu ,\mu }\) and \(N_{\mu ,\nu }\), respectively:

$$\begin{aligned} N_\mu&= N\left(p_{\mu }P_{\mu |\mu }+\sum _{\nu \ne \mu }p_{\mu }P_{\nu |\mu }\right) = N_\mu \left(P_{\mu |\mu }+\sum _{\nu \ne \mu }P_{\nu |\mu }\right) \nonumber \\&= N_{\mu ,\mu } + \sum _{\nu \ne \mu }N_{\mu ,\nu }. \end{aligned}$$
(14)

In atomic resolution the electron population on atom X can be decomposed as follows:

$$\begin{aligned} N_X = N_{X,X} + V_X = \sum _{\mu \in {X}}\sum _{\nu \in {X}}N_{\mu ,\nu } + \sum _{\mu \in {X}}\sum _{\nu \notin {X}}N_{\mu ,\nu }. \end{aligned}$$
(15)

Here, \(N_{X,X}\) stands for the number of electrons well localized on atom X (non-interacting “chemically” with electrons from other atoms, e.g. lone pairs, core electrons) and \(V_X\) is the atomic-covalency index counting up the population of electrons delocalized over chemical bonds with all other atoms,

$$\begin{aligned} V_X = \sum _{Y\ne X}\left(\sum _{\mu \in {X}}\sum _{\nu \in {Y}}N_{\mu ,\nu }\right) = \sum _{Y\ne X} W_{X,Y}. \end{aligned}$$
(16)

\(W_{X,Y}\) stands for the standard (quadratic) bond-covalency index originally proposed by Wiberg [18]. However, one should realize that electron populations and covalencies from Eqs. (15)–(16) calculated within representation of orthogonal AOs assume reliable and chemically meaningful values only when minimal set of conventionally atom-assigned atomic orbitals is used (this problem has been recently reexamined within the framework of OCT-approach in [19]). Such minimal basis set of orthogonal atomic orbitals one can always obtain using various methods, e.g. [20, 21].

3 Stationarity without idempotency

Unfortunately, Definitions (5)–(10) are correct only for one-determinant wavefunctions of closed-shell systems. In typical CI-type calculations the electron density matrix \(\mathbf{\Gamma }\) of the multi-determinant wavefunction represented by \(|\varPsi _{CI}\rangle \) is commonly expressed by means of its eigenvectors—natural orbitals \(|\varvec{\varTheta }\rangle \), and the corresponding eigenvalues—occupation numbers \(\varvec{\uplambda },\)

$$\begin{aligned} \hat{\varGamma }|\varTheta _i\rangle = \uplambda _i|\varTheta _i\rangle , \qquad \quad i=1,\cdots ,n, \end{aligned}$$
(17)

where the electron density operator \(\hat{\varGamma }\) is defined as follows:

$$\begin{aligned} \hat{\varGamma } = \sum _{i=1}^{n}\uplambda _i\hat{P}_i = \sum _{i=1}^{n}\uplambda _i|\varTheta _i\rangle \langle \varTheta _i|. \end{aligned}$$
(18)

Then

$$\begin{aligned} \varvec{\varGamma } = \langle \varvec{\chi }|\hat{\varGamma }|\varvec{\chi }\rangle = \langle \varvec{\chi }|\varvec{\varTheta }\rangle \varvec{\uplambda }\langle \varvec{\varTheta }|\varvec{\chi }\rangle = \varvec{\mathcal{C }\uplambda }\varvec{\mathcal{C }}^{\dagger } \end{aligned}$$
(19)

with normalization \(\text{ Tr}\varvec{\varGamma }=N\) and the electron probability distribution p is determined by diagonal elements \(\{\varGamma _{\mu ,\mu }\}\)

$$\begin{aligned} \mathbf{p}=\left\{ p_\mu = N^{-1}\varGamma _{\mu ,\mu } = N^{-1}\sum _{i=1}^{n}\uplambda _i\mathcal{C }_{\mu ,i}^2\right\} . \end{aligned}$$
(20)

However, since the electron density operator \(\hat{\varGamma }\) is generally not idempotent,

$$\begin{aligned} \hat{\varGamma }^2 = \sum _{i=1}^n\uplambda _i^2\hat{P}_i^2 = \sum _{i=1}^n\uplambda _i^2\hat{P}_i \ne \sum _{i=1}^n\uplambda _i\hat{P}_i = \hat{\varGamma }, \end{aligned}$$
(21)

normalization condition (10) cannot be fulfilled and hence off-diagonal elements of the relevant density matrix, \(\{ \varGamma _{\mu ,\nu } \}\) cannot be further regarded as amplitudes of corresponding conditional probabilities. In other words, for the hypothetical conditional probability matrix \(\mathbf{P}^\prime \) obtained from appropriate renormalization of each row of matrix of elements \(\{\varGamma _{\mu ,\nu }^2\}\) its stationary distribution \(\mathbf{p}^\prime \) (left eigenvector associated with the eigenvalue 1) differs from p and

$$\begin{aligned} \lim _{x\rightarrow \infty }\mathbf{pP}^\prime = \mathbf{p}^\prime \ne \mathbf{p}. \end{aligned}$$
(22)

To solve this problem we propose in this paper the following alternative definition of conditional probability \(P_{\nu |\mu }\), by means of natural orbitals and their occupation numbers:

$$\begin{aligned} \mathbf{P} = \left\{ P_{\nu |\mu } = \varGamma _{\mu ,\mu }^{-1}\sum _{i=1}^n\uplambda _iB^i_{\mu ,\nu } \right\} , \end{aligned}$$
(23)

where

$$\begin{aligned} B^i_{\mu ,\nu } = \mathcal{C }_{\mu ,i}\mathcal{C }_{\nu ,i}\left(\mathcal{C }_{\mu ,i} \mathcal{C }_{\nu ,i}+2\sum _{j=1}^{i-1}\mathcal{C }_{\mu ,j} \mathcal{C }_{\nu ,j}\right), \end{aligned}$$
(24)

Therefore, one can interpret \(B^i_{\mu ,\nu }\) as the contribution of \(i\)th NO to the corresponding electron population \(N_{\mu ,\nu }\) provided that all orbitals \(\{|\varTheta _j\rangle , j=1,\cdots ,i-1 \}\) are fully occupied.

In the one-determinant case of closed-shell molecular system all \(n^o\) occupied natural orbitals correspond to degenerated eigenvalue \(\uplambda _i=2\) and therefore can be easily separated from the (usually larger) subspace of \(n^v\) unoccupied orbitals (\(\uplambda _i=0\)). Due to this separation and degeneracy of \(\{\uplambda _i\}\) it can be proved that both definitions of conditional probability \(P_{\nu |\mu }\), (9) and (23), are fully equivalent, regardless of orbital order:

$$\begin{aligned}&P_{\nu |\mu } = \varGamma _{\mu ,\mu }^{-1}\sum _{i=1}^n\uplambda _iB^i_{\mu ,\nu } = 2\varGamma _{\mu ,\mu }^{-1}\sum _{i=1}^{n^o}B^i_{\mu ,\nu } \nonumber \\&=2\varGamma _{\mu ,\mu }^{-1}\left(\sum _{i=1}^{n^o} \mathcal{C }_{\mu ,i}^2\mathcal{C }_{\nu ,i}^2 + 2\sum _{i=1}^{n^o}\sum _{j=1}^{i-1}\mathcal{C }_{\mu ,i} \mathcal{C }_{\nu ,i}\mathcal{C }_{\mu ,j}\mathcal{C }_{\nu ,j} \right)\nonumber \\&=2\varGamma _{\mu ,\mu }^{-1}\left(\sum _{i=1}^{n^o} \mathcal{C }_{\mu ,i}\mathcal{C }_{\nu ,i}\right)\left(\sum _{j=1} ^{n^o}\mathcal{C }_{\mu ,j}\mathcal{C }_{\nu ,j}\right)\nonumber \\&=2\varGamma _{\mu ,\mu }^{-1}\left(2^{-1}\varGamma _{\mu ,\nu } \right)\left(2^{-1}\varGamma _{\mu ,\nu }\right) = (2\varGamma _{\mu ,\mu })^{-1}\varGamma _{\mu ,\nu }^2 \end{aligned}$$
(25)

For one-determinant wavefuntions the atomic charges and Wiberg-type bond covalencies (14)–(16) based on the density matrix (5) are commonly known to be invariant with respect to unitary transformation of molecular orbitals. Thus, from the equivalence of definitions (9) and (23) one can deduce the same transformational properties of conditional probabilities (23).

It is also straightforward to prove that, for the newly proposed definition (23), stationarity of probability distribution p is preserved for any arbitrary permutation of eigenvectors and eigenvalues of density matrix. First, let us sum up elements \(B_{\mu ,\nu }^i\) over all atomic orbitals,

$$\begin{aligned}&\sum _{\nu }B^i_{\mu ,\nu } = \sum _{\nu }\mathcal{C }_{\mu ,i}^2\mathcal{C }_{\nu ,i}^2 + 2\sum _{\nu }\sum _{j=1}^{i-1}\mathcal{C }_{\mu ,i}\mathcal{C }_{\nu ,i} \mathcal{C }_{\mu ,j}\mathcal{C }_{\nu ,j} \nonumber \\&=\mathcal{C }_{\mu ,i}^2\left(\sum _{\nu }\mathcal{C }_{\nu ,i}^2\right) + 2\sum _{j=1}^{i-1}\mathcal{C }_{\mu ,i}\mathcal{C }_{\mu ,j} \left(\sum _{\nu }\mathcal{C }_{\nu ,i}\mathcal{C }_{\nu ,j}\right) \nonumber \\&=\mathcal{C }_{\mu ,i}^2 + 2\sum _{j=1}^{i-1}\mathcal{C }_{\mu ,i}\mathcal{C }_{\mu ,j}\delta _{i,j} = \mathcal{C }_{\mu ,i}^2. \end{aligned}$$
(26)

The last equality in the preceding equation results from orthonormality of natural orbitals, \(\langle \varTheta _i|\varTheta _j\rangle =\delta _{i,j}\). Now, using the result from (26) we can easily prove the appropriate normalization requirement,

$$\begin{aligned}&\sum _{\nu }P_{\nu |\mu } = \varGamma _{\mu ,\mu }^{-1}\sum _{i=1}^{n}\uplambda _i\left(\sum _{\nu }B^i_{\mu ,\nu }\right) = \nonumber \\&\varGamma _{\mu ,\mu }^{-1}\sum _{i=1}^{n}\uplambda _i\mathcal{C }_{\mu ,i}^2 = \varGamma _{\mu ,\mu }^{-1}\varGamma _{\mu ,\mu }=1. \end{aligned}$$
(27)

Since electron probabilities (and the corresponding populations) are invariant with respect to interchange of any pair of natural orbitals there are formally \(r_P=n!\) different matrices P satisfying (12) for non-degenerated \(\varvec{\uplambda }\) (i.e. \(g_s=1\) for \(s=1,\cdots ,n\)). However, if degeneracy occurs (or certain eigenvalues below a fixed threshold can be neglected) and there are only \(n^\prime \) different eigenvalues (i.e. \(g_s\ge 1\) for \(s=1,\cdots ,n^\prime \)) then the number of possible conditional probability matrices reduces to

$$\begin{aligned} r_P = \left( \prod _{s}^{n^\prime }g_s!\right)^{-1}{n!}, \end{aligned}$$
(28)

where \(g_s\) stands for the degeneracy of \(s\)th possible eigenvalue.

In the case of one-determinant calculations subspaces of occupied and virtual orbitals are strictly separated and there are exactly \(n^o!\) permutations of \(\varvec{\uplambda }\) that lead to the same conditional probability matrix. Furthermore, due to orthonormality of natural orbitals and basis functions, the maximal separation of these subspaces ensures that all \(P_{\nu |\mu }\ge 0\). All permutations of “mixed” occupied and unoccupied NOs can lead to negative values of \(P_{\nu |\mu }\); this is mainly because of predominatingly anti-bonding character of virtual subspace with relation to the occupied one. This facts allow one to draw a conclusion that for the multi-determinant calculations the natural generalization of the maximal separation requirement is the following condition:

$$\begin{aligned} \sum _{i=1}^{n-1} \left( \uplambda _i - \uplambda _{i+1} \right)^2=\text{ minimum}. \end{aligned}$$
(29)

Thus, for ground-state systems the conditional probability matrix P obtained from permutations of NOs for which \(\uplambda _1\le \cdots \le \uplambda _{i}\le \uplambda _{i+1}\le \cdots \le \uplambda _n\), reveals the minimal dependence on order of NOs within the particular (conventionally separated) subspace, high occupied (\(\uplambda _i\approx 2\)) or low occupied orbitals (\(\uplambda _i\approx 0\)).

4 Summary

It was the main goal of this short paper to demonstrate that idempotency property of the density operator is not an indispensable condition for the stationarity of electron probability distribution in molecular systems. We have introduced a simple method of evaluation of conditional probabilities for the multi-determinant wavefunction. Moreover, the newly proposed definition based on natural orbitals successfully displaces hitherto prevailing one for the one-determinant wavefunctions. Since presented methodology introduces some arbitrariness due to orbital order, it has been argued to use the criterion of maximal separation of nearly-degenerated orbital subspaces in ground-state calculations.

The proposed method can be also extended to cover open-shell molecular systems as well as the excited states. However, this requires a more insightful investigation and will be the subject of the seperate article.