Introduction

Active distribution network (ADN) is directly connected with users, which is the key link to ensure the quality of power supply and user service and improve the operation efficiency of power system. With the access of a large number of distributed energy resources such as photovoltaic solar, wind, and battery storage, the complexity of traditional fault location technology of single power system in active distribution network is also increasing. Therefore, it is urgent to propose an effective and fast fault location method for active distribution network.

The fault location methods of active distribution network are mainly based on electrical quantity, impedance, traveling wave method and intelligent algorithm. The method based on electrical quantities usually uses the positive sequence1,2 or zero sequence3,4,5 quantities during the fault to estimate the fault distance. However, too high transition resistance will make the change of electrical quantity not obvious, which will lead to the failure of location method. The impedance-based method mainly locates the fault by constructing the impedance matrix6,7,8. But the main flaw in the suggested techniques is that the impedance matrix construction cannot be done offline since the network structure changes due to sub-network formation and network reconfiguration. Thus, creating an impedance matrix does not significantly benefit large systems but increases complexity. The traveling wave method mainly uses the arrival time of the incident wave and the reflected wave to estimate the fault distance9,10. However, it must maintain the timing of measurements. What’s more, it requires high sampling frequencies and can be adversely affected by noise-corrupted measurements. The intelligent algorithm mainly uses the related optimization algorithm to realize the fault location. Refs.11,12,13 use Bayesian compressed sensing (BCS) to pinpoint all fault types in distribution networks. However, this method makes it easy to locate the adjacent regions of the fault region.

Phasor Measurement Units (PMUs) are being used increasingly often due to the ongoing advancements in communication technology, and they have grown to be a crucial component of the dynamic power system monitoring process. It obtains voltage and current phase data mainly by the discrete Fourier variation method. Based on the measurement data obtained by the PMU, it offers significant data support for the location of distribution network faults and effectively satisfies the demands for quick and precise fault location. Using the voltage threshold value determined by a thorough investigation of the test system, Reference14 uses the data from the optimal PMU locations to provide a two-stage fault detection method. However, does not specifically address how to get the optimal PMU location. Reference15 investigates a new field of study for defective region detection with little data from sensors in the power distribution system to address the issue of a large variety of PMU setups. In addition, literatures16,17,18 classify and locate the defects using deep learning algorithms based on measured data. The solutions, however, will not operate in a distribution network system with DGs and are only helpful for traditional distribution networks.

This paper analyzed the fault characteristics in ADN. Based on the characteristics of current variation before and after the fault occurs, the spectral norm ratio coefficient (SNRC) and current phase difference (CPD) are defined. The proposed method only needs current amplitude and phase data of any period before and after the fault occurs, so it can avoid the impact of time synchronization on fault region identification. Moreover, the method is not affected by fault type, neutral point grounding type, and transition resistance.

The remaining of this article is as follows: “Fault features of active distribution network” section analyzes current features in ADN. How to implement fault region identification using the SNRC and CPD is described in “Fault region identification method” section. “Simulation results” section uses PSCAD and MATLAB to verify the effectiveness of the proposed method from different fault position, fault types, neutral point grounding methods, and transition resistances. “Experiment results” section verifies the practical significance of the method. The conclusion is presented in “Conclusion” section.

Fault features of active distribution network

Fault features of distribution network without DG

Figure 1 illustrates this by using the A-phase ground fault at fault point F as an example (for research, L1 and L2 are set as the upstream and downstream lines of point F, respectively). The present relationship between lines L1, L2, and L3 is satisfied after the fault occurs as follows:

$$\left\{ \begin{aligned} & I_{{1{\text{AF}}}} {\kern 1pt} = I_{{f{\text{A}}}} + V_{{f{\text{A}}}} /Z_{{\text{d}}} \hfill \\ & I_{{{\text{1BF}}}} {\kern 1pt} = I_{{f{\text{B}}}} + V_{{f{\text{B}}}} /Z_{{\text{d}}} \hfill \\ & I_{{1C{\text{F}}}} = V_{{f{\text{C}}}} /Z_{{\text{d}}} \hfill \\ \end{aligned} \right.$$
(1)

where I1AF, I1BF, and I1CF are the A, B, and C three-phase currents of line L1, respectively. IfA, IfB, and IfC are the fault currents of the A, B, and C phases at the fault point F, respectively. VfA, VfB, and VfC are the fault voltages of A, B, and C phases at the fault point F, respectively. Zd is the sum of Z2 and Z3.

Figure 1
figure 1

Distribution network without DG.

When an A-phase grounding fault occurs, there are:

$$\left\{ \begin{aligned} & I_{{f{\text{A}}}} = \frac{{3V_{f}^{(0)} }}{{Z^{(1)} + Z^{(2)} + Z^{(0)} }} \hfill \\ & I_{{f{\text{B}}}} = I_{{f{\text{C}}}} = 0 \hfill \\ & V_{{f{\text{B}}}} = \frac{{(a^{2} - a)Z^{(2)} + (a^{2} - 1)Z^{(0)} }}{{Z^{(1)} + Z^{(2)} + Z^{(0)} }}V_{f}^{(0)} \hfill \\ & V_{{f{\text{C}}}} = \frac{{(a - a^{2} )Z^{(2)} + (a - 1)Z^{(0)} }}{{Z^{(1)} + Z^{(2)} + Z^{(0)} }}V_{f}^{(0)} \hfill \\ & V_{{f{\text{A}}}} = 0 \hfill \\ \end{aligned} \right.$$
(2)

where Vf(0) is the voltage before fault at fault point F and a is equal to ej120. Z(1) is the sum of equivalent positive sequence impedance in line, Z(2) is the sum of equal negative sequence impedance in line, and Z(0) is the sum of equivalent zero sequence impedance in line. In this paper, Z(1) equals Z(2).

Combined with Eqs. (1)-(2), the currents of L1A, L1B, and L1C after fault are:

$$\left\{ \begin{aligned} & I_{{1{\text{AF}}}} {\kern 1pt} = \frac{{3V_{f}^{(0)} }}{{Z^{(1)} + Z^{(2)} + Z^{(0)} }} \hfill \\ & I_{{1{\text{BF}}}} {\kern 1pt} = \frac{{(a^{2} - a)Z^{(2)} + (a^{2} - 1)Z^{(0)} }}{{Z_{{\text{d}}} (Z^{(1)} + Z^{(2)} + Z^{(0)} )}}V_{f}^{(0)} \hfill \\ & I_{{1{\text{CF}}}} {\kern 1pt} = \frac{{(a - a^{2} )Z^{(2)} + (a - 1)Z^{(0)} }}{{Z_{{\text{d}}} (Z^{(1)} + Z^{(2)} + Z^{(0)} )}}V_{f}^{(0)} \hfill \\ \end{aligned} \right.$$
(3)

The currents of line L2 and L3 after the A-phase fault are:

$$\left\{ \begin{aligned} I_{{2{\text{BF}}}} {\kern 1pt} = I_{{{\text{3BF}}}} = \frac{{(a^{2} - a)Z^{(2)} + (a^{2} - 1)Z^{(0)} }}{{Z_{{\text{d}}} (Z^{(1)} + Z^{(2)} + Z^{(0)} )}}V_{f}^{(0)} \hfill \\ I_{{2{\text{CF}}}} {\kern 1pt} = I_{{{\text{3CF}}}} = \frac{{(a - a^{2} )Z^{(2)} + (a - 1)Z^{(0)} }}{{Z_{{\text{d}}} (Z^{(1)} + Z^{(2)} + Z^{(0)} )}}V_{f}^{(0)} \hfill \\ \end{aligned} \right.$$
(4)

Thus, the current variation of L1B, L1C, L2, and L3 are:

$$\left\{ \begin{aligned} & \Delta I_{1B} {\kern 1pt} = \Delta I_{2B} = \Delta I_{2B} = \frac{{(a^{2} - a)Z^{(2)} + (a^{2} - 1)Z^{(0)} }}{{Z_{{\text{d}}} (Z^{(1)} + Z^{(2)} + Z^{(0)} )}}V_{f}^{(0)} - \frac{{V_{{f{\text{B}}}}^{(0)} }}{{Z_{{\text{d}}} }} \hfill \\ & \Delta I_{1C} {\kern 1pt} = \Delta I_{2C} = \Delta I_{2C} = \frac{{(a - a^{2} )Z^{(2)} + (a - 1)Z^{(0)} }}{{Z_{{\text{d}}} (Z^{(1)} + Z^{(2)} + Z^{(0)} )}}V_{f}^{(0)} - \frac{{V_{{f{\text{C}}}}^{(0)} }}{{Z_{{\text{d}}} }} \hfill \\ & \Delta I_{{2{\text{A}}}} {\kern 1pt} = \Delta I_{{3{\text{A}}}} = - \frac{{V_{f}^{(0)} }}{{Z_{{\text{d}}} }} \hfill \\ \end{aligned} \right.$$
(5)

In the formula, VfB(0) and VfC(0) are the voltages of the B and C phases before fault at fault point F.

The current variation of lines L1A is:

$$\Delta I_{{1{\text{A}}}} = \frac{{3V_{f}^{(0)} }}{{Z^{(1)} + Z^{(2)} + Z^{(0)} }} - \frac{{V_{f}^{(0)} }}{{Z_{{\text{d}}} }}$$
(6)

According to Eqs. (5), (6), the sum of the fault current and the phase current downstream of the fault site is the fault phase current upstream of the fault point.

When a B and C two-phase ground short circuit occurs at point F, the current of line L1 may be determined as follows:

$$\left\{ \begin{aligned} & I_{{{\text{1BF}}}} {\kern 1pt} = I_{{f{\text{B}}}} + \frac{{V_{{f{\text{B}}}} }}{{Z_{{\text{d}}} }} \hfill \\ & I_{{{\text{1CF}}}} {\kern 1pt} = I_{{f{\text{C}}}} + \frac{{V_{{f{\text{C}}}} }}{{Z_{{\text{d}}} }} \hfill \\ & I_{{{\text{1AF}}}} = \frac{{V_{{f{\text{A}}}} }}{{Z_{{\text{d}}} }} \hfill \\ \end{aligned} \right.$$
(7)

where, I1AF, I 1BF, and I1CF are the A, B, and C three-phase currents of line L1, respectively. IfB and IfC are the fault currents of the B and C phase at the fault point F, respectively. VfA, VfB, and VfC are the fault voltages of A, B, and C phases at the fault point F, respectively. Zd is the equivalent impedance from fault point F to the end of the line.

When a BC two-phase ground short circuit occurs in the distribution network, there are:

$$\left\{ \begin{aligned} & I_{{f{\text{B}}}} = (a^{2} - \frac{{Z^{(2)} + aZ^{(0)} }}{{Z^{(2)} + Z^{(0)} }})I_{{f{\text{A(1)}}}} \hfill \\ & I_{{f{\text{C}}}} = (a - \frac{{Z^{(2)} + a^{2} Z^{(0)} }}{{Z^{(2)} + Z^{(0)} }})I_{{f{\text{A(1)}}}} \hfill \\ & V_{{f{\text{B}}}} = V_{{f{\text{C}}}} = 0 \hfill \\ & I_{{f{\text{A(1)}}}} = \frac{{V_{f}^{(0)} }}{{Z^{(1)} + Z_{{\text{p}}} }} \hfill \\ & V_{{f{\text{A}}}} = \frac{{3Z^{(2)} Z^{(0)} }}{{Z^{(2)} + Z^{(0)} }}I_{{f{\text{A(1)}}}} \hfill \\ \end{aligned} \right.$$
(8)

where IfB and IfC are the fault currents of B and C phases at fault point F. VfA, VfB, and VC are the fault voltages of the A, B, and C phases at fault point F. IfA(1) is the positive sequence current of the A phase in line L1 and a is equal to ej120, Vf(0) is the voltage before fault at fault point F. Z(1) is the sum of equivalent positive sequence impedance in line, Z(2) is the sum of equal negative sequence impedance in line, and Z(0) is the sum of equivalent zero sequence impedance in line. ZP is the parallel equivalent resistance of Z(2) and Z(0). In this paper, Z(1) equals Z(2).

Combined with Eqs. (7)-(8), the currents of L1A, L1B, and L1C after fault are:

$$\left\{ \begin{aligned} & I_{{{\text{1BF}}}} {\kern 1pt} = V_{f}^{(0)} \frac{{(a^{2} - 1)Z^{(1)} + (a^{2} - a)Z^{(0)} }}{{(Z^{(1)} )^{2} + 2Z^{(1)} Z^{(0)} }} \hfill \\ & I_{{{\text{1CF}}}} {\kern 1pt} = V_{f}^{(0)} \frac{{(a - 1)Z^{(1)} + (a - a^{2} )Z^{(0)} }}{{(Z^{(1)} )^{2} + 2Z^{(1)} Z^{(0)} }} \hfill \\ & I_{{{\text{1AF}}}} = V_{f}^{(0)} \frac{{3Z^{(1)} Z^{(0)} }}{{Z_{d} ((Z^{(1)} )^{2} + 2Z^{(1)} Z^{(0)} )}} \hfill \\ \end{aligned} \right.$$
(9)

The currents of line L2 and L3 after the B, and C two-phase ground fault are:

$$\left\{ \begin{aligned} & I_{{{\text{2BF}}}} {\kern 1pt} = I_{{{\text{2CF}}}} = I_{{3{\text{BF}}}} {\kern 1pt} = I_{{3{\text{CF}}}} = 0 \hfill \\& I_{{{\text{2AF}}}} = I_{{3{\text{AF}}}} = V_{f}^{(0)} \frac{{3Z^{(1)} Z^{(0)} }}{{Z_{{\text{d}}} ((Z^{(1)} )^{2} + 2Z^{(1)} Z^{(0)} )}} \hfill \\ \end{aligned} \right.$$
(10)

Therefore, the current variation of L1A, L2, and L3 is:

$$\left\{ \begin{aligned} & \Delta I_{{{\text{1A}}}} {\kern 1pt} = \Delta I_{{{\text{2A}}}} = \Delta I_{{3{\text{A}}}} = V_{f}^{(0)} \frac{{Z^{(1)} Z^{(0)} - (Z^{(1)} )^{2} }}{{Z_{{\text{d}}} ((Z^{(1)} )^{2} + 2Z^{(1)} Z^{(0)} )}} \hfill \\ & \Delta I_{{2{\text{B}}}} {\kern 1pt} = \Delta I_{{{\text{3B}}}} {\kern 1pt} = - \frac{{V_{{f{\text{B}}}}^{(0)} }}{{Z_{{\text{d}}} }} \hfill \\ & \Delta I_{{2{\text{C}}}} {\kern 1pt} = \Delta I_{{{\text{3C}}}} {\kern 1pt} = - \frac{{V_{{f{\text{C}}}}^{(0)} }}{{Z_{{\text{d}}} }} \hfill \\ \end{aligned} \right.$$
(11)

where, VfB(0) and VfC(0) are the voltages of the B and C phases before fault at fault point F.

The current variation of lines L1B and L1C is:

$$\left\{ \begin{aligned} & \Delta I_{{2{\text{B}}}} {\kern 1pt} {\kern 1pt} = V_{f}^{(0)} \frac{{(a^{2} - 1)Z^{(1)} + (a^{2} - a)Z^{(0)} }}{{(Z^{(1)} )^{2} + 2Z^{(1)} Z^{(0)} }} - \frac{{V_{{f{\text{B}}}}^{(0)} }}{{Z_{{\text{d}}} }} \hfill \\ & \Delta I_{{2{\text{C}}}} {\kern 1pt} {\kern 1pt} = V_{f}^{(0)} \frac{{(a - 1)Z^{(1)} + (a - a^{2} )Z^{(0)} }}{{(Z^{(1)} )^{2} + 2Z^{(1)} Z^{(0)} }} - \frac{{V_{{f{\text{C}}}}^{(0)} }}{{Z_{{\text{d}}} }} \hfill \\ \end{aligned} \right.$$
(12)

Equations (10)-(12) demonstrate that when a two-phase fault occurs, the current of the fault phase downstream of the fault point is zero. The sum of the fault current and the phase current downstream of the fault point is the fault phase current upstream of the fault point. Because the voltage at the fault point is zero when a three-phase short-circuit fault occurs, the current downstream of the fault point is also zero. The current of the fault phase downstream of the fault point decreases when a phase-to-phase short-circuit fault occurs, and the mechanism is identical to a two-phase short-circuit fault. This paper will not be duplicated due to space limitations. In summary, when a fault occurs, the current of the fault phase upstream of the fault point is greater than the current of the fault phase downstream of the fault point.

Fault features of distribution network with DG

For the fault features of the downstream region without DG (i.e., the Region 2 in the Fig. 2), as shown in Fig. 2, when a fault occurs, the system power supply and DG will both provide a short-circuit current to the fault point.

Figure 2
figure 2

Distribution network with DG.

Assuming that the positive direction is the direction of current flow out of the bus. At this time, the current variations of nodes Q and J before and after faults are:

$$\Delta i_{Q} = \Delta i_{s} + \Delta i_{dg}$$
(13)
$$\Delta i_{J} = - i_{J}$$
(14)

where, Δis is the current variation the system power supply provides before and after the fault. Δidg is the variation provided by DG, and iJ is the current at point J before the fault.

When a fault occurs in the Region 2, i.e., fault f3, the current variation downstream of the fault point equals the current before the fault, and the current variation upstream of the fault point equals the sum of the system power supply current variation and the current variation supplied by the DG. So, by contrasting the current amplitude difference before and after the fault, the fault region may be identified.

For the fault features of the downstream region with DG (i.e., the Region 1 in the Fig. 2), as shown in Fig. 2, when a fault occurs, the fault current from DG will flow to the fault point if DG is connected to the fault region. Point M of the Region 1 in Fig. 2 is arbitrarily chosen to study the current attributes of the upstream and downstream fault. The direction of the current flowing out of the bus is the positive direction.

When no fault occurs, the current IMpre of the point M is:

$$I_{{{\text{Mpre}}}} = \frac{{V_{{\text{S}}} - V_{{\text{M}}} }}{{Z_{{\text{T}}} + Z_{1} + Z_{2} }}$$
(15)

VS and VM are the system source and M-point voltage during the regular line operation. ZT is the sum of the internal impedance of the transformer and the system source. Z1 and Z2 are the line impedances.

  1. 1.

    Single-phase fault feature in Region 1 of Fig. 2.

    Suppose that an A-phase fault occurs upstream of the detection point M, as shown in f1 of Fig. 2. The current IMF1 of the point M is:

    $$I_{{{\text{MF1}}}} = \frac{{V_{f1} - V_{Q} }}{{Z_{{2}} + Z_{3} + Z_{4} }}$$
    (16)

    where Vf1 and VQ are the fault point’s and Q point’s voltage, respectively. Z2, Z3, and Z4 are the line impedances.

    If the transition resistance is 0, it is obvious to find:

    $$\left\{ \begin{aligned} & V_{f1} = 0 \hfill \\ & I_{{{\text{MF1}}}} = \frac{{ - V_{Q} }}{{Z_{{2}} + Z_{3} + Z_{4} }} \hfill \\ \end{aligned} \right.$$
    (17)

    Hence, IMF1 is negative. the length of the line is generally not more than a few kilometers when the distribution network is operating normally, and the length of the line less impacts the impedance angle. It can be considered that the line impedance angle is roughly equal, ranging from 70° to 80°19. As a result, in Eq. (15), the amplitude of VS-VM is lower, and the phase is earlier than VS. Transformer impedance and system impedance are both contained in ZT. Its maximum impedance angle is not more than 90°, so the phase of IMF1 lies behind the phase of IMpre. If there is a transition resistance since the amplitude of the DG access point voltage is greater than Vf1, Vf1-VQ is negative, and the phase of IMF1 still lags behind the phase of IMpre.

    Assume that an A-phase fault develops downstream of point M, as shown in f2 of Fig. 2. The current IMF2 of the point M is:

    $$I_{{{\text{MF2}}}} = \frac{{V_{{\text{S}}} - V_{f2} }}{{Z_{{\text{T}}} + Z_{1} + Z_{2} + Z{}_{3}}}$$
    (18)

    where Vf2 and VS are the fault point and system source voltage, respectively. Z1, Z2, and Z3 are the line impedances. ZT is the sum of the internal impedance of the system source and the impedance of the transformer.

    According to the study above, the phase of IMF2 is ahead of the phase of IMpre if the transition resistance is 0. On the other hand, the phase of Vf2 is behind the phase of the f2-point voltage when the line is operating normally. As a result, the phase of Vf2 is later than the phases of VM and VS. It can be determined that the phase of IMF2 is ahead of the phase of IMpre.

  2. 2.

    Two-phase short fault and three-phase short fault feature in Region 1 of Fig. 2.

    When a two-phase short-circuit fault occurs, the voltage phase of the access point will mutate. As a result, it is appropriate to use positive sequence quantity for analysis. Suppose that a BC two-phase short-circuit fault occurs upstream of point M, as illustrated in f1 of Fig. 2, and that the phase of the A-phase voltage at fault time is 0 degree as a reference. Figure 3 depicts the equivalent circuit at f1 fault.

    Figure 3
    figure 3

    Equivalent circuit at f1 fault.

    The positive sequence current variation ΔIM1 of M point is:

    $$\Delta I_{{{\text{M1}}}} = - \Delta I_{dg}$$
    (19)

    where, ΔIdg is the variation of the output current of DG. When the inverter-type DG output current is equal to 1.2 p.u., the amplitude range of the positive sequence fault component before and after the M-point fault is obtained as follows:

    $$|\Delta I_{{{\text{dg}}}} | \in 0.22p.u.\sim 0.61p.u.$$
    (20)

    ΔIM1 is negative, so the current of M point phase lags behind IMpre phase after a fault. Assume that a BC two-phase short-circuit fault occurs downstream of point M, as depicted in f2 of Fig. 2. The equivalent circuit at f2 fault is shown in Fig. 4.

    Figure 4
    figure 4

    Equivalent circuit at f2 fault.

    According to the superposition theorem, the current positive sequence variation ΔIM2 of M can be obtained as:

    $$\begin{aligned} \Delta I_{{{\text{M2}}}} & = \frac{{V_{f2} }}{{\Delta Z + (Z_{T} + Z_{1} + Z_{2} + Z_{3} )||(Z_{4} + (Z_{{{\text{Leq}}}} ||Z_{dg} )}} \cdot \frac{{(Z_{4} + (Z_{{{\text{Leq}}}} ||Z_{dg} )}}{{Z_{T} + Z_{1} + Z_{2} + Z_{3} + Z_{4} + Z_{{{\text{Leq}}}} + Z_{dg} }} \hfill \\ & \quad - \Delta I_{dg} \cdot \frac{\Delta Z}{{\Delta Z + (Z_{T} + Z_{1} + Z_{2} + Z_{3} )}} \hfill \\ \end{aligned}$$
    (21)

    where Zdg is the internal impedance of DG and ZLeq is the sum of the impedance downstream of the Region 1 and the load. ΔZ is the transition resistance. “||” is the parallel operation of impedance. For the virtual power branch, because the internal impedance of the inverter-type DG is infinite, it can be simplified as follows:

    $$\begin{aligned} \Delta I_{{{\text{M2}}}} & = \frac{{V_{f2} }}{{\Delta Z + (Z_{T} + Z_{1} + Z_{2} + Z_{3} )}} - \Delta I_{dg} \cdot \frac{\Delta Z}{{\Delta Z + (Z_{T} + Z_{1} + Z_{2} + Z_{3} )}} \hfill \\ & = \frac{{V_{f} - \Delta I_{dg} \Delta Z}}{{\Delta Z + (Z_{T} + Z_{1} + Z_{2} + Z_{3} )}} \hfill \\ \end{aligned}$$
    (22)

    When a fault occurs, the amplitude of the positive-sequence voltage at the fault point is half what it was before the fault, and the voltage phase stays unaltered. The amplitude of the positive-sequence fault component of the virtual power supply's current is 3 to 12 times that of the DG's current. When the inverter-like DG’s output current is set at 1.2 p.u., according to reference19, there is:

    $$\left\{ \begin{aligned} & |\Delta I_{{{\text{M2}}}} | \in 2.86p.u.\sim 11.94p.u. \hfill \\ & |\Delta I_{{{\text{dg}}}} | \in 0.22p.u.\sim 0.61p.u. \hfill \\ \end{aligned} \right.$$
    (23)

    Therefore, the direction of ΔIM2 is positive, implying that the current phase of M point is ahead of the phase of IMpre after the fault. Taking the impedance angle of the overhead line as 72°19, the phase angle variation of the positive sequence fault component at both ends of the Region 1 may be calculated roughly as:

    $$\arg (\Delta I_{PQ} ) \in (0^{ \circ } ,87.3^{ \circ } )$$
    (24)

    If a three-phase short-circuit fault occurs upstream and downstream of point M, it is seemingly possible to find:

    $$\left\{ \begin{aligned} & |\Delta I_{{{\text{M1}}}} | = |\Delta I_{{{\text{dg}}}} | \in 0.22pu\sim 2.0pu \hfill \\ & |\Delta I_{{{\text{M2}}}} | \in 5.58pu\sim 12.44pu \hfill \\ \end{aligned} \right.$$
    (25)

    where |ΔIM1| is M point upstream fault current amplitude variation. |ΔIM2| is M point downstream fault current amplitude variation.

    Combining Eqs. (19), (22), and (25), it can be obtained that the phase of ΔIM1 is behind the phase of IMpre, whereas the phase of ΔIM2 is ahead of the phase of IMpre. Therefore, the phase angle variation of the positive sequence fault component at both ends of the Region 1 can be approximated as follows:

    $$\arg (\Delta I_{PQ} ) \in (0^{ \circ } ,162.9^{ \circ } )$$
    (26)

    From above, when a fault occurs in a downstream region with DG access, the positive sequence fault current phase upstream of the fault point will be ahead of the normal operation phase. In contrast, the positive sequence fault current phase downstream of the fault point will lag. If we define φp as the phase before the fault and φF as the phase after the fault, then the phase variation ∆φ is:

    $$\Delta \varphi = \varphi_{{\text{p}}} - \varphi_{{\text{F}}}$$
    (27)

    If ∆φ is positive, the phase after the fault is ahead of the phase before the fault. If ∆φ is negative, the phase after the fault lags behind the phase before the fault.

    When a fault occurs in the Region 2 in Fig. 2, there is IPF = IQF. At both ends of the Region 1, the CPD of the positive fault sequence current is:

    $$\Delta \varphi_{PQ} = |\Delta \varphi_{P} - \Delta \varphi_{Q} | = 0$$
    (28)

    The above analysis shows that the fault positive sequence current phase at upstream of the fault point is ahead of the phase before the fault. In contrast, the fault positive sequence current phase at downstream of the fault point lags behind the phase before the fault, resulting in the maximum amount of phase difference in faulty region. Equations (24) and (26) illustrate the range.

Fault region identification method

Spectral norm ratio coefficient

The matrix 2-norm of the factual matrix Am×n is defined as follows:

$$ \left\| {\bf {A}} \right\|_{2} = \sqrt {\max ({eig}({\bf {A}}^{{\text{T}}} \times {\bf {A}}))}$$
(29)

where A is an m × n matrix, AT is a transposed matrix of A. eig(.) is the eigenvalue calculation function of a square matrix. eig (AT × A) represents the eigenvector [λ1, …, λi, …]T of matrix A. λi represents the ith eigenvalue.

In this research, the length of the data acquisition time window is set to 10 ms. A fault is considered to have occurred when the current data acquired by the PMU exceeds the threshold. Data from each cycle before the fault is collected and discretized. The phase current matrix of node i before the fault is:

$$\left\{ \begin{aligned} & I_{{i\text{A}}} = [\begin{array}{*{20}c} {I_{{i\text{A}}} (1)} & {I_{{i\text{A}}} (2)} & {\begin{array}{*{20}c} \cdots & {I_{{i\text{A}}} (M)} \\ \end{array} } \\ \end{array} ]^{{\text{T}}} \\ & I_{{i{\text{B}}}} = [\begin{array}{*{20}c} {I_{{i{\text{B}}}} (1)} & {I_{{i{\text{B}}}} (2)} & {\begin{array}{*{20}c} \cdots & {I_{{i{\text{B}}}} (M)} \\ \end{array} } \\ \end{array} ]^{{\text{T}}} \\ & I_{{i{\text{C}}}} = [\begin{array}{*{20}c} {I_{{i{\text{C}}}} (1)} & {I_{{i{\text{C}}}} (2)} & {\begin{array}{*{20}c} \cdots & {I_{{i{\text{C}}}} (M)} \\ \end{array} } \\ \end{array} ]^{{\text{T}}} \\ & I_{i} = [\begin{array}{*{20}c} {I_{{i\text{A}}} } & {I_{{i{\text{B}}}} } & {\begin{array}{*{20}c} {I_{{i{\text{C}}}} } \\ \end{array} } \\ \end{array} ]^{{\text{T}}} \\ \end{aligned} \right.$$
(30)

where i = 1, 2, 3, …, N; k = 1, 2, 3, …, M. IiA(k), IiB(k), IiC(k) are the current amplitudes of the kth sampling point of the node I before the fault occurs, and M is the total number of data obtained by discretization.

After a length of time, the current IiF of each node is derived using discretization:

$$\left\{ \begin{aligned} & I_{{i\text{FA}}} = [\begin{array}{*{20}c} {I_{{i\text{FA}}} (1)} & {I_{{i\text{FA}}} (2)} & \cdots & {I_{{i\text{FA}}} (M)} \\ \end{array} ]^{{\text{T}}} \hfill \\ & I_{{i\text{FB}}} = [\begin{array}{*{20}c} {I_{{i\text{FB}}} (1)} & {I_{{i\text{FB}}} (2)} & \cdots & {I_{{i\text{FB}}} (M)} \\ \end{array} ]^{{\text{T}}} \hfill \\ & I_{{i\text{FC}}} = [\begin{array}{*{20}c} {I_{{i\text{FC}}} (1)} & {I_{{i\text{FC}}} (2)} & \cdots & {I_{{i\text{FC}}} (M)} \\ \end{array} ]^{{\text{T}}} \hfill \\ & I_{{i{\text{F}}}} = [\begin{array}{*{20}c} {I_{{i{\text{FA}}}} } & {I_{{i{\text{FB}}}} } & {\begin{array}{*{20}c} {I_{{i{\text{FC}}}} } \\ \end{array} } \\ \end{array} ]^{{\text{T}}} \hfill \\ \end{aligned} \right.$$
(31)

where i = 1, 2, 3, …, N; k = 1, 2, 3, …, M. I iFA(k), IiFB(k), IiFC(k) are the current amplitudes of the kth sampling point of the node i after the fault.

To realize fault region identification under limited PMU, the SNRC is defined to measure the difference between the upstream and downstream currents of the fault point. The expression of SNRC is as follows:

$$K = \frac{{||I_{i} ||_{2} }}{{||I_{{i{\text{F}}}} ||_{2} }}$$
(32)

where, ||Ii||2 represents the 2-norm of the current before the fault of node i, and ||IiF||2 represents the 2-norm of the current after the fault of node i. Then the SNRC variation ΔKm of the region m (the first node and end node of region m are nodes i and j, respectively) is:

$$\Delta K_{m} = |K_{i} - K_{j} |$$
(33)

where m = 1, 2, 3, …, J; i, j = 1, 2, 3, …, N, and i ≠ j.

When the line is in normal operation, the 2-norm of the three-phase current calculated for the entire line for any period is not significantly different. Hence, the ΔK of each region is close to 0. When the line faults, the phase current is greater than the threshold Iset, and the ΔK of each region increases. Moreover, the ΔK of the fault region is significantly greater than the ΔK of the normal region. As a result, the fault region may be identified by comparing the variations in ΔK of each location.

Current phase angle difference

Because there is no current flowing downstream of the fault point when a fault occurs in the downstream region without DG integration, the fault region may still be recognized using the SNRC. When a fault occurs in the downstream region with DG access, the DG injects current into the fault point. Current flows downstream of the fault point. Relying exclusively on the SNRC may lead to errors. Equations (26)–(28) shows that the CPD of the normal region is 0. However, the CPD of the fault region will be much more significant than that in the normal region. To identify the fault region, this study combines the CPD stated in Eq. (34) with the SNRC.

$$\left\{ \begin{aligned} & \Delta \varphi_{M} = \varphi_{M{\text{F}}} - \varphi_{M{\text{P}}} \hfill \\ & \Delta \varphi_{N} = \varphi_{N{\text{F}}} - \varphi_{N{\text{P}}} \hfill \\ & \Delta \varphi_{MN} = |\Delta \varphi_{M} - \Delta \varphi_{N} | \hfill \\ \end{aligned} \right.$$
(34)

where φMF and φNF are the current phases after node M and N fault, respectively. φM and φNP are the current phases before node M and N fault, respectively. ΔφM is the current phase variation of node M. ΔφN is the current phase variation of node N. ΔφMN is the CPD of the region (node M and node N are the first node and end node of the region, respectively.) If the region MN is normal, then its ΔφMN approaches 0. Inversely, its ΔφMN will be the maximum one.

Fault region identification start criterion

In this paper, PMUs are utilized to monitor the phase current in real-time. Meanwhile, the exists of faults is determined based on whether the phase current exceeds the limit. The particular requirement is as follows:

$$I \ge I_{{\text{set}}}$$
(35)

where Iset is the fault protection threshold. The neutral point ungrounded system threshold Iset-U and the neutral point through the arc suppression coil grounding system threshold Iset-A respectively, meet:

$$\left\{ \begin{aligned} & I_{{\text{set - U}}} = 0.5K_{{\text{rel}}} (I_{C\Sigma } - I_{L} ) \hfill \\ & I_{{\text{set - A}}} = 0.5WK_{{\text{rel}}} I_{C\Sigma } \hfill \\ \end{aligned} \right.$$
(36)

where, Krel is the reliability coefficient, generally 0.8; I is the total ground capacitive current; IL is the maximum line-to-ground capacitance current; and W is the over-compensation degree.

When the above starting criterion is satisfied at time t, PMU will take the point at time t as the fault starting point and record the phase current data before the fault starting point and the phase current data of a cycle after the fault starting point. According to the data, the PMUs at both ends of the region use Eqs. (32)–(34) to identify the fault region.

To eliminate the impact of other interference signals on the beginning fault region identification, the PMU is utilized to capture phase current data one cycle after the fault starting point for verification after finding the fault starting point. If Eq. (37) is fulfilled, the identification and the next step are determined.

$$\left\{ \begin{aligned}{}& I_{km} \ge KI_{n} \hfill \\ & I_{km} = \sqrt {\frac{1}{N}\sum\limits_{k = 1}^{N} {i^{2}_{n} (k)} } \hfill \\ \end{aligned} \right.$$
(37)

where, Ikm is the practical phase current value in one cycle, and K is the reliability coefficient of phase current fault, generally K = 2–3. In is the reasonable value of phase current when the line is normal, and in(k) is the instantaneous value of phase current at the kth identification point after the fault.

PMU configuration and region division strategies

The PMU configuration and region division strategies are proposed in this paper, as shown in Fig. 5. Specifically, the PMU configuration strategy is as follows:

  1. 1.

    The first node of main feeder and the end node of main feeder in ADN are arranged with PMU, respectively;

  2. 2.

    The first node of branch and the end node of branch are arranged with PMU, respectively.

Figure 5
figure 5

Diagram of PMU configuration and region division.

In addition, the region division strategy is:

  1. 1.

    The first node of main feeder and the first node of the nearest brunch in distribution network is the regional boundaries;

  2. 2.

    The first node of branch and the end node of the branch is the regional boundary;

  3. 3.

    The end node of main feeder and the first node of the nearest branch is the regional boundaries.

Fault region identification method steps

The framework and flowchart of the proposed method shown in Figs. 6-7. The implementation steps are as follows:

Figure 6
figure 6

The framework of the proposed method.

Figure 7
figure 7

The flowchart of the proposed method.

Step 1::

arrange the PMUs according to the proposed method and divide the region.

Step 2::

use the three-phase current data collected by PMUs. Judge the fault according to Eqs. (35)–(37).

Step 3::

calculate and compare ΔKj of each region based on Eqs. (29)–(33) to gain the region corresponding to the ΔKmax.

Step 4::

calculate and compare Δφj of each region based on Eq. (34).

Step 5::

if the region corresponding to ΔKmax is the same as that corresponding to ∆φmax, output the region name. Adversely, output the region corresponding to ∆φmax.

Simulation results

The IEEE 33-node system is built in PSCAD/EMTDC, using a simulation sampling frequency of 4 kHz. The parameters of the system are shown in the Supplementary Information. The capacities of DG1 and DG2 are 0.544 MVA and 1.4 MVA, respectively. MatlabR2016b is utilized for data analysis and computation to verify the effectiveness of the proposed fault region identification method. According to the method of PMU layout and region division, the nodes and region division of PMU in an IEEE 33-node distribution network are shown in Fig. 8.

Figure 8
figure 8

IEEE 33-node distribution network.

Firstly, set an A-phase grounding fault in Region 7 and combine it with the phase current data collected by PMU. The curve of ΔK variation with time in each region is obtained, as depicted in Fig. 9(a). Secondly, set up a phase A grounding fault in Region 2 and use the data collected by PMU. The ΔK variation and CPD with time in each region is obtained, as shown in Fig. 9(b)-(c).

Figure 9
figure 9

ΔK and Δφ in each region in case of fault in Region 7 and Region 5 respectively. (a) ΔK for each region in case of fault in Region 7; (b) ΔK for each region in case of fault in Region 5; (c) Δφ for each region in case of fault in Region 5.

Figure 9(a) shows that the ΔKmax occurs when a fault occurs in a region without DG access downstream. Additionally, Fig. 9(b)-(c) show that when a fault occurs in a region with DG access downstream, its ΔK is second only to ΔKmax, but the CPD is at its maximum. To accurately identify the fault region, the SNRC and CPD of the region can be combined.

To verify the reliability and robustness of the method, the fault type, neutral point grounding mode, and fault transition resistance of the line are changed, respectively. The results are illustrated in Figs. 10-11 and Tables 1-2. From the results in Fig. 10, the SNRC variation and CPD of Region 2 reach their maximum when two-phase faults and three-phase faults occur in Region 2. This is because the short-circuit current delivered by DG1 and DG2 is slight when a two-phase or three-phase fault occurs.

Figure 10
figure 10

Identification results under different fault types in Region 2. (a) Single-phase fault; (b) Two-phase fault; (c) Three-phase fault.

Figure 11
figure 11

Identification results of Region 2 fault under different neutral grounding modes. (a) Neutral point ungrounded mode; (b) Neutral point through low resistance grounding mode; (c) Neutral point grounding through arc suppression coil.

Table 1 Identification results of Region 5 fault under different transition resistance.
Table 2 Identification results of Region 2 fault under different transition resistance.

Figure 11 shows the identification results under various neutral point grounding modes, indicating that the grounding mode of the neutral point has little effect on the proposed method and that the fault region can still be accurately identified under various neutral grounding modes.

The results and analyses in Tables 1-2 reveal that the method has a decent identification impact in any location. The smaller the ΔK and Δφ of each region, the more excellent the transition resistance. Furthermore, the SNRC variation of the fault region will reach their maximum values. This is because the short-circuit current of the distributed power supply is reduced when the transition resistance increases. As a result, in the case of a high impedance fault, the method proposed can still identify the fault region.

Table 3 demonstrates the identification result with the PMU installed in node 2 takes 10 ms longer to gather data than other PMUs. It can be obtained that when a single-phase ground fault occurs in Region 7, the ΔK and Δφ of Region 7 are greater than those in other regions. While a single-phase ground fault occurs in Region 2, the Δφ of Region 2 reaches the maximum. And the ΔK of Region 2 is the second largest compared with other regions. This is because the estimated current spectrum norm measures the current's variation in a data window, whereas the current phase utilizes the steady-state value. As a result, the method suggested in this research has low communication needs between PMUs.

Table 3 Fault region identification results of node 2 with a PMU delay of 10 ms.

In order to verify the effectiveness of the proposed method in large-scale distribution networks, this paper builds an IEEE 69-node distribution network system as shown in Fig. 12. The parameters of the system are shown in the Supplementary Information. The capacities of DG1 and DG2 are 0.544 MVA and 1.4 MVA, respectively. Set an A-phase grounded fault in R7(node 12-node 27) and R8(node 3-node 69), respectively. Collect the current data of each cycle before and after the fault to calculate the ΔK of each region. The result is shown in Table 4. It can be obtained that when a fault occurs in the region downstream of the fault point accessing the DG (e.g., R7), the Δφ of the fault region are significantly larger than those of the other regions. And the ΔK will reach the maximum or the second largest. When a fault occurs in the region downstream of the fault point without access to the DG (e.g., R8), the ΔK and Δφ in the fault region are both significantly larger than those in normal regions. Therefore, the method proposed in this paper is still able to effectively and accurately identify fault region under large-scale distribution networks.

Figure 12
figure 12

IEEE 69-node distribution network.

Table 4 Identification results of R7 and R8 fault respectively.

To further verify the superiority of the methods proposed, this paper compared it with other four methods. The four methods use the characteristics of the upstream and downstream currents at the fault point to realize fault identification. Reference20 realizes fault region identification with the help of zero sequence conductance. Although it is independent of neutral grounding method, transition resistance and communication desynchronization, it can only identify single-phase faults and can only be applied to single-supply systems. Literature21, compared to literature20, is independent of fault type, but it requires communication synchronization. The methods proposed in literatures22,23 are applicable to DG-integrated distribution networks, but do not consider the effects of neutral grounding method, transition resistance and communication desynchronization. The specific method comparison results are shown in Table 5. Therefore, method proposed in this paper is more comprehensively considered and superior to other methods.

Table 5 The main characteristics of other methods and the method proposed in this paper.

Experiment results

To verify the efficacy and dependability of the proposed method, a real 6-node test platform is used to validate the method. Figure 13 depicts the test platform structure, PMU configuration, and region division. The power quality analyzer replaces the PMU in the actual test, and three power quality analyzers acquire the platform’s current data before and after the short-circuit defect at a 4 kHz acquisition frequency. Simultaneously, a sampling time of 10 ms is set between the three power quality analyzers mentioned above to examine the proposed method's performance change when the PMU clock is not synced.

Figure 13
figure 13

Test platform (a) The topology of the test platform; (b) The actual test platform.

In the actual experiment, the fault is set in Region 1 and Region 2, respectively. Figure 14 depicts the measured waveforms of the currents at both ends of the region when Region 1 and Region 2 fault respectively. It can be found that when Region 1 faults, the current at node 1 increases and the currents at node 3 and node 6 remain essentially unchanged. Whereas, when Region 2 faults, the currents at both node 1 and node 2 increase and the current at node 6 remains essentially unchanged. The fault region identification results obtained by the proposed method are shown in Table 6. It can be obtained that the ΔK and Δφ of the faulty region are significantly larger than those of the normal region when either Region 1 or Region 2 fault. Thus, the method proposed in this paper can still accurately identify the fault region when the communication is not synchronized, proving its practical potential.

Figure 14
figure 14

Measured current waveform of each node (a) Region 1 faults; (b) Region 2 faults.

Table 6 Fault region identification results of Region 1 and Region 2 fault respectively.

Conclusion

The paper analyzes the fault characteristics of the ADN in detail. A fault region identification approach based on the SNRC and CPD is proposed for the ADN. It only needs to extract the current amplitude and phase data of any period before and after the fault occurs, avoiding the impact of time synchronization on fault region identification. Case studies of faults under different conditions are used to evaluate the performance of the proposed method. The results demonstrate that the method is less affected by fault type, neutral grounding mode, fault position, transition resistance, and communication. Moreover, the actual test platform shows that the proposed method can effectively identify the fault region and has practical significance.

With the further development of synchronous measuring technology, PMU will become more commonly employed in distribution networks. Understanding how to use restricted PMU to accomplish accurate fault location in the ADN is critical.