1 Introduction

With the depletion of fossil energy, the worsening environmental pollution, and the occurrence of extreme climate events, large-scale development of renewable energy has become a significant trend around the world [1]. However, wind, solar and other renewable energy are intermittent and random. Meanwhile, the local absorption capability of renewable energy is limited, making the phenomenon of abandoning wind and solar energy is very common [2]. Because of its advantages of flexible active and reactive power control and low transmission loss, the flexible DC grid has become one of the most important parts to solve the above problems. The first DC grid demonstration project has been in construction in Zhangbei, China.

Compared with traditional DC technology using line commutated converters (LCC), flexible DC technology with voltage source converters (VSC) is immune to commutation failure, which has decoupled active and reactive power control with no need for filtering and reactive compensation, and can also supply power to isolated islands [3, 4]. In brief, such a flexible DC grid has numerous additional advantages including: (1) Multiple power supplies and receivers improve the overall stability of the power grid. (2) Network operation can reduce the number of converter stations and investment costs of transmission corridors. (3) It provides a more secure and flexible operation mode, which can guarantee power transmission under an n-1 condition.

The protection and fault handling of conventional LCC HVDC transmission systems and two-terminal flexible HVDC transmission systems are relatively mature. For the former, when a fault occurs, the thyristors in the LCC have high endurance to overcurrent, and the firing angle can also be controlled to limit the fault current. For the latter, although IGBTs are vulnerable, the converters can block the submodules quickly to protect the component-level equipment with no high requirement for selectivity. However, the flexible DC grid has different fault characteristics and thus brings high requirements on both protection speed and selectivity. Therefore, there are some important challenges in protection and fault handling in the flexible DC grid as follows:

  1. (1)

    The transient process after a fault is complicated and difficult to calculate accurately. However, analysis of fault characteristics by stages is the basis of component design and protection research.

  2. (2)

    The conventional protection principle in simple and special scenarios is insufficient to meet the requirement in complex network for the speed and selectivity of protection. Therefore, a practical analysis of the classification of the existing protection principle is important.

  3. (3)

    Fast and effective fault isolation in a flexible DC grid is challenging, and the ideal situation is to use DC circuit breakers to isolate the fault section. Therefore, the development of highly efficient and economical DCCBs can bring a major breakthrough to DC grid development.

  4. (4)

    Rapid system recovery after a fault is an important measure to ensure continuous transmission of electrical energy. Studying the reclosing strategy applicable to a flexible DC grid can significantly improve the reliability of system operation.

This paper starts with the development background of HVDC transmission systems, then reviews the protection and fault handling from the aspects of fault analysis, protection principle, fault isolation and fault recovery, and finally illustrates future development directions.

2 Overview of flexible DC technology

HVDC transmission technology based on LCC has the disadvantages of being susceptible to commutation failure, absorbing a large amount of reactive power and having no ability to supply power to passive networks. In order to solve these problems, a flexible DC transmission technology based on VSC was proposed. Flexible DC projects have undergone 2 generations of technological development. In the first generation, two-level and three-level VSC converters were used. The converter valve is composed of IGBTs in series, resulting in many problems including difficulty in voltage sharing, high switching loss, and high harmonic content. A modular multi-level converter (MMC) topology [5] was proposed by R. Marquart in 2001. This has the advantages of low manufacturing difficulty and low loss, leading to the second generation of flexible DC technology. A number of MMC topologies with DC fault current suppression capability have also been proposed.

By 2019, 44 flexible DC transmission projects have been put into operation all over the world with a total transmission capacity of 35,084.2 MW, and there are also 7 projects under construction. Table 1 shows some of the projects.

Table 1 Typical VSC-HVDC transmission projects

The flexible DC grid is expected to become the backbone of building a new generation of smart grid [2, 4]. The U.S. has proposed the plan of building a national power backbone network connecting the East and the West, from Mexico in the south to Canada in the north. Europe has also proposed the building of an interconnected flexible DC grid from the North Sea to the northern part of the European continent to achieve the mutual complementarity of multiple renewable energy sources across Europe. In China, five flexible DC projects have been completed, and the world’s first demonstration project with a voltage level of ±500 kV will be put into operation in 2020. The rest of the paper is based on this Zhangbei 4-terminal HVDC system as shown in Fig. 1. With the development of technology and increasing demands, flexible DC projects are moving towards high voltage, high power configuration and multi-terminal networked structures.

Fig. 1
figure 1

Schematic diagram of Zhangbei 4-terminal HVDC system

3 Requirements of DC grid protection and fault handling research

The rapid development of flexible DC power transmission, especially the increasing number of overhead line projects, brings high requirements for protection and fault handling technology. It not only needs to know the fault characteristics, but also needs to reveal the fault mechanism. In addition to being able to identify the faults, it is also necessary to isolate the faults and realize the recovery of the DC grid in an efficient way.

3.1 Analysis of fault characteristics

Analysis of fault characteristics is the basis for the development of a flexible DC grid. From the grid construction plan to accident analysis, from parameter design to equipment selection, from protection principles to parameter setting, all need to be supported by fault mechanisms. In AC systems, research on the fault mechanism is quite mature, but in the field of the flexible DC grid, the knowledge is far from being sufficient. Electromagnetic transient simulation can provide information on the fault characteristics but not the reason behind them. Therefore, for the long-term development of the flexible DC grid, understanding of fault mechanisms is a top priority.

3.2 Protection principle

The protection principle is the essential safeguard for the development of a flexible DC grid. If there is no adequate protection scheme, a single fault can lead to the shutdown of the entire DC grid. The first step of protection should ensure that the power electronic devices are protected from damage. Secondly, the protection needs to be selective, with high reliability and sensibility to endure disturbances and varying fault resistance.

3.3 Fault clearing

Fault isolation is a key requirement of protection, and it is also a challenging task in DC technology. Unlike the fault current in an AC system, which has a natural zero crossing, breaking the DC fault current is challenging. The realization of efficient and economical fault isolation is the basis for the expansion of the DC grid.

3.4 Fault recovery

Fault recovery of the flexible DC grid needs to be intelligent and flexible, so as to minimize secondary impact for the operating grid. How to identify the permanent fault of a DC line and to reduce the influence of reclosing will be the key points.

Therefore, this paper will review the relay protection from the aspects of fault analysis, protection principle, fault isolation and fault recovery in the following sections.

4 DC fault analysis

DC fault analysis is the basis of component design and protection research. Existing researches of DC grid fault analysis mainly includes two aspects: 1) analysis of the DC fault development process, 2) analysis of the time-domain and frequency-domain characteristics of fault electrical quantities. Among them, the fault development process illustrates the dominant contributor of the fault current during each fault stage, while the time-domain and frequency-domain characteristics of the fault electrical quantities show the difference features between internal faults and external faults that can be used to construct protection criteria.

4.1 Analysis of the DC fault development process

4.1.1 Fault development process of the converter

Existing research shows that the development process of a pole-to-pole fault of the two-level VSC [6] and MMC [7] can be divided into three stages: capacitor discharge, diode freewheeling and AC feeding, as depicted in Fig. 2. Based on the division of the fault stages, scholars have further studied the converter’s fault equivalent circuit and fault current analytical expressions in each stage [8,9,10]. In the capacitor discharge stage and the diode freewheeling stage the converter can be equivalent to a second-order circuit and a first-order circuit [7, 11,12,13,14,15,16,17], respectively. During the AC feeding stage, the IGBT is blocked due to self-protection and the converter can be equivalent to an uncontrolled diode rectifier. References [18, 19] carried out a theoretical analysis of the fault mechanisms of two-level VSC and MMC during the AC feeding stage, dividing them into transient and steady-state phases. References [20,21,22,23,24,25] estimated the steady-state DC fault current based on the IEEE61660 standard, while [26, 27] used the external VI-characteristic of the converter to obtain a more accurate value of the steady-state DC fault current. However, due to the non-linearity of the converter, the analytical expression of the transient fault current in the AC feeding stage has not yet been obtained [22,23,24, 28].

Fig. 2
figure 2

VSC and MMC converter fault equivalent circuits in the three stages of the pole-to-pole fault

4.1.2 Fault development process on the DC line

When a DC line fault occurs, the transient voltage and current traveling waves generated at the fault point propagate to both sides of the line, as shown in Fig. 3. When these traveling waves reach the converter, they will be superimposed with the fault current generated by the converter, resulting in complex fault characteristics [29].

Fig. 3
figure 3

Schematic diagram of the fault traveling waves

Efforts have been made to reveal the fault developments on DC lines. Reference [30] analyzed the influence of different line models on the transient fault current and determined the initial transient current for different network topologies. Reference [29] studied the effect of line distributed capacitance on fault currents. Considering boundary conditions, a single line fault current expression considering the line distribution parameters was obtained in [31], while reference [32] studied the influence of adjacent lines and obtained the expression for the transient fault current in multi-terminal DC girds. These methods are effective in determining the initial transient fault current, but as the number of traveling wave reflections increases, their accuracy decreases.

4.2 Time-domain and frequency-domain characteristics

4.2.1 Time-domain characteristics

The time-domain characteristics of fault voltage and current have three main types: amplitude characteristics, rate of change characteristics, and polarity characteristics.

Because of the discharging of the capacitor, the DC voltage drops and DC current rises in the first stage of the fault. For the DC voltage, as shown in Fig. 3, ignoring the attenuation effect of the DC line and considering the internal fault F1, the traveling wave generated by the fault point will directly propagate to both ends of the line, and the DC voltage Vm and Vn will have a vertical jump. However, when the external fault F2 occurs, the traveling wave will first be attenuated by the reactor installed at the end of the line, resulting in lower rates of change of Vm and Vn. Therefore, the rates of change of DC voltages can be used to distinguish internal faults and external faults. By contrast, for DC fault current, its amplitude and change rate are not only influenced by the fault location but also the fault resistance, and thus cannot be used to construct the primary protection criteria. Nevertheless, the polarities of the fault current at different fault locations differ. In Fig. 3, for fault F1, the direction of the fault current is from the bus to the DC line, which is opposite to that of the external fault F2. Thus, this feature can be used to determine the fault direction.

4.2.2 Frequency-domain characteristics

In this section, the frequency-domain characteristics of the DC voltage, the DC fault current, and the differential current will be analyzed.

The multiple reflection process of traveling waves has brought complex fault transients to the DC voltage. In [28], the voltage of the DC bus during a metallic fault is analyzed by Fourier Transform, and the results show that the DC voltage contains significant high-frequency components. These high-frequency components can be easily blocked by the current limiting reactors, which makes the voltage of the internal faults and external faults present different frequency-domain characteristics. In [33, 34], the expression of the ratio of the DC voltages across the current limiting reactor is obtained. The magnitude-frequency curve of the expression shows that the high-frequency component of the voltage is attenuated while the low-frequency component is strengthened, and this feature can be used to distinguish the fault line.

Reference [30] analyzed the frequency-domain characteristics of the fault current at each fault stage, and presented the converter’s fault current waveforms and its frequency domain response, as shown in Fig. 3. It pointed out that the fault current before converter blocking is mainly the attenuated period component, whose frequency as given in (1) depends on the capacitor discharge circuit parameters. Also, because of the commutation of diodes, there will be a significant sixth harmonic component in the fault current after converter blocking. This can provide a theoretical basis for the construction of distance protection after blocking [31]

$$ f=\frac{\sqrt{\left(1/ LC\right)-{\left(R/2L\right)}^2}}{2\pi } $$
(1)

where, R, L, and C are the resistance, inductance, and the capacitance of the equivalent fault circuit, respectively.

The fault differential current of line MN in Fig. 1 can be expressed as:

$$ {i}_{diff}={i}_m-{i}_n $$
(2)

It is seen that for the external fault F2, the main component of the differential current idiff is the distributed capacitor discharge current, whose frequency is related to the natural frequency of the traveling wave and is usually higher than 1 kHz. Whereas for the internal fault F1, idiff is mainly the fault current, and its frequency depends on the capacitor discharging circuit and is generally below 1 kHz. Therefore, it is possible to use the frequency characteristics of the differential current to distinguish internal and external faults [35].

Research on the time-frequency characteristics of faults further reveal the differences between internal and external fault characteristics, which lays a good theoretical foundation for the establishment of fault identification criteria. However, these studies are mostly qualitative analysis and lack of quantitative calculation of fault time and frequency quantities, thereby brings great difficulty in protection setting.

5 Protection principle

The protection principle of flexible DC grid is a key point of the research. At present, the research on fault detection and identification of AC system is mature, but the fault characteristics of flexible DC grid are different. Therefore, the traditional AC protection principle is difficult to be applied in flexible DC grid. On the one hand, the energy storage element in VSC makes the DC fault current rise rapidly; on the other hand, the network topology expands the influence range of DC faults. In addition, the vulnerability of power electronics equipment also makes the requirements for protection more stringent. The requirement of speed and selectivity of protection principle have become two key issues that need to be addressed in flexible DC grid. In the existing research, the protection methods of flexible DC grids can be roughly divided into several schemes of fault detection and identification such as traveling wave protection, voltage and current protection, boundary protection, pilot protection, and protection-control coordination protection.

5.1 Traveling wave protection

Traveling wave protection is commonly adopted in DC grids. When a DC line fault occurs, voltage and current traveling waves will be generated at the fault point and propagate to both sides of the line at approximately the speed of light. Due to its high speed, it has good application in practical engineering.

In engineering application, travelling wave protection is usually based on abundant characteristics, such as amplitude, polarity, propagation time. These electrical quantities can be extracted and analyzed by corresponding mathematical methods to detect or identify the fault. However, most corresponding protection schemes have no immunity to large transition resistance and noise disturbance.

In [36, 37], a traveling wave detection method based on two terminal electrical quantities was proposed. By calculating the time interval of traveling wave propagation between two terminals, the fault type and location can be accurately determined. However, it will be limited by communication delay, while compared with a traditional HVDC system, the flexible HVDC grids require a higher protection speed. A method based on rate of current change to identify fault section was proposed in [38, 39], but extreme cases like bus-bar fault can cause mal-operation and the reliability needs to be raised. Reference [40] used the rate of voltage change to identify the fault, though the method has the problem of poor immunity to large transition resistance.

Reference [41] uses the Manhattan distance matrix to find the accurate measurement combination, and then verifies the accuracy and robustness of the results by the fault traveling wave front arrival time and network topology. Reference [42] finds the symmetry point in the network topology according to the position of the traveling wave recorder and uses the linear least square method to solve the fault equation. This method does not need to consider the complexity of the DC network topology, that is, it does not need to consider the number of looped networks and radiation lines. In reference [43], according to the sequence results of the DC line topology and the moment when the fault traveling wave arrives at each travel-wave recorder, the lines that do not meet the sequence order are excluded from all the lines. The methods mentioned above enhanced reliability, but principle and calculation are complex

Traveling wave protection is widely used, mainly because it has abundant fault information. In addition, the fault transient traveling wave propagates to the two ends of the line at the speed of light, which can enable high speed fault identification. However, the ability of immunity to transition resistance, noise and lightning disturbance is poor. Reliability and sensibility are still not fully met in the flexible DC grid.

5.2 Voltage and current protection

When a short-circuit fault occurs on a DC line, it will cause some characteristic changes such as voltage drop and sharp current rise with oscillation. The voltage and current on both sides contain rich fault information, so the amplitude, change rate and polarity can be used as the basis for fault identification.

The rate of voltage change of the DC line is used in reference [44] for fault identification and location. However, the threshold setting needs to consider all faults in the protection area, and it is difficult to ensure its selectivity. Reference [45] proposed a protection scheme based on the overcurrent, which is only suitable for special DC systems such as low-voltage and small-scale micro-grids. Reference [46] proposed a DC line protection method that measures both the voltage differential value and the voltage amplitude level, which can be used as backup protection in a practical flexible DC project. Reference [47] also proposed a backup protection algorithm based on quick-change detection (QCD). The protection based on local information measurement is applicable to various scenarios. It can realize a fast and accurate backup protection function and ensure high system reliability.

The advantage of the detection method based on voltage and current changes is that the fault characteristics can be better captured in the time domain or the frequency domain, and the operation speed is fast based on single terminal. Meanwhile, the scheme based on multi-point measurement of voltage and current can also raise the reliability. Therefore, it can be a suitable choice as the main or backup protection for a flexible DC grid. However, in a multi-terminal system, it is sometimes difficult to ensure selectivity by only depending on the single-terminal electrical quantity, and it can be easily affected by some factors such as transition resistance.

5.3 Boundary protection

A large capacitor is configured in parallel at the DC side of two-level VSC and current limiting reactors are configured at the end of DC lines in MMC system, as shown in Fig. 4. The nature of boundary characteristic is when the fault occurs at one side of boundary, the fault characteristics measured has the significant difference compared with another side. Usually, high-frequency component is impeded by the boundary. According to the results of spectrum analysis in Fig. 5, the DC voltage contains a large high-frequency component.

Fig. 4
figure 4

The diagram of current limiting reactor boundary

Fig. 5
figure 5

Spectrum of DC voltage

It is more common to use the characteristics of the boundary conditions themselves, such as voltage information of inductor and current information of capacitor, to extract fault characteristic quantities. The protection scheme constituted by the above ideas is called a transient protection scheme based on boundary conditions. Reference [48] proposed an event-based multi-terminal protection scheme, in which each protection unit uses the transient current fault identification method and artificially constructs line reactor to autonomously identify the event type. Compared with traditional data-based protection schemes, it does not need communication and clock synchronization, but it does depend on the designed structure of the boundary. Reference [49] proposed a fault detection method based on the first-order and second-order differentials of the transient DC fault current, but it can’t effectively identify high-impedance faults and has only been tested in a small scale DC grid. Reference [50] proposed a protection scheme based on the rate of the voltage change of the current-limiting reactor. Since the rate of the inductor voltage change is the second derivative of the current, the protection method has a better ability to endure the transition resistance. Meanwhile, compared with the scheme based on rate of current change, the bus voltage is relatively unchanged at the beginning of the fault, so it can provide a greater reliability of fault identification. It can quickly identify faults without the need for two-terminal communication, but the setting scheme is relatively complex. Reference [51] proposed a protection scheme using transient voltage on the DC transmission line. As the boundary of the DC line, the inductor provides a high-impedance path for high-frequency component. This will cause a significant difference in the ratio of the transient voltage on both sides of the inductor in situations of internal fault and external fault. Compared with the methods based on rate of voltage change and current change, it has superiority in sensibility and ability to endure transition resistance.

The boundary protection has stronger capability of immunity to the disturbance of noise and large transition resistance. Meanwhile, this will not require additional equipment configured in the system. However, it also increases the complexity when the algorithm like wavelet is used to extract the high frequency component. Moreover, transient quantities are susceptible to nonlinearity. In brief, single-ended boundary protection can be usually used as one of the primary protection.

5.4 Pilot protection

Pilot protection can be divided into current differential protection and directional pilot protection. The current differential protection needs to detect the difference of current between the two ends of the line to identify the fault. The directional pilot protection uses the electrical quantities at both ends of the line as the basis for fault identification. The signal propagation will affect the speed of protection. In order to reduce the influence of communication delay on the speed of protection, reference [52] shortens the signal propagation time by setting multiple photocurrent sensors on the DC line and adopts distributed detection of differential current to achieve high speed fault detection. It can effectively identify internal and external faults and provide backup for each other in the event of element failure to increase reliability, but the economic cost is high. Reference [53] uses the energy of the difference of current to decrease the influence of distributed capacitors, but the problem of protection delay still exists. Reference [54] takes the DC reactors at both ends of the line in the DC grid as the boundary, analyzing the difference in the transient characteristics of the DC voltage on both sides of the line between internal and external faults. But the high frequency component is extracted by wavelet algorithm, which increases the computation of protection. Reference [55] uses a linear process after the moment of the fault to identify the internal and external faults based on the direction of the fault component. The protection has the advantage in speed, but it will be affected by nonlinear interference such as converter blocking. On this basis, reference [56] proposed a pilot protection scheme based on full current, which is no longer affected by non-linear processes and increases the reliability of protection.

To sum up, pilot protection in flexible DC grid will depend on the electrical quantities at both ends of the line, and communication delay can’t be avoided, so the protection speed has no superiority compared with single-ended protection. However, it has advantages on selectivity and sensibility. Current differential protection and directional pilot protection can be backup protection, but directional pilot protection is also taken as primary protection under some situations.

5.5 Protection-control coordination

In a multi-terminal flexible DC grid, complex control can interfere with protection. However, the controllable characteristics of the converters can also be used to provide a stable and reliable characteristic signal for protection, thereby improving the reliability and accuracy of protection discrimination, as shown in Fig. 6. It can also provide technical support for high speed and selective identification of faults.

Fig. 6
figure 6

The protection of active signal injection

Many studies have mentioned the use of highly controllable converters such as DC-DC and full-bridge MMC to detect and locate faults by injecting characteristic signals into the fault point after fault. Reference [57, 58] proposed controlling the switching mode by using the DC/DC boost converter to change the modulation frequency of the converter in a photovoltaic grid-connected DC system after fault. The converter at each port is transformed into an injection source with characteristic signal for fault calculation and detection. However, it is only applicable to similar multiport DC aggregation systems and is difficult to apply to long-distance transmission systems. Reference [59] combined a half-bridge MMC and hybrid circuit breaker, and used the hybrid circuit breaker to actively inject a voltage pulse signal to the fault line to determine the fault property and achieve high speed restart. In [60], the MMC submodule control is changed briefly to inject voltage pulses into the DC line, and the fault location is achieved by measuring the pulse emission and arrival time. This method can be applied by both half-bridge and full-bridge MMC.

The research the above on fault detection and location methods by changing the control mode of converter is still in the preliminary stage, which can only be carried out in the steady state process after the fault, and it is difficult to meet the requirement of speed of protection. For multi-terminal systems, the port where the signal is generated and the type of signal should also be considered.

6 DC fault clearing

After identifying the faulty line accurately, it is also necessary to quickly interrupt the fault current to avoid further damages. In AC systems, the fault current can be cleared by opening the mechanical switch at the zero-crossing point of the fault current. However, the DC system fault current does not have a zero-crossing point, which makes the DC fault clearing a major problem in DC system protection.

By now, scholars have carried out many studies on how to clear the DC fault current, and three types of DC fault interruption methods have been proposed: 1) using AC circuit breaker, 2) using DC circuit breaker, 3) using converters with fault current blocking capabilities.

Once a fault occurs, the converter can be regarded as a huge fault power source (usource) and is continuing discharging to the fault point, as shown in Fig. 7a. According to Ohm’s law, the fault current (ifault) can be reduced by reducing the fault voltage ufault or by increasing the resistance rfault. Correspondingly, there are two approaches to realize DC fault clearing. One is the step-down of the fault source scheme (where the fault source is directly set to zero or negative voltage), and the other is the reverse blocking EMF (Electromotive Force, EMF) scheme (in which the fault sources still exists but a reverse blocking EMF with an amplitude higher than that of the fault source will be generated to interrupt the fault current). The three existing fault clearing methods are basically realized through these two approaches.

Fig. 7
figure 7

Schematic diagram of DC fault clearing method

6.1 Approach 1: change the fault source

6.1.1 Fault clearing scheme based on AC circuit breaker

In many commissioned DC projects, the DC fault clearing is realized using the AC circuit breaker [61]. After fault inception, the converters at both ends of the line are first blocked, and then the AC circuit breaker is opened at the zero-crossing point of the AC current, which is equivalent to setting the fault source ufault to zero, as shown in Fig. 7b. Finally, the remaining fault energy will be dissipated by the resistance of the fault circuit, and the fault current ifault will naturally drop to zero.

The well-known handshake fault clearing method in [62] realized the selective isolation of faulty lines by combining the above scheme and the DC fast mechanical switches. In order to prevent the overcurrent injected by the AC side from damaging the anti-parallel diodes of the converter, a single thyristor switch (SSTS) is parallel to the diodes, helping to transfer the fault current to the thyristor [61, 63]. References [64, 65] further optimized the above scheme, and proposed a double-thyristor switch (DTSS)-based scheme and grid double-thyristor switch (GTSS)-based scheme. Both the DTSS scheme and the GTSS scheme can separate the AC grid and the DC side apart, which has not only set the fault power supply to zero but also protected the inverter from the AC power supply at the same time.

Using the AC circuit breaker has the advantages of economy and maturity. However, reducing the fault source (usource) by cutting off the AC side will cause power outages of the system. Also, the residual fault energy on the DC side is only dissipated by the small resistance of the fault current loop, leading to a quite long fault clearing time.

6.1.2 Fault clearing scheme based on converter self-clearing

The fault clearing scheme based on the converter self-clearing interrupts DC fault currents by converting the fault source to a negative voltage with an optimized submodule/converter topology.

So far, scholars have proposed a variety of sub-module structures with fault clearing capabilities. For example, the well-known full-bridge sub-module [66], and clamped double-sub-module [67], as shown in Fig. 8. After the sub-module is blocked, the fault current will flow through the anti-parallel diode to charge the sub-module, and the inverter outputs a negative voltage to the outside, forcing the fault current to decrease. The schematic diagram of the fault circuit is shown in Fig. 7c. Based on this basic principle, there are multiple sub-modules with the same functions. Such as the tandem-twin module in [68], the enhanced self-resistance submodule in [69], the cross-connected submodule in [70], the hybrid submodules in [71], the diode-clamped submodule in [72], the diode clamped twin module in [73], etc.

Fig. 8
figure 8

Submodules with fault blocking capabilities

Modification of the submodules to be equipped with fault blocking capability usually also brings higher cost and operating loss to the converter [74, 75]. After blocking, the energy of the fault circuit will be used to charge the capacitor of the submodule, which can easily cause the capacitor voltage of the submodule to exceed the voltage limit. Besides, if the reverse EMF provided by the converter is lower than the AC power EMF, it may cause the AC power to feed into the fault point again [76].

Recently, some optimized MMC topologies with fault clearing capabilities have also been proposed. In order to achieve a balance between the DC fault clearing capabilities and the economic performance, reference [77] proposes a half-bridge submodule and full-bridge submodule cascade hybrid scheme, which can generate reverse EMF without blocking. In reference [78], the branch with current interruption ability is connected across the upper and lower bridge arms of the converter so that the converter has the fault current blocking ability.

Compared with the fault clearing scheme based on the AC circuit breaker, the fault clearing scheme based on converter self-clearing has significantly shortened the fault clearing and recovery time. However, it requires setting the DC side voltage to zero or negative voltage through blocking or modulation, which will affect the normal operation of other healthy lines connected by the converter.

6.2 Approach 2: generating a reverse blocking EMF

The fault-clearing scheme based on the DC circuit breaker uses the DC circuit breaker to generate an EMF with the polarity opposite to that of the fault source to clear the fault current. This generated reverse EMF may be connected in series with the fault source or in parallel, as seen in Fig. 7d and e.

6.2.1 Fault clearing scheme based on DC circuit breakers

For DC circuit breakers installed at both ends of the line, the reverse EMF (uinterrupt) is established across the breaker after receiving the tripping command, as seen in Fig. 7d. At the same time, the surge arrester or the non-linear resistance (rinterrupt) in the circuit breaker is inserted into the fault circuit, which makes the fault current ifault drop rapidly. During the fault isolation, the DC voltage will not drop to zero, so it will not affect the operation of the healthy part of the system. Therefore, this method has been considered as the most promising fault clearing scheme.

At present, three kinds of DC circuit breakers have been proposed: 1) mechanical circuit breakers (MCBs), 2) pure solid-state circuit breakers (SSCBs), 3) hybrid circuit breakers (HCBs) [49, 50]). Among them, the HCB is preferable because of its low on-state loss comparable to the MCB and its fast-effective fault current interruption capability comparable to the SSCB [79, 80]. In 2012, ABB company launched its first hybrid DC circuit breaker, which can breaks DC currents up to 16kA in 2.25 ms [81]. Plenty of industrial and academic researches related to the HCB have been carried out since then [82,83,84]. At present, the HCB has been initially applied in the actual flexible DC project.

Nevertheless, the high cost of the HCB greatly limits its large-scale application. Researchers have made many efforts to reduce the costs of a single HCB in recent years. Fault current limiters [85] and fault current limiting strategies [86] were studied to reduce the size and the capacity required for a single HCB. Topologies using fewer and cheaper semiconductors are proposed [87]. Optimized topologies of converters [88] and DC switch yards [58] are also investigated to help to improve the economic performances of the HCB.

Considering that the main cost of HCB lies in its main breaker branch, scholars has proposed multiport DC circuit breakers, which use a shared main breaker to interrupt DC currents of multiple lines [22,23,24,25,26,27,28,29,30]. The typical topology of the multiport hybrid DC circuit breaker is shown in Fig. 9. The branch selector-n can be used to block the fault current of the port-n. Based on this idea, reference [89] uses a string of unidirectional series thyristor as the selector, and then significantly reduces the costs. In reference [90, 91], a unidirectional bypass branch is used as the selector. However, they cannot interrupt the DC bus fault current. To ensure bidirectional current interruption, reference [92] uses a circuit consisting of a bidirectional MB cell in series with a thyristor string with anti-parallel diodes as the selector while reference [93,94,95] uses half the number of MB cells and bidirectional bypass branches [96] as the selectors. However, the cost of the multiport HCB in [93, 94] is still high, and the breaker in reference [95] is not extendable.

Fig. 9
figure 9

Topology of multiport hybrid DC circuit breaker

With the expansion of the DC network scale, the cost advantages of employing the multiport HCB will become more obvious. However, the major problem remains to be solved in developing a multiport HCB is to achieve lower costs and higher reliability without losing any current interruption capability. Aiming at this goal, reference [97] proposed a ring-connected multiport HCB, which has the same fault interruption performance as the two-port hybrid DC circuit breaker, whereas the cost is lower than 1/3 of the two-port DC circuit breaker.

Additionally, the reliability of the DC circuit breaker should also be concerned. Due to the complicated structure, the failure rate of existing DC circuit breakers is much higher than that of traditional AC circuit breakers [98]. Among the three types of existing DC circuit breakers, the full-solid DC breaker and the hybrid DC circuit breaker with more semiconductor switches have lower reliability than the mechanical DC breaker. In [99], the reliability of the AC distribution networks and DC distribution networks were evaluated and compared. The results show that the low reliability of the DC circuit breaker is one of the main reasons for the low reliability of the DC distribution network [98, 100], and it should be further studied.

6.2.2 Fault clearing scheme of the separation converter and line

To coordinate the converter and DC circuit breaker, and reduce the overall cost of the fault clearing equipment, reference [101, 102] proposed a converter and line separated fault clearing method. The basic idea is to separate the converter and the fault line apart by a bypass branch and then create an energy dissipation branch for the converter and the fault line, respectively. The energy dissipation branch can generate parallel reverse EMF both at the outlet of the converter and the line, respectively, so as to reduce the fault current, as shown in Fig. 7e.

However, the above fault clearing scheme also needs to redesign the converter, which may affect the performance of the converter. In addition, when the parallel reverse EMF at the outlet of the converter will also affect the normal operation of other healthy lines.

7 Fault recovery

The development trend of multi-terminal flexible DC systems using overhead lines puts forward higher requirements for DC fault isolation. For system recovery after faults in a flexible DC system, CIGRE and many research institutions have proposed a variety of solutions, which can be divided into two technical routes [103]: one is based on the protection technology of converters without fault self-clearing ability and DC circuit breakers, while the other is based on converters with fault self-clearing ability and an ultrafast disconnector switch (UFD). The former realizes fast isolation, removal and system recovery of faulty lines through the quick tripping and closing of DC circuit breakers. In contrast, the latter mainly prevents the continuous discharge of capacitors inside the converter and blocks the short circuit current feed of the AC system by blocking the converter or changing its control mode, and realizes the isolation of faulty lines and restoration of the system through the mechanical switch in the DC lines. In terms of ensuring continuous and reliable transmission of active power in the DC grid, the advantages of adopting the protection technology of converters without fault self-clearing capabilities and DC circuit breakers are obvious.

Whether the DC system can restore operation successfully after a fault mainly depends on the identification of fault properties and the isolation of faulty lines. For temporary faults, appropriate recovery methods should be adopted to quickly restore normal operation after the faults are cleared. For permanent faults, the converter should be blocked for line maintenance. At present, researches for fault recovery of flexible DC systems can be classified into four categories: fault identification schemes based on AC circuit breakers [104, 105], converters [106,107,108,109,110,111], DC circuit breakers [112,113,114], and electrical quantities after a fault [115, 116].

The system recovery strategy based on the “handshake method” and the coordination between an AC circuit breaker and a DC switch was proposed in [104, 105]. This strategy can isolate the fault area without relying on communication, which can reduce the system cost. However, this method may interrupt the power of a non-faulty bus during the fault clearing phase, leading to low power supply reliability. What’s more, the performance is poor because the AC circuit breaker operating time is so long that the fault current cannot be interrupted in time.

The converter-based recovery strategy is also called restart strategy. In [106,107,108], a converter with fault clearing capability was operated in a specific mode after fault isolation, then the characteristics of the fault can be judged by DC voltage, DC current and bridge arm current. Reference [109,110,111] put damping modules in series inside the converter. Through coordinating the mechanical switch, the arm current limiter, the DC circuit breaker and the control of the converter station, the impact of the secondary inrush current can be limited. However, such a fault recovery scheme has limited applications for its converter structure, and when used in a DC power grid, the blocking of the converter valves will disable the whole converter station, expanding the scale of power failure and resulting in low reliability.

A system recovery scheme based on a hybrid DC circuit breaker identifies the fault properties by detecting the existence of the fault current bypassing the arrester energy-absorption branch [112,113,114]. The DC circuit breaker can directly break the fault current through disconnecting the faulty line at high operation speed. Providing the power electronic devices are in their safe operating region, it can ensure the normal operation of non-faulty lines without blocking the converter, solving the problem of DC fault clearing and isolation in flexible DC systems and improving system availability. However, there is a problem of blind recovery in the process as the DC circuit breaker automatically recloses after waiting for a fixed deionizing time to restore operation.

The fault property identification scheme based on electrical quantity information recognizes the fault property by constructing the relationship between the characteristic quantity and the fault state. This method largely reflects the inherent mechanism of fault development. A fault property identification scheme based on the residual voltage characteristic of the faulty pole was proposed in [115]. The fault property recognition was realized by detecting the induced voltage of the healthy pole on the faulty pole after the fault. In [116], characteristic signals were injected into healthy lines and the identification of fault property is realized by detecting the coupling characteristic signals of the faulty poles. However, the above-mentioned fault identification methods based on electrical quantities are only applicable to unipolar short-circuit faults in symmetrical bipolar DC systems.

After a DC line fault occurs, according to the operating status of the circuit breaker, the subsequent discharge process can be divided into the following three stages:

  1. (1)

    The stage before MOAs break down.

When the fault has just occurred, the circuit breaker has not yet acted, and the fault current is mainly composed of superposition of the valve-side feed current and the distributed capacitor discharging current, as shown in Fig. 10a, where Zequal represents the equivalent internal impedance of the MMC and Lline represents the inductance of the current limiting reactors.

  1. (2)

    The stage after MOAs break down.

Fig. 10
figure 10

DC line fault discharge process

The circuit breaker trips and as the current change rate (di/dt < 0) sharply reverses on the current limiting reactor (Lline), the circuit breaker and the arrester withstand a large transient segmented voltage, leading to the arrester being quickly broken down. The discharge circuit is shown in Fig. 10b.

  1. (3)

    The stage after the MOAs restore insulation.

With the release of energy at the valve side, the arrester terminal voltage drops rapidly and the insulation is restored. After that, the fault current is mainly the capacitor discharge current distributed to the ground by the line, as shown as Fig. 10c.

In fact, during the 100-200 ms after the circuit breaker trips, the distributed capacitance of the DC line to the ground will discharge continuously. This stage contains a large amount of valuable information. If the electrical characteristics quantities representing the moment when the fault disappears can be extracted, it becomes possible to build an adaptive reclosing strategy.

8 Protection technology prospects

In the aspect of DC fault analysis, although scholars have done a lot of research on DC fault development process and the fault characteristics, the influence of the multiple factors such as the converter blocking on the fault transients and the quantitive determination of the fault quantities have not been paid much attention. To complete the researches on DC fault analysis, future studies may focus on the following aspects: 1) fault mechanisms across the whole fault development process still need to be improved and optimized, among which the fault analysis after converter blocking is a significant supplement. 2) Study on the quantitative calculation method of fault time-frequency electric quantities so as to describe the internal and external fault characteristics more accurately and to provide theoretical basis for protection setting.

In terms of protection principle, the exiting protection schemes have their own adaptive scenarios. In the future, multiple schemes should cooperate to form the primary and backup protection. Voltage and current protection, traveling wave protection, and protection based on boundaries can be used as primary protection because they can ensure the speed of protection; the current differential protection and direction pilot protection can be an important supplement due to the good selectivity and sensibility. The goal is to realize the scientific cooperation of main and backup protection. The protection scheme based on the protection-control cooperation can maximize the use of controllable equipment such as converters for fault detection, which provides a new direction for the following research.

In terms of DC fault clearing, the scheme based on DC circuit breaker is still the first choice. Several companies have developed prototypes of DC circuit breakers that meet the requirements of DC systems, and have been successfully used in actual DC projects. However, the cost and the reliability of the DC circuit breaker may be the main obstacles to its large-scale commercial application. The future research can be carried out from two aspects: focusing on reducing the cost of the DC circuit breaker and increasing its reliability.

In terms of fault recovery, the future research should pay more attention to the pre-judgment of fault nature and adaptive reclosing strategy. When the fault is permanent, HCBs should not reclose to avoid a secondary impact on the system. By analyzing the discharge process of the fault point before and after the trip of HCBs, the viewpoint of identifying the fault nature by using the residual voltage characteristics along the line after HCBs’ energy consumption path restores insulation is proposed, which is instructive for the subsequent research on adaptive reclosing of flexible DC systems.

9 Conclusion

With many advantages, the flexible DC grid is of great significance to the development of smart grid. However, the protection and fault handling are the keys to the promotion and application of flexible DC grid. In order to promote future research of DC grid protection, this paper reviews the recent literatures about fault analysis, protection principle, fault isolation and recovery. Their advantages and disadvantages are presented. The development directions of DC grid protection research are recommended:

  1. (1)

    Analytical expressions of the transient electrical quantities across the whole fault development process can be further investigated.

  2. (2)

    The combination of various protection schemes constitutes the primary protection, which has the strong immunity to fault resistance and disturbance. And the coordination with backup protection should be further strengthen, realizing the integrity of protection for DC grid.

  3. (3)

    More attention should be paid to improving the economics and reliability of the existing fault clearing methods.

  4. (4)

    Fault recovery should focus on identifying the fault nature and detecting the temporary fault disappearing moment. Making full use of the transient electrical quantity after HCB trips can realize the adaptive reclosing more efficiently.