1 Introduction

One of the key operational activities in the power system is to schedule power production according to the predicted load demands. Dynamic economic and environmental dispatches (DEED) are both crucial objectives in this scheduling task because a small percentage improvement may potentially bring significant cost savings and operational improvements [1, 2]. The goal of economic dispatch is to minimize the cost by determining the power production of thermal power plant units, while managing system constraints, balancing power production and load demand, and meeting plant operational requirements, e.g. ramp rates. Similarly, the objective of environmental dispatch is to minimize emissions under the same system constraints. Both dispatch problems are difficult to solve due to the non-smooth non-convex DEED formulations in cost functions and constraints, especially for large power systems [3].

Fast development of renewable power sources and changes in load demand bring more planning and operational uncertainties to the grid [4]. Plug-in electric vehicles (PEVs) with high penetrations are potentially important participants in the power system due to the additional large load requirements [5, 6]. The stochastic charging of PEVs may also significantly affect the distribution grid as well as increase the generation costs and pollutants emissions if not managed efficiently. It is therefore of importance to measure the impact of different PEV charging scenarios on the power system and intelligently schedule and dispatch power generation using an optimized DEED system approach.

In this paper, four different PEV charging scenarios are modeled using charging time probability distribution, based on PEV charging data from Electric Power Research Institute (EPRI), a full peak charging scenario and a full off-peak charging scenario and a stochastic charging scenario. These four charging time probability distributions are measured with a certain number of charging PEVs and integrated in the power demand of a 5-unit system and a 15-unit system respectively. Both the economic and environmental impacts are evaluated by solving the dynamic dispatch problems.

The objective functions and constraints of DEED problems are significantly non-smooth and non-convex due to factors like the valve point effects and exponential nature of emission output curve. Meta-heuristic approaches have consequently been employed to solve these problems. Numerous meta-heuristic algorithms have been used to solve the economic dispatch problem such as particle swarm optimization (PSO) [7], differential evolution (DE) [8], harmony search (HS) [9], biogeography-based optimization (BBO) [10], and imperialist competitive algorithm (ICA) [11], etc.

Based on our previous work [12], a new variant of teaching learning based optimization (TLBO), namely self-learning teaching-learning based optimization (SL-TLBO) is proposed and adopted to solve DEED problems. Some state-of-the-art variants of TLBO have been numerically compared to 10 well-known benchmarks and DEED problem for 5-unit and 15-unit power systems respectively. The dispatch results of four charging scenarios are also comparatively studied. The results show the significance of the new SL-TLBO scheduling method when applied to both benchmarks test and two scales of DEED test systems in terms of both the convergence speed and accuracy. For the four charging profiles, the comparative studies show that the off-peak charging scenario is the most economical and an environmental friendly choice.

2 Problem formulations

The DEED problem is a multi-objective problem combining the economic dispatch objective denoted as F1 and environmental dispatch objective denoted as F2. This two-objective problem can also be transformed into a single objective problem as

$$ {\rm min} \;F = \omega_{ee} F_{1} + (1 - \omega_{ee} )F_{2} $$
(1)

where ω ee is a weighting factor, being a constant between 0 and 1. The overall cost F denotes the single objective to be minimized. In terms of the formulations of F 1 and F 2 , both non-linear models for economic and environmental dispatch are considered.

2.1 Dynamic economic load dispatch model

The dynamic economic load dispatch problem is to minimize the total economic cost of the fossil fuel in a whole day time. The decision variable is the dispatched power P i,t in each time interval t. The problem is formulated as

$$ \begin{aligned} F_{1} =& \sum\limits_{t = 1}^{T} {\sum\limits_{i = 1}^{{N_{u} }} {F_{i} (P_{i,t} )} } = \sum\limits_{t = 1}^{T} {\sum\limits_{i = 1}^{{N_{u} }} {[(a_{i} + b_{i}P_{i,t} + c_{i}P_{i,t}^{2} )]} } \\ &+ \left| {e_{i} \sin (f_{i} (P_{imin } - P_{i,t} ))} \right| \\ \end{aligned} $$
(2)

where a i , b i and c i are the fuel cost coefficients of the ith generator; e i and f i the fuel cost coefficients for evaluating ripples in the cost curve caused by the valve-point effect; and F1 accumulates the cost of N u generators in totally T intervals. In addition to the cost functions, there are several system constraints associating with the objective functions as follows.

  1. 1)

    Power output limits:

$$ P_{i\hbox{min} } \le P_{i,t} \le P_{i\hbox{max} } (i = 1,2, \ldots ,N_{u} ) $$
(3)

where the power output should be within the capacity of each specific power generator P imax and P imin .

  1. 2)

    Power balance limits:

$$ \sum\limits_{i = 1}^{{N_{u} }} {P_{i,t} = P_{D,t} + P_{L,t + } + L_{ev,t} } \;\;\;t = 1,2, \ldots ,T $$
(4)

The total power generated in each time interval should meet the power load demand in the corresponding time period. In this paper, the power demand constraint considers the original load demand P D,t in the time interval t, associated with the transmission losses P L,t and the PEV charging load L ev,t . This PEV charging load is a new load type and will be further addressed in Section 3. The transmission losses are also considered and approximated with the widely used B-coefficients method [13] denoted as

$$ \begin{gathered} \hfill P_{L,t} = \sum\limits_{i = 1}^{{N_{u} }} {\sum\limits_{j = 1}^{{N_{u} }} {P_{i,t} } B_{i,j} P_{j,t} + \sum\limits_{i = 1}^{{N_{u} }} {B_{0,i} } P_{i,t} + B_{00} } \\ \hfill t = 1,2, \ldots ,T \\ \end{gathered} $$
(5)

where B i,j , B0,i and B00 are loss coefficients. The handling approach is implemented by the method proposed in [14].

  1. 3)

    Ramp rate limits:

$$ \left\{ \begin{gathered} P_{i,t} - P_{i,t - 1} \le UR_{i} \hfill \\ P_{i,t - 1} - P_{i,t} \ge DR_{i} \hfill \\ \end{gathered} \right. $$
(6)

Thermal generators are subject to the power ramp rate limitation that the power outputs cannot dramatically change between two adjacent intervals. The D Ri and U Ri are the ramp-up and ramp-down rate limits of the ith generator respectively. The dispatched power of a generator in the tth time interval P i,t should be limited by the previously dispatched power P i,t−1 at time interval t − 1 within ramp-up and ramp-down rate limits D Ri and U Ri .

2.2 Dynamic environmental load dispatch model

The environmental load dispatch problem minimizes the emissions of environment pollutants including sulphur dioxide (SO2) and nitrogen oxides (NOx) [15]. A quadratic polynomial formulation is associated with an exponential term to model the emissions as

$$ \begin{aligned} F_{2} &= \sum\limits_{t = 1}^{T} {\sum\limits_{i = 1}^{{N_{u} }} {F_{i} (P_{i,t} )} } = \sum\limits_{t = 1}^{T} {\sum\limits_{i = 1}^{{N_{u} }} {[(\alpha_{i} + \beta_{i}P_{i,t} + \gamma_{i}P_{i,t}^{2} )]} } \\ \; &\quad + \eta_{i} \exp (\delta_{i} P_{i,t} ) \\ \end{aligned} $$
(7)

where α i , β i , γ i , η i , and δ i are the emission coefficients of the ith generation unit. The power output limit, balance limit as well as the ramp rate limit is also taken into consideration in forming the constraints.

3 Plug-in electric vehicle load profiles

Plug-in electric vehicles are an unusual power demands and unlike traditional household and industry loads, the simultaneous charging of the 20 kW household chargers and 120 kW superchargers will possibly form huge ripples or even spikes on the daily power demand curve. Such effects could be avoided by the coordination and scheduling of charging. With the development of smart grid technology the introduction of smart PEV chargers to coordinate and control PEV charging looks highly alike. In this section, four different charging scenarios, including an EPRI predicted profile based on the assuming behaviors of drivers, an off-peak charging profile and a peak charging profile and a stochastic charging profile are modeled to compare and evaluate the impact on both economic and environmental aspects for the power system operation.

3.1 EPRI profile

The Electric Power Research Institute (EPRI) is a US funded non-profit organization founded in 1973. It is one of the leading organisations in the world, which produces publications and reports on the electric power industry. In an environmental assessment of PEVs [16], EPRI proposes an aggregate distribution of charging profiles to assess GHG emissions, where a probability distribution of charging profile is proposed as shown in Table 1.

Table 1 EPRI charging probability distribution

In this charging scenario, over 60% power is delivered in 7 hours in the evening during 22:00 to 4:00. The other time slots see low charging rates and cover the rest of energy delivery.

3.2 Off-peak profile

In [17], two charging scenarios, peak and off-peak charging by assuming flat load demand for PEV charging during peak and off-peak time in the whole Ireland are proposed as a case study. The probability distribution in each hour for the off-peak case is illustrated in Table 2.

Table 2 Off-peak charging probability distribution

This profile assumes three charging levels with 18.5% of power is delivered every hour during 23:00 to 02:00, 9% of power is delivered in each time interval from 02:00 to 04:00 and the rest of charging is completed in 06:00. It is apparently an ideal case that besides the 8 hours charging, other time slots are forbidden for PEVs to get charged.

3.3 Peak profile

Similar to off-peak charging, peak charging assumes a flat load curve with three levels of charging power to describe a certain number of EVs getting charged during the peak load time on the wholesale electricity market. The probability distribution of each hour for the peak case is listed in Table 3.

Table 3 Peak charging probability distribution

This is another extreme case where all the PEV charging power requests are provided during peak time of electricity consumption during daytime from 13:00 to 20:00. The charging period and power in this paper are based on [16], where the peak time for charging is slightly shifted due to the profile of the load fleet.

3.4 Stochastic profile

Considering the uncertainties of drivers’ charging behavior, a stochastic charging profile is proposed in this paper. The stochastic load profile simulates some urgent group charging or distributed fast charging at random time throughout the whole day. The random probability follows the normal distribution with the mean value as 5%. The probability distribution of each hour for the stochastic case is presented in Table 4.

Table 4 Stochastic charging probability distribution

The probability of stochastic charging profile in each hour ranges from 1.1% to 9.7%. It changes randomly regardless of the peak or off-peak load time.

The four different PEV load charging distributions are illustrated in Fig. 1 respectively. These four profiles will impose extra load L ev,t in the power demand constraints (3). The new dispatch problems are tightly constrained and show strong non-convex, calling for more powerful computational tools to solve.

Fig. 1
figure 1

Four different PEV load distributions

4 Self-learning teaching-learning based optimization

Teaching-learning based optimization (TLBO) is a new meta-heuristic algorithm proposed in 2011 and has been utilized in solving some engineering problems [18], [19], [20], [21]. Two phases are designed for each evolution iteration in the original TLBO, named teaching phase and learning phase respectively. Both the convergence speed and exploitation ability of TLBO have been tested on well-known benchmarks and its effectiveness have been confirmed [18, 19]. Though TLBO is powerful in solving many optimization problems, it can be further improved for specific problems. In this paper, a new self-learning phase is incorporated into original TLBO, aiming to continuously improve the exploitation ability.

4.1 Teaching phase

The teaching phase mimics a class teaching process that a teacher shares his/her knowledge to the students. A teacher will be first selected from the whole population by sorting the fitness function value. The deviation between the teacher and the mean of students will be calculated as:

$$ DM_{i} = rand_{1} (T_{i} - T_{F} Mean_{i} ) $$
(8)

where DM i is the value difference in the ith iteration; Mean i the mean value; T i the selected teacher; and T F a teaching factor. According to the original paper of TLBO, the T F can either be 1 or 2 denoted as

$$ T_{F} = round(1 + rand_{2} (0,1)) $$
(9)

Each learner in the class will gain knowledge from the value difference and update themselves as

$$ X_{ij}^{new} = X_{ij}^{old} + DM_{i} $$
(10)

where X new ij and X old ij are the jth old and new learners of ith iteration. The new learners will compete with his/her predecessor and replace them if a better fitness value is achieved.

4.2 Learning phase

Followed the teaching phase, a learning phase provides a chance for each student to learn from a classmate. In this phase, each solution would randomly select another solution to compare the fitness, and update the knowledge storage according to the interaction. The phase is denoted as

$$ X_{ij}^{new} = \left\{ \begin{gathered} X_{ij}^{old} + rand_{3} (X_{ik} - X_{ij} ) \hfill \\ X_{ij}^{old} + rand_{3} (X_{ij} - X_{ik} ) \hfill \\ \end{gathered} \right. \,\begin{array}{c} {f(X_{ik} ) < f(X_{ij} )} \\ {f(X_{ij} ) < f(X_{ik} )} \\ \end{array} $$
(11)

where the jth learner X ij and kth learner X ik are randomly selected from the population in the ith iteration. Through a competition, the initial learner X ij will refresh his/her knowledge based on the deviation of the two learners. Similar with teaching phase, the refreshed student will have to compete with his/her predecessor and the better one will remain in the class.

4.3 Self-learning phase

The original TLBO performs well on weakly constrained or completely unconstrained problems. For strong constrained ill-conditioned optimization problems, the global optimal solutions are sensitive to slight changes. It is likely that the global optimum would be missed out due to the low exploitation ability. A self-learning phase is designed to further exploit the space near the particle position for the promising global optimum. Each student will get a chance to learn from his/her self-surrounding spaces. The self-learning phase is illustrated as

$$ X_{ij}^{new} = X_{ij}^{old} (1 + (rand_{4} - 0.5)\omega ) $$
(12)

where a self-learning weight ω is designed to determine the self-learning range of each particle. The parameter tuning of this factor will be discuss in Section 5.

5 Numerical experiments

In order to evaluate the performance of the new SL-TLBO algorithm, 10 well-known benchmark functions with 30 dimensions from [22] were tested. The SL-TLBO was compared with some TLBO variants including original TLBO, elite TLBO [23], and a modified TLBO [24]. In addition, some other commonly used algorithms which include weighted PSO [25], PSO-CF [26] and classical DE/rand/1/bin [27] are selected for the comparative study.

5.1 Benchmark functions

The 10 benchmark functions are defined in [22] and listed with the dimensions and boundaries respectively as follows.

  • Sphere function (f1): dimension = 30, [−100, 100];

  • Schwefel’s problem 1.2 (f2): dimension = 30, [−100, 100];

  • Rosenbrock function (f3): dimension = 30, [−30, 30];

  • Ackley’s function (f4): dimension = 30, [−32, 32];

  • Griewank function (f5): dimension = 30, [−600, 600];

  • Rastrigin function (f6): dimension = 30, [−5.12, 5.12];

  • Step function (f7): dimension = 30, [−100, 100];

  • Schwefel’s problem 2.21 (f8): dimension = 30, [−100, 100];

  • Schwefel’s problem 2.26 (f9): dimension = 30, [−500, 500];

  • Quartic function (f10): dimension = 30, [−1.28, 1.28].

5.2 Determination of control parameters in SL-TLBO

Though there are no algorithm specific parameters to be tuned in original TLBO, the new SL-TLBO method has introduced a self-learning weighting factor ω to adjust the learning range. It is therefore important to find proper settings of ω. Three benchmarks have been tested with the ω ranging from 0.1 to 10. The maximum generation is set as 100 and the particle number is 30. To eliminate the experimental incidents, 30 different run were employed. The initialization values were randomly generated within the boundary and were taken as the same input for all the algorithms. The searching results are showed in Table 5 with mean values and standard deviations for each parameter setting respectively.

Table 5 Benchmark tests result with different weighting factor

It could be observed that the results get better with the increase of ω in both on f1 and f4. The best results are achieved when ω is 3 in the f1 test and from 1 to 10 in the f4 test. On the other hand, the final results change stochastically with the increase of ω in the f9 test where the best result was achieved when ω is 2. These results show that the performance of the parameter settings for the new SL-TLBO algorithm is problem specific. We choose ω = 3 for the further comparative study on algorithms performance in sub-section 5.3.

5.3 Simulation results and discussions

In this sub-section, the new SL-TLBO algorithm were tested and comparatively studied with some counterparts on the ten benchmark functions tests mentioned in 5.1. To fairly compare the algorithm performance, the number of the function evaluation (FES) is introduced as the iteration criteria. Each generation of PSO and DE method accounts for one FES. The original TLBO has two phases and accounts for double times of function evaluation, whereas the SL-TLBO has one more self-learning phase in each generation and triple times are used to calculate the FES.

In terms of the parameters settings for the algorithms, the population number is 30 while the FES is set as 12,000. The weighted PSO uses c1 = 1, c2 = 3, wmax = 0.9, wmin = 0.4; for PSO-CF, c1 = c2 = 2.05, K = 0.729; for classical DE, F = 0.7, CR = 0.5; for elite TLBO, the elite number is set as 5; the self-learning weighting factor ω is 3; no specific parameters are required to be set for original TLBO and mTLBO. Similarly, 30 different runs were carried out for each benchmark function by each algorithm. The mean values and standard deviations for each test are illustrated in Table 6.

Table 6 Benchmark tests results for different algorithms

From the results, it is shown that the new SL-TLBO performs best in the limited FES among 7 of totally 10 tests. In the tests on f1, f4, f7, and f8, the SL-TLBO significantly outperforms other counterparts. Comparable performance for the new algorithm is displayed on f2, f5, f6. In the cases of f3, f9, and f10, SL-TLBO is outperformed by the original TLBO, mTLBO, and eTLBO, respectively. However, in these three cases, the performances of the four TLBO variants are relatively the same, which indicates that the general problem solving capability of SL-TLBO appears to be satisfactory. Therefore, this new method is utilized in solving the DEED problem with PEV loads.

6 Simulation results on DEED problem and discussions

6.1 Case 1: 5-unit economic dispatch without PEV

In order to show the significance of the new SL-TLBO algorithm, the original 5-unit considering valve-point effect and transmission loss is tested. The FES is set as 60000, and the population number for the SL-TLBO method is 50. The 5-unit economic dispatch benchmark data is taken from [28]. According to the experimental tests, ω is set as 0.05, which is much smaller than the previous setting in benchmark functions tests. This is because the DEED problem is highly constrained due to which a big learning rate will easily cause the violation of constraints. On the other hand, a small learning factor would increase the searching ability in nearby solution space, which is more adaptable for DEED problem to search better solutions in a limited available space.

It can be seen from the Table 7 that the new SL-TLBO can obtain the best results comparing with some previous methods. This method is then utilized in PEV integration analysis.

Table 7 Comparison of total fuel cost over 30 runs (Case 1: 5-unit without PEV)

6.2 Case 2: 5-unit economic dispatch with PEV

In this case, a 5-unit system combined with the four PEV load profiles considering valve-point effect and transmission losses is investigated to dynamically dispatch the generation production from the economic perspective.

Considering the local population and load situation of the benchmark system, an extra load of 30000 different types of PEVs is integrated, where 45% of these PEVs are low hybrid vehicles equipped with 15 kWh batteries. Besides, 25% of PEVs are medium hybrid vehicles using 25 kWh batteries, and 30% PEVs are pure battery vehicles of which the total power are provided by 40 kWh batteries as in Table 8. It is also assumed that 50% SOC are the energy necessity for PEVs in a 24-hour period [32]. The total PEV load for one day is calculated as L ev  = 30000 × (15 × 45 % + 25 × 25% + 40 × 30 %) × 0.5 = 375 MW.

Table 8 Multiple types of PEV

Fig. 2 illustrates the four different load demand profiles with corresponding situations of PEV charging probability in a 24-hour time period. Differences lie on the noon time between 12:00-17:00 and midnight between 22:00-06:00. The peak load has increased significantly in the peak charging situation shown as the purple line in Fig. 2 lasts till the next load valley. On the contrary, the valley load is increased by both off-peak charging and EPRI charging (see green and red line in Fig. 2). The stochastic charging profile generates a small new sub-peak during 6:00 to 9:00 in the morning showed in the blue line again in Fig. 2.

Fig. 2
figure 2

Power load demands with four PEV profiles

The four circumstances are tested by seven algorithms respectively. The particle number is set as 50 and the FES is 20000. The algorithms parameter settings are almost the same with the tests in 5.3, while the only difference lies on the setting of the self-learning weighting factor ω in SL-TLBO as mentioned in case 1.

Table 9 shows the dispatched results of four PEV charging profiles solved by seven algorithms. The SL-TLBO outperforms all its counterparts and achieves the best results on all the four charging profiles. Moreover, comparing the four charging patterns, the off-peak charging costs 46508.86 $/day which is the lowest among all the situations. In contrast, the peak charging costs reach to 47367.17 $/day and becomes the highest cost. The EPRI charging profile ranks the second lowest in terms of the cost and outperforms the stochastic charging behavior which ranks the third place with the costs of 46770.71 $/day and 47158.86 $/day respectively. The off-peak charging profile costs 858.31 $/day lower than the peak charging profile, which implies that under the same charging demand, charging preferentially at off-peak time could save 1.85% in terms of the economic cost.

Table 9 Economic dispatching result for case 1 ($/day)

6.3 Case 3: 5-unit environmental dispatch with PEV

In this case, the environmental emission is considered rather than the economic cost. Similarly, the 5-unit system is employed and the data is taken from [33]. The objective function is presented as (7) and the constraints are the same with economic dispatch in the previous subsection. The PEV number and the power necessity are the same with the aforementioned economic case. All the four charging profiles are also comparatively tested by seven algorithms including wPSO, PSO-CF, classical DE, original TLBO, elite TLBO, modified TLBO, and SL-TLBO. The parameter initializations for these algorithms are the same with case 2.

Table 10 shows the environmental dispatch results for the four different charging profiles solved by seven algorithms. The lowest environmental emission results are still produced by the proposed SL-TLBO method on all the four PEV charging scenarios. Among these charging profiles, the off-peak charging profile again emits the least air pollutants with the amount of 18659.24 lb/day achieved by SL-TLBO, while the peak charging profile produces most emissions with 19227.18 lb/day. Therefore, a reduction of 3.0% emission production with 567.94 lb/day from the generation side would be achieved by shifting the PEV charging time. It is the same situation as in case 2, the EPRI charging profile and the stochastic charging profile rank the 2nd and the 3rd with the emissions of 18820.78 lb/day and 18963.69 lb/day respectively.

Table 10 Environmental dispatching result for case 2 (lb/day)

6.4 Case 4: 5-unit economic and environmental dispatch with PEV

Since the economic aspect and the environmental aspect are both vital for power system dispatch, a combination of these two objectives is also investigated. The ωee in the objective function (1) is defined as 0.5 in this case to comprehensively consider the economic and environmental profit and trade-off the both situations. The 5-unit test system is again utilized of which the load demand and four PEV charging situations remain the same with previous cases. Dealing with the objective function with significant high non-linear characteristics with both sinusoidal and exponential terms, the case would be more complex for algorithms to solve. In order to evaluate the algorithms performances and compare the four charging situations, the seven algorithms with same parameter initializations are again implemented.

Table 11 shows all the dispatching results considering both economic and environmental aspects. Not incidentally, SL-TLBO again achieved the best fitness values. The fitness value of off-peak charging profile is 33924.62 while it is 34731.92 for the peak charging scenario. There is a 2.4% deviation between these two values. If the performances of all the algorithms are compared in all the cases, SL-TLBO always gives the best performance.

Table 11 Economic/environmental dispatching result for case 3 (0.5 · $/day + 0.5 · lb/day)

6.5 Case 5: 15-unit economic dispatch with PEV

The 5-unit system is only a small scale system with 120 variables for a dynamic dispatch in 24-hour period of time. However, dispatching tasks for larger system widely exist. In this case, a 15-unit system is implemented of which the total load demand is 60981 MW [34]. The number of variables is tripled and reaches to 360 in one-day time. To simplify the situation, the valve point-effect is neglected in the objective function as

$$ F = \sum\limits_{t = 1}^{T} {\sum\limits_{i = 1}^{{N_{u} }} {F_{i} (P_{i,t} )} } = \sum\limits_{t = 1}^{T} {\sum\limits_{i = 1}^{{N_{u} }} {[(a_{i} + b_{i}P_{i,t} + c_{i}P_{i,t}^{2} )]} } $$
(13)

where the transmission loss and ramp rate constraints are also considered. The system data is taken from [31]. To evaluate the economic impact of the four PEV charging profiles in this test system, the total PEV number is proportionally increased to 90000. While the PEV types remain the same as in Table 8, resulting in that the total PEV charging power necessity is tripled to 1,125 MW in a single day. This extra PEV charging load is distributed within the four charging scenarios and integrated with the original load demand. The seven algorithms are again employed to solve the large scale problem for comparative study where the specific parameter settings for these algorithms are the same with those in case 2.

The results provided in Table 12 illustrate that the SL-TLBO method again achieved the optimal solutions, outperforming with all the other six algorithms in all the four load scenarios. The lowest cost appears in the off-peak charging scenario with the cost of 780862.82 $/day, while the peak charging scenario costs 1099.09 $/day more than the off-peak one, reaching to 781961.91 $/day.

Table 12 Economic dispatching result for Case 4 ($/day)

7 Conclusion

Dynamic economic dispatch has long been an intractable problem for power system operators and the complexity is ever increasing with new participants such as PEV entering in the equation. In this paper, the non-convex dynamic economic and environmental dispatch has been comparatively investigated with the integrations of various PEV charging scenarios. A new self-learning teaching learning based optimization method is proposed to solve the economic and environmental dispatch problems. A small scale 5-unit system and a large scale 15-unit system are tested in 24-hour time period. Four different PEV charging scenarios including EPRI predicted charging, off-peak charging, peak charging and stochastic charging profiles with different number of PEV have been integrated in the load demands of both systems. The numerical results show that the new SL-TLBO algorithm is a viable alternative approach for solving both small and large scale dynamic dispatch problems and outperforms other popular heuristic methods and state-of-the-art TLBO variants in the tests on well-known benchmarks and DEED problem with proper parameter tuning. In terms of the four PEV charging scenarios, the off-peak charging scenario, as expected, has the advantage in reducing the economic cost and environmental pollutant emissions.

In future studies, renewable energy sources such as photovoltaic panels and wind power as well as the vehicle to grid (V2G) will be introduced in the system dispatch to comprehensively analyze the interaction between the PEV and power systems.