Machine-Learning-Based Real-Time Economic Dispatch in Islanding Microgrids in A Cloud-Edge Computing Environment
Machine-Learning-Based Real-Time Economic Dispatch in Islanding Microgrids in A Cloud-Edge Computing Environment
Abstract—The paradigm of the Internet of Things (IoT) and construction cost [2]. The power generation intermittency of
cloud-edge computing plays a significant role in future smart massively penetrated distributed renewable power sources and
grids. The data-driven solution integrating the artificial intel- demand-side load randomness brings about technical chal-
ligence functionalities brings novel methods to address the
nontrivial task of economic dispatch in microgrids in the presence lenges to the stable and efficient operation for the energy
of uncertainties of renewable generations and loads. This arti- management system (EMS) of microgrids.
cle proposes a learning-based decision-making framework for the Islanding microgrids are often deployed to provide power
economic energy dispatch of an islanding microgrid based on the supply in remote locations (e.g., offshore islands and geo-
cloud-edge computing architecture. Cloud resources are utilized graphically remote areas) that lack electricity supply from the
to solve the optimal dispatch decision sequences over historical
operating patterns. It can be considered as a sample labeling pro- power utilities. The islanding operation mode of microgrids
cess for the supervised training that can implement the complex imposes a stringent requirement to maintain the dynamic
mapping of input–output space through an advanced machine balance of power generation and demand to ensure system
learning model. Then, the well-trained model can be adopted stability in the presence of various operational uncertainties.
locally at edge computing devices keeping the long-term param- The real-time energy dispatch aims to meet this require-
eters unchanged for implement the real-time microgrid energy
dispatch. The key benefit of the proposed solution is that it effec- ment through appropriate management of the energy storage
tively avoids the prediction of multiple stochastic variables and system (ESS) and controllable generators, e.g., diesel sources,
the design of sophisticated regulation strategies or reward pol- considering the uncertainties introduced by the intermittent
icy functions for real-time dispatch. The solution is extensively renewable sources and demands. Due to the limited avail-
assessed through simulation experiments by the use of real data ability of energy resources in the islanding microgrid, the
measurements for a set of operational scenarios and the numer-
ical results validate the effectiveness and benefit of the proposed economic dispatch is of paramount importance to minimize
algorithmic solution. the operational cost by improving the global energy utilization
efficiency while meeting the system operational constraints.
Index Terms—Data-driven control, economic dispatch,
machine learning, optimal dispatch. However, the optimal/economic energy dispatch in an
islanding microgrid is considered a nontrivial task due to
the system operational uncertainties. The power generations
I. I NTRODUCTION of renewable sources, e.g., WTs and PVs, are stochastic and
nondispatchable. Also, the power demand in such small-scale
ICROGRID is a small-scale power system consisting of
M a collection of power generation sources, storage units
and different loads that can operate in either grid-connected
microgrid often exhibits a higher degree of uncertainty com-
pared with that of the utility grid. The optimal energy dispatch
is often formulated as a multistage stochastic optimization
mode or islanding mode [1]. It enables efficient integration
problem [1]. Two key issues need to be addressed for an
of different forms of renewable distributed generators, e.g.,
economic dispatch solution: the computational complexity
micro wind turbines (WTs) and solar photovoltaic units (PVs),
of the real-time dispatch and the coordinated scheduling in
to reduce carbon emission, power loss, and infrastructure
an optimal way over multiple time periods. To address the
Manuscript received November 13, 2020; revised January 27, 2021 challenge of energy economic dispatch with operational uncer-
and February 28, 2021; accepted March 17, 2021. Date of publication tainties (e.g., intermittent generation and fluctuant demand),
March 22, 2021; date of current version August 24, 2021. This work was the existing solutions are mainly proposed based on two frame-
supported in part by the Natural Science Foundation of China under Grant
51777183. (Corresponding author: Qiang Yang.) works: 1) the model predictive control (MPC)-based energy
Wei Dong and Qiang Yang are with the College of Electrical Engineering, dispatch and 2) the Markov decision process (MDP)-based
Zhejiang Lab, Zhejiang University, Hangzhou 310027, China (e-mail: energy dispatch.
dong_w@zju.edu.cn; qyang@zju.edu.cn).
Wei Li and Albert Y. Zomaya are with the Australia–China Joint Research Based on stochastic optimization approaches, the MPC-
Centre for Energy Informatics and Demand Response Technologies, Centre for based framework can be adopted to implement the rolling
Distributed and High Performance Computing, School of Computer Science, optimization in the forthcoming transient horizon [3]. Also,
The University of Sydney, Camperdown, NSW 2008, Australia (e-mail:
weiwilson.li@sydney.edu.au; albert.zomaya@sydney.edu.au). considering the advantages of both long-term horizon of
Digital Object Identifier 10.1109/JIOT.2021.3067951 optimization and low uncertainty of real-time dispatch, a
2327-4662
c 2021 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See https://www.ieee.org/publications/rights/index.html for more information.
Authorized licensed use limited to: INTERNATIONAL ISLAMIC UNIVERSITY. Downloaded on May 17,2022 at 11:39:21 UTC from IEEE Xplore. Restrictions apply.
13704 IEEE INTERNET OF THINGS JOURNAL, VOL. 8, NO. 17, SEPTEMBER 1, 2021
TABLE I
A NALYSIS OF THE R ELATED MPC AND MDP-BASED S OLUTIONS
two-stage model consisting of the day-ahead optimal schedul- Table I briefly summarizes the pros and cons of MPC and
ing using the prediction information and real-time operation MDP-based solutions.
with elaborate regulation strategies has been exploited and In recent years, along with the advances of smart grid
developed. In [4], a two-layer coordinated energy management development, the adoption of Internet-of-Things (IoT) devices
approach was proposed. The solution carried out an economic has brought tremendous data during the system operation.
operation scheduling in the day-head optimization layer and Due to the limited capacity of computation and storage, the
reduced the power regulation cost in the real-time layer to remote cloud computing facilities are often used to process
resolve the prediction errors. The study in [5] proposed a the data introduced from the field IoT devices. However, such
double-layer coordination approach in which the commitment a centralized paradigm results in significant communication
of generations and the prediction of reserve boundaries were delay and overhead. To overcome this issue, edge comput-
obtained in the day-ahead stage. Then these results can be ing brings the cloud in proximity to the IoT devices [12].
regulated through the successive approximation model in real- Unlike the conventional centralized cloud computing archi-
time. In [6], the upper layer optimized the look-ahead dispatch tecture, the cloud-edge paradigm enables the edge computing
for individual generators, and a Lagrangian formulation was devices to carry out the decision-making tasks for timely
implemented as the corrective strategy considering the oppor- energy management with significantly reduced communication
tunity cost of discharging and the marginal cost of charging. delay and congestion [13] [14]. Thus, the cloud-edge orches-
However, it should be noted that the MPC-based solutions tration/cooperation is considered a promising and efficient
often require accurate prediction of renewable generation and architecture to improve the energy management performance
power demand as well as the sophisticated designs of real-time in microgrids.
regulation strategies. The rolling optimization approach also Fig. 1 illustrates the proposed cloud-edge computing archi-
requires higher computing capacity for the local controller. tecture for the energy management of islanding microgrids.
To address the randomness and uncertainties induced by the Such an EMS architecture with ubiquitous sensing and com-
multiple stochastic variables, economic dispatch in microgrids puting devices effectively drives the exploitation and deploy-
is often considered and modeled in the MDP framework [7]. ment of more advanced operation and control functionali-
In most recent years, reinforcement learning (RL) has been ties for microgrids. The edge computing units can obtain
widely adopted as an effective solution for solving MDP the system operational states through device-to-edge com-
without any a priori knowledge or effort of stochastic munication with field devices, and make available to the
modeling [1]. In [8], a Q-learning-based algorithmic solu- cloud when necessary through edge-to-cloud communication.
tion was developed to implement the economic dispatch that Artificial intelligence (AI)-based solutions can be adopted in
relaxed the global information requirement and balanced the the intelligent orchestration in such a cloud-edge computing
exploration and exploitation in a distributed way. The study architecture [15]. Sufficient computation and storage resources
in [9] proposed a consensus Q-learning for decentralized gen- are available on cloud service to support the exhaustive anal-
eration commitment by exchanging the Q-value matrices and ysis of the massive historical data through a model training
exploiting the prior knowledge to accelerate the algorithm con- process. The well-trained model can be further made avail-
vergence rate. In [10], an RL-based economic dispatch solution able to the edge computing units in the field to fulfill the local
was proposed that combined the value-function approximation management functionalities in the microgrids.
using the distributed optimization based on multiplier splitting. The machine learning techniques are widely considered
However, the RL-based solutions that use the value function efficient for identifying the underlying patterns and knowl-
or policy function from the Bellman equation to achieve long- edge from the large volume of historical data. Our previous
term optimization can often fail into the local optimum. Also, work [16] comprehensively reviewed the state-of-the-art
a set of sophisticated learning policies and useful features are machine learning techniques and applications in smart grid,
generally needed to avoid high-dimensional state space [11]. e.g., forecasting of renewable generation forecasting [17] and
Authorized licensed use limited to: INTERNATIONAL ISLAMIC UNIVERSITY. Downloaded on May 17,2022 at 11:39:21 UTC from IEEE Xplore. Restrictions apply.
DONG et al.: MACHINE-LEARNING-BASED REAL-TIME ECONOMIC DISPATCH IN ISLANDING MICROGRIDS 13705
Authorized licensed use limited to: INTERNATIONAL ISLAMIC UNIVERSITY. Downloaded on May 17,2022 at 11:39:21 UTC from IEEE Xplore. Restrictions apply.
13706 IEEE INTERNET OF THINGS JOURNAL, VOL. 8, NO. 17, SEPTEMBER 1, 2021
where pd (t) and pb (t) are the dispatched power of the diesel where pmin
b and pmax
b are the lower and upper bounds of the
generator and the output power of the battery in time slot power output of the ESS, respectively.
t, respectively. When it is discharging, pb (t) is positive, and The dispatch constraint represents the dispatch ranges of
when it is charging, pb (t) is negative. SOH(t) is the state of decision variables. For the diesel generators, the generation
health of the battery in time slot t. Set T is the set of time limit constraint and minimum up/down time constraints are
slots for the long-term objective in global optimization. These given, respectively, as follows:
costs are, respectively, given as
pmin < pd (t) < pmax (9)
d d
Cd (pd (t)) = ad · (pd (t)) + bd · pd (t) + cd + · · ·
2
(rrt−1 − Ton )(I(pd (t − 1)) · I(pd (t))) ≥ 0
(10)
+ sd · I(pd (t)) · (1 − I(pd (t − 1))) (2) (zzt−1 − Toff )(I(pd (t)) · I(pd (t − 1))) ≥ 0
Cb (pb (t), SOH(t)) = ρb · (SOH(t) − SOH(t + 1)) (3) where pmin and pmax are the lower and upper bounds of the
d d
where ad , and cd are the generation cost coefficients of the output generation of the TUs, respectively. rrt is continuously
diesel generator [22]. sd is the unit startup cost coefficient. working time of unit at time t, zzt is continuously shutdown
I(·) is the indicator function to indicate if the diesel unit is time of unit at time t; Ton is the minimum up time of the unit,
turned on in each time slot. ρb is the degradation cost coeffi- Toff is the minimum down time of the unit.
cient for ESS. Furthermore, ESS life degradation due to battery
charging and discharging cycles can be calculated using (4) as III. P ROPOSED DATA - DRIVEN D ISPATCH S OLUTION
given in [23]: In this work, a data-driven machine-learning-based solu-
tion in a cloud-edge computing architecture is developed for
SOH(t + 1) = SOH(t) − h(t) · SOH(t) (4)
real-time economic dispatch in an islanding microgrid. The
where h(t) is the degradation factor, which is related to the main idea behind it is as follows. First, a large number of
change of battery state of charge (SOC), given as historical data measurements are adopted to simulate real-
−1 word operation patterns. A collection of data samples are
h(t) = αh · (SOC(t))βh + ηh (5) selected to construct an offline optimization problem accord-
where αh , βh , and ηh are the degradation parameters related to ing to the system model presented in Section II. Through the
the battery specifications and the values can be obtained from optimization solver tool, the optimal decision sequence with
the empirical tests. SOC(t) is calculated by the charging or the long-term optimization horizon (e.g., 1 day) is obtained,
discharging power to label the learning targets of corresponding training sam-
ples. The selected features containing operating information
Eb , pb (t) ≤ 0
−cb pb (t) · t (e.g., power generations of renewable energy sources, SOC of
SOC(t) = (6)
−pb (t) · t db Eb , pb (t) > 0 ESS, and demand of loads) are used as the input variables.
where cb and db are the charging and discharging coefficients A random forest regression model is applied to fit the labeled
determined by battery charging/discharging efficiency charac- samples in a supervised learning fashion to realize the complex
teristics. t is the time interval. Eb is the battery capacity of mapping between operating pattern space and dispatch deci-
the ESS. In this work, the values of SOC and SOH are both sion space. Finally, the well-trained machine learning model
normalized to the interval [0, 1], and SOC is limited to the can address the real-time decision-making inference of the
range of [0.2, 0.8] in operation to avoid the deterioration effect economic dispatch in a new operating pattern. The proposed
due to deep charging and discharging. data-driven solution is designed to be self-adaptive through
carrying out rolling training to update the inference model
B. Constraints once new operational patterns are available. The generaliza-
tion ability and effectiveness of the model can be enhanced
The operation constraints of optimal energy management through such a rolling learning process at the cloud during
include power balance constraints, capacity constraints, and the long-term microgrid operation. The proposed machine-
dispatch constraints, as given in (7)–(10). The power balance learning-based solution in cloud-edge computing architecture
constraint requires that the generated power by all kinds of and the rolling learning process is illustrated in Fig. 3.
source generations minus the load demands equal the power
absorbed or released by the ESS, i.e.,
A. PSO-Based Optimization Process
pw (t) + pp (t) + pd (t) − pl (t) = −pb (t) (7) The data-driven optimal decision sequence labeling pro-
cess can be reduced to the typical optimal dispatch of a unit
where pw (t), pp (t), and pl (t) are the power of WTs, PVs, and
commitment (UC) problem under certain conditions. Many
loads, respectively, in time slot t which are nondispatchable
optimization solvers are available in the literature to effectively
variables. For the economic dispatch of islanding microgrid,
address the optimization problem, e.g., dynamic programming
the power dispatch of the diesel generator pd (t) is an active
(DP) [24], mixed-integer linear programming [25], Lagrangian
decision variable, and the power dispatch of the battery pb (t)
relaxation [26], and some other commercial solvers such
is passively determined by (7).
as CPLEX. In addition to them, the meta-heuristic algo-
The capacity constraint of the batteries are satisfied to
rithms are considered effective tools to solve such noncon-
b < pb (t) < pb
pmin max
(8) vex optimization problems (e.g., [27]). The particle swarm
Authorized licensed use limited to: INTERNATIONAL ISLAMIC UNIVERSITY. Downloaded on May 17,2022 at 11:39:21 UTC from IEEE Xplore. Restrictions apply.
DONG et al.: MACHINE-LEARNING-BASED REAL-TIME ECONOMIC DISPATCH IN ISLANDING MICROGRIDS 13707
Fig. 4. Renewable generation and load profiles of the training data set.
Authorized licensed use limited to: INTERNATIONAL ISLAMIC UNIVERSITY. Downloaded on May 17,2022 at 11:39:21 UTC from IEEE Xplore. Restrictions apply.
13708 IEEE INTERNET OF THINGS JOURNAL, VOL. 8, NO. 17, SEPTEMBER 1, 2021
Authorized licensed use limited to: INTERNATIONAL ISLAMIC UNIVERSITY. Downloaded on May 17,2022 at 11:39:21 UTC from IEEE Xplore. Restrictions apply.
DONG et al.: MACHINE-LEARNING-BASED REAL-TIME ECONOMIC DISPATCH IN ISLANDING MICROGRIDS 13709
Fig. 8. Comparison of the operational and cumulative costs for 15 test days.
TABLE II
C OMPARISON OF THE AVERAGE C OSTS ($) OF D IFFERENT
S OLUTIONS OVER T EST DAYS
Fig. 7. Energy dispatch of diesel generator and battery SOC variation over
five consecutive days. (a) Proposed solution. (b) Ideal dispatch solution.
slightly lower cost. For the benchmark solutions, i.e., day-
ahead scheduling with two levels of prediction errors, the
the machine learning model obtained in this work has excel- efficient coordination between the battery-based ESS and dis-
lent generalization ability and robustness that can make the patchable diesel generators can be hardly achieved, and the
inference close to the optimal energy dispatch decision for the prediction errors can significantly degrade the energy dispatch
test days. performance, as confirmed by the results given in Fig. 8 and
To validate the machine-learning-based solution, the com- Table II.
parison of the operational costs in each day and their cumu- The average costs of different dispatch solutions for 15 test
lative curves obtained from the day-ahead scheduling with days are compared and presented in Table II. It shows that
different prediction errors, ideal dispatch without prediction the proposed solution is lower than the benchmark solutions
errors, and our proposed solution are shown in Fig. 8. It can be in each cost, and performance close to the ideal solution.
observed that all the evaluated solutions present a similar ten- The machine-learning-based solution can achieve the eco-
dency on operational and cumulative costs. It implies that for nomic improvement of 5.8% and 11.0% on average over the
the same operating patterns with the same state information, day-ahead scheduling with small and large prediction errors,
these solutions achieve similar basic goals for many situations. respectively. It is demonstrated that the proposed solution can
For instance, they are capable of absorbing renewable energy achieve efficient and real-time economic in the multienergy
generation and ensure the power supply reliability for local microgrid in the presence of different forms of uncertainties.
demand. Their similar tendencies indicate that these solutions
all attempt to realize the optimal strategy.
The results show that the proposed solution can achieve V. C ONCLUSION
a cost close to the ideal dispatch, and even slightly lower In this article, a learning-based framework in the cloud-edge
than that optimized without uncertainties in some days. This is computing environment is proposed to address the challenge
because, for nonconvex optimization problems, it is still diffi- of economic dispatch in islanding microgrids by considering
cult to achieve an absolute global optimal solution even under the stochastic nature of renewable generations and local loads.
certain conditions. The proposed machine-learning-based solu- The proposed data-driven solution leverages an external cloud
tion makes appropriate decisions in the real-time stage based service to label and generates the optimal decision samples
on the knowledge and experience learned in the histori- of a dispatch sequence through the heuristic algorithm. These
cal pattern, to realize other locally optimal solutions with a samples containing the different operating patterns are used in
Authorized licensed use limited to: INTERNATIONAL ISLAMIC UNIVERSITY. Downloaded on May 17,2022 at 11:39:21 UTC from IEEE Xplore. Restrictions apply.
13710 IEEE INTERNET OF THINGS JOURNAL, VOL. 8, NO. 17, SEPTEMBER 1, 2021
the supervised training process of a machine learning model. [16] M. S. Ibrahim, W. Dong, and Q. Yang, “Machine learning driven smart
The well-trained model can be deployed in the edge devices electric power systems: Current trends and new perspectives,” Appl.
Energy, vol. 272, Aug. 2020, Art. no. 115237.
to carry out the real-time dispatch task locally in the island- [17] W. Dong, Q. Yang, and X. Fang, “Multi-step ahead wind power genera-
ing microgrid. The numerical results obtained from extensive tion prediction based on hybrid machine learning techniques,” Energies,
simulation experiments confirm that the proposed solution out- vol. 11, no. 8, p. 1975, 2018.
performs the existing stochastic optimization methods with a [18] M. Rafiei, T. Niknam, J. Aghaei, M. Shafie-Khah, and J. P. S. Catalao,
“Probabilistic load forecasting using an improved wavelet neural
similar cost to the ideal dispatch scenario. network trained by generalized extreme learning machine,” IEEE Trans.
For future work, two research directions are considered Smart Grid, vol. 9, no. 6, pp. 6961–6971, Nov. 2018.
worth further research effort. The proposed learning-based [19] I. Dusparic, A. Taylor, A. Marinescu, F. Golpayegani, and S. Clarke,
“Residential demand response: Experimental evaluation and compari-
framework needs to be further assessed in the multi- son of self-organizing techniques,” Renew. Sustain. Energy Rev., vol. 80,
energy microgrids consisting of additional energy sources and pp. 1528–1536, Dec. 2017.
demands (e.g., cooling and heating) through a trail study. [20] Y. Zhao, T. Li, X. Zhang, and C. Zhang, “Artificial intelligence-based
Additional effort is needed to develop more advanced and fault detection and diagnosis methods for building energy systems:
Advantages, challenges and the future,” Renew. Sustain. Energy Rev.,
interpretable machine learning models, e.g., graph neural vol. 109, pp. 85–101, Jul. 2019.
network, through combining the domain knowledge. [21] W. Dong, Q. Yang, X. Fang, and W. Rua, “Adaptive optimal fuzzy
logic based energy management in multi-energy microgrid consider-
ing operational uncertainties,” Appl. Soft Comput., vol. 98, Jan. 2021,
Art. no. 106882.
R EFERENCES [22] T. A. Nguyen and M. L. Crow, “Stochastic optimization of renewable
based microgrid operation incorporating battery operating cost,” IEEE
[1] W. Liu, P. Zhuang, H. Liang, J. Peng, and Z. Huang, “Distributed Trans. Power Syst., vol. 31, no. 3, pp. 2289–2296, May 2016.
economic dispatch in microgrids based on cooperative reinforcement [23] A. Hoke, A. Brissette, K. Smith, A. Pratt, and D. Maksimovic,
learning,” IEEE Trans. Neural Netw. Learn. Syst., vol. 29, no. 6, “Accounting for lithium-ion battery degradation in electric vehicle charg-
pp. 2192–2203, Jun. 2018. ing optimization,” IEEE J. Emerg. Sel. Topics Power Electron., vol. 2,
[2] L. Mariam, M. Basu, and M. F. Conlon, “Microgrid: Architecture, policy no. 3, pp. 691–700, Sep. 2014.
and future trends,” Renew. Sustain. Energy Rev., vol. 64, pp. 477–489, [24] A. Rong, H. Hakonen, and R. Lahdelma, “A dynamic regrouping based
Oct. 2016. sequential dynamic programming algorithm for unit commitment of
[3] G. Hug, S. Kar, and C. Wu, “Consensus+innovations approach for dis- combined heat and power systems,” Energy Convers. Manag., vol. 50,
tributed multiagent coordination in a microgrid,” IEEE Trans. Smart no. 4, pp. 1108–1115, 2009.
Grid, vol. 6, no. 4, pp. 1893–1903, Jul. 2015. [25] L. Yang, J. Jian, and Y. Wang, “Projected mixed integer programming
[4] Q. Jiang, M. Xue, and G. Geng, “Energy management of microgrid formulations for unit commitment problem,” Int. J. Elect. Power Energy
in grid-connected and stand-alone modes,” IEEE Trans. Power Syst., Syst., vol. 68, no. 68, pp. 195–202, 2015.
vol. 28, no. 3, pp. 3380–3389, Aug. 2013. [26] Q. Jiang, B. Zhou, and M. Zhang, “Parallel augment Lagrangian relax-
[5] X. Lei, T. Huang, Y. Yang, Y. Fang, and P. Wang, “A bi-layer multi- ation method for transient stability constrained unit commitment,” IEEE
time coordination method for optimal generation and reserve schedule Trans. Power Syst., vol. 28, no. 2, pp. 1140–1148, May 2013.
and dispatch of a grid-connected microgrid,” IEEE Access, vol. 7, [27] J. Zhao, S. Liu, M. Zhou, X. Guo, and L. Qi, “Modified cuckoo search
pp. 44010–44020, 2019. algorithm to solve economic power dispatch optimization problems,”
[6] Z. Zhang, J. Wang, T. Ding, and X. Wang, “A two-layer model for IEEE/CAA J. Automatica Sinca, vol. 5, no. 4, pp. 794–806, Jul. 2018.
microgrid real-time dispatch based on energy storage system charg- [28] J. G. Vlachogiannis and K. Y. Lee, “A comparative study on parti-
ing/discharging hidden costs,” IEEE Trans. Sustain. Energy, vol. 8, cle swarm optimization for optimal steady-state performance of power
no. 1, pp. 33–42, Jan. 2017. systems,” IEEE Trans. Power Syst., vol. 21, no. 4, pp. 1718–1728,
[7] J. Wu and X. Guan, “Coordinated multi-microgrids optimal control algo- Nov. 2006.
rithm for smart distribution management system,” IEEE Trans. Smart [29] W. Dong and M. Zhou, “A supervised learning and control method to
Grid, vol. 4, no. 4, pp. 2174–2181, Dec. 2013. improve particle swarm optimization algorithms,” IEEE Trans. Syst.,
[8] F. Li, J. Qin, and W. X. Zheng, “Distributed Q-learning-based Man, Cybern., Syst., vol. 47, no. 7, pp. 1135–1148, Jul. 2017.
online optimization algorithm for unit commitment and dispatch in
[30] W. Dong and Q. Yang, “Data-driven solution for optimal pumping units
smart grid,” IEEE Trans. Cybern., vol. 50, no. 9, pp. 4146–4156,
scheduling of smart water conservancy,” IEEE Internet Things J., vol. 7,
Sep. 2020.
no. 3, pp. 1919–1926, Mar. 2020.
[9] X. S. Zhang, Q. Li, T. Yu, and B. Yang, “Consensus transfer Q-learning
[31] J. Kennedy and R. Eberhart, “Particle swarm optimization,” in Proc.
for decentralized generation command dispatch based on virtual gen-
IEEE Int. Conf. Neural Netw., vol. 4. Perth, WA, Australia, 1995,
eration tribe,” IEEE Trans. Smart Grid, vol. 9, no. 3, pp. 2152–2165,
pp. 1942–1948.
May 2018.
[10] P. Dai, W. Yu, G. Wen, and S. Baldi, “Distributed reinforcement learning [32] L. Breiman, J. Friedman, C. J. Stone, and R. A. Olshen, Classification
algorithm for dynamic economic dispatch with unknown generation cost and Regression Trees. Hoboken, NJ USA: CRC Press, 1984.
functions,” IEEE Trans. Ind. Informat., vol. 16, no. 4, pp. 2258–2267, [33] L. Breiman, “Random forest,” Mach. Learn., vol. 45, pp. 5–32,
Apr. 2020. Jan. 2001.
[11] V. Mnih et al., “Human-level control through deep reinforcement [34] A. Millner, “Modeling lithium ion battery degradation in electric
learning,” Nature, vol. 518, pp. 529–533, Feb. 2015. vehicles,” in Proc. IEEE CITRES, Sep. 2010, pp. 349–356.
[12] C. Gong, F. Lin, X. Gong, and Y. Lu, “Intelligent cooperative edge [35] P. Li, D. Xu, Z. Zhou, W.-J. Lee, and B. Zhao, “Stochastic optimal oper-
computing in Internet of Things,” IEEE Internet Things J., vol. 7, no. 10, ation of microgrid based on chaotic binary particle swarm optimization,”
pp. 9372–9382, Oct. 2020. IEEE Trans. Smart Grid, vol. 7, no. 1, pp. 66–73, Jan. 2016.
[13] F. Liang, W. Yu, X. Liu, D. Griffith, and N. Golmie, “Toward edge-based [36] M. Gao, J. Li, F. Hong, and D. Long, “Day-ahead power forecasting
deep learning in industrial Internet of Things,” IEEE Internet Things J., in a large-scale photovoltaic plant based on weather classification using
vol. 7, no. 5, pp. 4329–4341, May 2020. LSTM,” Energy, vol. 187, Nov. 2019, Art. no. 115838.
[14] S. Kulkarni et al., “Enabling a decentralized smart grid using [37] R. Doherty and M. O’Malley, “A newapproach to quantify reserve
autonomous edge control devices,” IEEE Internet Things J., vol. 6, demand in systems with significant installed wind capacity,” IEEE Trans.
no. 5, pp. 7406–7419, Oct. 2019. Power Syst., vol. 20, no. 2, pp. 587–595, May 2005.
[15] Y. Wu, “Cloud-edge orchestration for the Internet-of-Things: [38] S. H. Karaki, R. B. Chedid, and R. Ramadan, “Probabilistic performance
Architecture and AI-powered data processing,” IEEE Internet Things assessment of autonomous solar-wind energy conversion systems,” IEEE
J., early access, Aug. 7, 2020, doi: 10.1109/JIOT.2020.3014845. Trans. Energy Convers., vol. 14, no. 3, pp. 766–772, Sep. 1999.
Authorized licensed use limited to: INTERNATIONAL ISLAMIC UNIVERSITY. Downloaded on May 17,2022 at 11:39:21 UTC from IEEE Xplore. Restrictions apply.
DONG et al.: MACHINE-LEARNING-BASED REAL-TIME ECONOMIC DISPATCH IN ISLANDING MICROGRIDS 13711
Wei Dong (Student Member, IEEE) received the Wei Li (Senior Member, IEEE) received the Ph.D.
B.S. degree from the Huazhong University of degree from the School of Information Technologies,
Science and Technology, Wuhan, China, in 2017. The University of Sydney, Camperdown, NSW,
He is currently pursuing the Ph.D. degree with Australia, in 2012.
the College of Electrical Engineering, Zhejiang He is currently a Research Fellow with the Centre
University, Hangzhou, China. for Distributed and High Performance Computing,
His research interests include complex system School of Computer Science, The University of
modeling, machine learning and optimization, and Sydney. His research interests include Internet of
data-driven-based control. Things, edge computing, sustainable computing, task
scheduling, and energy efficiency and optimization.
Dr. Li is the recipient of four IEEE or ACM con-
ference best paper awards. He received the IEEE TCSC Award for Excellence
in Scalable Computing for Early Career Researchers in 2018 and the IEEE
Outstanding Leadership Award in 2018. He is a Senior Member of the IEEE
Computer Society and a member of ACM.
Authorized licensed use limited to: INTERNATIONAL ISLAMIC UNIVERSITY. Downloaded on May 17,2022 at 11:39:21 UTC from IEEE Xplore. Restrictions apply.