Electronics 09 01277 v2
Electronics 09 01277 v2
Article
Battery Energy Management of Autonomous Electric
Vehicles Using Computationally Inexpensive Model
Predictive Control
Kyoungseok Han 1 , Tam W. Nguyen 2 and Kanghyun Nam 3, *
1 School of Mechanical Engineering, Kyungpook National University, Daegu 41566, Korea;
[email protected]
2 Department of Aerospace Engineering, University of Michigan, Ann Arbor, MI 48109, USA;
[email protected]
3 Department of Mechanical Engineering, Yeungnam University, Gyeongsan 38541, Korea
* Correspondence: [email protected]; Tel.: +82-53-810-2455
Received: 14 July 2020; Accepted: 5 August 2020; Published: 9 August 2020
Keywords: self-driving car; model predictive control; dynamic programming; prediction horizon;
move-blocking; warmstarting
1. Introduction
For the last few years, many experts in the field of automotive research have claimed that
self-driving cars will become commonplace in the near future. Although there are still technical
difficulties for commercializing self-driving cars, at least a certain degree of automation in the vehicles
is expected to be available soon [1,2].
With the advent of new features in self-driving cars, novel approaches to optimize the
energy efficiency of vehicles have become available. In general, human-driver behaviors are
based on their own driving habits on the roads, which are not consistent with the traffic flow [3].
Self-driving cars, in contrast, can plan optimally their trajectories by exploiting real-time traffic
information [4–6]. For example, many short and long-range traffic information using vehicle-to-vehicle
(V2V), vehicle-to-infrastructure (V2I), and, more generally, vehicle-to-everything (V2X) communication
(see Figure 1) provide useful traffic information, which the self-driving cars can effectively use in real
time. For example, the geometry information of the route is available through the pre-loaded maps
and the future traffic density is also available through the V2X technologies. Therefore, unlike human
drivers, self-driving cars can make suitable decisions based on real-time traffic flow so as to optimize
their energy efficiency.
However, since the traffic dynamics are complex and nonlinear, there is a non-negligible
computational burden associated with long-term real-time traffic prediction. Moreover, since the
traffic predicted by the self-driving car in the long run become uncertain (e.g., traffic disturbances,
unexpected changes in human-driver behavior), it is required to choose an appropriate length of the
prediction horizon to effectively exploit the current traffic information.
Although there have been numerous researches that show the benefits of vehicle speed- and
position-trajectory planning in terms of energy efficiency, to the best of our knowledge, practical
aspects, such as the length of the prediction horizon and its relative computational burden, have not
been discussed in the existing literature on self-driving cars. In order to achieve high performance
while ensuring computational feasibility, this paper deals with the above-mentioned practical concerns
for real production vehicles.
Also, in this paper, the battery-electric vehicle (BEV) is assumed to be ego vehicle that needs to be
controlled, so, using connectivity technologies (i.e., V2X), the optimal battery energy management is
our ultimate goal.
optimization and control, the procedure determining the horizon length is omitted, but our approach
starts with this important aspect considering the implementation in real hardware. As compared to the
control results of nominal MPC and DP, the proposed method can reduce the computational burden
significantly while minimizing the performance degradation in terms of the battery state-of-charge
(SOC) reduction.
ṡ = v, (1)
ρAf Cd v2
Tm + Tb
v̇ = i0 − − g sin θ − γg cos θ, (2)
mr 2m
where s ∈ R is the vehicle position, v ∈ R is the vehicle speed, m > 0 is the vehicle mass, Tm ∈ R and
Tb ∈ R are the motor and friction brake torques, respectively, r > 0 is the wheel radius, i0 > 0 is the final
gear ratio, ρ > 0 is the air density, Af > 0 is the frontal area of the vehicle, Cd ≥ 0 is the aerodynamic drag
coefficient, θ ∈ (−π/2, π/2) is the road inclination that influences the energy consumption significantly
but assumed to be zero in this paper, g > 0 is the gravitational constant, and γ ≥ 0 is the coefficient of
rolling resistance. Note that θ is a function of the position, that is, θ (s), and that the rolling friction is
kinetic, that is, γ = 0 for v = 0. In the remainder of this paper, we consider Tb = 0 without a significant
loss of generality.
Electronics 2020, 9, 1277 5 of 19
In actual fact, several parameters that are assumed to be constant such as m and θ influence the
vehicle’s energy efficiency significantly. Also, the other constant parameters vary depending on driving
conditions, so the parameter estimations are needed for the accurate energy-efficiency calculation.
However, for simplicity, this paper does not consider the varying-parameters, but it should be treated
in the future.
To apply digital control methods, the continuous-time mode (Equations (1) and (2)) is discretized
using the Euler-forward method
xkv+1 = xkv + gv ( xkv , uvk ) Ts , (3)
where xkv , [sk vk ]T , uvk , Tm,k , gv : R2 × R → R2 is the vehicle dynamics function vector that
aggregates in a column the right-hand side of Equations (1) and (2), and Ts > 0 is the sampling time.
˙ = − Ib ,
SOC (4)
Cb
where SOC ∈ [0, 1] is the state of charge of the battery, Ib ∈ R is the battery current, and Cb > 0 is the
battery capacity and is assumed to be constant.
The battery current Ib is a function of the open-circuit voltage Voc ∈ R, the battery resistance
Rb > 0, and the battery-power consumption Pb ∈ R as,
p
Voc − 2 − 4R P
Voc b b
Ib = , (5)
2 Rb
where
Pm
+ for Pm ≥ 0,
ηb
Pb = (6)
Pm for Pm < 0,
ηb−
ηb+ ∈ (0, 1) is the battery-depletion efficiency, ηb− > 1 is the battery-recharge efficiency, and Pm is the
motor-output power.
The SOC dynamics is captured more accurately by accounting for the variations of Voc and Rb ,
which are functions of SOC, as illustrated in Figure 3.
13.5 0.25
0.2
13
0.15
12.5
0.1
12
0.05
11.5 0
0 0.2 0.4 0.6 0.8 1
Figure 3. Variation of the open-circuit voltage Voc and internal resistance Rb as functions of
state-of-charge (SOC).
Note that, in practice, the variations of Voc and Rb are relatively small compared to Pb during
driving time. Therefore, the battery-output power Pb mainly contributes to the battery SOC changes
Electronics 2020, 9, 1277 6 of 19
in Equation (4). In general, Pb is used as a control input in the battery dynamics. Note that Pb depends
also on the motor-output power Pm , whose dynamics are described hereafter.
200 0.7
0.75 0.7
0.75
0.7 0.850.8 0.850.8
0.75
0.
56
0.8
150 0.85 0.
85
100 0.9
Motor Torque [Nm]
0.9
0.7
50
0.9
0.85
0.75
-100 0.9
0.7
0.85
-150 0.75 0.8
0.
85
0.65
0.85
0.750.8 0.750.8
-200 0.7 0.7
where gs : [0, 1] × R → [0, 1] is the SOC dynamics function vector, which aggregates as a column the
right-hand side of Equations (4) and (5).
xk+1 = xk + f ( xk , uk ) Ts , (9)
where xk , [sk vk SOCk ]T , uk , Tm , and f , [( gv )T ( gs )T ]T . In this paper, we assume that, for all
k ≥ 0, the state xk is available, and thus, an observer needs not to be built. Based on Equation (9),
the optimization control problem and several optimization methods will be described in Section 3.
Electronics 2020, 9, 1277 7 of 19
N −1
min Jk (Ûk ) =
Ûk
∑ Pb,i (ûi|k , x̂i|k ), (10a)
i =0
s.t.
x̂i+1|k = x̂i|k + f ( x̂i|k , ûi|k ) Ts , (10b)
T
x̂0|k = [sk vk SOCk ] , (10c)
vmin ≤ v̂i+1|k ≤ vmax , (10d)
τmin (v̂i+1|k + δ) ≤ sil+1 − ŝi+1|k ≤ τmax (v̂i+1|k + δ) (10e)
ˆ i+1|k ≤ SOCmax
SOCmin ≤ SOC (10f)
min max
Tm,i (ω̂m,i|k ) ≤ ûi|k ≤ Tm,i (ω̂m,i|k ) (10g)
i = 0, · · · , N − 1, (10h)
where sl is the deterministic position of leader car provided by driving cycle, 0 ≤ vmin < vmax ,
0 < τmin < τmax are the minimum/maximum time headways, δ > 0 is the velocity bound,
SOCmin < SOCmax ∈ [0, 1], and Tm,i min : R → R and T max : R → R are given constraint functions
m,i
at step j that depend on ω̂m,i|k .
Note that the problem formulation (10a)–(10h) is a receding-horizon problem formulation if the
problem must be solved for all k ≥ 0. The terms (10d)–(10f) are the state constraints, and (10g) is the
control constraint. The constraint bounds are reasonable values according to the driving conditions on
US roads [26]. In particular, depending on the ego car’s speed, (10e) indicates the reasonable varying
position constraint between the ego car and leader car.
All simulations presented in this paper are carried out using MATLAB R2020a, Windows 10,
Intel Core i7-9700 CPU @ 3.00 Ghz, and 16 GB RAM. Furthermore, the continuous-time dynamics are
integrated by ode45 and the control is constant between samples.
where j ∈ { N − 1, . . . , 0}. Accordingly, the optimal control policy π for each node x̃i is mapped by
where µi : R3 → R is the mapping function and i = 0, . . . , Ndp − 1. Accordingly, Equation (13) can be
used to find the optimal control input in a forward simulation.
In this paper, we use the generic DP function in Matlab, that is, dpm [30]. An appropriate grid
number for the state and control variables must be specified to avoid the curse of dimensionality.
Doing so, the computation is completed approximately in five minutes while satisfying all constraints,
as illustrated in Figure 5. The entire vehicle speed trajectories for two drive cycles, i.e., WLTC, US06,
are optimized so as to minimize the cost and enforce the constraints. One distinguished feature is
that the speed profile when applying DP is flattened as much as possible to prevent spikes in the
battery current.
The battery SOC consumption for each drive cycle is compared in Table 1, which shows that DP
can significantly improve the energy efficiency of BEV around 14 to 20%. These control performances
are considered ideal, so we will use these as our benchmark. However, as mentioned, DP is not
implementable in real time due to the computational complexity and strong assumptions on command
and disturbance preview.
Electronics 2020, 9, 1277 9 of 19
30
Baseline
DP
20
10
0
0 200 400 600 800 1000 1200 1400 1600 1800
30
20
10
0
0 100 200 300 400 500 600
Figure 5. Comparison of the speed trajectories obtained by dynamic programming (DP) and
human-driver maneuvers with (a) WLTC and (b) US06.
Table 1. Comparison of the battery state-of-charge (SOC) consumption (%) with the speed profiles
obtained by dynamic programming (DP) and nominal speed.
Since the cost function is nonconvex and the dynamics are nonlinear, several local minima may
be presented. However, as DP computes all costs for every node by backward induction, it can be said
that the obtained performance ensures global optimality.
Tm v 2
Pb ≈ Pm ≈ i0 + αTm (14)
r
where α > 0 is a tunable motor parameter. Note that, even though the cost is greatly simplified
compared to the original Pb , Equation (14) is still nonlinear and depends on the value of α, which
modifies the overall control performance significantly.
In this paper, the cost function is further simplified as follows. Since the objective of this study
is to minimize the battery SOC reduction, the fluctuation of Ib should be minimized according to
Equation (4). It also can be interpreted that the sudden changes in the control input u, which cause Ib
Electronics 2020, 9, 1277 10 of 19
fluctuations, should be avoided as much as possible. Therefore, for all k ≥ 0, another minimization of
a quadratic cost in place of Equation (10a) is used by eliminating the first term in Equation (14), that is,
N −1 N −1
min ∑
ûi|k ∈Ui|k i =0
Pb,i ( x̂i|k , ûi|k ) ≈ min ∑
ûi|k ∈Ui|k i =0
û2i|k , (15)
0.8
0.79
0.78
0.77
0.76
0 20 40 60 80 100 120 140 160 180 200
300
200
100
-100
-200
-300
0 20 40 60 80 100 120 140 160 180 200
15
10
0
0 20 40 60 80 100 120 140 160 180 200
Figure 6. Comparison of the simulation results using the different cost functions with N = 10 and
Ts = 1 s. The top plot shows the battery SOC trajectories, the middle plot shows the control input
evolutions, and the bottom plot shows the computation times tc,k at each step k.
In contrast, by exploiting the quadratic form (15), the computation time at each step is significantly
reduced compared to the original form, as shown in the bottom plot of Figure 6.
Electronics 2020, 9, 1277 11 of 19
Therefore, it can be concluded that the nonlinear cost (10a) can be replaced with the quadratic
cost (15) without sacrificing too much performance in terms of the battery SOC reduction.
(a)
3.85
3.8
3.75
5 6 7 8 9 10 11 12 13 14 15
(b)
100
50
0
5 6 7 8 9 10 11 12 13 14 15
Figure 7. (a) Battery SOC reductions with respect to N, and (b) computational burdens with respect to
N. The computational-time constraint violation represents the ratio between the number of occurrences
where tc > Ts and the number of samples of the entire experiment.
It can be concluded that N = 10 is sufficient to optimize the energy efficiency of our BEV, but the
computational burden should be reduced for implementation. A collection of methods to deal with
the computational complexity of MPC is proposed in the next section.
(a)
60
40
20
0
0 50 100 150 200
(b)
60
40
20
0
0 50 100 150 200
(c)
60
40
20
0
0 50 100 150 200
Figure 8. Distance difference between ego car and leader car with (a) Ts = 1 s, (b) Ts = 2 s, and (c)
Ts = 3 s.
5.2. Warmstarting
Another remedy for computational burden reduction is warmstarting [32]. Warmstarting is a
method that supplies an initial guess of the sequence of controls to the solver rather than using a
constant initial control sequence, typically, a vector of zeros. In general, supplying an initial guess is
useful when dealing with nonconvex problems and nonlinear solvers because the solvers can easily
get stuck to a local minimum point. In addition, this scheme also prevents cases with singularities by
avoiding regions close to the initial condition.
Electronics 2020, 9, 1277 13 of 19
Specifically, instead of starting the optimizer with the initial control sequence Ûk,0 = [0 · · · 0]T ,
we use the previous optimal solution, which is computed at k − 1, at the current step k as
(
[0 · · · 0]T , k = 0,
Ûk,0 = (16)
[Ûk∗−1 (2 : end) Ûk∗−1 (end)]T , k > 0,
where Ûk∗−1 (2 : end) , [u1∗|k−1 · · · u∗N −1|k−1 ]T ∈ R N −1 is the optimal control sequence computed at
step k − 1, which is truncated from the second component to the N-th component. Note that the last
element Ûk∗−1 (end) is copied twice in Ûk,0 so as to match the number of components in the array to
N components.
The effectiveness of adopting warmstarting strategy is summarized in Table 2. Although the
improvement is not very significant, it can be said that adopting warmstarting is usually helpful.
Table 2. Comparison of the average computation time with and without warmstarting.
where In is the n-by-n identity matrix, and 1m×n is the m-by-n matrix of ones.
A variation of the control-horizon strategy is adopted in this paper, which is the
move-blocking strategy. The motivation of this variation is that, since the reference position of
the leader car can vary significantly in the future, blocking the control over a large interval yields
constraint violation and, in some cases, suboptimality or infeasibility. Therefore, we divide the
prediction horizon into several blocking intervals. Accordingly, the control-horizon matrix is modified
to the move-blocking matrix
where 0 < kb < N is the number of control moves blocked at each interval (except the first interval), kr
is the remainder of the division N/kb (if zero, then this term disappears), and Nb , ceil( N/kb ) − 1 + kb
is the length of Ûk0 using move-blocking strategy. With this move-blocking matrix, the control is free to
move for the first kb steps, and, afterwards, blocked for kb steps consecutively within Nb − kb intervals.
An example of move-blocking sequence using the proposed method is shown in Figure 9 with kb = 3
and N = 10.
Electronics 2020, 9, 1277 14 of 19
120
100
80
60
40
20
0
0 1 2 3 4 5 6 7 8 9
Figure 9. Control sequence using move-blocking strategy, where kb = 3 and N = 10. The first three
moves are free, then blocked in consecutive blocks of three controls. Note that Nb = 6, kr = 1, and the
last control is within a separate block from k = 9 to k = 10.
6. Simulation Results
So far, several methods to reduce the computational burden of MPC have been discussed. In this
section, the effectiveness of the proposed strategies is verified for two driving cycles, namely, WLTC
and US06. The parameters of the simulations are listed in Table 3. The model parameters are extracted
from the Ford Focus BEV model in ADVISOR [34], which is a high-fidelity simulator with the aim of
analyzing the vehicle-energy efficiency.
The following simulations compare control performance between the proposed method and
other strategies, i.e., nominal MPC and DP. As mentioned earlier, the performance using DP can be
considered as ideal. Therefore, if the performance using our approach closely matches the one of DP
with a computationally manageable complexity, it can be said that the main objective of this study is
achieved. Also, compared to the nominal MPC, the computational complexity should be reduced with
the almost same control performance.
Simulation 1: WLTC driving cycle. The results in urban-driving scenario obtained from the proposed
method and DP are compared in Figure 10. Since the state and control trajectories using the nominal
MPC are very similar to those of our approach, we omit the plots with the nominal MPC.
Electronics 2020, 9, 1277 15 of 19
(a)
100
80
60
40
20
0
0 200 400 600 800 1000 1200 1400 1600 1800
(b)
40
30
20
10
0
0 200 400 600 800 1000 1200 1400 1600 1800
(c)
0.8
0.75
0.7
0.65
0.6
0 200 400 600 800 1000 1200 1400 1600 1800
(d)
300
200
100
-100
-200
-300
0 200 400 600 800 1000 1200 1400 1600 1800
Figure 10. Comparison between model prediction control (MPC) and DP for the WLTC driving cycle,
(a) distance difference between ego car and leader car, (b) speed profiles, (c) battery SOC trajectories,
and (d) control trajectories.
Note that the appropriate distances between the self-driving car and leader car are maintained
for both methods, as shown in Figure 10a. Depending on the self-driving car speed, the time-varying
upper- and lower- bounds (10e) are specified in the predictions. No constraint violation occurs for MPC
and DP. The entire speed trajectories are very similar to each other, but slightly different in Figure 10b.
The battery SOC trajectories are depicted in Figure 10c and the SOC consumption using the proposed
method is very similar to that of DP, which verifies the effectiveness of our approach. Since DP
explores all possible combinations of control and state pairs, the control input in Figure 10d sometimes
changes rapidly because DP is just designed to minimize the cost function without considering the
physical relationship between the actuator and vehicle. Therefore, the control performance using DP is
exaggerated to some extent, but it can serve as a benchmark that only considers the minimization of
the cost.
The trajectories of s, v, and SOC are shown in Figure 11a–c. From the battery SOC trajectory
in Figure 11c, the proposed MPC, which is computationally tractable, can give a solution that is
very close to the globally optimum solution obtained by DP. Moreover, the trajectories obtained by
DP sometimes physically do not make sense and should be corrected for hardware implementation.
Since the nonlinear model used in this paper only captures the primary dynamics of the vehicle and
powertrain, the control input using DP in Figure 11d cannot ensure a globally optimum solution in
Electronics 2020, 9, 1277 16 of 19
real production vehicle. Therefore, in reality, we expect that the performance gap between DP and our
approach is much closer to each other, but the computational burden is significantly reduced with our
approach, which is the central argument of this paper.
Simulation 2: US06 drive cycle. The US06 drive cycle represents the case of a highway-driving scenario.
The results are shown in Figure 11.
(a)
80
60
40
20
0
0 100 200 300 400 500 600
(b)
40
30
20
10
0
0 100 200 300 400 500 600
(c)
0.8
0.78
0.76
0.74
0.72
0.7
(d)
300
200
100
-100
-200
-300
0 100 200 300 400 500 600
Figure 11. Comparison between MPC and DP in US06 driving cycle, (a) distance difference between
ego-car and leader-car, (b) speed profiles, (c) battery SOC trajectories, and (d) control trajectories.
Discussion. Table 4 and Figure 12 support the above claims. As expected, the speed modification that
minimizes Tm 2 is effective in terms of battery SOC reduction. As expected, the proposed MPC with
move-blocking method does not outperform the nominal MPC but is very close to that of nominal MPC
in terms of battery SOC reduction performance (see Table 4). Unfortunately, the control performances
of our approach and nominal MPC shows the significant difference by comparing with DP result,
but DP is not implementable in the real-time and trajectories from the DP do not consider the physical
limitation of the actuator.
Electronics 2020, 9, 1277 17 of 19
Table 4. ∆SOC obtained by different optimized speed trajectories (the values in parentheses describe
the improvements compared to the baseline).
WLTC US06
Baseline 17.55 13.41
Proposed 15.64 (10.88%) 11.42 (14.83%)
Nominal MPC 15.42 (12.14%) 11.30 (15.73%)
DP 14.96 (14.76%) 10.74 (19.90%)
Nominal MPC
40
Proposed
30
20
10
0
US06 WLTC
Figure 12. Comparison of the computational-time constraint violation rates obtained by nominal MPC
and proposed method.
Note that the computational-time constraint violation rate is significantly reduced with the
proposed method (see Figure 12). Even though our approach sometimes exceeds the sampling
period Ts , some remedies to handle this issue are possible. For example, we can further reduce the
computation time by replacing the language from MATLAB to C, which is essentially needed for
hardware implementation. Also, constraint relaxation and the introduction of faster solvers instead of
fmincon can be alternative solutions, which are left to our future works.
7. Conclusions
There have been extensive approaches in the literature to apply MPC in automotive control
applications, but its computational feasibility was not clearly reviewed. This paper investigated several
methods to reduce the computational complexity of a nominal MPC implemented in a battery-electric
vehicle (BEV). First, an appropriate prediction horizon for our optimization control problem was
determined with a nominal MPC, and we concluded that the control performance of nominal MPC
closely matches the one of DP, which is assumed to be globally optimal. Next, to ensure real-time
feasibility, adopting move-blocking strategy, with warmstarting and large sampling times, significantly
reduces the computational burden compared to a nominal MPC without sacrificing too much of the
control performance in terms of battery state-of-charge (SOC) reduction. Methods that provide further
reduction of the computation load for real-time implementation of MPC in hardware will be our future
work. In addition, a lot of constraint violations are expected to occur with the proposed approach
when the reference value varies significantly. To overcome such a shortcoming, the relaxation of the
constraints with the proposed approach is also left to our future work.
Author Contributions: Conceptualization, K.H., T.W.N., and K.N.; Data curation, K.H. and T.W.N.; Formal
analysis, K.H.; Funding acquisition, K.N; Investigation, K.H. and T.W.N.; Methodology, K.H., and T.W.N.;
Software, K.H. and T.W.N.; Supervision, K.N.; Visualization, T.W.N.; Writing—original draft, K.H. and T.W.N.;
Writing—review & editing, K.N. All authors have read and agreed to the published version of the manuscript.
Electronics 2020, 9, 1277 18 of 19
Funding: This paper was supported by Korea Institute for Advancement of Technology(KIAT) grant funded by
the Korea Government(MOTIE)(P0008473, HRD Program for Industrial Innovation).
Conflicts of Interest: The authors declare no conflict of interest.
References
1. Wadud, Z.; MacKenzie, D.; Leiby, P. Help or hindrance? The travel, energy and carbon impacts of highly
automated vehicles. Transp. Res. Part A Policy Pract. 2016, 86, 1–18. [CrossRef]
2. Masood, K.; Molfino, R.; Zoppi, M. Simulated Sensor Based Strategies for Obstacle Avoidance Using Velocity
Profiling for Autonomous Vehicle FURBOT. Electronics 2020, 9, 883. [CrossRef]
3. Li, N.; Oyler, D.W.; Zhang, M.; Yildiz, Y.; Kolmanovsky, I.; Girard, A.R. Game theoretic modeling of driver
and vehicle interactions for verification and validation of autonomous vehicle control systems. IEEE Trans.
Control Syst. Technol. 2017, 26, 1782–1797. [CrossRef]
4. Han, K.; Li, N.; Kolmanovsky, I.; Girard, A.; Wang, Y.; Filev, D.; Dai, E. Hierarchical Optimization of Speed
and Gearshift Control for Battery Electric Vehicles Using Preview Information. In Proceedings of the IEEE
American Control Conference (ACC), Denver, CO, USA, 1–3 July 2020; pp. 4913–4919.
5. Han, J.; Sciarretta, A.; Ojeda, L.L.; De Nunzio, G.; Thibault, L. Safe-and eco-driving control for connected
and automated electric vehicles using analytical state-constrained optimal solution. IEEE Trans. Intell. Veh.
2018, 3, 163–172. [CrossRef]
6. Ersal, T.; Kolmanovsky, I.; Masoud, N.; Ozay, N.; Scruggs, J.; Vasudevan, R.; Orosz, G. Connected and
automated road vehicles: State of the art and future challenges. Veh. Syst. Dyn. 2020, 58, 672–704. [CrossRef]
7. Tate, L.; Hochgreb, S.; Hall, J.; Bassett, M. Energy Efficiency of Autonomous Car Powertrain; Technical Report;
SAE Technical Paper; SAE International: Warrendale, PA, USA, 2018.
8. Chen, T.D.; Kockelman, K.M.; Hanna, J.P. Operations of a shared, autonomous, electric vehicle fleet:
Implications of vehicle & charging infrastructure decisions. Transp. Res. Part A Policy Pract. 2016, 94, 243–254.
9. Zeng, X.; Wang, J. Globally energy-optimal speed planning for road vehicles on a given route. Transp. Res.
Part C Emerg. Technol. 2018, 93, 148–160. [CrossRef]
10. Chen, D.; Kim, Y.; Stefanopoulou, A.G. State of charge node planning with segmented traffic information.
In Proceedings of the IEEE Annual American Control Conference (ACC), Milwaukee, WI, USA, 27–29 June 2018;
pp. 4969–4974.
11. Wan, N.; Vahidi, A.; Luckow, A. Optimal speed advisory for connected vehicles in arterial roads and the
impact on mixed traffic. Transp. Res. Part C Emerg. Technol. 2016, 69, 548–563. [CrossRef]
12. McDonough, K.; Kolmanovsky, I.; Filev, D.; Szwabowski, S.; Yanakiev, D.; Michelini, J. Stochastic fuel
efficient optimal control of vehicle speed. In Optimization and Optimal Control in Automotive Systems; Springer:
Berlin, Germany, 2014; pp. 147–162.
13. Mehta, P.; Meyn, S. Q-learning and Pontryagin’s minimum principle. In Proceedings of the 48h IEEE
Conference on Decision and Control (CDC) held jointly with 2009 28th Chinese Control Conference, Shanghai,
China, 15–19 December 2009; pp. 3598–3605.
14. Lee, H.; Song, C.; Kim, N.; Cha, S.W. Comparative Analysis of Energy Management Strategies for HEV:
Dynamic Programming and Reinforcement Learning. IEEE Access 2020, 8, 67112–67123. [CrossRef]
15. Lee, H.; Kang, C.; Park, Y.I.; Kim, N.; Cha, S.W. Online Data-Driven Energy Management of a Hybrid Electric
Vehicle Using Model-Based Q-Learning. IEEE Access 2020, 8, 84444–84454. [CrossRef]
16. Li, S.; Li, N.; Girard, A.; Kolmanovsky, I. Decision making in dynamic and interactive environments based
on cognitive hierarchy theory: Formulation, solution, and application to autonomous driving. arXiv 2019,
arXiv:1908.04005.
17. Kouvaritakis, B.; Cannon, M. Model Predictive Control; Springer: Charm, Switzerland, 2016.
18. Nguyen, T.W.; Islam, S.A.U.; Bruce, A.L.; Goel, A.; Bernstein, D.S.; Kolmanovsky, I.V. Output-Feedback
RLS-Based Model Predictive Control. In Proceedings of the American Control Conference (ACC), Denver,
CO, USA, 1–3 July 2020.
19. Han, J.; Vahidi, A.; Sciarretta, A. Fundamentals of energy efficient driving for combustion engine and electric
vehicles: An optimal control perspective. Automatica 2019, 103, 558–572. [CrossRef]
Electronics 2020, 9, 1277 19 of 19
20. Prakash, N.; Cimini, G.; Stefanopoulou, A.G.; Brusstar, M.J. Assessing fuel economy from automated driving:
Influence of preview and velocity constraints. In Proceedings of the ASME 2016 Dynamic Systems and
Control Conference, Boston, MA, USA, 6–8 July 2016.
21. Seok, J.; Wang, Y.; Filev, D.; Kolmanovsky, I.; Girard, A. Energy-Efficient Control Approach for Automated
HEV and BEV With Short-Horizon Preview Information. In Proceedings of the ASME 2018 Dynamic Systems
and Control Conference, Atlanta, GA, USA, 30 September–3 October 2018.
22. HomChaudhuri, B.; Vahidi, A.; Pisu, P. Fast model predictive control-based fuel efficient control strategy for
a group of connected vehicles in urban road conditions. IEEE Trans. Control Syst. Technol. 2016, 25, 760–767.
[CrossRef]
23. Jia, Y.; Jibrin, R.; Itoh, Y.; Görges, D. Energy-optimal adaptive cruise control for electric vehicles in both time
and space domain based on model predictive control. IFAC-PapersOnLine 2019, 52, 13–20. [CrossRef]
24. Kim, Y.; Figueroa-Santos, M.; Prakash, N.; Baek, S.; Siegel, J.B.; Rizzo, D.M. Co-optimization of speed
trajectory and power management for a fuel-cell/battery electric vehicle. Appl. Energy 2020, 260, 114254.
[CrossRef]
25. Brackstone, M.; McDonald, M. Car-following: A historical review. Transp. Res. Part F Traffic Psychol. Behav.
1999, 2, 181–196. [CrossRef]
26. National Highway Traffic Safety Administration. Summary of State Speed Laws Ninth Edition: Current as of
January 1, 2006; National Committee on Uniform Traffic Laws and Ordinances: Washington, DC, USA, 2006.
27. Angel, E. Dynamic programming for noncausal problems. IEEE Trans. Autom. Control 1981, 26, 1041–1047.
[CrossRef]
28. Kalia, A.V.; Fabien, B.C. On Implementing Optimal Energy Management for EREV using Distance Constrained
Adaptive Real-Time Dynamic Programming. Electronics 2020, 9, 228. [CrossRef]
29. Bellman, R. Dynamic programming. Science 1966, 153, 34–37. [CrossRef]
30. Sundstrom, O.; Guzzella, L. A generic dynamic programming Matlab function. In Proceedings of the IEEE
Control Applications, (CCA) & Intelligent Control, (ISIC), St. Petersburg, Russia, 8–10 July 2009; pp. 1625–1630.
31. Walker, K.; Samadi, B.; Huang, M.; Gerhard, J.; Butts, K.; Kolmanovsky, I. Design Environment for Nonlinear
Model Predictive Control; Technical Report; SAE Technical Paper; SAE International: Warrendale, PA, USA, 2016.
32. Liao-McPherson, D.; Huang, M.; Kolmanovsky, I. A regularized and smoothed fischer–burmeister method
for quadratic programming with applications to model predictive control. IEEE Trans. Autom. Control
2018, 64, 2937–2944. [CrossRef]
33. Cagienard, R.; Grieder, P.; Kerrigan, E.C.; Morari, M. Move blocking strategies in receding horizon control.
J. Process Control 2007, 17, 563–570. [CrossRef]
34. Wipke, K.B.; Cuddy, M.R.; Burch, S.D. ADVISOR 2.1: A user-friendly advanced powertrain simulation using
a combined backward/forward approach. IEEE Trans. Veh. Technol. 1999, 48, 1751–1761. [CrossRef]
c 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).