A Learning Algorithm For Optimal Internal Combustion Engine Calibration in Real Time
A Learning Algorithm For Optimal Internal Combustion Engine Calibration in Real Time
net/publication/254578979
CITATIONS READS
28 1,429
3 authors, including:
All content following this page was uploaded by Dennis N. Assanis on 18 July 2016.
DETC2007-34718
Dennis N. Assanis
Department of Mechanical Engineering
ABSTRACT 1. INTRODUCTION
Advanced internal combustion engine technologies have The growing requests for better performance and fuel
increased the number of accessible variables of an engine and economy, and reduced emissions, have motivated continued
our ability to control them. The optimal values of these research in advanced internal combustion engine technologies.
variables are designated during engine calibration by means of These technologies, such as fuel injections systems, variable
a static correlation between the controllable variables and the geometry turbocharging, variable valve actuation, and exhaust
corresponding steady-state engine operating points. While the gas recirculation, have increased the number of accessible
engine is running, these correlations are being interpolated to engine controllable variables, and our ability to optimize engine
provide values of the controllable variables for each operating operation. In particular, the determination of the optimal values
point. These values are controlled by the electronic control unit of these variables, referred to as engine calibration, have been
to achieve desirable engine performance, for example in fuel shown to be especially critical for achieving high engine
economy, pollutant emissions, and engine acceleration. The performance and fuel economy while meeting emission
state-of-the-art engine calibration cannot guarantee standards. Consequently, engine calibration is defined as a
continuously optimal engine operation for the entire operating procedure that optimizes one or more engine performance
domain, especially in transient cases encountered in driving indices, e.g., fuel economy, emissions, or engine performance
styles of different drivers. This paper presents the theoretical with respect to the engine controllable variables. Engine
basis and algorithmic implementation for allowing the engine calibration generates a static correlation between the optimal
to learn the optimal set values of accessible variables in real
values of the controllable variables and the corresponding
time while running a vehicle. Through this new approach, the
steady-state engine operating points to coordinate optimal
engine progressively perceives the driver’s driving style and
performance of the specified indices. This correlation is
eventually learns to operate in a manner that optimizes
incorporated into the electronic control unit (ECU) of the
specified performance indices. The effectiveness of the
engine to control engine operation, so that optimal values of the
approach is demonstrated through simulation of a spark
ignition engine, which learns to optimize fuel economy with specified indices are maintained.
respect to spark ignition timing, while it is running a vehicle. Despite the advanced engine technologies, however,
continuously optimal engine operation has not yet been
Keywords: Markov Decision Process (MDP), learning possible. State-of-the-art engine calibration methods rely on
algorithms, sequential decision-making under uncertainty, dynamometer static correlations for steady-state operating
simulation-based optimization, reinforcement learning, internal points accompanied by transient vehicle testing. However, the
combustion engine calibration, fuel economy calibration process, its duration, and its cost grow exponentially
with the number of controllable variables and optimal
*
Author of correspondence, Phone: (734) 647-1409, Fax: (734) 764-4256, Email: [email protected]
Engine Torque
cases encountered in driving styles of different drivers [2]. Trajectory A of engine
operating points
To pre-specify the huge number of transient operations is
impractical and, thus, calibration cannot generate optimal static
correlations for all transient cases a priori. Transient operation Terminal engine operating
point of A and B
constitutes the largest segment of engine operation over a
driving cycle compared to the steady-state one [3, 4]. Trajectory B of engine
Emissions during transient operation are extremely complicated operating points
[4], vary significantly with each particular driving cycle [5, 6],
and are highly dependent upon the calibration [6, 7]. Engine Engine Speed
operating points, during the transient period before their Figure 1. Two trajectories A, and B, of engine operating
steady-state value is reached, are associated with different points ending at the same operating point.
Brake-Specific Fuel Consumption (BSFC) values, depending
on the directions from which they have been arrived, as
BSFC value at the
illustrated qualitatively in Figures 1 and 2. Pollutant emissions terminal operating point BSFC value at the
such as NOx, and particulate matters, demonstrate the same reached from trajectory A terminal operating point
qualitative behavior, as shown by Hagena et al. [8]. reached from trajectory B
The main objective of calibration methods is to expedite
dynamometer tests significantly using a smaller subset of tests. BSFC
This subset is utilized either in implementing engine calibration
experimentally or in developing mathematical models for
evaluating engine output. Using these models, optimization
methods can determine the engine calibration static correlations
BSFC value of the
between steady-state operating points and the controllable Transient Period terminal operating point
engine variables [9]. Design of Experiments (DoE) [10-12] has at steady-
steady-state operation
been widely used as the baseline method. Major applications
include catalyst system optimization [13], optimization of Time
variable valve trains for performance and emissions [14-17], Figure 2. BSFC value of the terminal engine operating point
implementation of dynamic model-based engine calibrations as reached from trajectories A, and B.
[18, 19], and optimization of fuel consumption in a spark
ignition engine with dual-continuously controlled camshaft generate optimized correlations for a V6 engine equipped with
phasing with respect to valve timing [20]. two-step variable valve actuation and intake cam phasing.
DoE-based calibration systems are typically used to reduce Guerrier et al. [18] employed DoE and advanced statistical
the scope of the experiments required to derive the optimal modeling to develop empirical models to advance the
engine calibration correlation under steady-state operating powertrain control module calibration tables. Stuhler et al. [19]
conditions. Dynamic model-based calibration, however, utilizes implemented a standardized and automated calibration
high-fidelity dynamic or transient engine modeling. The data environment, supporting the complexity of gasoline direct
required to develop the engine model are obtained by operating injection engines, for an efficient calibration process using an
the engine through a set of transient dynamometer tests while online DoE to decrease the calibration cost. These engine
the engine calibration is perturbed in real time by a models can predict engine output over transient operation.
reconfigurable rapid prototyping control system. The predictive However, not all the correlations of optimal values of the
engine model produced in this fashion utilizes a combination of controllable engine variables associated with the transient
equation-based and neural network methods. DoE-experimental operating points can be quantified explicitly; to pre-specify the
calibration is well suited only for steady-state engine operation entire transient engine operation is impractical, and thus,
over some driving cycle. In contrast, dynamic modeling engine calibration correlations cannot be optimized for these
produces a transient or dynamic engine model capable of cases a priori.
predicting engine operating cycle. The steady-state optimal Various approaches have been proposed for using artificial
engine calibration can be produced from the transient engine neural networks (ANN) to promote modeling and calibration of
model as a sub-set of the transient engine operation. Rask et al. engines [21-26]. However, ANNs are application-specific and
[1] developed a simulation-based calibration method to rapidly exhibit unpredictable behavior when previously unfamiliar data
This relation is extremely valuable for various MDP problems, 2.2 The Learning Process of the Engine
where the terminal stage is unknown; Eq. (7) holds under The learning process transpires while the engine is running
certain conditions [29]. the vehicle and interacting with the driver. Taken in
A large class of sequential decision-making problems under conjunction with assigning values of the controllable variables
uncertainty is solved by using classical dynamic programming from the feasible action space, A, this interaction portrays the
algorithms, originally proposed by Bellman [30]. Algorithms, progressive enhancement of the engine’s “knowledge” of the
such as value iteration, policy iteration, and linear driver’s driving style with respect to the controllable variables.
programming are employed to find optimal solution of MDPs. More precisely, at each of a sequence of
However, the computational complexity of these algorithms in stages κ = 0,1, 2,...M , as M → ∞ , the driver introduces a state
some occasions may be prohibitive and can grow intractably siκ ∈ S to the engine, and on that basis the engine selects an
with the size of the problem and its related data. In addition,
dynamic programming algorithms require the realization of the action, α κ ∈ A( si ) . This state arises as a result of the driver’s
transition probability matrix, P(⋅,⋅), and the transition reward driving style corresponding to particular engine operating
matrix, R(⋅,⋅). For complex systems with large state space, the points. One stage later, as a consequence of its action, the
type of problem along with the decision-maker’s “lifetime” the magnitude, T ( s j ) , defined as the maximum average future
problem [40]: The longer the lifetime, the worse the reward. Suppose that the current state is si and the following
consequences of prematurely converging on a sub-optimal state given an action α ∈ A( si ) is sj with
solution. This could result from not fully exploring the entire
feasible action space for each state. In our case, the objective is probability p( s j | si , a) . The average future overall reward will
to make the engine learn its optimal calibration for the driver’s be
driving style in the infinite-horizon context. Consequently, the ⎛ N ⎞
engine has to explore the entire action space for any state being ~ ⎜ ∑ p ( sl | s j , a) ⋅ R( sl | s j , a ) ⎟
visited by the particular driving style. In particular, it is T ( s j ) = max ⎜ l =1 ⎟,
a∈A ⎜ N ⎟ (9)
assumed that for any state si ∈ S corresponding to the driving ⎜ ⎟
⎝ ⎠
style, all actions of the feasible action set α ∈ A( si ) are selected
∀s j ∈ S .
at least once. This may result in sacrificing the engine
performance indices in the short run; however, the ultimate The immediate expected reward by transiting from state si to
state s j given an action α ∈ A( si ) is
~ where m f is the fuel mass flow rate per unit time and P is
t ( s j | si , a) = p ( s j | si , a) ⋅ R( s j | si , a). (10) engine’s power output. Continuous engine operation at MBT
ensures optimum fuel economy with respect to the spark
For the problem of optimal control of uncertain systems, which ignition timing.
is treated in a stochastic framework, all uncertain quantities are For a successful real-time, self-learning optimization of
described by probability distributions and the expected value of engine calibration in terms of spark ignition timing, the engine
the overall reward is maximized. In this context, the optimal should realize the MBT timing for each engine operating point
policy π* realized by POSCA is based on the maxmin control (steady-state and transient) dictated by the driving style of a
approach, whereby the worst possible values of the uncertain driver. Consequently, by achieving MBT timing for all steady-
quantities within the given set are assumed to occur. This is a state and transient operating points an overall improvement of
pessimistic point of view which essentially assures that the the BSFC is expected. Aspects of preventing knocking are not
optimal policy will result in at least a minimum overall reward considered in this example; however, they can be easily
value. Consequently, being at state si , POSCA predicts the incorporated by defining the spark ignition space to include the
maximum allowable values.
optimal policy π* in terms of the values of the controllable
The software package enDYNA by TESIS [42] suitable for
variables as
real-time simulation of internal combustion engines is
employed. The software utilizes thermodynamic models of the
⎧⎪ ⎫⎪
π * ( si ) = arg max ⎨min ⎛⎜ t ( s j | si , a ) + T ( s j ) ⎞⎟ ⎬ ,
~ ~
gas path and is well suited for testing and development of
⎪⎩ a∈j A ⎝ ⎠⎪
s ∈S (11) electronic engine controllers. In the example, a four-cylinder
⎭ gasoline engine is used from the enDYNA model database. The
∀si , s j ∈ S. software’s static correlation involving spark ignition timing and
engine operating points is bypassed to incorporate the POSCA
3. EXAMPLE algorithm. This correlation is designated by the baseline
An example of real-time, self-learning optimization of the calibration that enDYNA model is accompanied by, and is
calibration with respect to spark ignition timing in a spark included in, the engine’s ECU. In the context of the MDP
ignition engine is presented. In spark ignition engines the fuel problem, the states represent the pair of gas-pedal position and
and air mixture is prepared in advance before it is ignited by engine speed, and the actions denote the spark ignition timing;
the spark discharge. The major objectives for the spark ignition the rewards that the decision-maker (engine) receives
are to initiate a stable combustion and to ignite the air-fuel correspond to the engine brake torque.
mixture at the crank angle resulting in maximum efficiency, The engine model is run repeatedly over the same driving
while fulfilling emissions standards and preventing the engine style represented by the pedal position. Every run over this
from knocking. Simultaneous achievement of the driving style constitutes one complete simulation. To evaluate
aforementioned objectives is sometimes inconsistent; for the efficiency of our approach in both steady-state and transient
instance, at high engine loads the ignition timing for maximum engine operation, the pedal position rate is chosen to represent
efficiency has to be abandoned in favor of prevention of engine an aggressive acceleration, as illustrated in Figure 6.
destruction by way of engine knock. Two essential parameters
are controlled with the spark ignition: ignition energy and
ignition timing. Control of ignition energy is important for
assuring combustion initiation, but the focus here is on the Maximum Brake Torque
spark timing that maximizes engine efficiency. Ignition timing (MBT)
influences nearly all engine outputs and is essential for
Engine Torque
4. RESULTS 15
After completing the fourth simulation, POSCA specified
the optimal policy in terms of the spark ignition timing, as 10
shown in Figure 7, and compared with the spark ignition timing
designated by the baseline calibration of the enDYNA model. 5 Baseline Engine Calibration
Real-Time Engine Calibration through Learning
The optimal policy resulted in higher engine brake torque
0
compared to the baseline calibration as shown in Figures 8 and 2 2.5 3 3.5 4
Time [sec]
9. This improvement indicates that the engine with self-
learning calibration was able to operate closer to MBT timing. Figure 7. Spark ignition timing over the
Having the engine operate at MBT timing resulted in an overall driving style
minimization of the BSFC, illustrated in Figure 10. Figure 11
compares the velocity of the two vehicles, one carrying the 140
engine with the baseline calibration and the other with the self-
calibrated one. 120
90
80
135
70
Gas-Pedal Position [deg]
60 130
50
125
40
30 120
20 115
10
Baseline Engine Calibration
110 Real-Time Engine Calibration through Learning
0
2 2.5 3 3.5 4
Time [sec] 2.4 2.6 2.8 3 3.2 3.4 3.6 3.8
Time [sec]
Figure 6. Gas-pedal position rate representing
Figure 9. Engine brake torque (zoom-in)
a driver’s driving style
660
preventing knocking were not considered in this example;
650
however, a potential extension is possible such as defining the
640
spark ignition space to include the maximum allowable values
630 ensuring engine operation without knocking. POSCA predicted
620 efficiently the optimal control policy (spark ignition timing) for
610
Baseline Engine Calibration each state (engine operating point). It is left for future research
Real-Time Engine Calibration through Learning
to explore the impact of traffic patterns, and terrain, on the
600
2 2.5 3 3.5 4 general applicability of having the engine learn its optimal
Time [sec]
calibration for an individual driving style. Future research
Figure 10. BSFC comparison between the baseline should also investigate the potential of advancing POSCA in
and self-learning calibration predicting the optimal policy of a number of controllable
variables associated with different states and, thus, avoiding the
18
Baseline Engine Calibration enhancement of the problem’s dimensionality. Increased
Real-Time Engine Calibration through Learning
dimensionality is a major challenge for learning algorithms.
17
Velocity of the Vehicle [mph]
16
ACKNOWLEDGMENTS
This research was partially supported by the Automotive
15 Research Center (ARC), a U.S. Army Center of Excellence in
14
Modeling and Simulation of Ground Vehicles at the University
of Michigan. The engine simulation package enDYNA was
13 provided by TESIS DYNAware GmbH. This support is
12 gratefully acknowledged.
11 REFERENCES
2.8 3 3.2 3.4 3.6 3.8
Time [sec]
[1] Rask, E. and Sellnau, M., "Simulation-Based Engine
Figure 11. Velocity of the two vehicles carrying the
Calibration: Tools, Techniques, and Applications," SAE
engine with baseline and self-learning calibration
Transactions-Journal of Engines, v. 113, 2004, SAE 2004-
01-1264.
5. CONCLUDING REMARKS
[2] Atkinson, C. and Mott, G., "Dynamic Model-Based
The POSCA algorithm allows an internal combustion
Calibration Optimization: An Introduction and
engine to act as an autonomous system that can learn its
Application to Diesel Engines," SAE World Congress,
optimal calibration for both steady-state and transient operating
Detroit, Michigan, April 11-14, 2005, SAE 2005-01-0026.
points in real time while running a vehicle. The engine
[3] Rakopoulos, C. D., Giakoumis, E. G., Hountalas, D. T.,
progressively perceives the driver’s driving style and,
and Rakopoulos, D. C., "The Effect of Various Dynamic,
eventually, learns to coordinate optimal performance of several
Thermodynamic and Design Parameters on the
specified indices, e.g., fuel economy, pollutant emissions,
Performance of a Turbocharged Diesel Engine Operating
engine performance, for this particular driving style. The longer
under Transient Load Conditions," SAE 2004 World
the engine runs with a particular driving style, the better the
Congress and Exhibition, Detroit, Michigan, April 8-11,
specified indices will be. The engine’s ability to learn its
2004, SAE 2004-01-0926.
optimum calibration is not limited, however, to a particular
[4] Wijetunge, R. S., Brace, C. J., Hawley, J. G., Vaughan, N.
driving style. The engine can learn to operate optimally for
D., Horroocks, R. W., and Bird, G. L., "Dynamic Behavior
different drivers by assigning the transition probability P(⋅,⋅),
of a High-Speed, Direct-Injection Diesel Engine," SAE
and reward matrices R(⋅,⋅) for each driver. The drivers would Transactions-Journal of Engines, v. 108, 1999, SAE 1999-
indicate their identities before starting the vehicle to denote the 01-0829.
pair of these matrices that the engine should employ. The [5] Clark, N. N., Gautam, M., Rapp, B. L., Lyons, D. W.,
engine can then adjust its operation to be optimal for a Graboski, M. S., McCormick, R. L., Alleman, T. L., and
particular driver based on what it has learned in the past National, P. N., "Diesel and CNG Transit Bus Emissions
regarding his/her driving style. Characterization by Two Chassis Dynamometer