Complete Download Learning Control Zhang PDF All Chapters
Complete Download Learning Control Zhang PDF All Chapters
com
https://ebookmass.com/product/learning-control-zhang/
OR CLICK HERE
DOWLOAD NOW
https://ebookmass.com/product/learning-control-dan-zhang/
ebookmass.com
https://ebookmass.com/product/motor-learning-and-control-concepts-and-
applications-richard-a-magill/
ebookmass.com
https://ebookmass.com/product/craving-control-faust/
ebookmass.com
https://ebookmass.com/product/communicating-across-cultures-and-
languages-in-the-health-care-setting-voices-of-care-1st-edition-
claire-penn/
ebookmass.com
PMP Project Management Professional All-in-One Exam Guide
Joseph Phillips
https://ebookmass.com/product/pmp-project-management-professional-all-
in-one-exam-guide-joseph-phillips/
ebookmass.com
https://ebookmass.com/product/climate-economics-a-call-for-more-
pluralism-and-responsibility-michael-roos/
ebookmass.com
https://ebookmass.com/product/code-name-omega-jameson-force-security-
book-10-sawyer-bennett/
ebookmass.com
https://ebookmass.com/product/oxford-studies-in-political-philosophy-
volume-4-david-sobel/
ebookmass.com
https://ebookmass.com/product/an-introduction-to-policing-9th-edition-
ebook-pdf/
ebookmass.com
Narratives of Difference in an Age of Austerity 1st
Edition Irene Gedalof (Auth.)
https://ebookmass.com/product/narratives-of-difference-in-an-age-of-
austerity-1st-edition-irene-gedalof-auth/
ebookmass.com
Learning Control
FIRST EDITION
Dan Zhang
York University, Toronto, ON, Canada
Bin Wei
Algoma University, Sault Ste Marie, ON, Canada
Table of Contents
Cover image
Title page
Copyright
List of contributors
Abstract
1.1. Introduction
1.2. Background
1.4. Results
1.5. Conclusions
Appendix 1.A.
References
Chapter 2: Cognitive load estimation for adaptive human–machine
system automation
Abstract
2.1. Introduction
References
Abstract
3.1. Introduction
3.8. Remarks
References
Abstract
4.3. Summary
References
Abstract
5.1. Introduction
Abstract
6.1. Introduction
6.6. Conclusion
References
Abstract
7.1. Introduction
7.5. Conclusion
Appendix 7.A.
Appendix 7.B.
References
Abstract
8.1. Introduction
8.5. Conclusion
References
Abstract
9.1. Introduction
9.7. Conclusions
References
Abstract
10.1. Introduction
10.3. Experiments
10.4. Conclusions
References
Index
Copyright
Elsevier
Radarweg 29, PO Box 211, 1000 AE Amsterdam, Netherlands
The Boulevard, Langford Lane, Kidlington, Oxford OX5 1GB, United
Kingdom
50 Hampshire Street, 5th Floor, Cambridge, MA 02139, United States
Notices
Knowledge and best practice in this field are constantly changing.
As new research and experience broaden our understanding,
changes in research methods, professional practices, or medical
treatment may become necessary.
To the fullest extent of the law, neither the Publisher nor the
authors, contributors, or editors, assume any liability for any injury
and/or damage to persons or property as a matter of products
liability, negligence or otherwise, or from any use or operation of
any methods, products, instructions, or ideas contained in the
material herein.
ISBN: 978-0-12-822314-7
Typeset by VTeX
List of contributors
Simge Akay Computer Engineering Department, Bahcesehir
University, Istanbul, Turkey
Basim Alghabashi Concordia Institute for Information Systems
Engineering (CIISE), Concordia University, Montreal, QC, Canada
Mohamed Al Mashrgy Electrical and Computer Engineering
(ECE), Al-Mergib University, Alkhums, Libya
Nafiz Arica Computer Engineering Department, Bahcesehir
University, Istanbul, Turkey
Zeinab Arjmandiasl Concordia Institute for Information Systems
Engineering (CIISE), Concordia University, Montreal, QC, Canada
Muhammad Azam Electrical and Computer Engineering (ECE),
Concordia University, Montreal, QC, Canada
B. Balasingam Department of Electrical and Computer
Engineering, University of Windsor, Windsor, ON, Canada
Jamal Bentahar Concordia Institute for Information Systems
Engineering (CIISE), Concordia University, Montreal, QC, Canada
F. Biondi Faculty of Human Kinetics, University of Windsor,
Windsor, ON, Canada
Aaron Boda Department of Earth and Space Science and
Engineering, Lassonde School of Engineering, York University,
Toronto, ON, Canada
Nizar Bouguila Concordia Institute for Information Systems
Engineering (CIISE), Concordia University, Montreal, QC, Canada
Duygu Cakir Computer Engineering Department, Bahcesehir
University, Istanbul, Turkey
Mark Green Faculty of Science, Ontario Tech University, Oshawa,
ON, Canada
Sorin Grigorescu
Robotics, Vision and Control (ROVIS), Transilvania University of
Brasov, Brasov, Romania
Artificial Intelligence, Elektrobit Automotive, Brasov, Romania
Baoxin Hu Department of Earth and Space Science and
Engineering, Lassonde School of Engineering, York University,
Toronto, ON, Canada
Xishi Huang RS Opto Tech Ltd., Suzhou, Jiangsu, China
M. Khalghollah Schulich School of Engineering, University of
Calgary, Calgary, AB, Canada
Howard Li Department of Electrical and Computer Engineering,
University of New Brunswick, Fredericton, NB, Canada
C.J.B. Macnab Schulich School of Engineering, University of
Calgary, Calgary, AB, Canada
Narges Manouchehri Concordia Institute for Information Systems
Engineering (CIISE), Concordia University, Montreal, QC, Canada
Jun Meng Zhejiang University Robotics Institute, Hangzhou,
China
Afshin Rahimi Department of Mechanical, Automotive and
Materials Engineering, University of Windsor, Windsor, ON, Canada
P. Ramakrishnan Department of Electrical and Computer
Engineering, University of Windsor, Windsor, ON, Canada
Jing Ren Department of Electrical and Computer Engineering,
Ontario Tech University, Oshawa, ON, Canada
Jianguo Wang Department of Earth and Space Science and
Engineering, Lassonde School of Engineering, York University,
Toronto, ON, Canada
Jin Wang Zhejiang University Robotics Institute, Hangzhou,
China
Chapter 1: A high-level design
process for neural-network
controls through a framework of
human personalities
M. Khalghollah; C.J.B. Macnab Schulich School of Engineering,
University of Calgary, Calgary, AB, Canada
Abstract
Current learning systems in the field of feedback control deal
with both unbiased nonlinearities and biased nonlinearities
(where bias is measured at the origin) quite differently.
Unbiased nonlinearities lend themselves to direct adaptive
control methods. Biased systems, on the other hand, typically
require actual learning of the bias term in order to achieve
acceptable error and effort. This paper attempts to unify these
approaches, and to learn to compensate for both types of
nonlinearities simultaneously. To do so we utilize a graphical,
quantitative theory of human personalities, which assumes that
their personalities indicate how people interact with the world
around them using feedback. This biologically-inspired
approach allows us to develop a formal design framework for
tackling this problem. Simulations with a two-link robotic
manipulator demonstrate the utility of the learning design
method, where gravity provides the main biased nonlinearities,
while friction, centripetal, and Coriolis forces are treated as
unbiased nonlinearities; our neural-network update laws learn
all these robot nonlinearities at the same time. Lyapunov
methods result in stability guaranties for the proposed method.
Keywords
learning control; direct adaptive control; linear quadratic regulator;
gravity compensation; trajectory tracking; robotic manipulators;
personality theory
Chapter Outline
1.1 Introduction
1.2 Background
1.2.1 The CMAC associative-memory neural
network
1.2.2 Unbiased nonlinearities
1.2.3 Direct adaptive control in the presence
of bias
1.2.4 A graphical model of personalities
1.2.5 A computer model of personalities
1.3 Proposed methods
1.3.1 Proposed learning law
1.3.2 Cost functional for optimization
1.3.3 Stability analysis
1.4 Results
1.4.1 Developing a design procedure
1.4.2 Two-link robotic manipulator
1.5 Conclusions
References
1.1 Introduction
This work addresses the difficulty in designing a learning control for
system dynamics that contain both a large bias and significant
nonlinearities, using the example of a robotic manipulator to
develop and test the ideas. The vast majority of learning-control
techniques in the literature addressing tracking control of the robot's
end-effector try to cancel the force of gravity, or compensate for
velocity-dependent nonlinearities that tend to pull the tip off-track,
i.e. friction, Coriolis, and centripetal (FCC) terms. The ad hoc
methods found in the literature that proposed learning both gravity
and FCC terms end up doing so sequentially, not simultaneously, to
the best of our knowledge. In this paper, we suggest the basic
difficulty in trying to achieve both at the same time stems from the
need to approach these two problems differently, i.e. they require
two different types of learning.
Gravity acting on a robot qualifies as a nonlinear term with
significant bias; for a simpler example of a biased nonlinearity,
consider a cosine term near the origin. Control systems generally
endeavor to cancel biased nonlinearities with a feedforward term, i.e.
an open-loop, bias-compensation, or set-point control. A sine
function, on the other hand, provides a simple example of a
nonlinear function with a value of zero at the origin; compensating
for this type of unbiased nonlinearity requires nonlinear feedback. A
learning control for this problem essentially ends up achieving a
memorized feedback term. Since the FCC terms in a robot manipulator
go to zero when the velocity goes to zero, and the need for high-
speed precision remains relatively rare in industrial applications, as
a practical matter the trajectory-tracking control problem without
gravity ends up more like learning sine than cosine. One important
difference from the designer's point of view lies in the choice of
robust weight update method, i.e. designing updates for the weights
(adaptive parameters) that remain robust to disturbances in the
sense of limiting weight drift (overlearning) and preventing bursting
(a sudden increase in error after a period of convergence). For
unbiased nonlinearities, a leakage term forgetting factor that tries to
drive the weights toward zero works well in practice. However, such
a term would directly result in a steady-state error for biased
nonlinearities; for biased nonlinearities the field of adaptive control
offers deadzone, parameter projection, and supervised leakage as
choices for the designer, all of which require some a priori
knowledge of the system's parameters and/or expected disturbances;
as a result many do not classify such methods as true learning
systems.
Here we use our own previously-proposed graphical model of
human personalities to examine the problem [1]. A resulting
computational model, based on feedback theory, allows a prediction
of the probability distribution of Myers–Briggs behavioral technique
types in the human population, for the J/P and T/F pairs [2] as well
as for the I/E and N/S pairs [3]. This biologically-inspired perspective
allows more intuitive design methods for the high-level thinking
required in today's advanced control-system applications [4]. The
proposed framework provides insight into the nature of the two
types of learning problems outlined above, and it suggests how to
achieve them simultaneously on a robot arm, i.e. like humans do.
Building on the idea of LQR control, the approach results in an
optimization method for the design of all the control parameters
(feedback gains and adaptation rates) in a learning controller—the
first such formal design method appearing in the literature that
extends the LQR approach to nonlinear systems, to our knowledge.
Robotic-manipulator dynamics contain significant nonlinearities;
proposed control methods based on linear-system theory must, at
the very least, assume implementation of a gravity-compensation
term [5]. Actually learning the force of gravity should provide many
advantages, including decreasing engineering-design costs and
achieving real-time payload adaptation. An iterative learning control
[6] has some advantages, but does not address robustness to
disturbances and/or payloads. Radial basis function networks
(RBFNs) can learn to estimate the gravity term in some robot
manipulators [7,8] and some biped robots [9], but the method does
not extend well to multi-link robots due to the curse of dimensionality
when trying to add more inputs into an RBFN network.
The method in [10] learns both gravity and other nonlinearities,
but the very-small leakage term used in order to avoid sag-due-to-
gravity appears insufficient to deal with realistic disturbances. The
authors of [11] proposed an RBFN method that adapts to both
gravity (biased nonlinearity) and FCC terms (unbiased
nonlinearities), but requires knowledge of the inertia matrix—which
implies the designer would, in fact, know the gravity term. In [12],
only gravity compensation gets proposed in the first step of
y g y p g p p p
backstepping for a flexible-joint robot, and not FCC terms. The same
authors tackle learning all nonlinearities for a Baxter robot in [13],
but the tracking performance requires a set of weights identified
during a learning stage—in practice it would seem the learning stage
would find the large biased nonlinearity and the tracking stage
would fine-tune the performance by compensating for the unbiased
nonlinearities.
In previous work, our research group presented a near-optimal
control [14], which developed an approach to achieve a near-optimal
control signal in the presence of gravity. A cerebellar model
articulation controller (CMAC) [15], with advantages of a fast
adaptation rate and real-time computational ability, was found to
have unique properties for tackling this problem; freezing a set of
supervisory network's local weights when the bias becomes
identified can compensate for the gravity bias term, and further
learning (using leakage) could then fine-tune for the FCC terms. The
disadvantage was the procedure was ad hoc, based on intuitive
insights into the workings of the CMAC.
Developing a formal design method, on the other hand, may have
some far-reaching implications for the field of control. Some have
playfully accused the field of control-systems of having a dirty secret:
designers often choose gains and/or parameters by trial-and-error,
rules-of-thumb, and/or experience. Such methods can be inadequate
when facing contemporary problems of interest, such as robots
interacting with unknown, unstructured environments.
Optimization methods promise a path to solve this problem, but
standard higher-level cognitive frameworks for the design of cost-
functionals remain an open problem to our knowledge. Even in the
case of the linear quadratic regulator (LQR), current theory does not
provide the designer with a method for choosing the Q and R
weighting matrices. For more advanced nonlinear systems that
interact with an environment, researchers struggle with even
creating a suitable cost-functional at the moment. This work
provides a biologically-inspired method for designing cost-
functionals and the value of the weightings. We use a model of
human personalities; choosing a personality directly results in a
p g p y y
choice of numerical weightings. Thus, a control system designer can
use their intuitive understanding of human personalities at the high-
level design stage. Not only can this avoid trial and error in picking
parameters, but it may also significantly reduce the total number of
parameters needed compared to manual designs. Consider an
analogy to how fuzzy logic proved quite a time-saving invention for
control design, as a result of allowing human intuition to guide the
design of computational decision-making and reducing the number
of parameters that the designer needs to choose; fuzzy control ended
up significantly broadening the field of computer automation, since
many more problems would lend themselves to a cost-effective
and/or time-efficient design solution.
This chapter first gives a background on CMAC neural network
and direct adaptive control methods, as applied to both biased and
unbiased nonlinearities. The Background section ends with a short
introduction to our personality theory, and describes how the
framework enables design of PID+bias controls using a nonlinear
quadratic regulator. In the Proposed Methods section, we show how
to extend the approach to designing an adaptive learning control.
The Results section illustrates a formal engineering design
procedure, based on constraints and objectives, for a simple mass-
spring-mass and then a two-link robotic manipulator.
1.2 Background
1.2.1 The CMAC associative-memory neural
network
This work assumes an associative memory will serve as the
nonlinear approximator without loss of generality, where a weighted
sum of basis functions gives the estimate of nonlinear function
as
(1.1)
(1.2)
where denotes the normalized position within the
cell on the kth input; normalization produces
(1.3)
An example of a spline CMAC with normalized basis functions in
one dimension appears in Fig. 1.1: for two dimensions see Fig. 1.2.
Rather than an impractical allocation of memory for n-dimensional
arrays, the CMAC uses hash-coding to map virtual cells to a one-
dimensional physical memory, since relatively few cells ever get
activated in high-dimensional space during trajectory tracking of a
robot [16] (hash collisions become possible, however unlikely).
(1.4)
(1.5)
(1.6)
(1.7)
(1.8)
where indicates the ideal-weight estimates, positive parameter ν
determines the influence of the robust leakage modification, which
limits weight drift by trying to force the weights to zero. As long as
remains relatively small, the leakage term does not significantly
reduce performance. The definition of z implies positive constant
equates to a derivative gain from PID control—giving an effective
proportional gain . We denote the positive-constant
adaptation gain as because in subsequent sections will we point
out similarities of the neural network to an integral term, i.e. this
work treats a neural network trained with adaptive-control update
laws as just a memorized integral.
Note that the system control actually occurs as a multi-rate signal,
with the feedback running fast enough to approximate a continuous
time signal and the neural network updated at a discrete rate, i.e. a
zero-order-hold discretized signal describes its output characteristics.
Designers often choose leakage for a robust update, and in discrete
time with sampling period Δt the delta-rule update becomes
(1.9)
(1.11)
(1.12)
(1.13)
(1.15)
where the words indicate overall norm measurements e.g.
norms; the first two terms equate to the terms multiplied by the Q
matrix in an LQR control (assuming an SISO system with position
and velocity states), while the third term penalizes the PID control
effort for the first half of the step response, but for the second half
acts like traditional LQR only penalizing a measure of PD feedback
effort, e.g. . The term stems from understanding the
integral term as trying to compensate for the bias (or inaccuracies in
bias compensation). Thus, an integral qualifies as a simple learning
term, and its error measures the distance from its ideal value (the
bias that would result in as the error reaches zero at steady
state). We point out the following similarities to the qualities in the
graphical theory of personalities:
which allows a high-level control design e.g. choosing a desired
personality results in the weighting parameters in a nonlinear
quadratic regulator (NQR). This eliminates the need to pursue trial-
and-error, rule-of-thumb, or experiential choices of Q's and R's,
allowing an intuitive understanding of human personalities to guide
the design process at the highest level (Fig. 1.7).
In previous work we pointed out similarities in PID+bias control
to the Myers–Briggs personality behavioral techniques for humans:
(1.16)
(1.17)
(1.19)
The first three terms look familiar, known from the LQR cost
functional, while the feedback effort only includes
(1.20)
(1.22)
(1.23)
Our website is not just a platform for buying books, but a bridge
connecting readers to the timeless values of culture and wisdom. With
an elegant, user-friendly interface and an intelligent search system,
we are committed to providing a quick and convenient shopping
experience. Additionally, our special promotions and home delivery
services ensure that you save time and fully enjoy the joy of reading.
ebookmass.com