0% found this document useful (0 votes)

37 views606 pages

Lecture Notes in Electrical Engineering

Lecture Notes

Uploaded by

د. محمد الصارم

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views606 pages

Lecture Notes in Electrical Engineering

Lecture Notes

Uploaded by

د. محمد الصارم

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 606

Lecture Notes in Electrical Engineering

Volume 68
.
Sio-Iong Ao Burghard Rieger
l l

Mahyar A. Amouzegar

Machine Learning and

Systems Engineering
Editors
Dr. Sio-Iong Ao Prof. Dr. Burghard Rieger
International Association of Engineers Universität Trier
Hung To Road 37-39 FB II Linguistische
Hong Kong Datenverarbeitung
Unit 1, 1/F Computerlinguistik
Hong Kong SAR Universitätsring 15
[email protected] 54286 Trier
Germany

Prof. Mahyar A. Amouzegar

College of Engineering
California State University
Long Beach
CA 90840
USA

ISSN 1876-1100 e-ISSN 1876-1119

ISBN 978-90-481-9418-6 e-ISBN 978-90-481-9419-3
DOI 10.1007/978-90-481-9419-3
Springer Dordrecht Heidelberg London New York
Library of Congress Control Number: 2010936819

# Springer Science+Business Media B.V. 2010

No part of this work may be reproduced, stored in a retrieval system, or transmitted in any form or by any
means, electronic, mechanical, photocopying, microﬁlming, recording or otherwise, without written
permission from the Publisher, with the exception of any material supplied speciﬁcally for the purpose
of being entered and executed on a computer system, for exclusive use by the purchaser of the work.

Cover design: SPi Publisher Services

Printed on acid-free paper

Springer is part of Springer Science+Business Media (www.springer.com)

Preface

A large international conference on Advances in Machine Learning and Systems

Engineering was held in UC Berkeley, California, USA, October 20–22, 2009,
under the auspices of the World Congress on Engineering and Computer Science
(WCECS 2009). The WCECS is organized by the International Association of
Engineers (IAENG). IAENG is a non-profit international association for the engi-
neers and the computer scientists, which was founded in 1968 and has been under-
going rapid expansions in recent years. The WCECS conferences have served as
excellent venues for the engineering community to meet with each other and to
exchange ideas. Moreover, WCECS continues to strike a balance between theoreti-
cal and application development. The conference committees have been formed
with over two hundred members who are mainly research center heads, deans,
department heads (chairs), professors, and research scientists from over thirty
countries with the full committee list available at our congress web site (http://
www.iaeng.org/WCECS2009/committee.html). The conference participants are
truly international representing high level research and development from many
countries. The responses for the congress have been excellent. In 2009, we received
more than six hundred manuscripts, and after a thorough peer review process
54.69% of the papers were accepted.
This volume contains 46 revised and extended research articles written by
prominent researchers participating in the conference. Topics covered include
Expert system, Intelligent decision making, Knowledge-based systems, Knowledge
extraction, Data analysis tools, Computational biology, Optimization algorithms,
Experiment designs, Complex system identification, Computational modeling,
and industrial applications. The book offers the state of the art of tremendous
advances in machine learning and systems engineering and also serves as an
excellent reference text for researchers and graduate students, working on machine
learning and systems engineering.

Sio-Iong Ao
Burghard B. Rieger
Mahyar A. Amouzegar

v
.
Contents

1 Multimodal Human Spacecraft Interaction in Remote

Environments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
2 The MIT SPHERES Program . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
2.1 General Information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2.2 Human-SPHERES Interaction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
2.3 SPHERES Goggles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
3 Multimodal Telepresence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
3.1 Areas of Application . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
3.2 The Development of a Test Environment . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
4 Experimental Setup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
4.1 Control via ARTEMIS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
4.2 The Servicing Scenarios . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
5 Results of the Experiments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
5.1 Round Trip Delays due to the Relay Satellite . . . . . . . . . . . . . . . . . . . . . . 11
5.2 Operator Force Feedback . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
6 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14

2 A Framework for Collaborative Aspects of Intelligent

Service Robot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
2 Related Works . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
2.1 Context-Awareness Systems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
2.2 Robot Grouping and Collaboration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
3 Design of the System . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
3.1 Context-Awareness Layer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
3.2 Grouping Layer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
3.3 Collaboration Layer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24

vii
viii Contents

4 Simulated Experimentation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
4.1 Robot Grouping . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
4.2 Robot Collaboration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
5 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28

3 Piecewise Bezier Curves Path Planning with Continuous

Curvature Constraint for Autonomous Driving . . . . . . . . . . . . . . . . . . . . . . . . 31
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
2 Bezier Curve . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
2.1 The de Casteljau Algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
2.2 Derivatives, Continuity and Curvature . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
3 Problem Statement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
4 Path Planning Algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
4.1 Path Planning Placing Bezier Curves within Segments (BS) . . . . . . 37
4.2 Path Planning Placing Bezier Curves on Corners (BC) . . . . . . . . . . . . 38
5 Simulation Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
6 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45

4 Combined Heuristic Approach to Resource-Constrained Project

Scheduling Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
2 Basic Notions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
3 Algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
4 Generalisation for Multiproject Schedule . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
5 KNapsack-Based Heuristic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
6 Stochastic Heuristic Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
7 Experimentation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
8 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56

5 A Development of Data-Logger for Indoor Environment . . . . . . . . . . . . . 59

1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
2 Sensors Module . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60
2.1 Temperature Sensor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60
2.2 Humidity Sensor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
2.3 CO and CO2 Sensor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
3 LCD Interface to the Microcontroller . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
4 Real Time Clock Interface to the Microcontroller . . . . . . . . . . . . . . . . . . . . . . 62
5 EEPROM Interface to the Microcontroller . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
6 PC Interface Using RS-232 Serial Communication . . . . . . . . . . . . . . . . . . . . . 63
7 Graphical User Interface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
8 Schematic of the Data Logger . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
Contents ix

9 Software Design of Data Logger . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64

9.1 Programming Steps for I2C Interface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
9.2 Programming Steps for LCD Interface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
9.3 Programming Steps for Sensor Data Collection . . . . . . . . . . . . . . . . . . . 67
10 Results and Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
11 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69

6 Multiobjective Evolutionary Optimization and Machine Learning:

Application to Renewable Energy Predictions . . . . . . . . . . . . . . . . . . . . . . . . . . 71
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
2 Material and Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
2.1 Support Vector Machines . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
2.2 Multiobjective Evolutionary Optimization . . . . . . . . . . . . . . . . . . . . . . . . . 74
2.3 SVM-MOPSO Trainings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76
3 Application . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78
4 Results and Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78
5 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81

7 Hybriding Intelligent Host-Based and Network-Based

Stepping Stone Detections . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
2 Research Terms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84
3 Related Works . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85
4 Proposed Approach: Hybrid Intelligence Stepping Stone
Detection (HI-SSD) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85
5 Experiment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86
6 Result and Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88
6.1 Intelligence Network Stepping Stone Detection (I-NSSD) . . . . . . . . 88
6.2 Intelligence Host-Based Stepping Stone Detection (I-HSSD) . . . . . 89
6.3 Hybrid Intelligence Stepping Stone Detection (HI-SSD) . . . . . . . . . . 92
7 Conclusion and Future Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94

8 Open Source Software Use in City Government . . . . . . . . . . . . . . . . . . . . . . . 97

1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97
2 Related Research . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
3 Research Goals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100
4 Methodology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100
5 Survey Execution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101
6 Survey Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102
7 Analysis: Interesting Findings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104
7.1 Few Cities Have All Characteristics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104
x Contents

7.2 Possible Aversion to OSS If Not Currently Using OSS . . . . . . . . 105

7.3 Current OSS Support by Leadership, Management,
and IT Staff . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 106
7.4 Discrepancy of OSS Awareness: Self, Others . . . . . . . . . . . . . . . . . . . 108
8 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108

9 Pheromone-Balance Driven Ant Colony Optimization

with Greedy Mechanism . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111
2 Preliminaries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113
2.1 Ant Colony Optimization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113
2.2 Related Studies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114
3 Hybrid ACO with Modified Pheromone Update Rules . . . . . . . . . . . . . . 115
4 Experiments and Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117
5 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119

10 Study of Pitchfork Bifurcation in Discrete Hopfield

Neural Network . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 121
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 121
2 Determination of Fixed Points . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 123
3 Local Stability Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 124
4 Pitchfork Bifurcation Direction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 125
5 Simulations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 128
6 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 128
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 130

11 Grammatical Evolution and STE Criterion . . . . . . . . . . . . . . . . . . . . . . . . . 131

1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 131
2 STE – Sum Epsilon Tube Error . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132
3 STE – Empirical Properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 134
3.1 SSE (Advantages, Disadvantages) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 134
3.2 STE (Advantages, Disadvantages) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 136
4 Probabilistic Mapping of SSE to STE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137
5 Goodness-of-Fit Tests of Data Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 139
5.1 Uncensored Data – ET10x50 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 140
6 Probabilistic Relationship Between STE and SSE . . . . . . . . . . . . . . . . . . . 141
7 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 141
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 142

12 Data Quality in ANFIS Based Soft Sensors . . . . . . . . . . . . . . . . . . . . . . . . . . . 143

1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 143
2 ANFIS Based Inferential Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 144
Contents xi

2.1 Training and Testing Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 145

3 Impact of Data Quality . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 145
3.1 Experimental Methodology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 147
3.2 Experimental Factors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 148
3.3 Experimental Design . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 149
3.4 Experimental Result . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 150
4 Tane Algorithm for Noisy Data Detection . . . . . . . . . . . . . . . . . . . . . . . . . . . 152
5 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 152
6 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 154
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 155

13 The Meccano Method for Automatic Volume Parametrization

of Solids . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 157
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 157
2 The Meccano Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 159
3 Application of the Meccano Method to Complex
Genus-ZeroSolids . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 160
3.1 Example 1: Bust . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 161
3.2 Example 2: Bunny . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 163
3.3 Example 3: Bone . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 163
4 Conclusions and Future Research . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 165
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 166

14 A Buck Converter Model for Multi-Domain Simulations . . . . . . . . . . . 169

1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 169
2 The Model for Calculating Switching Events . . . . . . . . . . . . . . . . . . . . . . . . 170
3 The Averaged Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 172
4 Consideration of Switching Losses . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 176
5 Implementation of the Simulation Models . . . . . . . . . . . . . . . . . . . . . . . . . . . 177
6 Simulation and Laboratory Test Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 178
7 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 180
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 180

15 The Computer Simulation of Shaping in Rotating

Electrical Discharge Machining . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 183
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 183
2 Mathematical Modelling of Redm Shaping by End Tool
Electrode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 185
3 Mathematical Modelling of Redm Shaping by Lateral
Surface of Tool Electrode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 188
4 Software for Computer Simulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 190
5 Experimental Verification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 192
6 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 194
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 194
xii Contents

16 Parameter Identification of a Nonlinear Two Mass System

Using Prior Knowledge . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 197
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 197
2 General Dynamic Neural Network . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 198
2.1 Administration Matrices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 199
2.2 Implementation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 200
3 Parameter Optimization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 200
3.1 Levenberg–Marquardt Algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 201
3.2 Jacobian Calculations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 202
4 Two-Mass-System . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 202
5 Structured Dynamic Neural Networks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 203
6 Identification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 205
6.1 Excitation Signal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 205
6.2 Engine Parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 205
6.3 TMS Parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 206
7 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 210
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 210

17 Adaptive and Neural Learning for Biped Robot

Actuator Control . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 213
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 213
2 Problem Description . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 214
2.1 Objective . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 214
2.2 Biped Dynamics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 215
2.3 Uncertain Actuator Dynamics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 215
2.4 Desired Moments Md . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 216
2.5 Adaptive Control Approach . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 216
3 Solution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 216
3.1 Reference Model for Actuator . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 216
3.2 Inverse Model Reference . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 216
3.3 MRAC Scheme . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 217
4 MRAC for Walking Biped Actuators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 218
4.1 Dynamics of Walking Biped . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 218
4.2 Computation of Desired Moments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 220
4.3 Dynamics of Actuators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 220
4.4 Configuration of MRAC Actuator . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 221
4.5 Convergence Analysis of MRAC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 221
4.6 Neural Network Learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 221
5 Simulation Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 222
5.1 First Simulation (Without Disturbance) . . . . . . . . . . . . . . . . . . . . . . . . . . 222
5.2 Second Simulation (Disturbance) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 222
5.3 Third Simulation (Neural Network Estimation) . . . . . . . . . . . . . . . . . . 223
6 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 224
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 225
Contents xiii

18 Modeling, Simulation, and Analysis for Battery

Electric Vehicles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 227
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 227
2 Steady State Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 228
2.1 Projected Gravity Force . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 229
2.2 Aerodynamic Drag . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 229
2.3 The Rolling Resistance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 230
2.4 Power Required . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 230
2.5 Energy Required . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 230
2.6 Battery Specific Energy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 231
2.7 Maximum Cruise Speed . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 233
3 Dynamic Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 235
3.1 Power Limited . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 236
3.2 Traction Limited . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 237
3.3 0–60 mph . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 238
3.4 Maximum Gradeability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 239
4 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 240
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 241

19 Modeling Confined Jets with Particles and Swril . . . . . . . . . . . . . . . . . . . . 243

1 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 243
2 Gas Phase and Turbulence Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 245
2.1 Standard k e Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 246
2.2 Renormalization Group (RNG) k e Model . . . . . . . . . . . . . . . . . . . . . . 247
2.3 Realizable k e Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 247
3 Dispersed Phase . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 248
4 Simulation Settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 250
5 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 251
6 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 254
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 255

20 Robust Tracking and Control of Mimo Processes

with Input Saturation and Unknown Disturbance . . . . . . . . . . . . . . . . . . . 257
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 257
2 MRAGPC Design Scheme . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 258
2.1 MRAGPC Problem Formulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 259
2.2 Controllers Parameterization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 260
3 Additional Design Schemes for MRAGPC . . . . . . . . . . . . . . . . . . . . . . . . . . 262
3.1 Robust Parallel Compensator (RPC) Scheme
for MIMO Processes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 262
3.2 Unknown Disturbance Estimation Scheme
for MIMO Processes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 264
4 Simulation Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 265
4.1 Control of MIMO System Without Disturbance . . . . . . . . . . . . . . . . . 265
xiv Contents

4.2 Control of MIMO System with Disturbance . . . . . . . . . . . . . . . . . . . . . 265

5 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 268
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 268

21 Analysis of Priority Rule-Based Scheduling in

Dual-Resource-Constrained Shop-Floor Scenarios . . . . . . . . . . . . . . . . . . 269
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 269
2 Literature Review . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 270
2.1 Shop Scheduling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 270
2.2 Priority Rules . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 270
2.3 Multi/dual-Resource Constrained Scheduling . . . . . . . . . . . . . . . . . . . . 271
3 Problem Description . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 271
4 Experiments with Static Instances . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 273
4.1 Experimental Design . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 274
4.2 Analyses of Static Instances . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 276
5 Long-Term Simulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 277
5.1 Long-Term Simulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 277
5.2 Analysis of Long-Term Simulations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 277
6 Conclusion and Further Research . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 280
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 280

22 A Hybrid Framework for Servo-Actuated Systems

Fault Diagnosis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 283
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 283
2 System Under Consideration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 285
3 Role of Fuzzy Logic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 287
4 Design of Fuzzy Logic Controller . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 288
4.1 Inputs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 289
4.2 Membership Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 291
4.3 Rule-Based Inference . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 292
4.4 Defuzzification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 293
4.5 Rule Viewer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 293
5 Simulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 293
6 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 294
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 295

23 Multigrid Finite Volume Method for FGF-2 Transport

and Binding . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 297
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 297
2 Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 298
2.1 Mathematical Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 298
2.2 Collocated Finite Volume Discretization . . . . . . . . . . . . . . . . . . . . . . . . 299
2.3 Multigrid Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 301
Contents xv

3 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 303
4 Discussions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 307
5 Concluding Remarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 309
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 309

24 Integrated Mining Fuzzy Association Rules For Mineral

Processing State Identification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 311
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 311
2 Grinding Process Modelling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 313
3 The Controller Design . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 314
3.1 Fuzzy Logic Controller . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 317
3.2 Association Rules Miming Algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . . . 319
4 Simulation Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 322
5 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 323
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 324

25 A Combined Cycle Power Plant Simulator: A Powerful,

Competitive, and Useful Tool for Operator’s Training . . . . . . . . . . . . . . 327
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 327
2 Antecedent . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 328
3 Architecture Configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 329
3.1 Software Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 329
3.2 Software Platform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 332
3.3 Hardware Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 333
4 Modeled Systems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 334
4.1 Control System . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 334
4.2 DCS Model for Real-Time Simulation . . . . . . . . . . . . . . . . . . . . . . . . . . . 335
4.3 The Graphic Visualization Tool . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 335
4.4 Processes System . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 335
5 Project Control . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 336
6 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 337
7 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 338
8 Future Works . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 338
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 339

26 Texture Features Extraction in Mammograms

Using Non-Shannon Entropies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 341
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 341
2 Gray Level Histogram Moments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 343
3 Experimental Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 344
4 Conclusions and Future . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 349
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 350
xvi Contents

27 A Wideband DOA Estimation Method Based on Arbitrary

Group Delay . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 353
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 353
2 Method of Digital Group Delay . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 354
3 DOA Estimation Based on Digital Group Delay . . . . . . . . . . . . . . . . . . . . . 356
4 Simulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 356
5 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 357
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 358

28 Spatial Speaker Spatial Positioning of Synthesized

Speech in Java . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 359
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 359
2 Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 360
2.1 Our Research Contribution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 362
3 System Design and Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 362
3.1 FreeTTS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 363
3.2 MIT Media Lab HRTF Library . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 364
3.3 Signal Processing Module . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 365
3.4 JOAL Library . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 365
3.5 Soundcard . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 366
4 Prototype Applications and Preliminary User Studies . . . . . . . . . . . . . . . 366
4.1 Spatial Audio Representation of a Text File . . . . . . . . . . . . . . . . . . . . . 367
4.2 Spatial Story Reader . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 367
4.3 Multiple Simultaneous Files Reader . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 368
5 Conclusion and Future Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 370
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 370

29 Commercial Break Detection and Content Based

Video Retrieval . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 373
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 373
2 Preprocessing and Feature Extraction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 375
2.1 Audio Feature Extraction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 376
2.2 Video Feature Extraction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 377
3 Commercial Detection Scheme . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 379
3.1 Audio Feature Based Detection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 379
3.2 Video Feature Based Detection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 379
4 Mechanism for Automatic Annotation and Retrieval . . . . . . . . . . . . . . . . 379
4.1 Automatic Annotation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 379
4.2 Content Based Video Retrieval . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 380
5 Results and Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 380
6 Conclusion and Future Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 382
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 383
Contents xvii

30 ClusterDAM: Clustering Mechanism for Delivery of Adaptive

Multimedia Content in Two-Hop Wireless Networks . . . . . . . . . . . . . . . . 385
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 385
2 Cluster-Dam Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 387
2.1 Cluster-based Two-Hop Design for WiMAX Networks . . . . . . . . . 387
2.2 QOAS - Quality Oriented Adaptive Scheme . . . . . . . . . . . . . . . . . . . . . 388
2.3 Other Adaptive Solutions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 389
3 Simulation Model and Testing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 389
3.1 Dumbbell and Double Dumbbell Topology . . . . . . . . . . . . . . . . . . . . . . 389
3.2 Simulation Setup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 391
4 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 392
5 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 395
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 395

31 Ranking Intervals in Complex Stochastic Boolean Systems

Using Intrinsic Ordering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 397
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 397
2 The Intrinsic Ordering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 399
2.1 Intrinsic Order Relation on {0,1}n . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 399
2.2 The Intrinsic Order Graph . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 401
2.3 Three Sets of Bitstrings Related to a Binary n-tuple . . . . . . . . . . . . . 402
3 Generating and Counting the Elements of Cu and Cu . . . . . . . . . . . . . . . . 404
4 Ranking Intervals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 406
5 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 409
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 410

32 Predicting Memory Phases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 411

1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 411
2 Phase Classification Techniques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 412
2.1 Wavelet Based Phase Classification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 412
2.2 Activity Vectors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 413
2.3 Stack Reuse Distances . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 413
2.4 Other Techniques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 414
3 Setvector Based Phase Classification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 414
4 Metrics to Compare Phase Classification Techniques . . . . . . . . . . . . . . . 415
5 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 416
5.1 Classification Accuracy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 416
5.2 Computational Performance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 420
6 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 420
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 421
xviii Contents

33 Information Security Enhancement to Public–Key

Cryptosystem Through Magic Squares . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 423
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 423
2 Methodology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 424
2.1 Magic Squares and Their Construction . . . . . . . . . . . . . . . . . . . . . . . . . . . 425
2.2 Construction of Doubly Even Magic Square Based
on Different Views of Fundamental Magic Square . . . . . . . . . . . . . . 427
2.3 Construction of Doubly Even Magic Square of
Order 16 Based on the Properties of 4 4 Magic Square . . . . . . . 428
3 Encryption/Decryption of Plain Text Using RSA Cryptosystem
with Magic Square . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 435
3.1 Wrapper Implementation-Example . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 435
4 Parallel Cryptography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 435
5 Experimental Result . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 436
6 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 436
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 437

34 Resource Allocation for Grid Applications: An Economy Model . . . 439

1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 439
2 Grid Economy Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 440
3 Resource Management Challenges . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 441
4 Resource Allocation Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 442
5 Design of Economy Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 444
6 Experimental Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 445
7 Related Works . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 446
8 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 447
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 448

35 A Free and Didactic Implementation of the Send Protocol

for Ipv6 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 451
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 451
2 Neighbor Discovery Protocol Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 452
3 Vulnerabilities of the Neighbor Discovery Protocol . . . . . . . . . . . . . . . . . 454
4 Secure Neighbor Discovery Protocol . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 454
4.1 Cryptographically Generated Address . . . . . . . . . . . . . . . . . . . . . . . . . . . . 456
4.2 Authorization Delegation Discovery . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 456
5 Related Works . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 456
6 A Didactic Implementation of the Send Protocol . . . . . . . . . . . . . . . . . . . . 457
7 Conclusions and Future Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 462
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 463

36 A Survey of Network Benchmark Tools . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 465

1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 465
2 Related Works . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 466
3 Network Benchmark Tools . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 467
Contents xix

3.1 Netperf . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 467

3.2 D-itg . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 468
3.3 NetStress . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 470
3.4 MGEN . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 471
3.5 LANforge . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 473
3.6 WLAN Traffic Visualizer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 474
3.7 TTCP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 475
4 Comparative Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 476
5 Conclusions and Future Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 478
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 480

37 Hybrid Stock Investment Strategy Decision Support System . . . . . . . 481

1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 481
1.1 High Risk Investment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 482
2 Finance Theories and Analysis in Stock Price Prediction . . . . . . . . . . . 482
3 Data Mining (DM) and Artificial Intelligence (AI) . . . . . . . . . . . . . . . . . . 483
4 DSS Model for Stock Investment Strategy . . . . . . . . . . . . . . . . . . . . . . . . . . . 484
5 Architecture of Stock Investment Strategy Decision
Support System . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 484
5.1 DM Component . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 486
5.2 TA Component . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 487
6 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 492
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 492

38 Towards Performance Analysis of Ad hoc Multimedia Network . . . 495

1 In-Vehicle Multimedia Network . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 496
1.1 System Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 498
1.2 Application Scenarios . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 500
2 Performance Modelling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 500
2.1 Network Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 500
2.2 Packet Delay Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 502
2.3 Throughput Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 502
3 Performance Evaluation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 503
3.1 Simulation Setup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 503
3.2 Delay Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 504
3.3 Throughput Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 504
4 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 505
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 506

39 Towards the Performance Optimization of Public-key Algorithms

Using Fuzzy Modular Arithematic and Addition Chain . . . . . . . . . . . . . 507
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 507
2 Concept of Sum of Squares, Addition Chain, Elliptic Curve,
and Fermat Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 509
2.1 Sum of Squares . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 509
xx Contents

2.2 Addition Chain . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 511

2.3 Elliptic Curve . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 512
3 Fuzzy Modular Arithmetic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 513
4 Applications of Sum of Squares, and Addition Chain in Reducing
the Number of Multiplication in Modular Exponentiation . . . . . . . . . . . 514
4.1 Pseudocode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 514
4.2 Example . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 515
5 Implementation of ECC Using Fuzzy Modular Arithmetic . . . . . . . . . . 516
6 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 518
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 518

40 RBDT-1 Method: Combining Rules and Decision Tree

Capabilities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 521
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 521
2 Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 523
3 Rule Generation and Notations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 524
3.1 Notations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 524
3.2 Rule Generation Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 524
4 RBDT-1 Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 525
4.1 Attribute Selection Criteria . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 525
4.2 Building the Decision Tree . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 528
5 Illustration of the RBDT-1 Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 528
5.1 Illustration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 528
6 Experiments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 529
7 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 531
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 531

41 Computational and Theoretical Concepts for Regulating

Stem Cells Using Viral and Physical Methods . . . . . . . . . . . . . . . . . . . . . . . . 533
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 533
2 Methods Used in Gene Therapy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 534
3 Proposed Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 536
4 Simulation Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 541
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 545

42 DFA, a Biomedical Checking Tool for the Heart Control System . . 547
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 547
2 Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 548
2.1 Finger Blood-Pressure Pulse . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 548
2.2 DFA Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 548
2.3 Volunteers and Ethics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 549
3 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 549
3.1 Extra-Systole . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 549
3.2 Alternans with Low Exponent . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 552
Contents xxi

3.3 Extraordinary High Exponent . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 552

3.4 Normal Exponent . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 553
3.5 DFA Is Beneficial . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 554
4 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 554
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 556

43 Generalizations in Mathematical Epidemiology . . . . . . . . . . . . . . . . . . . . . . 557

1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 557
2 CA And MR Applied to the SNIR Epidemic Model . . . . . . . . . . . . . . . . . 558
2.1 The Standard SIR Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 559
2.2 The S2IR Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 560
2.3 The S3IR, The S4IR and S5IR Models . . . . . . . . . . . . . . . . . . . . . . . . . . . 561
2.4 The SnIR Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 562
3 CA and MR Applied to the SNIMR Epidemic Model . . . . . . . . . . . . . . . . 563
3.1 The SI2R Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 563
3.2 The S2I2R Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 564
3.3 The SnImR Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 565
4 CA and MR Applied to the Staged Progressive SIMR
Epidemic Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 565
4.1 The Staged Progressive SI2R Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 566
4.2 The Staged Progressive SI3R Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 567
4.3 Staged Progressive SImR Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 567
5 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 568
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 568

44 Review of Daily Physical Activity Monitoring System Based

on Single Triaxial Accelerometer and Portable Data
Measurement Unit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 569
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 569
1.1 Measurement of Physical Activity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 570
1.2 Behavioral Observation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 571
1.3 Pedometers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 571
1.4 Accelerometers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 571
2 Material and Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 574
2.1 Portable Data Measurement Unit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 574
2.2 Physical Activity Data Collection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 575
2.3 Feature Extraction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 576
3 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 576
4 Discussion and Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 577
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 578

45 A Study of the Protein Folding Problem by a Simulation Model . . . 581

1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 581
2 The Protein Folding Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 582
xxii Contents

2.1 The Levinthal Paradox . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 582

2.2 Motivations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 583
3 Approaches to Study the Protein Folding Problem . . . . . . . . . . . . . . . . . . 583
3.1 Latest Approach . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 585
3.2 The Amino Acid Interaction Network . . . . . . . . . . . . . . . . . . . . . . . . . . . . 585
4 Folding a Protein in a Topological Space by
Bio-Inspired Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 586
4.1 Genetic Algorithms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 586
4.2 Motif Prediction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 588
4.3 Dataset . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 589
4.4 Overall Description . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 589
4.5 Genetic Operators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 589
4.6 Algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 590
5 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 592
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 592

46 Analysing Multiobjective Fitness Function with Finite

State Automata . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 595
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 595
2 Evolutionary Algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 598
2.1 Input-Output Specification (IOS) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 598
2.2 Syntax Term (S) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 599
2.3 Primitive Function (F) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 599
2.4 Learning Parameter (a1) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 599
2.5 Complexity Parameters (Tmax, b) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 599
2.6 System Proof Plan (u) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 599
3 Evolutionary Process . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 600
3.1 Single Objective Evolutionary Process . . . . . . . . . . . . . . . . . . . . . . . . . . . 600
3.2 Multi Objective Evolutionary Process . . . . . . . . . . . . . . . . . . . . . . . . . . . . 601
4 Result and Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 603
4.1 Input-Output Specification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 603
4.2 Performance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 604
5 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 605
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 605

Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 607
Chapter 1
Multimodal Human Spacecraft Interaction
in Remote Environments
A New Concept for Free Flyer Control

Enrico Stoll, Alvar Saenz-Otero, and Brent Tweddle

Abstract Most malfunctioning spacecraft require only a minor maintenance oper-

ation, but have to be retired due to the lack of so-called On-Orbit Servicing (OOS)
opportunities. There is no maintenance and repair infrastructure for space systems.
Occasionally, space shuttle based servicing missions are launched, but there are no
routine procedures foreseen for the individual spacecraft.
The unmanned approach is to utilize the explorative possibilities of robots to
dock a servicer spacecraft onto a malfunctioning target spacecraft and execute
complex OOS operations, controlled from ground. Most OOS demonstration mis-
sions aim at equipping the servicing spacecraft with a high degree of autonomy.
However, not all spacecraft can be serviced autonomously. Equipping the human
operator on ground with the possibility of instantaneous interaction with the
servicer satellite is a very beneficial capability that complements autonomous
operations.
This work focuses on such teleoperated space systems with a strong emphasis
on multimodal feedback, i.e. human spacecraft interaction is considered, which
utilizes multiple human senses through which the operator can receive output from
a technical device. This work proposes a new concept for free flyer control and
shows the development of an according test environment.

1 Introduction

On-Orbit Servicing (OOS) has been an active research area in recent times. Two
approaches have been studied: teleoperation by humans and autonomous systems.
Autonomous systems use machine pattern recognition, object tracking, and

E. Stoll (*)
Space Systems Laboratory, Massachusetts Institute of Technology, 77 Massachusetts Avenue,
Cambridge, MA 02139-4307, USA
e-mail: [email protected]

S.-I. Ao et al. (eds.), Machine Learning and Systems Engineering, 1

Lecture Notes in Electrical Engineering 68,
DOI 10.1007/978-90-481-9419-3_1, # Springer ScienceþBusiness Media B.V. 2010
2 E. Stoll et al.

acquisition algorithms, as for example DART [1] or Orbital Express [2]. The
research is still in early stages and the algorithms have to be realized in complex
systems.
In contrast, the human eye-brain combination is already very evolved and
trainable. Procedures can be executed by the trained user from the ground. Unfore-
seen incidents can be solved with greater flexibility and robustness. Arbitrary
spacecraft could be approached, i.e. spacecraft which were not explicitly designed
for rendezvous and docking maneuvers. Analogously, inspections and fly-arounds
can be controlled by the human operator. Based on the acquired information the
human operator on ground can decide how to proceed and which servicing mea-
sures to take. Another element in the decision queue is the path planning approach
for the target satellite to the capture object.
Multimodal telepresence, which combines autonomous operations with human
oversight of the mission (with the ability to control the satellites), provides the
benefits of autonomous free-flyers with the evolved human experience. In case
autonomous operations cause the work area to exhibit an unknown and unforeseen
state (e.g. when robotically exchanging or upgrading instruments) the human
operator on ground can support the operations by either finishing the procedure or
returning the system into a state which can be processed by autonomous procedures.
The advantage of multimodal telepresence in this connection is the fact that the
operator will not only see the remote site, but also feel it due to haptic displays. A
haptic interface presents feedback to the human operator via the sense of touch by
applying forces, vibrations or motion.
The applicability of the telepresence approach, with a human operator located in
a ground station, controlling a spacecraft, is mostly limited to the Earth orbit. This is
because the round trip delay increases with increasing distance from operator to the
teleoperator. A decrease of the telepresence feeling is the consequence, which has a
large impact on the task performance. Therefore, as the distance increases, the role
of the autonomy must increase to maintain effective operations.
For an overall and significant evaluation of the benefits of multimodal telepre-
sence a representative test environment is being developed at the MIT Space
Systems Laboratory using the SPHERES satellites on ground and aboard the
International Space Station (ISS).

2 The MIT SPHERES Program

The SPHERES laboratory for Distributed Satellite Systems [3] consists of a set of
tools and hardware developed for use aboard the ISS and in ground based tests.
Three micro-satellites, a custom metrology system (based on ultrasound time-of-
flight measurements), communications hardware, consumables (tanks and bat-
teries), and an astronaut interface are aboard the ISS. Figure 1 shows the three
SPHERES satellites being operated aboard the ISS during the summer of 2008.
1 Multimodal Human Spacecraft Interaction in Remote Environments 3

Fig. 1 SPHERES operations aboard the International Space Station (Picture: NASA)

The satellites operate autonomously, after the crew starts the test, within the US
Destiny Laboratory.
The ground-based setup consists of an analog set of hardware: three micro-
satellites, a metrology system with the same geometry as that on the ISS, a research
oriented GUI, and replenishable consumables. A “guest scientist program” [4]
provides documentation and programming interfaces which allow multiple
researchers to use the facility.

2.1 General Information

The SPHERES satellites were designed to provide the best traceability to future
formation flight missions by implementing all the features of a standard thruster-
based satellite bus. The satellites have fully functional propulsion, guidance, com-
munications, and power sub-systems. These enable the satellites to: maneuver in
6-DoF, communicate with each other and with the laptop control station, and
identify their position with respect to each other and to the experiment reference
frame. The computer architecture allows scientists to re-program the satellite with
new algorithms. The laptop control station (an ISS supplied standard laptop) is used
to collect and store data and to upload new algorithms. It uses the ISS network for
all ground data communications (downlink and uplink). Figure 2 shows a picture of
an assembled SPHERES satellite and identifies its main features. Physical proper-
ties of the satellites are listed in Table 1.
4 E. Stoll et al.

Fig. 2 SPHERES satellite Pressure Control Panel

Regulator
Ultrasound
Pressure Sensors
Gauge

Thrusters Battery

Table 1 SPHERES satellite Property Value

properties
Diameter 0.22 m
Mass (with tank and batteries) 4.3 kg
Max linear acceleration 0.17 m/s2
Max angular acceleration 3.5 rad/s2
Power consumption 13 W
Battery lifetime 2h

SPHERES has been in operation aboard the ISS since May 2006. To date, 21 test
sessions have taken place. The test sessions have included research on Formation
Flight, Docking and Rendezvous, Fluid Slosh, Fault Detection, Isolation, and
Recover (FDIR), and general distributed satellite systems autonomy.

2.2 Human-SPHERES Interaction

Most of the previous test sessions matured autonomous algorithms. However,

future servicing missions and the assembly of complex space structures will not
only depend on increased autonomy, but the ability of humans to provide high-level
oversight and task scheduling will always be critical. SPHERES tests were con-
ducted to develop and advance algorithms for adjustable autonomy and human
system interaction. This research began with basic tests during Test Session 11,
where the crew was asked to move a satellite to multiple corners in a pre-defined
volume. The satellite autonomously prevented collisions with the walls of the ISS.
The test demonstrated the ability of the crew to use the ISS laptop to control
SPHERES. It provided baseline results for future tests. An ongoing sequence of
ISS tests is being conducted in the framework of a program called “SPHERES
Interact”. The goal of the program is to conceive new algorithms that utilize both
human interaction and machine autonomy to complete complex tasks in 6 degrees
of freedom (DoF) environments. Tests during Test Session 19 and 20 included
several scenarios where human interaction helps schedule tasks of a complex
mission (e.g. servicing or assembly). The research area comprises human orienta-
tion, navigation, and recognition of motion patterns. Further, high level human
1 Multimodal Human Spacecraft Interaction in Remote Environments 5

abort commands and collision avoidance techniques for part of this ongoing
research aboard the International Space Station.

2.3 SPHERES Goggles

The SPHERES Goggles is a hardware upgrade to the SPHERES satellites that adds
cameras, lights, additional processing power and a high speed wireless commu-
nications system. Even though it was designed for autonomous operations, it can be
used to support the operator with a visual feedback. The main objective of the
SPHERES Goggles is to provide a flight-traceable platform for the development,
testing and maturation of computer vision-based navigation algorithms for space-
craft proximity operations. Although this hardware was not intended to be launched
to orbit, it was designed to be easily extensible to versions that can operate both
inside and ultimately outside the ISS or any other spacecraft.
The Goggles, which are shown in Fig. 3, were designed to be able to image
objects that are within few meters range and to possess the computational capability
to process the captured images. They further provide a flexible software develop-
ment environment and the ability to reconfigure the optics hardware.
The SPHERES Goggles were used in several parts of the telepresence environ-
ment setup at the MIT SSL ground facilities to support the human operator with a
realistic video feedback which is representative for a camera system used on orbit.
Apart from virtual reality animations of the remote environment it serves as the
only source of visual data in the experiments.

Camera Camera

LED Light LED Light

Fig. 3 Front view of Goggles mounted on SPHERES satellite

6 E. Stoll et al.

3 Multimodal Telepresence

Servicing missions can be differentiated by whether or not a robotic manipulator is

connected to a free flying base (the actual satellite). Different levels of autonomy
can be applied to the control of either and the human operator receives the accord-
ing feedback.

3.1 Areas of Application

Unlike robotic manipulators, where haptic feedback plays an important role for
control as e.g. ETS-VII [5] or Rokviss [6], free flyers are commonly only steered
using visual feedback. That means that even though free flying experiments can be
steered with hand controllers, as for example Scamp [7] or the Mini AERCam [8],
usually no haptic information is fed back to the human operator.
The implementation of haptic feedback into the control of free flyers enriches the
telepresence feeling of the operator and helps the operator on ground to navigate. It
paves the way for new concepts of telepresent spacecraft control. Collision avoid-
ance maneuvers for example can be made perceptible for the human operator, by
placing virtual walls around other spacecraft. Equipping these virtual walls with
sufficient high stiffness means that the operator is not able to penetrate them by
means of the haptic device, since it exerts to the operator a high resistance force.
Areas of fuel optimal paths can be displayed to the operator by implementing an
ambient damping force, featuring a magnitude which is proportional to the devia-
tion of the actual path from the fuel optimal trajectory and area, respectively.
Docking maneuvers can be supported by virtual boundaries as a haptic guiding
cone and damping forces which are increasing with decreasing distance to the
target.
Summarizing the benefits it can be seen that the application of telepresence
control will extend the amount of serviceable spacecraft failures by involving a well
trained human operator. In this connection it is proposed that the task performance
of the operator can be enhanced by feeding back high-fidelity information from the
remote work environment. Here the haptic feedback plays an important role in
human perception and will be tested in a representative test environment.

3.2 The Development of a Test Environment

The key element of the test environment is the Novint Falcon [9], which is a 3-DoF
force feedback joystick. All degrees of freedom are of translational nature and servo
motors are used to feed forces in 3-DoF back to the user. This system has high
utility for space applications since it allows the human operator to control the space
1 Multimodal Human Spacecraft Interaction in Remote Environments 7

application in 3D dimensional space. The Falcon is implemented in a Matlab/

SIMULINK environment via the HaptikLibrary [10], which is a component based
architecture for uniform access to haptic devices. It is used as the interface to
Matlab and reads the positions and button states of the haptic device as well as feeds
calculated forces back to it. By displacing the joystick handle, the human operator is
able to interact with two instances of the remote environment - the virtual instance
in SIMULINK and the hardware (SPHERES) instance on the SSL air table.
The joystick displacement is interpreted by the system, as either position,
velocity or force commands. The received commands are communicated to a
SIMULINK block, containing the satellite dynamics and a state estimator. The
simulation returns the estimated state of the satellite in the virtual entity of the
remote environment. This remote workspace is created using SIMULINK’s Virtual
Reality (VR) toolbox (cp. Fig. 4), allowing for satellite states and environmental
properties to be displayed.
In addition to the Matlab environment, algorithms in C are used as the interface
to the actual SPHERES hardware via the “SPHERES Core” API. Commands are
transmitted via wireless communications to the SPHERES satellites. Torques and
forces are calculated and directly commanded to the thrusters, which will cause a
motion of the SPHERES satellite. The satellites measure their position and attitude
and transmit the information in real-time to the laptop.
By transmitting the actual states to the VR, the operator obtains information of
the estimated and the actual motion of the free flyer, which should be identical if
the communication channel is not delayed and the virtual instance is a good

human visual feedback

operator
haptic joystick
feedback displacement

commands SPHERES
thruster goggles
model
UDP connection

SPHERES communication system

Novint Falcon

HaptikLibrary
force/
torques SPHERES on
satellite air table
forces dynamics /
estimator metrology
data distance
estimated states
information
Position &
virtual reality
attitude
remote
actual determination
environment Ultrasound
states system
Matlab / Simulink
environment C environment
beacons

Fig. 4 General block diagram of the test environment

8 E. Stoll et al.

approximation of reality. In the presence of time delay the predictions should give
the user a feeling for the behaviour of the system (cp. ETS-VII) and enhance the
task performance. This is an important point if a human operator on ground will
steer an application in space. That way the interactions between the autonomous
space operations and a telepresence controlled free flyer can be tested.

4 Experimental Setup

If OOS missions are executed in low Earth orbit (LEO) only limited time windows
are available for telecommands. The common approach for increasing those acqui-
sition times is the usage of geostationary relay satellites. While those satellites do
not have a profound impact on autonomous missions, they will influence the task
performance of an operator on ground directly interacting with a satellite. Thus, this
human spacecraft interaction was tested using a representative test scenario at SSL,
which involved a geostationary relay satellite.

4.1 Control via ARTEMIS

Due to the orbit height of geostationary satellites, the relay of the signal increases
the round trip delay between operator action and feedback to the operator to up to
7 s as in the case of ETS-VII. The delay between telecommand and telemetry is
usually not very intuitively manageable for the human operator and thus a special
area of interest if human spacecraft interaction is considered.
The effect on the human has already been shown for OOS missions in which the
operator on ground steers robotic manipulators via geostationary relay satellites
[11]. It has not been tested, yet, for multimodal human free flyer interaction.
Accordingly, for initial tests a geostationary satellite was introduced in the com-
manding chain. The UDP connection (cp. Fig. 4) was utilized to send the commands
of the Novint Falcon at SSL via a terrestrial internet connection to a ground station
at the Institute of Astronautics of Technische Universitaet Muenchen in Germany.
The telecommands were forwarded via the geostationary relay satellite ARTEMIS
(Advanced Relay Technology Mission) of the European Space Agency (ESA) to a
ground station of ESA in Redu, Belgium. The signal was mirrored in Redu and
retransmitted analogously back to MIT, where again the UDP connection was used
to feed the telecommand into the hardware on the air table and change the posi-
tion of SPHERES in the test environment. That way the SPHERES satellites were
controlled by the Novint Falcon via a geostationary satellite. The round trip delay
characteristics were logged and subsequently implemented into the scenario as a
SIMULINK block. That way the test scenarios could be evaluated in the absence of
a satellite link but with round trip delays representative for commanding a space-
craft in orbit.
1 Multimodal Human Spacecraft Interaction in Remote Environments 9

4.2 The Servicing Scenarios

To show the benefit of multimodal feedback to the operator, two scenarios were
developed and tested. Both are based on a servicing operation, in which three
satellites are involved. The target satellite is the satellite to be serviced. Therefore,
the servicer satellite has to execute proximity operations, approach the target, and
eventually dock with it. The inspector satellite is supporting the operator on ground
with additional data of the remote environment. It carries a camera system and
can yield information on the distance between the two other satellites.

4.2.1 The Human-Controlled Inspector Satellite

In this first scenario the control of the inspector satellite is handed over to the human
operator, while the servicer and the target dock autonomously. The task of the
operator is to ensure that the initial states of the other two satellites are appropriate
for the docking maneuver. Thus, the operator commands the inspector satellite
as depicted in Fig. 5 from its position in front of the other two satellites to a position
behind the two satellites, which is indicated by a virtual checkered marker.
For efficiently accomplishing this circumnavigation, virtual obstacles were
created to avoid collisions with the servicer, the target, and the borders of the
experimental volume. As to be seen in Fig. 5 both of the satellites to dock feature a
virtual collision avoidance sphere. Further, on the left and the right side of the
volume, there are virtual (brick) walls introduced. The Novint Falcon generates
forces in case the operator penetrates those objects. These resistance forces are fed
back to the operator and thus prevent from colliding with the actual hardware on
the SSL air table.

Fig. 5 Virtual and hardware instance of the inspection scenario

10 E. Stoll et al.

A further benefit of using the virtual reality techniques is that the environment
can be augmented with additional data of the remote environment. For example,
arrows can be used for indicating the current velocity and rotation rate (double
arrow) of the inspector. Furthermore, there are two entities of the inspector satellite
to be seen in the VR environment. The dark entity shows the commanded state,
whereas the pale entity shows the actual state of the hardware in the remote
environment. This is of great benefit for the human operator in the presence of
time delays as they occur due to the use of relay satellites.

4.2.2 The Human-Controlled Servicer Satellite

Similar to the first scenario, the inspector, target, and servicer satellite are again
involved in the second scenario. The servicer is supposed to dock with the target,
whereas the inspector is transmitting additional data from the remote scene. In this
scenario the target and the inspector (right upper corner in Fig. 6) are operating
autonomously and the servicer satellite (lower right corner) is controlled by the
human operator via the relay satellite.
Again, the virtual environment is enriched by collision avoidance objects (at the
inspector and the borders of the volume). The task of the operator is to accomplish a
successful docking maneuver. Therefore, the human operator is supposed to com-
mand the servicer at first to a position roughly aligned with centre of the docking
cone, which can be seen in Fig. 6 and approx. 50 cm away from the target. In a
second step the operator is commanding the servicer along the virtual cone until the
berthing takes place.

Fig. 6 Virtual and hardware instance of the docking scenario

1 Multimodal Human Spacecraft Interaction in Remote Environments 11

The docking cone is a mean to simplify the proximity operations for the operator.
Once the servicer has crossed the assistance horizon of the cone, a force field is
applied to the Falcon, which drives the servicer into the docking cone. Inside the
docking cone another force field drives the servicer towards the target. Here, the
forces are proportional to the distance to the target. This helps the operator to
concentrate on the precision of the docking point rather than to worry about relative
velocities and collisions.

5 Results of the Experiments

The two scenarios were controlled via the German ground station [12] at the
Institute of Astronautics and the ESA relay satellite. The human operator at MIT
in Cambridge received instantaneous feedback from the haptic-visual workspace.
To have a representative test conditions the operator had only visual feedback from
the SPHERES Goggles and the Matlab Simulink virtual instance of the remote
environment. Further, the haptic device yielded additional forces for an advanced
human spacecraft interaction in the 3D environment.

5.1 Round Trip Delays due to the Relay Satellite

The occurring round trip delays were logged since they are a first indicator for
the quality of the human task performance. Figure 7 shows an example graph of the
delay characteristics over time. The round trip delays are plotted depending on
the respective UDP packet number. They indicate that the delay in a real OOS mission
can be, except for a couple of outliers, well below 1 s. The outliers occurred due to
the use of a terrestrial internet connection and the lack of synchronization between

1400

1300

1200

1100
delay [ms]

1000

900

800

700

600
0 1000 2000 3000 4000 5000 6000 7000 8000 9000
packet number

Fig. 7 Round trip delay characteristic of the free flyer control via ARTEMIS
12 E. Stoll et al.

the sampling rate of the hardware at MIT and the sampling rate of the satellite
modem at LRT. Nonetheless, a mean of 695.5 ms with a sample standard deviation
of 24.1 ms indicate an acceptable round trip delay [13] for telepresence operations.

5.2 Operator Force Feedback

Navigation in a 3D environment with a sparse number of reference points can be

very complicated for a human operator. The motion with 6-DoF is not only very
unintuitive since the motion in free space is no longer superimposed by gravity as it
is on Earth the case. The equations of motions are further coupled in a way that an
introduced torque about a main axis of inertia of the spacecraft will not necessarily
cause the spacecraft to rotate about the respective axis but about all three axes.

2 10

1.8 9

1.6 8

1.4 7

1.2 6
y [m]

1 5

0.8 4

0.6 3

0.4 2

0.2 1

0
0 0.2 0.4 0.6 0.8 1 1.2 force
x [m] [N]

Fig. 8 Force feedback of the inspection scenario

1 Multimodal Human Spacecraft Interaction in Remote Environments 13

Thus, the human operator has to be supported by technical means in order to

solve complex problems in the remote environment. One of those means is, as
shown in this work, to augment the feedback to the operator. Virtual reality can be
used to show commanded/planned states versus actual states of spacecraft and can
additionally visualize potential dangerous areas.
Since the 3D remote environment is usually projected onto 2D screens, it can be
difficult for the operator to realize where exactly such an area, in which collisions
could take place, is located. Consequently, a haptic device was used which utilizes
another human sense and enriches the perception. Forces are fed back to the
operator site, permitting the operator to enter the respective areas.
Figures 8 and 9 show example forces that were fed back to the operator
depending on the position of the spacecraft in the remote environment. The path

2 10

1.8 9

1.6 8

1.4 7

1.2 6
y [m]

1 5

0.8 4

0.6 3

0.4 2

0.2 1

0
0 0.2 0.4 0.6 0.8 1 1.2
force
x [m] [N]

Fig. 9 Force feedback of the docking scenario

14 E. Stoll et al.

of the spacecraft is indicated by a solid line, whereas the force feedback is labeled
by small circles in gray scale. If a collision avoidance sphere was penetrated as e.g.
in Fig. 8 a restraining force was created proportional to the penetration depth and
the velocity (spring-damper system) of the spacecraft. The same held true for
virtual walls as can be seen in Fig. 9. This figure further shows the force feedback
inside the docking cone. As can be seen, the haptic feedback prevented the human
operator form colliding with the other spacecraft or the experimental boundaries. It
gave the operator a feeling for critical areas and helped the operator to accomplish a
very smooth docking/berthing approach.

6 Summary

This work presented the first tests on haptic feedback for free flyer systems. It
proposes that multimodal feedback from servicer satellites enhances the human task
performance. This feedback supports the operator with an intuitive concept for
collision avoidance and relative navigation. That way, complex tasks in micrograv-
ity can be safely operated from ground.

Acknowledgements This work was supported in part by a post-doctoral fellowship program of

the German Academic Exchange Service (DAAD). The authors would like to express their
gratitude to the ESA ARTEMIS team for providing the opportunity to use the ARTEMIS relay
satellite for their experiments.

References

1. T. Rumford, Demonstration of autonomous rendezvous technology (dart) project summary, in

Proceedings of Space Systems Technology and Operations Conference, Orlando, USA, Apr
2003
2. T. Weismuller, M. Leinz, GN&C technology demonstrated by the orbital express auto-
nomous rendezvous and capture sensor system. 29th AAS Guidance and Control Conference,
Breckenridge, USA, 2006
3. A. Saenz-Otero, A. Chen et al., SPHERES: Development of an ISS laboratory for formation
flight and docking research, IEEE Aerospace Conference (paper #81), Big Sky, Montana,
USA, 9–16 Mar 2002
4. J. Enright, M.O. Hilstad et al., The SPHERES guest scientist program: collaborative science
on the ISS, 2004 IEEE Aerospace Conference (paper #1296), Big Sky, Montana, USA, 7–12
Mar 2004
5. T. Imaida, Y. Yokokohji, T. Doi, M. Oda, T. Yoshikawa, Ground-space bilateral teleoperation
ex-periment using ETS-VII robot arm with direct kinesthetic coupling, in Proceedings of
IEEE International Conference on Robotics and Automation, Seoul, Korea, 2001
6. K. Landzettel et al., ROKVISS verification of advanced light weight robotic joints and tele-
presence concepts for future space missions, in Proceedings of 9th ESA Workshop on
Advanced Space Technologies for Robotics and Automation (ASTRA), Noordwijk, The
Netherlands, Nov 2002
7. C. McGhan, R. Besser, R. Sanner, E. Atkins, Semi-autonomous inspection with a neutral
buoyancy free-flyer, in Proceedings of Guidance, Navigation, and Control Conference,
Keystone, USA, Aug 2006
1 Multimodal Human Spacecraft Interaction in Remote Environments 15

8. S. Fredrickson, S. Duran, J. Mitchel, Mini AERCam inspection robot for human space
missions, in Proceedings of AIAA Space, San Diego, USA, Sept 2004
9. Novint Technologies Inc. (February 2010). http://www.novint.com
10. M. de Pascale, D. Prattichizzo, The Haptik Library: a component based architecture for
uniform access to haptic devices, IEEE Robotics Autom. Mag. 14(4), 64–75 (2007)
11. E. Stoll, Ground verification of telepresence for on-orbit servicing. Dissertation, Lehrstuhl f€ ur
Raumfahrttechnik, Technische Universit€at M€ unchen, 2008, ISBN 978-3-89963-919-3
12. J. Letschnik, E. Stoll, U. Walter, Test environment for time delay measurements of space links
via ARTEMIS, in Proceedings of 4th ESA International Workshop on Tracking, Telemetry
and Command Systems for Space Applications TTC 2007, Darmstadt, Germany, 2007
13. E. Stoll et al., Ground verification of the feasibility of telepresent on-orbit servicing. J. Field
Robotics 26(3), 287–307 (2009)
Chapter 2
A Framework for Collaborative Aspects
of Intelligent Service Robot

Joohee Suh and Chong-woo Woo

Abstract Intelligent service robot is becoming one of the most interesting issues
in the recent Robot research. The service robot monitors its surroundings, and
provides a service to meet a user’s goal. The service often becomes too complex
that one single robot may not handle efficiently. In other words, a group of robots
may be needed to accomplish given task(s) by collaborating each other. We can
define this activity as a robot grouping, and we need to study further to make better
group(s) by considering their characteristics of the each robot. But, it is difficult and
no formal methods to make such a specific group from the many heterogeneous
robots that are different in their functions and structures. This paper describes an
intelligent service robot framework that outlines a multi-layer structure, which is
suitable to make a particular group of robots to solve given task by collaborating
with other robots. Simulated experimentation for grouping from the generated
several heterogeneous is done by utilizing Entropy algorithm. And the collabora-
tion among the robots is done by the multi-level task planning mechanism.

1 Introduction

Ubiquitous computing [1] defines that various computing objects are connected
through the network, so that the system automatically provides services in anytime
in any place. An intelligent robot is an autonomous and dynamic object in this
ubiquitous environment, and it becomes one of the most interesting issues in this
area. It interacts with various surrounding computing devices, recognizes context,

J. Suh (*)
Korea School of Computer Science, Kookmin University, 861-1 Jeongneung-Dong, Seongbuk-
Gu, Seoul
e-mail: [email protected]

S.-I. Ao et al. (eds.), Machine Learning and Systems Engineering, 17

Lecture Notes in Electrical Engineering 68,
DOI 10.1007/978-90-481-9419-3_2, # Springer ScienceþBusiness Media B.V. 2010
18 J. Suh and C.‐W. Woo

and provides appropriate services to human and the environment [2]. The defini-
tion of the context [3] is any entity that affects interactions among the computing
objects in this environment. For instance, user, physical location, robots could be
such entities. Therefore, information described for the characteristics of such
entities are defined as a context, and recognizing a situation from the context is
the main focus of the context awareness system [4]. Essentially, the robot is
required to carry on two main issues in this study; understanding the context and
carrying out the context.
First, understanding a context can be done in many different ways, but the
general procedure includes that system perceives raw environmental information
through the physical sensors. And then context modeling and reasoning steps
follows right after preprocessing the raw data. The final result from this procedure
is a context. The approaches on this procedure are vary, and it needs to be further
studied in detail to make the system more efficient, and there are large amount of
studies [2–4] are being reported recently. The second issue is that the robot has to
carry out the generated context to meet the user’s goal. The context can be further
divided into some tasks, which often becomes too complex that one single robot
may not handle in this environment. In other words, a group of robots may be
needed to accomplish given task(s) by collaborating each other. We can define this
activity as a robot grouping, and we need to study further to make better group(s) by
considering their characteristics of the robot and the task.
In this paper, we are describing a development of the social robot framework for
providing an intelligent service with the collaboration of the heterogeneous robots,
in the context awareness environment. The framework is designed as multi-layers
suitable for understanding context, grouping robots, and collaborating among the
robots. In this study, we mainly focused and implemented on the grouping and
collaborating parts of the system. The context understanding part of the system is
designed, but will be discussed in the next study with comprehensive ontological
knowledge representations along with context modeling and context reasoning
mechanisms.

2 Related Works

2.1 Context-Awareness Systems

With the increase of mobile computing devices, the ubiquitous and pervasive
computing is getting popular recently. One part of the pervasive system is the
context awareness system, which is being studied explosively in various directions.
Among the many research results on this issue, the most prominent systems are the
CoBrA [5] (Context Broker Architecture), SOCAM [6] (Service Oriented Context-
Aware Middleware), and Context-Toolkit [7]. The CoBrA and SOCAM used
ontological model aiming for the benefits of easy sharing, reusing, and reasoning
2 A Framework for Collaborative Aspects of Intelligent Service Robot 19

of context information. But, first, these systems were not flexible enough to extend
to other domains when the system needs to expand with other devices or service
with a new domain. And second, they were also limited in formalized and shared
expression of the context, which is needed when the system interoperate or trans-
plant with other systems. Therefore, the ontology becomes one of the most popular
solutions to represent the data for the recent context awareness systems. Short
reviews of the previous systems as follows.
Context-Toolkit: This early context awareness middleware system gains infor-
mation from the connected devices. But, since it does not use ontology, it lacks of
the standardized representation for the context, and also the interoperability
between heterogeneous systems.
CoBrA: This system is developed based on the ontology, so that the standar-
dized representation for the context is possible. But, since the use of ontology is
limited only to a special domain, so called ‘Intelligent Meeting Room’, it does not
guarantee any extensibility to the other diverse domains.
SOCAM: This system is based on Service-oriented structure, which is efficient
middleware system for finding, acquiring, analyzing context information. But, since
it depends on OWL (Web Ontology Language) for reasoning, its reasoning capa-
bility is limited to its own learning module and inference engine.
In our study, we adapted merits of the previously studied systems, and designed a
framework that can overcome the above limitations, such as, the limited standardized
representation or extensibility. For instance, we have adapted context-awareness layer
by adapting the CONCON model [8] to provide extensibility of the ontological
representation.

2.2 Robot Grouping and Collaboration

First of all, our research issue focused on the collaboration among the robots, which
needs to form a group. Therefore, we first need to develop a method of grouping
robots for a given task. Study on the robot grouping is just beginning, but some
related researches are being reported as follows.
For instance, Rodic and Engelbrecht [9] studied initial investigation into feasi-
bility of using ‘social network’ as a coordination tool for multi-robot teams. Under
the assumption of multi-robot teams can accomplish certain task faster than a single
robot, they proposed multi-robot coordination techniques. Inspired by the concept
from the animal colony, Labella et al. [10], showed simple adaptation of an
individual can lead task allocation. They developed several small and independent
modules, called ‘s-bots’, and the collaboration in this system is achieved by means
of communication among them. They claimed that individuals that are mechani-
cally better for retrieval are more likely to be selected. Another point of view for the
collaboration is task allocation among the multi-robot colony. Mataric et al. [11]
had an experimentation comparing between a simulated data with physical mobile
robot experiment. The result showed that there is no single strategy that produces
20 J. Suh and C.‐W. Woo

best performance in all cases. And other approaches are the multi-robot task
allocation by planning algorithm [12–14].
All of these research efforts are being done in many different fields, and the
selection of individual is rather random or simple, which may often result in
inadequacy of performing a given task. Using ‘entropy’ of information theory
[15] could be a good alternative compare to the other informal approaches. Good-
rich argues that the behavioral entropy can predict human workload or measure of
human performance in human robot interaction [16] (HRI) domain. Balch [17]
demonstrated successfully in his experimental evaluation of multi-robot soccer and
multi-robot foraging teams. In our study, we will use the ‘entropy’ metric for
selecting an appropriate robot from the robot colonies by generating decision tree
first, to minimize the complexities of adapting the entropy.

3 Design of the System

As in the Fig. 1, the overall architecture of our system is divided into three main
layers, context-awareness layer, grouping layer, and collaboration layer. Also, we
have two more sub-layers, Physical and Network layer, which will not be discussed
here, since they are not the main issue here. The overall process of the three main
layers works as follows, and entire structure of the system can be viewed in the next
Fig. 2.
l The context-awareness layer generates a context from the raw information, and
then does modeling and reasoning in order to aware of the context.
l The grouping layer creates decision classifier based on the entropy mechanism,
and makes a necessary group.

Context- Grouping Collaboration

Awareness

Layer Layer Layer

Network Layer

Physical Layer

Fig. 1 General structure of social robot

2 A Framework for Collaborative Aspects of Intelligent Service Robot 21

CONTEXT-AWARENESS GROUPING COLLABORATION ROBOT

LAYER LAYER LAYER CLIENT (1)

Situation Recognizer Situation Task Action

Acceptor Allocator Planner
Inference Engine Rule
Classifier
Generator
Low-level
Context Integrator Task
Plan Rule
High-context Learner Planner
model
Actuator
Training Data
Context Provider High-level
low-context Grouper
Plan Rule
.
model
Classifier .
Raw Data Collector
.
Searcher Current ROBOT
Group Info. CLIENT (N)
Temporary & Context
Memory Collector
Group Info Action
Planner

Message Protocol

NETWORK LAYER Low-level

Plan Rule

PHYSICAL SENSOR 1 SENSOR 2 RFID ZIGBEE DEVICE Actuator

LAYER

[ SERVER SYSTEM ARCHITECTURE ] [ ROBOT CLIENT ARCHITECTURE ]

Fig. 2 Structure of the service robot in context awareness environment

l The collaboration layer does multi-level task planning; high-level task planning
generates a set of tasks for the context, the low-level task planning generates a
set of actions for the task.

3.1 Context-Awareness Layer

The context awareness layer receives raw data from surrounding computing
devices including RFID, Zigbee, and so on. Then it transforms the raw data into
a meaningful semantic data by going through some preprocessing steps, and
finally it generates a situation. This work can be done from the next sub modules
as follows.
Raw Data Collector: It simply receives raw data from the Physical layer passes
it to the context provider.
Context Provider: It receives raw data from the raw data collector and trans-
forms the data into standardized context as a preprocessing step according to low
context model. The low context model means that the raw data is formalized but it
is not semantic data.
Context Integrator: It receives standardized context from the context provider
and generates inference level context through the high-level context modeling. The
high level model supports converting the formalized context into a semantic
context.
22 J. Suh and C.‐W. Woo

Situation Recognizer: The context awareness process goes through the

above modules in sequence, and generates situation(s) by using rule-based
inference engine in this sub module. This situation is delivered to the grouping
layer.

3.2 Grouping Layer

When the situation generation is done by the situation acceptor, then this situation is
delivered to the grouping layer to make a group. The grouping layer first receives
information about which robot is connected to the server, and stores this informa-
tion into Group info database. Because currently connected robots through the
network can be the candidates for making a group. The grouping layer consists of
three sub modules, Situation Acceptor, Classifier Generator, and Grouper, and the
details of their work are as follows.
Situation Acceptor: It simply receives information regarding a situation, from
the Context awareness layer, and requests to the Classifier Generator to begin
grouping for this given situation.
Classifier Generator: It generates a classifier (i.e. decision tree) to make a
group for a given specific situation. And also we need to have a set of predefined
training data representing some characteristics of various kinds of robots. In this
study, we generated the classifier based on ID3 decision tree algorithm.
Grouper: The Grouper has the next two sub-modules; the searcher requests
instance information from the connected each robot through the network layer. In
other words, after the request for the instance information is acquired, such as
‘cleaning’, the grouper makes a group by using the classifier that generated from the
Classifier Generator module. The generated group information is stored in the
group info repository, and will be used for collaboration in the collaboration layer
later on.

3.2.1 Context Information for the Robot

For this experiment, we can set up several virtual situations, such as ‘cleaning’
situation, ‘delivery’ situation, ‘conversation’ situation, and so on. The Grouping
layer receives one from the situations, and start making a group that is appropriate
for the service.
The Fig. 3 is the user interface for entering robot attributes and a situation.
From this interface, we can enter five attributes interactively, such as ‘power’,
‘location’, ‘speed’, ‘possession’, ‘IL’, and a situation, such as ‘cleaning’. By using
this interface, we can create robot instances as many as we want arbitrarily and
they represent heterogeneous robots. And also we can set up a situation by just
selecting from the window. This means that each robot has different character-
istics and good for a certain work. For instance, we can set up a robot instance
2 A Framework for Collaborative Aspects of Intelligent Service Robot 23

Fig. 3 User Interface for entering robot attributes and situation

Table 1 Training data to create a decision tree

Index Power Location Speed Possession IL Cleaning
1 Low Near Fast Tray Poor Yes
2 Low Far Low Gripper Normal Yes
3 Middle Near Fast Interpreter Smart No
4 High Near Far Tray Normal No
5 Low Far Normal Tray Smart Yes
6 Low Near Normal Interpreter Normal Yes
7 Middle Far Normal Interpreter Normal No
8 High Near Slow Gripper Smart Yes
9 Middle Near Far Interpreter Smart No
10 High Far Slow Gripper Poor Yes

good for a ‘cleaning’ job as follows. If the robot has low power, location is near,
speed is low, possesses a tray, and so on, then we can consider this robot is good
for a cleaning situation. Similarly, we can create several robot instances for our
experimentation.

3.2.2 Training Data

Each robot’s attributes are described as in the Table 1, which shows that each robot
has different characteristics. For instance, ‘power, and speed’ means the current
robot’s basic characteristics, ‘location’ is the robot’s location to perform the given
24 J. Suh and C.‐W. Woo

context, ‘possession’ means a tool that robot can handle, and ‘IL’ means the robot’s
capability of language interpretation. Since we are not using sensed data from
computing devices for this simulation, we can generate robot instances through
the user interface arbitrarily, as many as possible. Table 1 is a set of training data
that shows ten robot instances for ‘cleaning’ situation.

3.2.3 Decision Tree Generation

The following equation is the entropy algorithm of the information theory, which
will be used to generate a tree.

EntropyðSÞ ¼ P ðþÞ log2 P ðþÞ P ðÞ log2 P ðÞ (1)

X j Su j
GainðS; AÞ ¼ EntropyðSÞ EntropyðSÞ (2)
u2ValuesðAÞ
S u

We can compute the general entropy using the Eq. (1), and compute the gain
entropy to select a single attribute using Eq. (2).

3.3 Collaboration Layer

It consists of following three sub-components on the server side, and action planner
is on the client (robot) side. The distinctive features of them are as follows.
Group Info. and Context Collector: It collects information of selected situa-
tion and information for grouped robots from the Grouping Layer.
Task Planner: It generates a set of tasks using high-level planning rules
(a global plan) on the server side. For instance, the generated tasks for “cleaning”
situation can be the “sweeping” and “mopping”.
Task Allocator: The task allocator sends the generated tasks to appropriate
robots.
Action Planner: Generated tasks by the task planner are delivered to the client
(robot), and further refined intro a set of actions by the action planner.

3.3.1 Multi-level Task Planning

In collaboration layer, the task is carried out by the multi-level task planning
mechanism as follows (see Fig. 4).
l The task planner gets the situation(s), and generates a set of tasks based on the
high-level planning rules.
2 A Framework for Collaborative Aspects of Intelligent Service Robot 25

HIGH-LEVEL Context Entity

PLAN RULE
(GENERAL
TASK RULE)
Context 1 Context 2 … Context n

Task 1 Task 2 Task 3

Robot Type 1 Robot Type 2

LOW-LEVEL
PLAN RULE
(ROBOT-SPECIFIC
ACTION RULE) Action 1 Action 2 Action 3 Action 1

Fig. 4 Multi-level task planning mechanism

l Then the system allocates the tasks to the appropriate robot who can handle the
specific task.
l When the task allocation is done, then each assigned robot activates the action
planner to generate a set of actions.

4 Simulated Experimentation

The overall experimentation is divided into two parts, robot grouping and robot
collaboration. Robot grouping is done by generating classifier using the Entropy
metric, and the collaboration is done by task planning algorithm.

4.1 Robot Grouping

In this study, the robot grouping simulation experiment begins with generating
virtual robot instances and a situation through the user interface as in the Fig. 3. We
can set up characteristics of each robot by selecting five attributes and also can set
up a virtual situation through the interface. When all the selection is done, then we
can send the information to the server using the start/stop button in the interface.
Figure 5 shows the snapshot of the implemented simulation result for grouping.
We designed the implementation result as six sub windows, and the function of
each window is explained as follows.
l ‘Context’ window: It shows the selected virtual context, such as, ‘cleaning’.
l ‘Training Data’ window: It shows ten training data for the selected situation.
26 J. Suh and C.‐W. Woo

Fig. 5 Snapshot of simulated robot grouping

l ‘Robot Instance’ window: It shows activated instances.

l ‘Robot Grouping Result’ window: It shows grouped robots as instance numbers.
l ‘Tree’ windows: It shows the generated decision tree for the selected context.
l ‘Process’ window: It shows the entropy computation process for the generated
decision tree.

The entire grouping process can be summarized as follows:

l Create arbitrary number of robot instances through the user interface (see Fig. 3),
and this information is sent to server.
l Also, information of a ‘virtual situation’ is sent to server.
l Then, the server creates decision tree using the information of the robot
instances, and decide a group of robots for the specific situation (e.g. cleaning).
l As a result of the final process, the grouped robot instances are shown in the
bottom left window as robot instance numbers.
From the bottom left of the windows in the Fig. 5, we can see that the
robot instance number 1, 4, 6 and 7 are grouped together to perform the task,
‘cleaning’.
2 A Framework for Collaborative Aspects of Intelligent Service Robot 27

4.2 Robot Collaboration

Although there are many different approaches of collaborations in robot stud-

ies, we consider the collaboration as sharing a task among a group of robots.
For instance, a generated task can be further refined into a set of subtask, which
will be carried out by each individual robot. The system generates a set of tasks
or subtasks by the task planning mechanism, which will be described in the
below.

4.2.1 Task Planning Rules

The task planning rules for collaboration layer is divided into two levels, high level
planning rules (general task rules), and the low-level planning rules (robot-specific
action rules). The general task rule can generate set of tasks, and the robot specific
action rule can generate a set of actions to perform the task. For example, if
‘cleaning’ is the selected context, then task planner generates a set of tasks for
the ‘cleaning’ as ‘sweeping’, and ‘mopping’. When the task ‘sweeping’ is assigned
to a robot, the generated action plan is ‘move’, ‘lift’, ‘sweep’, and ‘release’. The
task planner works on the server side, and the action planner is located on the client
side. The sample task planning rules are as in the Table 2.

4.2.2 Multi-level Planning

Our system divides the planning mechanism in two levels, a high-level planning
and low-level planning. It is a kind of hierarchical planning mechanism that is
efficient enough to make the planning mechanism as simple as possible. The high-
level planner generates a set of subtasks to accomplish a given context, and saves
them in a stack that will be taken one by one by the low-level planner. When the
high-level planning is done, the low-level planner gets activated with each one of
the subtasks. A single subtask becomes a goal to accomplish in the low-level
planning process, and it generates a set of actions as a result.

Table 2 Task planning rules

High-level planning rules Low-level planning rules
If (Context ¼ ‘Cleaning’) then (NeedTask If (Task ¼ ‘Sweeping’) then (NeedAction
Sweeping, Mopping) (Move, Lift, Sweeping, Release))
If (Context ¼ ‘Serving’) then (NeedTask If (Task ¼ ‘Take_order’) then (NeedAction
(Take_order, Food_service, Check_out)). . .. (Give_Menu, Help_order)). . ..
28 J. Suh and C.‐W. Woo

5 Conclusion

In this paper, we have described a development of intelligent robot framework for

providing intelligent service in the context awareness environment. The signifi-
cances of our research could be as follows.
First, we have designed intelligent service robot framework and successfully
carried out simulated experimentation by generating several robot instances. The
several heterogeneous robot instances can be generated by the user, through
the user interface windows, as many as possible. Second, the robots are grouped
by the decision classifier based on the entropy metric of the information theory.
This approach will provide more reliable and systematic methods for the robot
grouping. Third, in this study, we consider the collaboration as follows. Once a task
is generated, then it gets refined into a set of subtasks, which will be assigned to
each individual robot. If each robot accomplishes its own subtask, then the original
task is being done as a result. We do not have any interaction among the robot at the
moment. Fourth, the generation of task is being done by the multi-level task
allocation planning mechanism. The high-level task planning is done on the server
side, and detailed action planning is done on the client side (robot).
Our approach may provide some useful solutions for the intelligent service
oriented applications, which require multiple robot instances. Our immediate next
work is to complete the development of context-awareness layer using sensed raw
data from the external physical computing devices. Therefore, when we complete
the development of the entire framework, we are going to setup experimentations
with several humanoid robots equipped with sensors and network communications.
Also, more detailed strategy and also interaction for collaboration among each
individual robot is needed.

Acknowledgement This work was supported by the Seoul R&BD program (10848) of Korea.

References

1. M. Weiser. The computer for the twenty-first century. Sci. Am. 265(3), 94–104 (1991)
2. G.K. Mostefaoui, J. Pasquier-Rocha, P. Brezillon, Context-aware computing: a guide for the
pervasive computing community. Proc. IEEE/ACS Int. Conf. Pervasive Serv. (ICPS’04)
19(23), 39–48 (2004)
3. A.K. Dey, G.D. Abowd, Towards a better understanding of context and context-awareness.
GVU Technical Report GIT-GVU-99-22, Georgia Institute of Technology, 1999
4. M. Baldauf, S. Dustdar, A Survey on context-aware systems. Int. J. Adhoc Ubiquitous
Comput. 2, 263–277 (2004)
5. H. Chen, An intelligent broker architecture for pervasive context-aware systems. Ph.D. thesis,
University of Maryland, 2004
6. T. Gu, H.K. Pung, D.Q. Zhang, A Service-oriented middleware for building context-aware
services. J. Netw. Comput. Appl. 28, 1–18 (2005)
7. D. Salber, A. Dey, G. Abowd, The context toolkit: aiding the development of context-enabled
applications, in Proceedings of CHI’99, Pittsburgh, PA, 1999, pp. 434–441
2 A Framework for Collaborative Aspects of Intelligent Service Robot 29

8. X.H. Wang, T. Gu, D.Q. Zhang, H.K. Pung, Ontology based context modeling and reasoning
using owl, in Proceedings of 2nd IEEE Annual Conference on Pervasive Computing and
Communications Workshops, PERCOMW’04, Orlando, Florida, USA, 2004, pp. 18–22
9. D. Rodic, A.P. Engelbrecht, Social network as a coordination techniques for multi-robot
systems. International conference on system design and applications, Springer, Berlin,
2003, pp. 503–513
10. T.H. Labella, M. Dorigo, J-L. Deneubourg, Self-organized task allocation in a group of robots.
Technical Report No. TR/IRIDIA/2004–6, Universite Libre de Bruxelles, 2004
11. M. Mataric, S. Sukhatme, E. Ostergaard, Multi-robot task allocation in Uncertain Environ-
ment. Auton. Robots 14, 255–263 (2003)
12. M. Sterns, N. Windelinckx Combining planning with reinforcement learning for multi-robot
task allocation. in Proceedings of adaptive agents and MASII, LNAI 3394, 2006, pp. 260–274
13. R. Alami, A. Clodic, V Montreuil, E. Akin, R. Chatila, Task planning for human-robot
interaction, in Proceedings of the 2005 joint conference on Smart Object and ambient
intelligence, 2005, pp. 81–85
14. B. Gerkey, and M. Mataric, Principled communication for dynamic multi-robot task alloca-
tion. Exp. Robotics II, 353–362 (2000)
15. C.E. Shannon, The Mathematical Theory of Communication (University of Illinois Press,
Illinois, 1949)
16. M.A. Goodrich, E.R. Boer, J.W. Crandall, R.W. Ricks, M.L. Quigley, Behavioral entropy in
human-robot interaction, Brigham University, Technical Report ADA446467, 2004
17. T. Balch, Hierarchic social entropy: an information theoretic measure of robot group diversity.
Auton. Robots 8, 09–237 (2000)
Chapter 3
Piecewise Bezier Curves Path Planning
with Continuous Curvature Constraint
for Autonomous Driving

Ji-Wung Choi, Renwick Curry, and Gabriel Elkaim

Abstract We present two practical path planning algorithms based on Bezier

curves for autonomous vehicles operating under waypoints and corridor constraints.
Bezier curves have useful properties for the trajectory generation problem. This
paper describes how the algorithms apply these properties to generate the reference
trajectory for vehicles to satisfy the path constraints. Both algorithms generate the
piecewise-Bezier-curves path such that the curves segments are joined smoothly
with C2 constraint which leads to continuous curvature along the path. The degree
of the curves are minimized to prevent them from being numerically unstable.
Additionally, we discuss the constrained optimization problem that optimizes the
resulting path for a user-defined cost function.

1 Introduction

Bezier Curves were invented in 1962 by the French engineer Pierre Bezier for
designing automobile bodies. Today Bezier Curves are widely used in computer
graphics and animation. The Bezier curves have useful properties for the path
generation problem as described in Section 2 of this paper. Hence many path
planning techniques for autonomous vehicles have been discussed based on Bezier
Curves in the literature. Cornell University Team for 2005 DARPA Grand Chal-
lenge used a path planner based on Bezier curves of degree 3 in a sensing/action
feedback loop to generate smooth paths that are consistent with vehicle dynamics
[5]. Skrjanc proposed a new cooperative collision avoidance method for multiple
robots with constraints and known start and goal velocities based on Bezier curves
of degree 4 [6]. In this method, four control points out of five are placed such that

J.-W. Choi (*)

Autonomous Systems Lab, University of California, Santa Cruz, CA 95064, USA
e-mail: [email protected]

S.-I. Ao et al. (eds.), Machine Learning and Systems Engineering, 31

Lecture Notes in Electrical Engineering 68,
DOI 10.1007/978-90-481-9419-3_3, # Springer ScienceþBusiness Media B.V. 2010
32 J.‐W. Choi et al.

desired positions and velocities of the start and the goal point are satisfied. The fifth
point is obtained by minimizing penalty functions. Jolly described Bezier curve
based approach for the path planning of a mobile robot in a multi-agent robot soccer
system [3]. The resulting path is planned such that the initial state of the robot and
the ball, and an obstacle avoidance constraints are satisfied. The velocity of the
robot along the path is varied continuously to its maximum allowable levels by
keeping its acceleration within the safe limits. When the robot is approaching a
moving obstacle, it is decelerated and deviated to another Bezier path leading to the
estimated target position.
Our previous works introduced two path planning algorithms based on Bezier
curves for autonomous vehicles with waypoints and corridor constraints [1, 2]. Both
algorithms join cubic Bezier curve segments smoothly to generate the reference
trajectory for vehicles to satisfy the path constraints. Also, both algorithms are
constrained in that the path must cross over a bisector line of corner area such that
the tangent at the crossing point is normal to the bisector. Additionally, the con-
strained optimization problem that optimizes the resulting path for user-defined cost
function was discussed. Although the simulations provided in that paper showed
the generation of smooth routes, discontinuities of the yaw angular rate appeared at
junction nodes between curve segments. This is because the curve segments are
constrained to connect each other by only C1 continuity, so the curvature of the path
is discontinuous at the nodes. (Section 2 describes this more detail.)
To resolve this problem, we propose new path planning algorithms. The algo-
rithms impose constraints such that curve segments are C2 continuous in order to
have curvature continuous for every point on the path. In addition, they give the
reference path more freedom by eliminating redundant constraints used in previous
works, such as the tangent being normal to the bisector and symmetry of curve
segments on corner area. The degree of each Bezier curve segments are determined
by the minimum number of control points to satisfy imposed constraints while
cubic Bezier curves are used for every segments in previous works. The optimized
resulting path is obtained by computing the constrained optimization problem that
controls the tradeoff between shortest and smoothest path generation. Furthermore,
the proposed algorithms satisfy the initial and goal orientation as well as position
constraint while previous works only made the position constraint satisfied. The
numerical simulation results demonstrate the improvement of trajectory generation
in terms of smoother steering control and smaller cross track error.

2 Bezier Curve

A Bezier Curve of degree n can be represented as

X
n
PðtÞ ¼ Bni ðtÞPi ; t 2 ½0; 1
i¼0
3 Piecewise Bezier Curves Path Planning with Continuous Curvature Constraint 33

where Pi are control points and Bni ðtÞ is a Bernstein polynomial given by

n
Bni ðtÞ ¼ ð1 tÞni ti ; i 2 f0; 1; . . . ; ng
i

Bezier Curves have useful properties for path planning:

l They always start at P0 and stop at Pn.
l They are always tangent to P0 P1 and Pn Pn1 at P0 and Pn respectively.
l They always lie within the convex hull consisting of their control points.

2.1 The de Casteljau Algorithm

The de Casteljau algorithm describes a recursive process to subdivide a Bezier

curve P(t) into two segments. The subdivided segments are also Bezier curves.
Let fP00 ; P01 ; . . . ; P0n g denote the control points of P(t). The control points of the
segments can be computed by

Pji ¼ ð1 tÞP j1

i þ t P j1
iþ1 ;
(1)
t 2 ð0; 1Þ; j 2 f1; . . . ; ng; i 2 f0; . . . ; n jg

Then, fP00 ; P10 ; . . . ; Pn0 g are the control points of one segment and fPn0 ; Pn1
1 ; ...,
Pn0} are the another. (See an example in Fig. 1.) Note that, by applying
the properties of Bezier Curves described in the previous subsection, both of the
subdivided curves end at Pn0 and the one is tangent to Pn1 n
0 P0 and the other is to
Pn0 Pn1
1 at the point. Since Pn0 is chosen on Pn1
0 P1
n1
by using (1), the three points
n1 n n1
P0 , P0 , and P1 are collinear.
Remark 1. A Bezier curve P(t) constructed by control points fP00 ; P01 ; . . . ; P0n g
always passes through the point Pn0 and is tangent to Pn1
0 P1
n1 at Pn .
0

P01 P11
P30 P21
P20 P02
P10 P12
Fig. 1 Subdividing a cubic
Bezier curve with t ¼ 0. 4
P00 P03
by the de Casteljau Algorithm
34 J.‐W. Choi et al.

2.2 Derivatives, Continuity and Curvature

The derivatives of a P
Bezier curve can be determined by its control points. For a
Bezier curve PðtÞ ¼ ni¼0 Bni ðtÞPi , the first derivative is given by

X
n1
_
PðtÞ ¼ Bn1 ðtÞDi (2)
i
i¼0

_
where Di, control points of PðtÞ are

Di ¼ nðPiþ1 Pi Þ

Geometrically, (2) provides us with a tangent vector. The higher order derivative
can be obtained by using the relationship of (2), iteratively.
Two Bezier curves P(t) and Q(t) are said to be Ck at t0 continuous if

Pðt0 Þ ¼ Qðt0 Þ; Pðt _ 0 Þ; . . . ; PðkÞ ðt0 Þ ¼ QðkÞ ðt0 Þ

_ 0 Þ ¼ Qðt (3)

The curvature of a Bezier curve P(t) ¼ (x(t), y(t)) at t is given by

_ xðtÞj
_ yðtÞ yðtÞ€
jxðtÞ€
kðtÞ ¼ 3 (4)
ð_x2 ðtÞ þ y_ 2 ðtÞÞ2

Lemma 1. For the path constructed by two Bezier curve segments PðtÞjt2½t0 ;t1
and QðtÞjt2½t1 ;t2 , if P(t) and Q(t) are at least C2 continuous at t1 then the path has
continuous curvature for every point on it.
Proof. The curvature is expressed in terms of the first and the second derivative of a
curve in (4). Since the Bezier curves are defined as polynomial functions of t, their
k-th derivative for all k ¼ 1, 2, . . . are continuous. Hence, they always have
continuous curvature for all t. For two different Bezier curves P(t) and Q(t), it is
_
sufficient that k(t1), the curvature at the junction node is continuous if PðtÞ _
¼ QðtÞ
€ €
and PðtÞ ¼ QðtÞ are continuous at t1.

3 Problem Statement

Consider the control problem of a ground vehicle with a mission defined by

waypoints and corridor constraints in a two-dimensional free-space. Our goal is
to develop and implement an algorithm for navigation that satisfies these con-
straints. That is divided into two parts: path planning and path following.
3 Piecewise Bezier Curves Path Planning with Continuous Curvature Constraint 35

To describe each part, we need to introduce some notation. Each waypoint

is denoted as Wi 2 R2 for i 2 {1, 2, . . ., N}, where N 2 R is the total number of
waypoints. Corridor width is denoted as wj for j 2 {1, . . ., N 1}, j-th widths of
each segment between two waypoints, Wj and Wj+1. For the dynamics of the
vehicle, the state and the control vector are denoted q(t) ¼ (xc(t), yc(t), c(t))T and
u(t) ¼ (v(t), o(t))T respectively, where (xc, yc) represents the position of the center of
gravity of the vehicle. The yaw angle c is defined to the angle from the X axis. v is
the longitudinal velocity of the vehicle. o ¼ c_ is the yaw angular rate. State space
is denoted as C ¼ R3 . We assume that the vehicle follows that
0 1
cos cðtÞ 0
_ ¼ @ sin cðtÞ
qðtÞ 0 AuðtÞ
0 1

With the notation defined above, the path planning problem is formulated
as: Given an initial position and orientation and a goal position and orientation of
the vehicle, generate a path l specifying a continuous sequence of positions and
orientations of the vehicle satisfying the path constraints [4]. In other words, we are
to find a continuous map

l : ½0; 1 ! C

with

lð0Þ ¼ qinit and lð1Þ ¼ qgoal

where qinit ¼ (W1, c0) and qgoal ¼ (WN, cf) are the initial and goal states of the
path, respectively.
Given a planned path, we use the path following technique with feedback
corrections as illustrated in Fig. 2. A position and orientation error is computed
every 50 ms. A point z is computed with the current longitudinal velocity and

z
yerr

p
yerr

Fig. 2 The position error is Y (xc, yc)

measured from a point z, q
projected in front of the
vehicle, and onto the desired X
curve to point p
36 J.‐W. Choi et al.

heading of the vehicle from the current position. z is projected onto the reference
trajectory at point p such that zp is normal to the tangent at p. The cross track error
yerr is defined by the distance between z and p. The steering control o uses a PID
controller with respect to cross track error yerr.
ð
dyerr
do ¼ kp yerr þ kd þ ki yerr dt
dt

4 Path Planning Algorithm

In this section, two path planning methods based on Bezier curves are proposed.
To describe the algorithms, let us denote b ^j as the unit vector codirectional with
the outward bisector of ∠Wj1WjWj+1 for j 2 {2, . . ., N 1} as illustrated in Fig. 3.
The planned path must cross over the bisectors under the waypoint and the corridor
constraints. The location of the crossing point is represented as Wj þ dj b ^j , where
dj 2 R is a scalar value. The course is divided into segments Gi by bisectors. Gi
indicates the permitted area for vehicles under corridor constraint wi, from Wi to Wiþ1.
Bezier curves constructed by large numbers of control points are numerically
unstable. For this reason, it is desirable to join low-degree Bezier curves together in
a smooth way for path planning [6]. Thus both of the algorithms use a set of low-
degree Bezier curves such that the neighboring curves are C2 continuous at their
end nodes. This will lead to continuous curvature on the resulting path by Lemma 1.
The Bezier curves used for the path plannings are denoted as i PðtÞ ¼
Pni
k¼0 Bk ðtÞ Pk for i 2 {1, . . ., M}, t 2 [0, 1], where M is the total number of
ni i

the Bezier curves and ni is the degree of iP. The planned path l is a concatenation of
all iP such that

lðði 1 þ tÞ=MÞ ¼ i PðtÞ; i 2 f1; . . . ; Mg; t 2 ½0; 1

bˆ 3
W3
G3
W4
G2

Fig. 3 An example of the W2

course with four waypoints. G1 bˆ 2
Y
Gray area is the permitted W1
area for vehicles under a
corridor constraint X
3 Piecewise Bezier Curves Path Planning with Continuous Curvature Constraint 37

4.1 Path Planning Placing Bezier Curves within Segments (BS)

In this path planning method (BS), the Bezier curve iP for i 2 {1, . . ., N 1} are
used within each segment Gi. The adjacent curves, j1P and jP are C2 continuous
^j . The control points of iP, iPk for k ¼ {0, . . ., ni}
at the crossing point, Wj þ dj b
are determined to maintain these conditions.
l The beginning and the end point are W1 and WN.

1
P0 ¼ W 1 ; N1
PnN1 ¼ WN (5)

l The beginning and end orientation of l are c0 and cf.

1
P1 ¼ W1 þ l0 ðcos c0 ; sin c0 Þ; l0 2 Rþ (6a)

N1
PnN1 1 ¼ WN lf ðcos cf ; sin cf Þ; lf 2 Rþ (6b)

l
j1
P and jP, 8j 2 {2, . . ., N 1} are C2 continuous at the crossing point.

j1 ^j
Pnj1 ¼ j P0 ¼ Wj þ dj b (7a)

nj1 ðj1 Pnj1 j1

Pnj1 1 Þ ¼ nj ðj P1 j P0 Þ (7b)

nj1 ðnj1 1Þðj1 Pnj1 2 j1 Pnj1 1 þ j1 Pnj1 2 Þ

(7c)
¼ nj ðnj 1Þðj P2 2j P1 þ j P0 Þ

l The crossing points are bounded within the corridor.

1
jdj j < 8 minðwj1 ; wj Þ (8)
2

l Pk, k ¼ {1, . . ., ni 1} always lie within the area of Gi.

i
P1 2 Gi ; . . . ; i Pni 1 2 Gi (9)

Equations (6a, b) are derived by using the tangent property of Bezier curves at
their end points. Equations (7a–c) are obtained by applying (2) and (3). Equation (9)
makes the resulting Bezier curve satisfy the corridor constraint by the convex hull
property. At each crossing point, three control points of each adjacent Bezier curve
are dedicated to the C2 continuity constraint by (2), (4), and Lemma 1. So the
minimum number of control points to satisfy the constraints independent of the
others are six for iP, i 2 {2, . . ., N 2}. On other hand, 1P needs five: three to
38 J.‐W. Choi et al.

connect 2P plus two to connect W1 with slope of c0. Likewise, 2N3

P needs five.
Thus, ni is determined as

4; i 2 f1; N 1g
ni ¼
5; i 2 f2; . . . ; N 2g

Note that 1P0 and N1 PnN1 are fixed in (5). 1P1 and N1 PnN1 1 are fixed in
(6a, b). j1 Pnj1 and jP0 rely on dj in (7a–c). j1 Pnj1 1 and j1 Pnj1 2 rely on jP1
and j 1P2. So the free variables are, 8j 2 {2, . . ., N 1}, P1 ¼ {jP1}, P2 ¼ {jP2}, d
¼ {dj}, and L ¼ {l0, lf}. The number of the variables or the degrees of freedom is
5N 8. The variables are computed by minimizing the constrained optimiza-
tion problem:
X
N 1
min J ¼ Ji (10)
P1 ;P2 ;d;L
i¼1

subject to (8) and (9), where Ji is the cost function of iP(t), given by
ð1
Ji ¼ ½ðai ð_x2 ðtÞ þ y_ 2 ðtÞÞ þ bi ji kðtÞj2 dt (11)
0

where ai 2 R and bi 2 R are constants that control the tradeoff between arc lengths
versus curvatures of the resulting path.

4.2 Path Planning Placing Bezier Curves on Corners (BC)

Another path planning method (BC) adds quadratic Bezier curves on the corner area
around Wj, j 2 {2, . . ., N 1}. The quadratic Bezier curves are denoted as
P
j
QðtÞ ¼ 2k¼0 B2k ðtÞ j Q0k intersects the j-th bisector. The first and the last control
point, j Q00 and j Q02 are constrained to lie within Gj1 and Gj, respectively. Within
each segment Gi, another Bezier curve is used to connect the end points of jQ
with C2 continuity and/or W1 with slope of c0 and/or WN with cf. Hence, jQ is the
curve segments of even number index:

2ð j1Þ
PðtÞ ¼ j QðtÞ; j 2 f2; . . . ; N 1g

The degree of iP is determined by the minimum number of control points to satisfy

C2 continuity constraint, independently:
8
< 4; i 2 f1; 2N 3g
ni ¼ 2; i 2 f2; 4; . . . ; 2N 4g
:
5; i 2 f3; 5; . . . ; 2N 5g
3 Piecewise Bezier Curves Path Planning with Continuous Curvature Constraint 39

Let tc denote the Bezier curve parameter corresponding to the crossing point of
j
Q(t) on the bisector, such that

j ^j :
Qðtc Þ ¼ Wj þ dj b (12)

Let yj denote the angle of the tangent vector at the crossing point from X-axis:

j _ c Þ ¼ ðjj Qðt
Qðt _ c Þj cos yj ; jj Qðt
_ c Þj sin yj Þ: (13)

The notations are illustrated in Fig. 4. Due to the constraint of j Q00 and j Q02 within
Gj 1 and Gj, the feasible scope of yj is limited to the same direction as Wjþ1 is
with respect to b ^j . In other words, if Wjþ1 is to the right of b^j , then yj must point
^
to the right of bj , and vice versa.
Given j Q00 , j Q02 , dj, and yj, the other control point j Q01 is computed such that the
crossing point is located at Wj þ dj b ^j and the angle of the tangent vector at the
crossing point is yj. Since each control point is two-dimensional, the degrees of
freedom of jQ(t) are six. Since dj and yj are scaler, representing jQ(t) in terms of
Q0 , Q2 , dj, and yj does not affect the degrees of freedom. However, it comes up
j 0 j 0

with an advantage for corridor constraint. If we compute j Q01 such as above, the
points computed by applying the de Casteljau algorithm such that two subdivided
curves are separated by the j-th bisector are represented as j Q00 and j Q02 as described
in the following. The two curves are constructed by fj Q00 ; j Q10 ; j Q20 g and
fj Q20 ; j Q11 ; j Q02 g. We can test if the convex hull of fj Q00 ; j Q10 ; j Q20 g lies within Gj
j 2 j 1 j 0
1 and if that of f Q0 ; Q1 ; Q2 g lies within Gj in (26), instead of testing that of
j 0 j 0 j 0
f Q0 ; Q1 ; Q2 g. (Note that j Q01 is not constrained to lie within corridor as shown in
Fig. 4.) So, the convex hull property is tested for tighter conditions against the
corridor constraint without increasing the degrees of freedom.
In order to compute j Q01 , the world coordinate frame T is transformed and rotated
into the local frame jT where the origin is at the crossing point, jQ(tc) and X axis
is codirectional with the tangent vector of the curve at the crossing point, j Qðt _ c Þ.

j Q0
bˆ j 1

j
Q̇ (tc )
Wj

–dj qj
Y

j Q(t )
c
j Q0
2

j
Q00
Fig. 4 The geometry of the
j
Q(t) in corner area around Wj X
40 J.‐W. Choi et al.

Fig. 5 The geometry of the Y

control points of jQ(t) with j Q0
1
respect to jT

∗
at ∗ a(1–t )
j Q1 j Q1
0 1
X
j
Q20
j Q0
0

j
Q02

Let us consider the subdivision ratio, t∗ 2 (0, 1) such that the location of j Q20
computed by applying the de Casteljau algorithm with it is the crossing point. In
other words, t∗ has j Q20 be at the origin with respect to jT frame. Fig. 5 illustrates the
control points of jQ(t) with respect to jT frame. Note that j Q20 is at the origin by the
definition of jT and t∗. j Q10 and j Q11 are on the X axis by the definition of jT and
Remark 1. Let the coordinates of the control points be denoted as Q0i ¼ ðxi ; yi Þ,
i 2 {0, 1, 2}, where all coordinates are with respect to jT.
Lemma 2. Given dj and yj, for jQ(t) to intersect j-th bisector with the crossing
point determined by the dj and (12), and the tangent vector at the point determined
by the yj and (13), it is necessary that y0y2 0.
Proof. Let (x(t), y(t)) denote the coordinate of jQ(t) with respect to jT. By the
definition of jT and Remark 1, Q(t) passes through the origin with tangent slope
_ c Þ ¼ 0. Suppose that
of zero with respect to jT. That is, x(tc) ¼ 0, y(tc) ¼ 0 and yðt
_ > 0 and y€ðtÞ < 0 for t 2 [0, tc).
y0 ¼ y(0) < 0. Since y(t) is a quadratic polynomial, yðtÞ
_ < 0 and y€ðtÞ < 0 for t 2 (tc, 1]. Thus, y2 ¼ y(1) < 0 and y0y2 > 0.
Subsequently, yðtÞ
Similarly, if y0 > 0 then y1 > 0. If y0 ¼ 0 then yðtÞ _ ¼ 0 for t 2 [0, 1] and y2 ¼ 0.
Thus, y0y2 ¼ 0. □
We are to calculate j Q01 depending on whether y0y2 is nonzero. For simplicity,
superscript j is dropped from now on. Without loss of generality, suppose that
y0 < 0 and y2 < 0. Q20 is represented as

Q20 ¼ ð1 t ÞQ10 þ t Q11

by applying (1). So the coordinates of Q10 and Q11 can be represented as

Q10 ¼ ðat ; 0Þ; Q11 ¼ ðað1 t Þ; 0Þ; a>0 (14)

where a > 0 is some constant. Applying (1) with i ¼ 0 and j ¼ 1 and arranging the
result with respect to Q01 by using (14) gives

1 t 1 t
Q1 ¼ a
0
x0 ; y0 (15)
t t
3 Piecewise Bezier Curves Path Planning with Continuous Curvature Constraint 41

Similarly, applying (1) with i ¼ 1 and j ¼ 1 yields

t t
Q01 ¼ a x 2 ; y (16)
1 t 1 t 2

where a and t∗ are obtained by equations (15) and (16):

1 x0 y2 y0 x2
t ¼ pffiffiffiffiffiffiffiffiffiffiffi ; a¼ pffiffiffiffiffiffiffiffiffiffiffi (17)
1 þ y2 =y0 2y0 y2 =y0

t∗ and a have the square root of y2/y0. So, if y0y2 < 0 then t∗ and a are not
determined, hence, Q(t) is infeasible. That is, (17) ends up with Lemma 2.
If y0 ¼ y2 ¼ 0 then all control points of jQ are on X axis (see proof of Lemma 2).
In the geometric relation of control points and the points computed by applying the
de Casteljau algorithm as shown in Fig. 6, we obtain

x0 ¼ ða þ bÞt
x2 ¼ ða þ gÞð1 tÞ (18)
a ¼ bð1 tÞ þ gt

where a > 0, b > 0, g > 0 are some constants. Using (18), Q01 ¼ ðx1 ; 0Þ is
represented in terms of arbitrary t 2 (0, 1):

1 1t t
x1 ¼ x0 þ x2 (19)
2 t 1t

The constraints imposed on the planned path are formulated as follows:

l The beginning and end position of l are W1 and WN.

1
P0 ¼ W 1 ; 2N3
P4 ¼ W N (20)

l The beginning and end orientation of l are c0 and cf.

1
P1 ¼ W1 þ l0 ðcos c0 ; sin c0 Þ; l0 2 R þ (21a)

2N3
P3 ¼ WN lf ðcos cf ; sin cf Þ; lf 2 R þ (21b)

bt b(1 – t ) gt g (1 – t )

Fig. 6 The geometry of jQ(t) Q 00 Q 10 Q 20 Q 01 Q 11 Q 02

with respect to jT when y0 ¼
y2 ¼ 0 at a (1 – t )
42 J.‐W. Choi et al.

l
i1
P and iP, 8i 2 {2, . . ., 2N 3} are C2 continuous at the junctions.

i1
Pni1 ¼ i P0 (22a)

ni1 ði1 Pni1 i1

Pni1 1 Þ ¼ ni ði P1 i P0 Þ (22b)

ni1 ðni1 1Þði1 Pni1 2 i1 Pni1 1 þ i1 Pni1 2 Þ

(22c)
¼ ni ðni 1Þði P2 2 i P1 þi P0 Þ

l The crossing points are bounded within the corridor.

1
jd j j < minðwj1 ; wj Þ; 8j 2 f2; . . . ; N 1g (23)
2
l ^j .
yj has the same direction as Wjþ1 is with respect to b

mod ðﬀðWjþ1 Wj Þ ﬀb^j ; 2pÞ > p

(24a)
^j ; 2pÞ > p;
) mod ðyj ﬀb

mod ðﬀðWjþ1 Wj Þ ﬀb^j ; 2pÞ < p

(24b)
^j ; 2pÞ < p
) mod ðyj ﬀb

l j
Q00 and j Q02 with respect to jT satisfies Lemma 2.

y0 y2 0 (25)

where y0 and y2 are with respect to jT.

j 0
l Q0 and j Q10 lie within Gj1. j Q02 and j Q11 lie within Gj.

j
Q00 2 Gj1 ; j Q10 2 Gj1 ; j Q02 2 Gj ; j Q11 2 Gj (26)

l {iP1, . . ., iPni1} always lie within the area of Gi.

i
P1 2 Gi ; . . . ; i Pni 1 2 Gi ; i 2 f1; 3; . . . ; 2N 3g (27)

The free variables are, for j 2 {2, . . ., N 1}, Q ¼ {jQ0, jQ2}, d ¼ {dj},
u ¼ {yj}, and L ¼ {l0, lf}. The degrees of freedom is 6N 10. The variables are
computed by minimizing the constrained optimization problem:

X
N 1
min J ¼ Ji (28)
Q;d;y;L
i¼1
3 Piecewise Bezier Curves Path Planning with Continuous Curvature Constraint 43

subject to (23), (24,b), (25), (26), and (27), where Ji is the cost function of iP(t),
defined in (11). Notice that the convex hull property is tested for j Q10 and j Q11 of the
divided curves instead of j Q01 in (26). Thus, it comes up with tighter conditions for
curves against the corridor constraint.

5 Simulation Results

Simulations were run for the course with four waypoints and identical corridor
width of 8 as illustrated in Fig. 7. The initial and goal orientation are given by
c0 ¼ cf ¼ 0. For path following, the constant longitudinal velocity v(t) ¼ 10 m/s is
used. The magnitude of o is bounded within |o|max ¼ 25 rpm. The PID gains are
given by: kp ¼ 2, kd ¼ 1, and ki ¼ 0.1.

λlBS λcBS
70 70
Planned Path Planned Path
60 Actual Trajectory 60 Actual Trajectory

Vf Vf
50 50
North [m]

North [m]

40 40

30 30

20 20

10 10

V0 V0
0 0
0 20 40 60 80 0 20 40 60 80
East [m] East [m]

λlBC λcBC
70 70
Planned Path Planned Path
60 Actual Trajectory 60 Actual Trajectory

Vf Vf
50 50
North [m]

North [m]

40 40

30 30

20 20

10 10
V0 V0
0 0
0 20 40 60 80 0 20 40 60 80
East [m] East [m]

Fig. 7 The planned paths by BS method (top) and by BC method (bottom). Each pair of results are
obtained by minimizing arc lengths (left) and by minimizing curvatures (right)
44 J.‐W. Choi et al.

In Fig. 7, the reference paths (dashed curves) planned by applying the BS and BC
method are placed in top and bottom row, respectively. The actual trajectories
(solid curves) are generated by using the proposed tracking method. In the figures,
stars indicate the location of control points of Bezier curve segments at an even
number index, and empty circles indicate those of the others. The paths in the left
column are planned by minimizing the summation of (11) with ai ¼ 1 and bi ¼ 0.
Since only arc length is penalized, the cost function leads to paths with minimum
arc length, which we denote llBS and llBC for the BS and BC method, respectively.
On other hand, the paths in the right column are planned by minimizing the cost
function with ai ¼ 0 and bi ¼ 1 so that resulting paths with larger radii of curvature
are provided. We denote them lcBS and lcBC for the BS and BC method, respectively.
lcBS and lcBC generate longer but smoother trajectory guidance than llBS and llBC .
Taking a look at the tracking results on llBS and llBC , the vehicle overshoots in sharp
turns around W3 resulting in a large position error (see Fig. 8) due to the limit on
steering angle rate. The commanded steering angle rate on llBC , marked by ‘*’ in
Fig. 9 undergoes rapid changes and is constrained by the rate limit. However, on lcBS
and lcBC , the vehicle tracks the part of the planned paths accurately thanks to larger
radii of curvature. We can verify that more tangibly in cross track errors plot
provided in Fig. 8. Also, the steering control signal on lcBC , marked by ‘o’ in
Fig. 9 is smoother than that on llBC .

0.2
l
λBC
Cross Track Error [m]

c
λ BC
0.1

– 0.1

– 0.2
Fig. 8 The cross track errors 0 5 10
by BC Time [s]

2
l
λBC
1 λcBC
Yaw Rate [rad / s]

–1
– |w| max
–2

–3
Fig. 9 The steering controls 0 2 4 6 8 10 12
by BC Time [s]
3 Piecewise Bezier Curves Path Planning with Continuous Curvature Constraint 45

The main difference of the proposed algorithm from the previous ones of [1] is
the degree of continuity at the junctions: C2. Assuming that the vehicle tracks a
reference path perfectly and v is continuous, if k is continuous then o is continuous
because o ¼ kv. When v is constant as this simulation, since o is proportional to k,
the continuity characteristic of o tends to follow that of k. Since the previous
algorithms imposed only C1 continuity on junction nodes of the path, k is discon-
tinuous at the nodes. Hence the path underwent the discontinuity of angular rate,
that is, large angular acceleration, which leads to large torque on the vehicle. On
other hand, the proposed algorithm has k continuous along the resulting paths. If the
path has curvature small enough so that the vehicle is able to track it accurately
given a maximum steering angle rate, then the steering control signal will be
smooth, as that of lcBC in Fig. 9.

6 Conclusions

This paper presents two path planning algorithms based on Bezier curves for
autonomous vehicles with waypoints and corridor constraints. Bezier curves pro-
vide an efficient way to generate the optimized path and satisfy the constraints at
the same time. The simulation results also show that the trajectory of the vehicle
follows the planned path within the constraints.

References

1. J. Choi, R.E. Curry, G.H. Elkaim, in Path Planning Based on Bezier Curve for Autonomous
Ground Vehicles (Chapter 19, IEEE Computer Society, 2009), pp. 158–166
2. J. Choi, G.H. Elkaim, Bezier curves for trajectory guidance. Proceedings of the World
Congress on Engineering and Computer Science, WCECS 2008, San Francisco, CA (2008)
3. K.G. Jolly, K.R. Sreerama, R. Vijayakumar, A bezier curve based path planning in a multi-
agent robot soccer system without violating the acceleration limits. Robot. Autonom. Syst. 57,
23–33 (2009)
4. J.C. Latombe, in Robot Motion Planning (Kluwer, Norwell, MA, 1991)
5. I. Miller, S. Lupashin, N. Zych, P. Moran, B. Schimpf, A. Nathan, E. Garcia, in Cornell
University’s 2005 DARPA Grand Challenge Entry, vol. 36, chapter 12 (Springer, Heidelberg,
2007), pp. 363–405
6. I. Skrjanc, G. Klancar, Cooperative collision avoidance between multiple robots based
on bezier curves. Proceedings of the Information Technology Interfaces, 2007 (ITI 2007), pp.
451–456
Chapter 4
Combined Heuristic Approach
to Resource-Constrained Project
Scheduling Problem

Miloš Šeda, Radomil Matoušek, Pavel Ošmera, Čeněk Šandera,

and Roman Weisser

Abstract This chapter deals with the resource-constrained project scheduling

problem that belongs to NP-hard optimisation problems. There are many different
heuristic strategies how to shift activities in time when resource requirements
exceed their available amounts. We propose a transformation of the problem to a
sequence of simpler instances of (multi)knapsack problems that do not use tradi-
tionally predefined activity priorities and enable to maximise limited resources
in all time intervals given by start or end of an activity and therefore to reduce
the total time.

1 Introduction

A classical task of project management is to create the network graph of a

project and, on the basis of knowledge or an estimation of time of activities,
to determine critical activities that would have influenced a project delay. Each
activity draws on certain resources, e.g. financial resources, natural resources
(energy, water, material, etc.), labour, managerial skills. The solution of this
task is well known in the case when we do not take limited resources into
consideration.
The scheduling problem is a frequent task in the control of various systems
such as manufacturing processes [1], project management [2], and service system
control (reservation systems, timetabling).

M. Šeda (*)
Institute of Automation and Computer Science, Faculty of Mechanical Engineering,
Brno University of Technology, Technická 2, Brno 616 69, Czech Republic
e-mail: [email protected]

S.-I. Ao et al. (eds.), Machine Learning and Systems Engineering, 47

Lecture Notes in Electrical Engineering 68,
DOI 10.1007/978-90-481-9419-3_4, # Springer ScienceþBusiness Media B.V. 2010
48 M. Šeda et al.

However, in real situations, available capacities of resources are constrained to

certain limits. Project planning under limited resources [1] is difficult because of
l Interdependence among activities due to sharing the same resources
l Dependence of resource consumption on the manner in which the activity is
subdivided
l Dependence on the limits of availability of the various resources
The character of all the three dependencies is basically non-linear. In general,
scheduling problems are NP-hard, consequently no polynomial-time algorithms are
known to guarantee an optimal solution.
Classical methods of network analysis offer some approaches which depend on
whether activities that have a lack of resource(s) may be suspended or not and
whether activities will be put off until the moment when it is possible to perform
them. A question is: Which concurrent activities should be suspended or put off
because of a lack of resource(s)?
An excellent survey [3] with 203 references presents classification, models and
methods for solving the resource-constrained project scheduling problem (RCPSP)
and, in [4], it is critically reviewed and completed. There are single-mode and
multi-mode variants of the problem, resources can be non-renewable or renewable,
activities of project can be interrupted or not and so on.
With respect to NP-hardness of the RCPSP, mainly heuristic methods are used
for its solving. Papers frequently present applications of stochastic heuristics such
as simulated annealing [5, 6], genetic algorithms [6–9], tabu-search [10, 11], ant
systems [12] and swarm optimization method [13].
In this paper we assume that, when resources are not sufficient, a suitable method
for shifting one or more activities has been selected and we touch another problem.
In real situations, the durations of activities can only be estimated and are subject to
change. If we included this new information in our considerations, this could result
in quite a different optimal schedule. Since, obviously, the dates of activities in the
past cannot be changed, it is necessary for the calculation of the entire project with
new data to modify the dates of only those activities that have not yet been finished.

2 Basic Notions

In this paragraph we introduce the notation used in this paper and the basic concepts
of CPM. We consider a network graph G with n topologically ordered vertices [14],
which means that, for each edge (i, j), i appears before j (i < j) and the starting
vertex has number n0 ¼ 1 and ending vertex number n. This ordering can be gained
as follows:
1. Start from the origin and assign n0 to it.
2. Leave all edges outgoing from n0 and assign numbers n0 + 1, . . . , n0 + k1 to k1
vertices that have no input edge.
4 Combined Heuristic Approach to Resource-Constrained Project Scheduling Problem 49

3. Leave all edges outgoing from the vertices numbered in the previous step and
assign numbers n0 + k1 + 1, . . . , n0 + k1 + k2 to k2 vertices that have no input edge.
4. Continue this way until all vertices are numbered.
Assume that edges represent activities of a project and vertices correspond to
beginnings or ends of activities. Denote E(G) the set of edges of a graph G, and V
(G) its set of vertices. If e ¼ (i, j) is an edge in E(G), then we denote its duration
by tij or t(i, j), or te in short. Similarly, requirements of activity (i, j) for a resource r
will be denoted by rij etc.
The following notions refer to start and end vertices of the network graph: Ti(0)
represents the earliest possible start time of vertex i, Tj(1) the latest allowable finish
time of vertex j and TS(i, j) ¼ Tj(1) Ti(0) ti j is the total slack of activity (i, j)
or total (activity) float (the amount of time by which the start of a given activity can
be delayed without delaying the completion of the project).
Finally, assume that Vi(e) denotes the starting vertex of an edge e ¼ (i, j) and
Vj(e) is its ending vertex.
Further notation will be introduced when needed.

3 Algorithm

The algorithm is based on time shifting of activities when their total requirements
are higher than the resource limit. This is implemented by prolonging their duration
but distinguishing, for each activity, its starting duration and current duration which
equals the length of shift and starting duration. The greatest advantage of this access
is that whenever we need to compute new earliest possible start times and latest
allowable finish times for activities after shifts or update the actual time duration of
some activities, we can compute the whole project using a simple CPM method and,
in spite of this, the dates of finished activities remain unchanged in the result of the
new calculation. In other words, any change in the present has no effect on results in
the past.
Let us denote
tsij starting duration of activity (i,j)
tcij current duration of activity (i,j)
dij ¼ tcij tsij interval when activity has no requirements (“sleeps”)
Now we will formulate an algorithm. The symbol:¼ stands for the assignment
operator.
1. Initialization
Using CPM method, we determine for each edge the earliest possible start time
and the latest allowable finish time. Let us assign

t1 :¼ 0 (1)
50 M. Šeda et al.

dij :¼ 0; tcij :¼ tsij for every ði; jÞ 2 EðGÞ (2)

2. Test for finishing

ð0Þ
If t 1 ¼ Tn then algorithm finishes else we continue to step 3.
3. Determination of interval ½ t 1 ; t 2
The left bound is given and the right bound we determine from the following
formula
n o n o
ð0Þ ð0Þ ð0Þ
t 2 ¼ min Ti j Ti > t 1 [ Ti þ tcij (3)
ði;jÞ2EðGÞ

4. Determination of activities requiring resource in ½t1 ; t2

Let
n o
ð0Þ ð0Þ
A ¼ ði; jÞ 2 EðGÞj½t1 ; t2 ½Ti þ dij ; Ti þ tcij (4)

Let us determine the total requirements q of activities from A.

X
q ¼ rij (5)
ði;jÞ 2A

If q > resource limit, then we continue to step 5 else to step 7.

5. Ordering activities from A by their priorities
Let A ¼ fe1 ; :::; ek g. We order all m activities in A into a non-ascending
sequence B as follows. If bk, bl are any elements from B, then the order relation is
defined as follows:
ð0Þ
bk bl , TVi ðbk Þ þ dbk < t1
else if TS ðbk Þ < TS ðbl Þ
(6)
then if TS ðbk Þ ¼ TS ðbl Þ
else if rbk > rbl

The first condition on the right-hand side means that bk has begun in the previous
interval.
6. Right-shifting
Because of step 4, there exists j< m such that
X
j X
j þ1
rb i limit and rb i > limit (7)
i ¼1 i ¼1
4 Combined Heuristic Approach to Resource-Constrained Project Scheduling Problem 51

We shift activity bj+1 so that new values of its parameters are obtained from
the old values by the following assignments:

ð0Þ
tcb jþ1 :¼ tcb jþ1 þ t2 TPV b þ dbjþ1
ð jþ1 Þ (8)
dbjþ1 :¼ tcbjþ1 tsbjþ1

for the other activities in B (according to their priorities), we either add their
resource requirements and place them into the schedule or shift them if limit has
been exceeded.
Finally, we apply the CPM method again.
7. Next interval

t 1 :¼ t 2 (9)

and return to step 2.

4 Generalisation for Multiproject Schedule

The algorithm described in the previous section can be very simply adapted for
multiproject scheduling with limited resources. The steps of this generalised algo-
rithm must take into account that resources are shared by all the projects. If the
number of projects is N, then, e.g., Step 3 can be modified as follows
n o n o
ð0Þ ð0Þ ð0Þ
t2 ¼ min Ti jTi >t 1 [ Ti þ tcij (10)
ði;jÞ2EðGÞk2½1;N

The other steps may be adapted in a similar way.

5 KNapsack-Based Heuristic

If we consider a schedule as a sequence of time intervals where neighbouring

intervals based on the time an activity starts or finishes, the scheduling problem
may be reduced to a set of the optimal choices of activities in intervals by the order
relation (6).
In the network analysis literature, we may find many other strategies describing
how the ranking of competing activities is accomplished, for instance:
l Greatest remaining resource demand, which schedules as first those activities
with the greatest amount of work remaining to be completed
52 M. Šeda et al.

l Least total float, which schedules as first those activities possessing the least
total float
l Shortest imminent activity, which schedules as first those activities requiring the
least time to complete
l Greatest resource demand, which schedules as first those activities requiring the
greatest quantity of resources from the outset
Using defined priorities as prices, the task investigated in one interval corre-
sponds to the well-known knapsack problem. If the number of the activities sharing
a source in an interval is low, e.g. up to 20, and this is satisfied in most of real
situations, then this problem can be solved exactly by a branch and bound method
or by dynamic programming.
Assuming only one resource with a limited capacity, we can deal with the 0–1
knapsack problem (0–1 KP), which is defined as follows: A set of n items is
available to be packed into a knapsack with capacity of C units. Item i has value
vi and uses up to wi of capacity. We try to maximise the total value of packed items
subject to capacity constraint.
( )
X
n X
n
max vi xi j wi xi C; xi 2 f0; 1g; i ¼ 1; :::; n (11)
i¼1 i¼1

Binary decision variables xi specify whether or not item i is included in the

knapsack.
The simplest way is to generate all possible choices of items and to determine the
optimal solution among them. This strategy is of course not effective because its
time complexity is O(2n).
Finding the solution may be faster using the branch and bound method [15],
which restricts the growth of the search tree. Avoiding much enumeration depends
on the precise upper bounds (the lower the upper bounds, the faster the finding of
the solution is). Let items be numbered so that

v1 v2 vn
(12)
w1 w2 wn

We place items in the knapsack by this non-increasing sequence. Let x1, x2, . . . , xp
be fixed values of 0 or 1 and

Mk ¼ x j x 2 M; xj ¼ xj ; xj 2 f0; 1g; j ¼ 1; . . . ; p (13)

where M is a set of feasible solutions. if

X
q1 X
p X
q
ð9qÞ ðp<q ng : wj C wj xj < wj (14)
j¼pþ1 j¼1 j¼pþ1
4 Combined Heuristic Approach to Resource-Constrained Project Scheduling Problem 53

then the upper bound for Mk can be determined as follows:

!
X
p X
q1
vq Xp X
q1
UB ðMk Þ ¼ v j xj þ vj þ C w j xj þ wj (15)
j¼1 j¼pþ1
wq j¼1 j¼pþ1

In more complex and more frequent situations when we have more than one
limited resource, we transform the resource-constrained scheduling into a sequence
of multi-knapsack problem (MKP) solutions. MKP is defined as follows:

X
n
maximise vi xi
i¼1
Xn
(16)
subject to wri xi Cr ; r ¼ 1; ::: ; m;
i¼1
xi 2 f0;1g; i ¼ 1; ::: ; n

For the solution of the task with multiple constraints, we must generalise the
approaches mentioned above.
The combinatorial approach can be applied without any changes, but using the
branch and bound method, we must redefine the upper bound. For its evaluation we
use the following formula

UB ðMk Þ ¼ min UB1 ðMk Þ; ::: ; UBm ðMk Þ (17)

where auxiliary bounds UBi ðMk Þ; i ¼ 1; . . . ; m correspond to the given constraints

and are determined as in the 0–1 knapsack problem. Before evaluation of these
auxiliary bounds, the other variables must be sorted again by decreasing values vj/wij.
Evidently, the run time will increase substantially.

6 Stochastic Heuristic Methods

The branch and bound method discussed above is deterministic. Now we will pay
attention to stochastic heuristic methods. These methods are used in situations
where the exact methods would fail or calculations would require a great amount
of time.
Heuristic [16] is a technique which seeks goal (i.e. near optimal) solutions at a
reasonable computational cost without being able to guarantee either feasibility or
optimality, or even in many cases, to state how close to optimality a particular
feasible solution is. The most popular heuristics – genetic algorithms, simulated
annealing, tabu search and neural networks are reviewed in [16]. Examples of their
possible use are described in [13].
54 M. Šeda et al.

Let us briefly deal with the genetic algorithms now. The skeleton for GA is
shown as follows:
Generate an initial population.
Evaluate fitness of individuals in the population.
Repeat select parents from the population.
Recombine parents to produce children.
Evaluate fitness of the children.
Replace some or all of the population by the children.
Until a satisfactory solution has been found.
In the following paragraphs, we briefly summarize GA settings to our scheduling
problem.
Individuals in the population (chromosomes) are represented as binary strings of
length n, where a value of 0 or 1 at the i-th bit (gene) implies that xi ¼ 0 or 1 in the
solution respectively.
The population size N is usually chosen between n and 2n.
An initial population consists of N feasible solutions and it is obtained by
generating random strings of 0s and 1s in the following way: First, all bits in all
strings are set to 0, and then, for each of the strings, randomly selected bits are set to 1
until the solutions (represented by strings) are feasible.
The fitness function corresponds to the objective function to be maximised:
X
n
f ðxÞ ¼ vi xi (18)
i¼1

Pairs of chromosomes (parents) are selected for recombination by the binary

tournament selection method, which selects a parent by randomly choosing two
individuals from the population and selecting the most fit one.
The recombination is provided by the uniform crossover operator. That means
each gene in the child solution is created by copying the corresponding gene from
one or the other parent, chosen according to a binary random number generator. If a
random number is 0, the gene is copied from the first parent; if it is a 1, the gene is
copied from the second parent. After crossover, the mutation operation is applied to
each child. It works by inverting each bit in the solution with a small probability.
We use a mutation rate of 5/n as a lower bound on the optimal mutation rate. It is
equivalent to mutating five randomly chosen bits per string.
If we perform crossover or mutation operations as described above, then the
generated children can violate certain capacity constraints. We can assign penalties
to these children that prevent infeasible individuals from entering the population. A
more constructive approach uses a repair operator that modifies the structure of an
infeasible individual, so that the solution becomes feasible. Its pseudo-Pascal code
is shown below for a more general case of multiple constraints. Once a new feasible
child solution has been generated, the child will replace a randomly chosen
solution. We use a steady-state replacement technique based on eliminating the
individual with the lowest fitness value. Since the optimal solution values for most
4 Combined Heuristic Approach to Resource-Constrained Project Scheduling Problem 55

problems are not known, the termination of a GA is usually controlled by specifying

a maximum number of generations tmax. Usually we choose tmax 5,000.

7 Experimentation

The approach discussed in the previous paragraphs has been implemented in Bor-
land Delphi. Its interface is similar to Microsoft Project. Although Microsoft
Project often is not able to solve a constrained scheduling problem even for very
simple projects (it reports some over-allocation and underallocation may be
unavoidable), we have never come across this situation when using our programme.
Besides the GA approach, the programme implements another popular heuristic
method – simulated annealing – for a comparison, and a user may choose between
these two possibilities. It enables a verification of the quality of computational
results, as the exact results for large projects are not known.
We have tested the proposed approach in many situations. Table 1 shows results
for one real project with 183 activities, 54 vertices in the network graph and 12
limited resources. This project deals with the innovation action of a Benson boiler
in a local power station in the surroundings of Brno. This table contains durations of
schedule for this project designed in 30 tests by the genetic algorithm (GA) and by
the simulated annealing (SA). Parameters for both methods were set in a way such
that the computational time for design of the project schedule should be approxi-
mately 2.4 s. The branch and bound (BB) method found a schedule with a duration
of 291 days in a several times longer CPU time. The results of BB achieved are
favourable because the parallelism of activities was low. The dynamic program-
ming approach could not be used because of insufficient memory.
While the results appear comparable, statistically, the results gained by GA are
better than the SA results. The well-known Kruskal-Wallis test for balanced one-
way design was used to show that samples were taken from the same population. It
has yielded the following results: Average rank for SA ¼ 39.75, for GA ¼ 21.25,
the value of test statistic ¼ 17.5472 and computed significance level ¼ 2.8 105.

Table 1 Durations for schedule (days) designed by GA and SA

GA GA GA SA SA SA
291 294 293 294 293 295
292 292 291 292 294 296
293 291 292 293 296 292
293 293 293 294 292 297
292 292 291 297 294 294
294 291 293 292 291 294
292 291 292 293 298 295
292 292 292 294 294 291
291 293 291 294 292 295
293 291 293 295 293 293
56 M. Šeda et al.

Thus the hypothesis of sampling from the same population is rejected at all
traditionally used significance levels (0.05; 0.01; 0.001).

8 Conclusion

In this chapter, a new technique for computing of the resource-constrained project

scheduling was proposed. The strategy of activity-shifting was replaced by prolong-
ing their duration and dividing them into active and sleeping parts. It makes it
possible to apply a simple CPM algorithm. The proposed algorithm was designed in
a mathematical form and verified for a single version of RCPSP. We also sketched
how to adapt the proposed algorithm for multiproject scheduling with limited
resources.
However, the deterministic approaches are not effective or may not be used in
complex projects with multiple constraints because of the exponentially growing
time in branch and bound method calculations. A genetic algorithm approach is
proposed and compared with simulated annealing.
Further investigation will include fuzzy versions of the problem.

References

1. J. Blazewicz, K.H. Ecker, G. Schmidt, J. Weglarz, Scheduling Computer and Manufacturing

Processes (Springer-Verlag, Berlin, 1996)
2. S.E. Elmaghraby, Activity Networks: Project Planning and Control by Network Models
(Wiley, New York, 1977)
3. P. Brucker, A. Drexl, R. M€ ohring, K. Neumann, E. Pesch, Resource-constrained project
scheduling: notation, classification, models, and methods. Eur. J. Oper. Res. 112: 3–41 (1999)
4. W. Herroelen, E. Demeulemeester, B.D. Reyck, A note on the paper “resource-constrained
project scheduling: notation, classification, models, and methods” by Brucker et al. Eur. J.
Oper. Res. 128, 679–688 (2001)
5. K. Bouleimen, H. Lecocq, A new efficient simulated annealing algorithm for the resource-
constrained project scheduling problem and its multiple mode version. Eur. J. Oper. Res. 149,
268–281 (2003)
6. M. Mika, G. Waligóra, J. Weglarz, Simulated annealing and tabu search for multi-mode
resource-constrained project scheduling with positive discounted cash flows and different
payment models. Eur. J. Oper. Res. 164, 639–668 (2005)
7. A. Azaron, C. Perkgoz, M. Sakawa, A genetic algorithm for the time-cost trade-off in PERT
networks. Appl. Math. Comput. 168, 1317–1339 (2005)
8. K.W. Kim, M. Gen, G. Yamazaki, Hybrid genetic algorithm with fuzzy logic for resource-
constrained project scheduling. Appl. Soft Comput. 2/3F, 174–188 (2003)
9. K.W. Kim, Y.S. Yun, J.M. Yoon, M. Gen, G. Yamazaki, Hybrid genetic algorithm with
adaptive abilities for resource-constrained multiple project scheduling. Comput. Indus. 56,
143–160 (2005)
10. C. Artigues, P. Michelon, S. Reusser, Insertion techniques for static and dynamic resource-
constrained project scheduling. Eur. J. Oper. Res. 149, 249–267 (2003)
4 Combined Heuristic Approach to Resource-Constrained Project Scheduling Problem 57

11. V. Valls, S. Quintanilla, F. Ballestin, Resource-constrained project scheduling: a critical

activity reordering heuristic. Eur. J. Oper. Res. 149, 282–301 (2003)
12 L.-Y. Tseng, S.-C. Chen, A hybrid metaheuristic for the resource-constrained project schedul-
ing problem. Eur. J. Oper. Res. 175(2), 707–721 (2006)
13. H. Zhang, X. Li, H. Li, F. Huang, Particle swarm optimization-based schemes for resource-
constrained project scheduling. Autom. Constr. 14, 393–404 (2005)
14. T.H. Cormen, C.E. Leiserson, R.L. Rivest, C. Stein, Introduction to Algorithms (MIT Press,
Cambridge, MA, 2001)
15. J. Klapka, J. Dvořák, P. Popela, Methods of Operational Research (in Czech) (VUTIUM,
Brno, 2001)
16. C.R. Reeves, Modern Heuristic Techniques for Combinatorial Problems (Blackwell Scientific
Publications, Oxford, 1993)
Chapter 5
A Development of Data-Logger for Indoor
Environment

Anuj Kumar, I.P. Singh, and S.K. Sud

Abstract This chapter describes a development of data logger for indoor environ-
ment. Present work concentrates to environmental parameter (temperature and
humidity) and more polluted contaminants (concentration level of CO and CO2).
In this work four channels have been used for data logger and other four channels is
open to external sensor module. The data collected will be stored in the EEPROM
and output can be taken in note-pad in tabular corresponding to month/date/year
using graphical user interface.

1 Introduction

Environment monitoring system is a complete data logging system. It automatically

measures and records temperature, humidity and other parameters and provides
warnings when readings go out of range [1]. Indoor environment monitoring is
required to protect the building occupant’s health by providing thermally comfort-
able and toxicant free environment [2]. We need a system that monitors as well as
records the indoor environment data easily. In this chapter, we are discussing about
the developed indoor environment monitoring system that monitors and records
indoor temperature, humidity, CO and CO2.
Available technologies for sensing these environmental parameters are developed
by Onseat HOBO, Spectrum Technologies (Watch Dog weather conditions), TandD,
Telaire (Wireless Monitoring Systems), Testo, Log Tag, Measurement Computing
Corporation, Monarch Instruments, MSR Electronics GmbH, P3 International, Quality
Thermistor, S3 Crop, Sensaphone, Sansatronics, Lascar, ICP, Graphtech, Extech
Instruments, Dickson, Dent Instruments, Davis, ACR System Inc, 3M International,

A. Kumar (*)
Instrument Design Development Centre, Indian Institute of Technology Delhi, Electronics Lab,
New Delhi, India 110016
e-mail: [email protected]

S.-I. Ao et al. (eds.), Machine Learning and Systems Engineering, 59

Lecture Notes in Electrical Engineering 68,
DOI 10.1007/978-90-481-9419-3_5, # Springer ScienceþBusiness Media B.V. 2010
60 A. Kumar et al.

and Acumen [3]. The drawback of these available systems is that, both air quality
and thermal comfort can not be measured simultaneously. So there is a need to
develop an economical system (prototype data logger) which can help in collecting
data to analyze the necessary environmental parameters simultaneously.
A prototype data logger has been developed to monitor the environmental para-
meters. The developed data logger consist (i) sensors module (ii) LCD (iii) Real
Time Clock (RTC) (iv) EEPROM and (v) PC serial communication. This data
logger is operated through PC using graphical user interface (GUI) in visual basic.

2 Sensors Module

A sensor is a device that measures a physical quantity and converts it into an

equivalent analog or digital signal which can be read by an observer or by an
instrument [4]. We have used temperature, relative humidity, CO, and CO2 sensors
in the developed system.
A gas sensor detects particular gas molecules and produces an electrical signal
whose magnitude is proportional to the concentration of the gas [5]. Till date, no gas
sensor exists that is 100% selective to only a single gas. A good sensor is sensitive to
the measured quantity but less sensitive to other quantities. Available gas sensors are
based on five basic principles. These can be electrochemical, infrared, catalytic bead,
photo ionization and solid-state [6, 7]. We have selected these sensors because they
produce a strong signal for the selected variable especially at high gas concentrations
with adequate sensitivity. They have a fast response time, high stability, long life,
low cost, low dependency on humidity, low power consumption, and compact size
[5]. Four sensors along with their signal conditioning circuit are used to sense the
desired parameter such as temperature, humidity, CO, and CO2. Signal conditioning
circuit for that sensor needs to be connected externally. In software we can select
any of the analog channels. The interface of temperature, humidity, CO, and CO2
sensors with microcontroller PIC 18F4458 is described as follows.

2.1 Temperature Sensor

National semiconductor’s LM 35 IC has been used for sensing the temperature. It is

an integrated circuit sensor that can be used to measure temperature with an
electrical output proportional to the temperature (in C). The temperature can be
measured more accurately with it than thermistor. The operating and amplifica-
tion circuit is shown in Fig. 1. The output voltage of IC LM 35 is converted to
temperature in C is given by the following expression [7]

Temp:ð CÞ ¼ ðVout 100Þ=7 C

The measuring temperature range of the instrument is between 15 C and 70 C.
5 A Development of Data-Logger for Indoor Environment 61

Fig. 1 Operating and

amplification circuitof U1
LM35CZ

7
1
temperature sensor 3 +
+4.97 V 2 6 1.722V

0.246V
1 2 – OP07
3

4
8
0

–4.97 V
R2
12 K
R1 2K

Fig. 2 Operating circuit of Vcc

humidity sensor

R 12.5K

1 3
Vin
2 GND
0

Vout

2.2 Humidity Sensor

The sensor circuit develops a linear voltage vs. RH (relative humidity) output,
which is ratio metric to the supply voltage. This means when the supply voltage
varies, the sensor output voltage follows the same proportion. It can operate over
a range of 4–5.8 V supply. At 5 V supply voltage (at room temperature), corres-
ponding to relative humidity variation from 0% to 100% (noncondensing), the
output voltage varies from 0.8 to 3.9 V. The humidity sensor functions with a
resolution of up to 0.5% of relative humidity (RH), with a typical current draw of
only 200 mA, the HIH4000 series is ideally suited for low drain, battery operated
systems.
The operating circuit is shown in Fig. 2. The change in the RH of the surround-
ings causes an equivalent change in the voltage output. The output is an analog
voltage proportional to the supply voltage. Consequently, converting it to relative
humidity (RH) requires both the supply and the sensor output voltages (At 25 C)
and is given by the following expression [7].

RH ¼ Vout =Vsup ply 0:16 =0:0062

The output of the humidity sensor is 2.548 V i.e. the relative humidity is 56%
at 25 C.
62 A. Kumar et al.

TGS 4161

VRL
INPUT OUTPUT
VOLTAGE VC RL VOLTAGE
HEATER
V
VOLTAGE H

Fig. 3 Circuit for CO and CO2 sensor [7]

2.3 CO and CO2 Sensor

The operating circuit of CO and CO2 sensors is shown in Fig. 3. The relationship
between output voltage and gas concentration is given by the following expression
2
ðVC RL =VOUT Þ RL 1
c¼ 1
R0 K

where, VOUT ¼ output voltage; VC ¼ input voltage, R0 ¼ electrical resistance of

sensor at zero ppm, K ¼ a constant for particular, RL ¼ sensor load resistance [5, 7].

3 LCD Interface to the Microcontroller

In this work, we are using on-chip analog to digital converter which is on the
microcontroller. This analog to digital converter is having the 12 bit resolution with
programmable acquisition time. It is sensing the analog signal from the sensor at the
variable sampling rate (1 s to 1 h). The sensed value is converted to its digital
equivalent. This digital value is displayed on the LCD (liquid crystal display) and is
interfaced to the microcontroller [8, 9–12].

4 Real Time Clock Interface to the Microcontroller

The IC DS1307 operates as a slave device on the I2C bus. Access is obtained by
implementing a START condition and providing a device identification code fol-
lowed by a register address. Subsequent registers can be accessed sequentially until a
STOP condition is executed. When VCC falls below 1.25 VBAT, the device terminates
an access in progress and resets the device address counter. Inputs to the device will
5 A Development of Data-Logger for Indoor Environment 63

not be recognized at this time to prevent erroneous data from being written to the
device from an out of tolerance system. When VCC falls below VBAT, the device
switches into a low-current battery backup mode. Upon power up, the device switches
from battery to VCC when VCC is greater than VBAT þ0.2 V and recognizes inputs
when VCC is greater than 1.25 VBAT. We are using IC DS1307 as real time clock which
have features such as real-time clock counts in seconds, minutes, hours, day of month,
day of week, month, and year with leap year compensation valid up to 2100, 56 Byte,
Nonvolatile (NV) RAM for data storage, I2C serial interface, programmable square
wave output signal, automatic power fail detect and switch circuitry (consumes less
than 500 nA in battery backup mode oscillator running), and temperature range 40 C
to 85 C [9, 10]. We are using I2C to interface RTC and EEPROM to the micro-
controller. The I2C bus is the most popular of three serial EEPROM protocols.
The I2C chips include address pins as an easy way to have multiple chips on a single
bus while only using two connections to the microcontroller [9].

5 EEPROM Interface to the Microcontroller

The EEPROM will store the digital value which is coming from analog to digital
converter. We will require 52.73 MB of EEPROM if we are sampling all analog
channels at the rate of 1 sample/s. We are using the EEPROM AT24C256
(ATMEL). This will store the sample data at different instants [10–14].

6 PC Interface Using RS-232 Serial Communication

PIC 18F4458 using MAX-232 is interfaced with PC. IC (MAX-232) used to convert
TTL logic level to RS-232 logic level. RS-232 is the serial communication protocol
that does not require the clock along with data lines. Two data lines are there one is
TX and another is RX for serial communication. MAX-432 has two receivers (con-
verts RS-232 logic level to TTL logic) and two drivers. Separate power supply has
been provided because minimum power supply needed is 5 V and MAX-232 con-
sumes a lot of current for operation. External capacitors are required for internal
voltage pump to convert TTL logic level to RS-232 level. For battery operated
application MAX-232 can be used as level converter instead of MAX-232. It is low
supply low power consumption logic converter IC for RS-232 [9, 10, 13].

7 Graphical User Interface

The GUI is one of the important parts for this device as it displays the data from
microcontroller for data monitoring and analysis. The design template has to be user
friendly for best usage. For this chapter, the main objective is to display data received
in graphical form. As transducer detects and translate an analog signal, the data will
64 A. Kumar et al.

go through a conversion at the ADC. This digital data will be stored in EEPROM chip
with the help of Visual Basic 6.0 software. Since the data is using serial RS232
communication, an initialization needs to be done having baud rate, data bits, parity,
stop bit, and the COM port at PC. The baud rate is the number of signal changes per
second or transition speed between Mark (negative) and Space (positive) which
ranges from 110 to 19,200, data bits is the length of data in bit which has one Least
Significant Bit and one Most Significant Bit, the parity bit is an optional bit mainly for
bit error checking. It can be odd, even, none Mark, and Space. Stop bit is used to
frame up the data bits and usually combined with the start bit. These bits are always
represented by a negative voltage and can be 1, 1.5 and 2 stop bits, and COM port is
the selection of the available COM port at PC. The commonly used setting to
establish a serial RS232 communication is 9600 baud rate, none parity, 8 data bits,
1 stop bit, and COM port 1. This can be done by using the GUI monitoring system
where it automatically saves the data received in a notepad. The data saved is the date
and time at which the data collected and the data value it self. Figures 4 and 5,
represents the graphical user interface and logged data in file respectively [12, 13].

8 Schematic of the Data Logger

Figure 6 shows, the full schematic diagram of the data logger for indoor environ-
ment. This data logger has four embedded sensor module and other four channels
are open to be used for the measurement of other environmental parameters.

9 Software Design of Data Logger

This section includes the discussion on software design for all the modules inter-
faced with PIC 18F4458. It also explains the functions of software designed for data
logger [11].

Fig. 4 GUI for the data logger

5 A Development of Data-Logger for Indoor Environment 65

Fig. 5 Representations of the logged data in file

9.1 Programming Steps for I2C Interface

I2C interface is bi-directional. This is implemented by an “Acknowledge” or “ACK”

system allows data to be sent in one direction to one item on the I2C bus, than, that
item will “ACK” to indicate the data received. Normally, the master device controls
the clock line, SCL. This line dictates the timing of all transfers on the I2C bus. Other
devices can manipulate this line, but they can only force the line low. This action
means that item on the bus cannot deal with more data in to any device.

9.1.1 Writing to an I2C Chip

The function of writing to the EEPROM is shown here as “Control IN”, which
represents putting the EEPROM in an “input” mode. Since we are only sending data
to the EEPROM (as shown in Fig. 7), we use “Control IN” byte and later used
“Control OUT”. Next, the EEPROM acknowledges this byte. This is shown by the “A”
after the byte. It is put on the next line to indicate that this is transmitted by the
EEPROM. The Address Byte contains the address of the location of the EEPROM
where we want to write data. Since the address is valid, the data is acknowledged by
the EEPROM. Finally, we send the data we want to write. The data is then acknow-
ledged by the EEPROM. When that finishes, we send a stop condition to complete
the transfer. Remember the “STOP” is represented as the “T” block on the end. Once
the EEPROM gets the “STOP” condition it will begin writing to its memory.
66 A. Kumar et al.

VDD
36
LM35

32.768 KHz
CRYSTAL
1 3 2 AN0 35 R4
R6 10K 1 8
2

DSC130
10 K 2

RTC
3 AN1 34 NC
3V 3 7
+5v 33 6
4 V 5

EEPROM
1 CO 3 DD 32 1
2 0
2 V 31 3 6
SS
+5v 5
30
4 AN2
1 3 4 29
CO PIC18F4458
2 5 28 15 16
2 AN3 27 13 14
9 12

Connector
HIH4000 10
11 26 7 8
1 2 3 5 6

LCD
3 4
12 25 1
24 2
13 D+
0 14 23 0
8MHZ

D–
D+
22
D–
21
USB
Rx Tx Connector

16 15 14 13 12 11 10 9
VDD

+ MAX232
J2
–
1 3 4 5 2 6 7 8

3 C2
2 4 5 C3
7 C1
16 8 9
DB9 C4
CONNECTOR 0

Fig. 6 Full Schematic of the data logger

9.1.2 Reading from an I2C Chip

The transfer will use the “Control IN” byte to load the address into the EEPROM
(as shown in Fig. 8). This sends data to the EEPROM which is why we use the
control in byte. Once the address is loaded, we want to retrieve the data. So, we send
a “Control OUT” byte to indicate to the EEPROM that we want data FROM it.
5 A Development of Data-Logger for Indoor Environment 67

Fig. 7 Writing the data in

I2C chip (Controlling
Window)

Fig. 8 Reading the data from

an I2C chip

The EEPROM will acknowledge this and then send the data we requested. When
we are done getting data, we send a “NACK” to tell the EEPROM that we do not
want more data. If we were to send an ACK at this point, we could get the next byte
of data from the EEPROM. Since we only want to read one byte, we send a
“NACK”.

9.2 Programming Steps for LCD Interface

Set RS ¼ 0 to send command; Send 0b0010 to data lines three times with a delay of
2 ms; to send a byte on four data lines, send higher nibble first and give a RE pulse
of 100 ms at RE; send a set of instruction one after another with a delay of 2 ms
between each command to configure various setting as given in instruction set of
LCD datasheet [9, 13]; send instruction set again.
Set RS ¼ 1; Send higher nibble at four data lines. Send 100 ms RE pulse; Send
lower nibble at data lines. Send RE pulse; Keep track of number of character already
displayed on display panel using LCD_count. Go to line 2 or line 1 according to that.

9.3 Programming Steps for Sensor Data Collection

There are four sensor module connected such as temperature, humidity, CO, and
CO2. Data is collected by the ADC inbuilt in PIC. ADC provides 12 bit of data
after the conversion is completed.
68 A. Kumar et al.

9.3.1 Temperature Sensor Data Collection

Data collection from the temperature sensor needs following actions to be carried
out (a) Selecting the analog channel AN0, sampling frequency, and alignment of
bits for ADRESH and ADRESL, (b) Vref and power on the ADC module by setting
ADCON0, ADCON1 and ADCON2 registers, (c) starting analog to digital conver-
sion by setting ADGO bit high (wait till ADIF flag will not indicate the completion
of conversion), and (d) copy of results from ADRESH and ADRESL to variables.

9.3.2 Humidity Sensor Data Collection

Select the AN3 and set other features of ADC as temperature sensor; after comple-
tion of conversion copy the result in variable.

9.3.3 CO and CO2 Sensor Data Collection

Data collection from the CO sensor needs following actions to be carried out (a)
Selecting the analog channel AN1, sampling frequency, and alignment of bits for
ADRESH and ADRESL, (b) Vref and power on the ADC module by setting
ADCON0, ADCON1 and ADCON2 registers, (c) starting analog to digital conver-
sion by setting ADGO bit high (wait till ADIF flag will not indicate the completion
of conversion), and (d) copy of results from ADRESH and ADRESL to variables.
Now repeat the same process to collect the CO2 data on the channel number
AN2.

10 Results and Discussion

Sensors module, EEPROM, RTC, and LCD have been successfully interfaced to
the microcontroller. EEPROM is successfully storing the logged data with time and
date tag. The sensors data is being displayed on LCD module. A simple GUI has
been designed to store a logged data to a text file, so that it can be analyzed further.

11 Conclusions

We have developed a low cost, 12 bit resolution data logger and successfully
measured temperature, humidity, and concentration of CO and CO2 gases. The
GUI designed gives a lucratively look to the functioning of data logger. Initial
results of the data logger are encouraging and we are working on to improve the
GUI model as well as the accuracy of data logger.
5 A Development of Data-Logger for Indoor Environment 69

References

1. G.L. Tang, Lecture notes on Health and the built environment: Indoor air quality (University
of Calgary, Alberta, Canada)
2. J.D. Richard, G.S. Brager, Thermal comfort in naturally ventilated buildings: revisions to
ASHRAE standard 55. Energy Buildings 34, 549–561 (2002)
3. Microdaq (March 3, 2009); http://www.microdaq.com/data-logger/
4. D.D. Lee, D.S. Lee, Environment gas sensors. IEEE Sensors J. 1(3), 214–215 (2001)
5. N. Kularatna, B.H. Sudantha, An environmental air pollution monitoring system based on the
IEEE 1451 standard for low cost requirements. IEEE Sensors J. 8(4), 415–422 (2008)
6. Sensor industry development and trends, Sensors express (Nov 2002)
7. RS India (April 4, 2009); http://www.rsonlineindia.com
8. R. Luharuka, R.X. Gao, A microcontroller-based data logger for physiological sensing. IEEE
Proc. on Instrument and Measurement Technology Conference, Anchorage, AK, USA, 21–23
May, 175–180 (2002)
9. Data acquisition logging circuits (20 Mar 2009); http://www.hobbyprojects.com/A/
acquistions_data_circuits.html
10. Data sheet of real time clock DS 1307 (8 May 2008). http://www.maxim-ic.com/products/rtc/
real-time-clocks.cfm
11. Microchip (10 Jan 2009). http://www.microchip.com
12. Introduction to data acquisition (10 May 2008). http://zone.ni.com/devzone/concepted.nsf/
webmain/
13. G. Mason, A handheld data acquisition system for use in an undergraduate data acquisition
course. Data Acquisit. 45, 338–393 (2002)
14. D. Malone, Build a versatile data logger. Popular Electronics, 35–42, July 1994
Chapter 6
Multiobjective Evolutionary Optimization
and Machine Learning: Application
to Renewable Energy Predictions

Kashif Gill, Abedalrazq Khalil, Yasir Kaheil, and Dennis Moon

Abstract The inherent variability in climate processes results in significant

impacts on renewable energy production. While a number of advancements have
been made over the years, the accurate energy production estimates and the cor-
responding long-term variability at the full wind farm remains a big challenge. At
the same time, long-term energy estimates and the variability are important for
financial assessment of the wind farm projects. In this chapter, a machine learning
approach to model wind energy output from the wind farm is presented. A multi-
objective evolutionary optimization (MOEO) method has been applied for the
optimization of an Artificial Intelligence learning methodology the “Support Vector
Machines” (SVM). The optimum parameter search is conducted in an intelligent
manner by narrowing the desired regions of interest that avoids getting struck in
local optima. The National Center for Environmental Prediction (NCEP)’s global
reanalysis gridded dataset has been employed in this study. The gridded dataset
for this particular application consists of four points each consisting of five vari-
ables. A 40-years, 6-hourly energy prediction time series is built using the 40-years
of reanalysis data (1968-present) after training against short-term observed farm
data. This is useful in understanding the long-term energy production at the farm
site. The results of MOEO-SVM for the prediction of wind energy are reported
along with the multiobjective trade-off curves.

1 Introduction

It is well-known that the weather characteristics in a given time frame are deter-
mined by a relatively small number of discrete weather systems, each of which may
exhibit very different influences and patterns of development. These fundamental

K. Gill (*)
WindLogics, Inc., 1021 Bandana Blvd., E. # 111, St Paul, MN 55108, USA
e-mail: [email protected]

S.-I. Ao et al. (eds.), Machine Learning and Systems Engineering, 71

Lecture Notes in Electrical Engineering 68,
DOI 10.1007/978-90-481-9419-3_6, # Springer ScienceþBusiness Media B.V. 2010
72 K. Gill et al.

causes of variations in on-site wind speeds are inherent in the atmosphere and must
be understood, to the extent possible, for accurate wind resource assessment. In
order to make useful predictions of wind speed/energy, it is therefore important to
develop statistically sound relationships between those wind regimes and the
atmospheric conditions. This is the core of the current methodology described in
this chapter. Machine learning tools have gained immense popularity in the geos-
ciences community due to their success against physically-based modeling
approaches. In brief, machine learning tools are used to determine the relationship
between inputs and output in an empirical framework. These models do not employ
traditional form of equations common in physically-based models or as in regres-
sions, instead have flexible and adaptive model structures that can abstract relation-
ships from data.
The chapter describes a method for training Support Vector Machine (SVM)
in applications for wind energy predictions at a wind-farm level. The SVM is a
powerful learning algorithm developed by Vapnik and is known for its robust
formulation in solving predictive learning problems employing finite data [1].
The method is well-suited for the operational predictions and forecasting of wind
power, which is an important variable for power utility companies. The proposed
methodology employs a Multiobjective Evolutionary Optimization approach for
training the SVM. The goal of an optimization method is to efficiently converge to a
global optimum, in the case of a single objective function, and to define a trade-off
surface in the case of multiobjective problems. Overall, global optimization (GO)
methods have two main categories: deterministic and probabilistic. Deterministic
methods use well-defined mathematical search algorithms (e.g., linear program-
ming, gradient search techniques) to reach the global optimum, and sometimes use
penalties to escape from a local optimum. Probabilistic methods employ probabi-
listic inference to reach the global optimum. Evolutionary optimization algorithms
generally fall in probabilistic category of optimization methods.

2 Material and Methods

The details on support vector machine, the multiobjective evolutionary optimiza-

tion, and the SVM trainings are presented in the current section.

2.1 Support Vector Machines

Support Vector Machines (SVM) has emerged as an alternative data-driven tool

in many traditionally Artificial Neural Network (ANN) dominated fields. SVM
was developed by Vapnik in the early 1990s mainly for applications in classifi-
cation. Vapnik later extended his work by developing SVM for regression. The
SVM can “learn” the underlying knowledge from a finite training data and deve-
lops generalization capability which makes it a robust estimation method.
6 Multiobjective Evolutionary Optimization and Machine Learning 73

The mathematical basis for SVM is derived from statistical learning theory. SVM
consists of a quadratic programming problem that can be efficiently solved and for
which a global extremum is guaranteed [2]. It is sparse (robust) algorithm when
compared against ANN, but what makes it sparse is the use of Structural Risk
Minimization (SRM) instead of Empirical Risk Minimization (ERM) as is the case
in ANN. In SRM, instead of minimizing the total error, one is only minimizing an
upper bound on the error. The mathematical details on SVM can be found in [3–5].
The function f(x) that relates inputs to output has the following form:

X
N
f ðxÞ ¼ wi fðxi Þ þ b (1)
i¼1

Where N (usually N L) is the number of support vectors which are selected

from the entire dataset of ‘L’ samples. The support vectors are subsets of the training
data points that support the “decision surface” or “hyper-plane” that best fits the data.
The function f(x) is approximated using the SRM induction principle by mini-
mizing the following objective function:

1X N
Minimize R½ f ¼ C jyi f ðxi Þje þ kwk2 (2)
K i¼1

The first term (jyi f ðxi Þje ) in the above expression is what is called e-insensitive
loss function. The samples outside the e-tube are penalized by a penalty termx. The
above formulation makes the solution sparse in the sense that the errors less than e
are ignored. The sparse solution is the one that gives the same error with minimum
number of coefficients describing f(x). The second term in the objective function is
the regularization term added to avoid the consequences of the ill-posedness of the
inverse problem [6].
The Eq. (2) is solved in dual form employing Lagrange multipliers as [3]:

X
N
f ðxÞ ¼ ðai ai ÞKðx; xi Þ þ b (3)
i¼1

The ai and ai are the Lagrange multipliers and have to greater than zero for the
support vectors i ¼ 1, . . . , N, and Kðxi ; xÞis a kernel function.
There are three main parameters to be determined as part of SVM trainings; the
tradeoff ‘C’, the degree of error tolerance ‘e’, and parameter related to the kernel,
the kernel width, ‘g’. In practice, these may be determined either using a trial-and-
error procedure or an automatic optimization method [7]. The optimization scheme
can be single objective or multiobjective depending upon the nature of the problem.
The initial formulation for training SVM employed a single objective approach
to optimization. It was noticed that single objective methods to train model can
result in optimal parameter sets for the particular single-objective value; but fail to
74 K. Gill et al.

provide reasonable estimates for the other objective. It is therefore preferred to

employ multiobjective search procedures. In the current implementation, Multi-
objective Particle Swarm Optimization (MOPSO) algorithm is used to determine
the SVM parameters.

2.2 Multiobjective Evolutionary Optimization

The goal in an optimization algorithm is to find the minimum (or maximum) of a

real valued function f : S ! < i.e., finding optimum parameter set x 2 S such that:

f ðx Þ f ðxÞ; 8x 2 S (4)

Where S <D is the feasible range for x (the parameter set, having D-dimen-
sions) and <D represents the D-dimensional space of real number.
The current multiobjective methodology employs Swarm Intelligence based evo-
lutionary computing multiobjective strategy called Multiobjective Particle Swarm
Optimization (MOPSO) [8]. The PSO method has been developed for single objec-
tive optimization by R. C. Eberhart and J. Kennedy [9]. It has been later extended to
solve multiobjective problems by various researchers including the method by [8].
The method originates from the swarm paradigm, called Particle Swarm Optimi-
zation (PSO), and is expected to provide the so-called global or near-global optimum.
PSO is characterized by an adaptive algorithm based on a social-psychological
metaphor [9] involving individuals who are interacting with one another in a social
world. This sociocognitive view can be effectively applied to computationally
intelligent systems [10]. The governing factor in PSO is that the individuals, or
“particles,” keep track of their best positions in the search space thus far obtained,
and also the best positions obtained by their neighboring particles. The best position
of an individual particle is called “local best,” and the best of the positions obtained
by all the particles is called the “global best.” Hence the global best is what all the
particles tend to follow. The algorithmic details on PSO can be found in [8, 9,
11, 12]. The approach in [8] presents a multiobjective framework for SVM optimi-
zation using MOPSO.
A multiobjective approach differs from a single objective method in that the
objective function to be minimized (or maximized) is now a vector containing more
than one objective function. The task, therefore, of the optimization method is to
map out a trade-off surface (otherwise known as Pareto front), unlike finding a
single scalar-valued optimum in case of single objective problems. The multi-
objective approach to the PSO algorithm is implemented by using the concept
of Pareto ranks and defining the Pareto front in the objective function space.
Mathematically, a Pareto optimal front is defined as follows: A decision vector
~
x1 2 S is called Pareto optimal if there does not exist another ~ x2 2 S that dominates
it. Let P <m be a set of vectors. The Pareto optimal front P P contains all
vectors ~x1 2 P, which are not dominated by any vector ~ x2 2 P:
6 Multiobjective Evolutionary Optimization and Machine Learning 75

P ¼ f~ x2 2 P : ~
x1 2 Pj 6 9~ x2 ~
x1 g (5)

The idea of Pareto ranking is to rank the population in the objective space and
separate the points with rank 1 in a set P* from the remaining points. This
establishes a Pareto front defined by a set P*. All the points in the set P* are the
“behavioral” points (or non-dominated solutions), and the remaining points in set
P become the “non-behavioral” points (or inferior solution or dominated solutions).
The reason behind using the Pareto optimality concept is that there are solutions
for which the performance of one objective function cannot be improved without
sacrificing the performance of at least one other.
In the MOPSO algorithm, as devised in [8], the particles will follow the nearest
neighboring member of the Pareto front based on the proximity in the objective
function (solution) space. At the same time, the particles in the front will follow the
best individual in the front, which is the median of the Pareto front. The term
follow means assignments done for each particle in the population set to decide the
direction and offset (velocity) in the subsequent iteration. These assignments are
done based on the proximity in the objective function or solution space. The best
individual is defined in a relative sense and may change from iteration to iteration
depending upon the value of objective function.
An example test problem is shown in Fig. 1. The test function presents a
maximization problem (as in [13]):

Maximize F ¼ ðf1 ðx; yÞ; f2 ðx; yÞÞ (6)

8.6
MOPSO Front

8.4 True Front

8.2
f 2(x,y)

7.8

7.6

7.4
–4 –2 0 2 4 6 8
f 1(x,y)

Fig. 1 Test function results

76 K. Gill et al.

where

x
f1 ðx; yÞ ¼ x2 þ y; f2 ðx; yÞ ¼ þyþ1
2

subject to

x x
0 þ y 6:5; 0 þ y 7:5; 0 5x þ y 30; x; y 0
6 2

in the range 0 x; y 7. The true front and the front from the MOPSO algorithm
are shown in Fig. 1 after 5,000 function evaluations. It can be noticed that MOPSO
was able to reproduce the true front for this test case.

2.3 SVM-MOPSO Trainings

The unique formulation of MOPSO helps it to avoid getting struck in local optima,
when making a search in the multi-dimensional parameter domain. In the current
research, the MOPSO is used to parameterize the three parameters of SVM namely;
the trade-off or cost parameter ‘C’, the epsilon ‘e’, and the kernel width ‘g’. The
MOPSO method uses a population of parameter sets to compete against each other
through a number of iterations in order to improve values of specified multi-
objective criteria (objective functions) e.g., root mean square error (RMSE), bias,
histogram error (BinRMSE), correlation, etc. The optimum parameter search is
conducted in an intelligent manner by narrowing the desired regions of interest and
avoids getting struck in local optima.
In the earlier efforts, a single objective optimization methodology has been
employed for optimization of three SVM parameters. The approach was tested on
a number of sites and results were encouraging. However, it has been noticed that
using a single objective optimization method can result in sub-optimal predictions
when looking at multiple objectives. The single objective formulation (using PSO)
employed coefficient of determination (COD), as the only objective function, but
it was noticed that the resulting distributions were highly distorted when compared
to the observed distributions. The coefficient of determination (COD) is linearly
related to RMSE and can range between 1 and 1; the value of 1 being a perfect
fit. It is shown in Fig. 2 where a trade-off curve is presented between BinRMSE vs.
COD. It can be noticed that COD value increases with the increase in BinRMSE
value. The corresponding histograms are also shown in Fig. 3 for each of the
extreme ends (maximum COD and minimum BinRMSE) and the “compromise”
solution from the curve. The histograms shown in Fig. 3 make it clear that the best
COD (or RMSE) is the one with highest BinRMSE and indeed misses the extreme
ends of the distribution. Thus no matter how tempting it is to achieve best COD
6 Multiobjective Evolutionary Optimization and Machine Learning 77

0.55
BinRMS vs COD
Best BinRMS
0.54 Best COD
Compromise
0.53
COD

0.52

0.51

0.5

0.49
300 350 400 450 500 550 600 650 700
BinRMS

Fig. 2 Trade-off curve between BinRMSE and COD

2500
Observed
Best BinRMS
2000 Compromise
Best COD

1500
Counts

1000

500

0
0 2 4 6 8 10
× 104
Power (kw)

Fig. 3 Histogram comparison between Observed, BinRMS best, COD best, and the best compro-
mise solution

value it does not cover the extreme ends of the distribution. On the other hand the
best BinRMSE comes at the cost of lowest COD (or highest RMSE) and is not
desired either. Thus it is required to have a multiobjective scheme that simulta-
neously minimize these objectives and provide a trade-off surface and therefore a
compromise solution can be chosen between the two objectives. Figure 3 also
shows the histogram for the “compromise” solution which provides a decent
histogram when compared to observed data.
78 K. Gill et al.

3 Application

The current procedures primarily employ SVM for building regression models for
assessing and forecasting wind resources. The primary inputs to the SVM come
from the National Center for Environmental Prediction (NCEP)’s reanalysis gridded
data [14] centered on the wind farm location. The target is the measurements of wind
farm aggregate power. The current training uses a k-fold cross validation scheme
referred to as “Round-Robin strategy”. The idea within the “Round-Robin” is to
divide the available training data into two sets; use one for training and hold the
other for testing the model. In this particular “Round-Robin strategy” data is divided
into months. The training is done on all the months except one, and the testing is
done on the hold-out month. The previous operational methods employ manual
calibration for the SVM parameters in assessment projects and a simple grid-based
parameter search in forecasting applications.
The goal in using MOPSO is to explore the regions of interest with respect to
the specific multiobjective criteria in an efficient way. Another attractive feature of
MOPSO is that it results in a so-called Pareto parameter space, which accounts for
parameter uncertainty between the two objective functions. Thus the result is an
ensemble of parameter sets cluttered around the so called global optimum with respect
to the multiobjective space. This ensemble of parameter sets also gives tradeoffs on
different objective criteria. The MOPSO-SVM method is tested on the data from an
operational assessment site in North America. The results are compared with observed
data using a number of evaluation criteria on the validation sets.
As stated above, the data from four NCEP’s grid points each consisting of five
variables (total 20 variables) is used. It has been noticed that various normalization
and pre-processing techniques on the data may help to improve SVM’s prediction
capabilities. In the current study, input data has also been tried by pre-processing it
using Principal Component Analysis (PCA). In that case, the PC’s explaining 95%
of the variance are included as inputs to SVM. The comparison is made with SVM
that does not use PCA as pre-processing step.
MOPSO require a population consisting of parameter sets to be evolved through
a number of iterations competing against each other to obtain an optimum (mini-
mum in this case) value for the BinRMSE and RMSE. In the current formulation, a
50 member population is evolved for 100 iterations within MOPSO for wind power
predictions at the wind farm.

4 Results and Discussion

The wind farm site is located in Canada and has 29 months of energy data available.
The MOPSO is used to train SVM over the available 17 months of training data
(at 6-hourly time resolution) using a “Round-Robin” cross-validation strategy. This
gives an opportunity to train on 16 months and test the results on a ‘hold-out’ 1
month test set. By repeating the process for all the 17 months, gives a full 17 months
6 Multiobjective Evolutionary Optimization and Machine Learning 79

of test data to compare against the observed. Since there is 29 months of data
available for this site, a full 1 year of data is used in validation (completely unseen
data). The results that follow are the predictions on the validation set. The results
are shown on the normalized (between 1 and 1) dataset. The MOPSO-SVM
results are shown with and without PCA pre-processing.
The trade-off curve for MOPSO SVM optimization for the two objectives is
shown in Fig. 4. The trade-off between BinRMS vs. RMSE is shown for the SVM
using original input compared against SVM using Principal Components (PCs) as
inputs. It can be noticed that there is little difference between the two approaches.
The PCA-SVM produced a better objective function result for BinRMS, where as
simple SVM provided a better objective function result for RMSE.
Figure 5 shows the monthly mean wind power for the 12 months compared
against the observed data. The results are shown for SVM prediction with and
without the pre-processing using PCA. As stated above, there is a very little dif-
ference between the two approaches and a good fit has been found. It can be noticed
that predictions are in reasonable agreement with the observed data. The results in
Fig. 6 show histogram of observed vs. the predicted wind power data at the 6-hourly
time resolution (the prediction time step). The results are shown for SVM prediction
with and without the pre-processing using PCA. It can be noticed that the distribu-
tions are well-maintained using MOPSO methodology and a reasonable agreement
between observed and predicted power is evident from Fig. 6. A number of good-
ness-of-fit measures are evaluated in Table 1, which are monthly root mean square
error (RMSE), monthly coefficient of determination (COD), instantaneous RMSE,
instantaneous COD, and BinRMSE (histogram bin RMSE). The results in Table 1
are presented for SVM prediction with and without the pre-processing using PCA.
Both monthly and instantaneous wind power are of significant interest, and thus are

0.56

0.54 SVM
PCA-SVM
0.52 Poly. (PCA-SVM)
Poly. (SVM)
0.5
RMSE

0.48

0.46

0.44

0.42

0.4
5 7 9 11 13 15 17
BinRMS

Fig. 4 Trade-off curve between the two objectives BinRMS vs. RMSE for SVM and PCA-SVM
80 K. Gill et al.

0.3
0.2
0.1
0
–0.1 SVM
Predicted

PCA-SVM
–0.2
–0.3
–0.4
–0.5
–0.6
–0.7
–0.7 –0.6 –0.5 –0.4 –0.3 –0.2 –0.1 0 0.1 0.2 0.3
Observed

Fig. 5 Scatter plot for the mean monthly wind energy data for SVM and PCA-SVM

400
Observed
350 PCA SVM
SVM
300

250

200

150

100

0
–1 – 0.8 – 0.6 – 0.4 – 0.2 0 0.2 0.4 0.6 0.8 1
Power (kw)

Fig. 6 Histogram for SVM and PCA-SVM along with observed

included in the current analysis. It can be noticed that not only monthly but also
instantaneous power predictions are in close agreement with the observed.

5 Conclusions

Machine learning methods have gained in popularity in solving prediction problems in

the area of geosciences. Due to their ability to model complex processes, the machine
learning algorithms are preferred over computationally rigorous process-based
6 Multiobjective Evolutionary Optimization and Machine Learning 81

Table 1 Wind energy Goodness measure SVM PCA-SVM

goodness-of-fit
Monthly RMSE 0.053 0.064
Monthly COD 0.954 0.934
Instantaneous RMSE 0.381 0.381
Instantaneous COD 0.734 0.735
BinRMSE 37.758 36.491

models which are usually over-parameterized and require immense amounts of data
to calibrate. Support Vector Machines are well-suited for the problems exhibiting
high degrees of spatial and temporal variability, issues of nonlinearity, conflicting
scales, and hierarchical uncertainty.
In the current chapter, a multiobjective evolutionary computing method MOPSO
is used to optimize the three parameters of SVM for wind energy predictions. The
approach has been tested on data from a wind farm using NCEP’s re-analysis grid
data. The prediction strategy employs SVM which is parameterized for the two
objective functions. The approach is also tested by pre-processing the input data
using PCA. A number of graphical and tabular results in the form of goodness-of-fit
measures are presented for wind energy predictions. The results also show a trade-
off curve for the two objectives employed in the MOPSO. The trade-off curve is
helpful in identifying the appropriate parameter set for SVM in order to achieve
the desired accuracy for the two objective problem. The SVM predictions at the
farm level produced excellent agreement with the observed data for the validation
set. Overall, the results have been encouraging and it is recommended to use
MOPSO-SVM approach for other operational projects in the area of renewable
energy predictions and forecasting. While further modifications and advancements
are underway, the current procedure is sound enough to be applied in operational
settings.

References

1. V. Cherkassky, F. Mulier, Learning from Data - Concepts, Theory, and Methods (Wiley, New
York, 1998), p. 441
2. B. Sch€olkopf, K. Sung, J.C. Chris, C. Burges, F. Girosi, T. Poggio, V. Vapnik, Comparing
support vector machines with Gaussian kernels to radial basis function classifiers. IEEE Trans.
Signal Process. 45(11), 2758–2765 (1997)
3. V. Vapnik, The Nature of Statistical Learning Theory (Springer, New York, 1995), p. 188
4. V. Vapnik, Statistical Learning Theory (Wiley, Hoboken, NJ, 1998), p. 736
5. N. Cristianini, J. Shaw-Taylor (eds), An Introduction to Support Vector Machines and Other
Kernel-Based Learning Methods (Cambridge University Press, Cambridge, 2000), p. 189
6. A. Tikhonov, V. Arsenin, Solution of Ill-posed Problems (W.H. Winston & Sons, Washington,
DC, 1977), p. 258
7. M.K. Gill, T. Asefa, Y. Kaheil, M. McKee, Effect of missing data on performance of learning
algorithms for hydrologic predictions: Implications to an imputation technique. Water Resour.
Res. 43, W07416 (2007). doi:10.1029/2006WR005298.
82 K. Gill et al.

8. M.K. Gill, Y.H. Kaheil, A. Khalil, M. McKee, L. Bastidas, Multiobjective particle swarm
optimization for parameter estimation in hydrology. Water Resour. Res. 42, W07417 (2006).
doi:10.1029/2005WR004528 (2006)
9. R.C. Eberhart, J. Kennedy, A new optimizer using particle swarm theory, in Proceedings of
the Sixth International Symposium on Micro Machine and Human Science, 1995, MHS’95.
doi:10.1109/MHS.1995.494215 (IEEE Press, Piscataway, NJ (1995), pp. 39–43
10. J. Kennedy, R.C. Eberhart, Particle swarm optimization, in Proceedings of IEEE International
Conference on Neural Networks, IV, vol. 4. doi:10.1109/ICNN.1995.488968 (IEEE Press,
Piscataway, NJ (1995), pp. 1942–1948
11. J. Kennedy, R.C. Eberhart, Y. Shi, Swarm Intelligence (Morgan Kaufmann, San Francisco,
CA, 2001)
12. R. C. Eberhart, R. W. Dobbins, P. Simpson, Computational Intelligence PC Tools (Elsevier,
New York, 1996)
13. H. Kita, Y. Yabumoto, N. Mori, Y. Nishikawa, Multi-objective optimization by means of the
thermodynamical genetic algorithm, in Parallel Problem Solving From Nature—PPSN IV,
eds. by H.-M. Voigt, W. Ebeling, I. Rechenberg, H.-P. Schwefel (Springer-Verlag Berlin,
Germany, 1996), Sept. Lecture Notes in Computer Science, pp. 504–512
14. Kalnay et al. The NCEP/NCAR 40-year reanalysis project. Bull. Am. Meteor. Soc. 77,
437–470 (1996)
Chapter 7
Hybriding Intelligent Host-Based
and Network-Based Stepping Stone Detections

Mohd Nizam Omar and Rahmat Budiarto

Abstract This paper discusses the idea of hybriding intelligent host-based and
network-based stepping stone detections (SSD) in order to increase detection
accuracy. Experiments to measure the True Positive Rate (TPR) and False Positive
Rate (FPR) for both Intelligent-Network SSD (I-NSSD) and Intelligent-Host SSD
(I-HSSD) are conducted. In order to overcome the weaknesses observed from each
approach, a Hybrid Intelligent SSD (HI-SSD) is proposed. The advantages of
applying both approaches are preserved. The experiment results show that HI-
SSD not only increases the TPR but at the same time also decreases the FPR.
High TPR means that accuracy of the SSD approach increases and this is the main
objective of the creation of HI-SSD.

1 Introduction

When the Internet was created, security was not a priority. The TCP/IP protocol
security mechanism was thought to be sufficient at the beginning. However, as the
Internet usage increases, its security mechanism has become more and more
problematic [1]. In fact, as the internet is used widely, the number of attacks also
continues to increase. Therefore, attacks or intrusions have always occurred from
time to time. There are many techniques that can be used by an attacker or intruder
to execute network attacks or network intrusions. A persistent attacker usually
employs stepping stones as a way to prevent from being detected [2]. By using
Stepping Stone Detection (SSD), the attacker can be detected.
However, due to the complicated patterns of the stepping stones used by the
attackers, detection of these stepping stones becomes a challenging task. More

M.N. Omar (*)

College of Arts and Sciences, Universiti Utara Malaysia, 06010 UUM Sintok, Kedah, Malaysia
e-mail: [email protected]

S.-I. Ao et al. (eds.), Machine Learning and Systems Engineering, 83

Lecture Notes in Electrical Engineering 68,
DOI 10.1007/978-90-481-9419-3_7, # Springer ScienceþBusiness Media B.V. 2010
84 M.N. Omar and R. Budiarto

intelligent techniques are required in order to detect accurately the existence of

stepping stones by analyzing the traffic pattern from one node to another. Beginning
with a research conducted by Standiford-Chen and Herberlein [3] to the latest
research by Wu and Huang [4], problems such as accuracy [5] and Active Pertur-
bation Attacks (APA) [6] still remain the top issues among the researchers. An
Intelligent-Host-Based SSD (I-HSSD) [7] and an Intelligent Network-based SSD
(I-NSSD) [8] have been introduced in our previous research. Nevertheless, they
have yet to provide a high rate of accuracy in detecting the stepping stones. We
propose a Hybrid Intelligent SSD (HI-SSD) so as to tackle this accuracy problem.
The proposed HI-SSD will be compared with other approaches: the I-HSSD and
I-NSSD to examine the hybrid approach that has been applied in this research.
Experiments will be conducted using a well-planned dataset, and the performance
in terms of True Positive Rate (TPR) and False Positive Rate (FPR) [9] will become
our main benchmark.
The rest of this article is structured as follows. Section 2 gives research terms
used in this research. Section 3 discusses related works and Section 4 describes the
proposed approach. In Section 5, we discuss further the experiment and then, a
discussion of the results is in Section 6. Finally, we summarize the overall research
and present possible future works in Section 7.

2 Research Terms

Before we start in more detail on the experiment, there are several research terms or
terminologies used in this work which need to be clarified. In Fig. 1, there are five
hosts involved in an SSD environment. Host A is a source of attack and Host E is a
victim.
From Fig. 1, Stepping Stones (SS) are hosts A, B, C, D and E. SS ¼ {A, B, C, D, E}
where hosts A, B, C, D and E contain the same packet that flows through each host.
A Connection Chain (CC), on the other hand, is the connection between hosts A, B,
C, D and E. Therefore CC is a, b, c and d. CC ¼ {a, b, c, d}. In Stepping Stone
Detection (SSD) research using the Network-based approach (N-SSD), either SS ¼
{A, B, C, D, E} or CC ¼ {a, b, c, d}, can be used to denote the existence of stepping
stones.

A B C D E

a b i c d
inbound outbound
packet of packet of
Host C Host C

Fig. 1 Detecting a stepping stone chain

7 Hybriding Intelligent Host-Based and Network-Based Stepping Stone Detections 85

Host-based SSD (HSSD), in contrast, works by determining whether or not

inbound and outbound connections contain the same packet flow. By referring to
Fig. 1, if Host C is referred to, the inbound connection is CCc ¼ b and the outbound
connection is CCc ¼ c. In this case, CC2 ¼ CC3 or b ¼ c.
The challenge to SSD is to find the right solution by determining SS or CC.
Overall, SSD research can be divided into statistical- and AI-based SSD, with most
of the AI approach comprising the most recent research on SSD. As explained
before, SSD research begins with the introduction of statistical-based research such
as ON/OFF [10], Thumbprint [3], Deviation [11], and so forth.

3 Related Works

Research by Thames et al. [12] applied the hybrid concept by combining Bayesian
Learning Network (BLN) and Self-Organizing Map (SOM) for classifying network-
based and host-based data collected within LAN for network security purposes. The
experiment was conducted by using four types of analyses (i) BLN with network
and host-based data, (ii) BLN with network data, (iii) hybrid BLN-SOM analysis
with host and network-based data and (iv) hybrid BLN-SOM analysis with net-
work-based data. The four different types of analyses were required to compare one
result to another.
Meanwhile, Bashah et al. [13] proposed a system that combines anomaly, misuse
and host-based detection for Intrusion Detection System (IDS). This research only
proposed an architecture that combines fuzzy logic and the SOM approach without
any implementations/experiments.
Inspired by the above two research works, we hybrid the I-HSSD and I-NSSD
approaches and compare the hybrid approach (HI-SSD) with the non-hybrid
approaches (I-HSSD and I-NSSD) in terms of accuracy.

4 Proposed Approach: Hybrid Intelligence Stepping Stone

Detection (HI-SSD)

The main components of HI-SSD are intelligence and hybrid. The intelligence
component comes from the use of the Self-Organization Map (SOM) approach and
the hybrid component is created from the combination of Host-based SSD and
Network-based SSD. Both components of intelligence and the hybrid are discussed
in detail our previous research [7, 8, 14]. Figure 2 shows the HI-SSD architecture.
Figure 2 shows the overall HI-SSD architecture that involves I-HSSD and
I-NSSD. In this architecture an intrusion detection system (IDS) is used as a trigger
to detect any network intrusion. When an intrusion occurs, I-NSSD starts to capture
the network packet in a defined range. At the same time, each host also captures the
86 M.N. Omar and R. Budiarto

Host
for each
host I-HSSD

IDS

SSD
attack is
detected SSD list

similar?

I-NSSD

Fig. 2 HI-SSD architecture

network packet as well. When I-NSSD finishes its process to detect stepping stones,
information about related hosts involved in the chain of stepping stones is pro-
duced. Each host listed in I-NSSD as a stepping stone node then executes a self-
examination to check whether it is being used as a stepping stone or not. Results on
both I-NSSD’s list and I-HSSD’s list then are compared. Similarity between these
lists shows the real stepping stone host.
For testing purposes, only the functions of HI-SSD, I-NSSD and I-HSSD are
involved in the experiment. The development of a fully-functional HI-SSD will
become our future work.
The HI-SSD will contain a stepping stone list each from I-NSSD and I-HSSD,
while the I-NSSD and I-HSSD will contain a stepping stone list for every network
and host respectively. Comparisons will be measured on the TPR and the FPR on
each component.

5 Experiment

For the dataset arrangement, Telnet Scripting Tool v.1.0 [15] is used. This is to
guarantee a uniform pattern of telnet operations during the execution of the
experiment. Here, Telnet represents the interactive connection most frequently
used by SSD-related research. Moreover, there is no other dataset that is suitable
in this research. Jianhua and Shou-Hsuan [16] used their own dataset. Staniford-
Chen and Herberlein [3] also agreed with that.
The experiment is run in a controlled environment so as to avoid any interfer-
ence with outside networks. Wireshark [17], on the other hand, is used to capture
network packets that flow in each host. After the Telnet Scripting tool has ended its
run, information pertaining to the packets is converted into text-based form. This is
7 Hybriding Intelligent Host-Based and Network-Based Stepping Stone Detections 87

done in order to proceed to the consequent processes to acquire the appropriate

information needed later. In this research, only time information is needed. This
time information from the experiment is transferred into m-type file to be used
later with Matlab 6.1 [18] software. In Matlab 6.1, the time information is used
as the input to create, train, and lastly, to plot the SOM graph. The result from
the visualization is taken as the result of this research and will be discussed in the
next section.
Although the result for HI-SSD is obtained from I-NSSD and I-HSSD, this work
will show the benefit of stepping stone detection when information obtained from a
combination of the two approaches is compared to two separate sets of information
when using I-NSSD or I-HSSD alone.
Figure 3 shows the layout of the overall experiment. From the figure, it explicitly
shows that four hosts are involved in the stepping stone.
From these four stepping stone hosts, there are three connection chains (C1, C2,
and C3) involved. That means that only four stepping stones or three connections
should be detected by the SSD approach. Increased number of detections causes
lower TPR and decreased number of detections causes higher FPR. Table 1 shows
the relationships for each host and its connection chains.
Based on the location of each host in Fig. 1, Table 1 shows the number of
possible connection chains for each host and its list. The number of connection
chains has been made based on the assumption that Host 1 and Host 4 are the
starting and ending points respectively. Host 2 and Host 3 contain two different
types of connection chains, one and two. However, connection chains that contain

Host 4 Host 2

C3 C2 C1

Host 3 Host 1

Fig. 3 Experiment layout

Table 1 Host and its Host No. of connection chain Connection chain list
relationship
Host 1 3 c1, c2, c3
Host 2 1 c1
2 c2, c3
Host 3 2 c1, c2
1 c3
Host 4 3 c1, c2, c3
88 M.N. Omar and R. Budiarto

just one connection can be eliminated because one connection does not mean that it
is a stepping stone connection. In fact, research by RTT-based SSD [19–21] agreed
that only connection chains with more than three connections can be considered as
stepping stone connections. Based on our previous research [8], the existence of
two connections onwards is enough for a chain to be identified as a possible
stepping stone chain.
To calculate the effectiveness of the tested approach, TPR and FPR are used.

number of false positive

FPR ¼ (1)
number of possible negative instances

False Positive Rate (FPR) refers to the fraction of negative instances that are
falsely reported by the algorithm as being positive. In this situation, the algorithm
has detected the connection chains which exist even though it is not true.

number of true positive

TPR ¼ (2)
number of possible true instances

True Positive Rate (TPR) refers to the fraction of true instances detected by the
algorithm versus all possible true instances. The discussion on the result and its
analysis will be presented.

6 Result and Analysis

Result on each type of SSD will be discussed in each of the following sub-section.
As described before, we have chosen to use SOM as the intelligent approach. As a
result, the SOM graph will represent each compared solution.

6.1 Intelligence Network Stepping Stone Detection (I-NSSD)

In I-NSSD, the results are obtained from the execution of SOM approach through
the arrival time for the overall captured data in the network. In this case, only one
graph has been produced. It is different from I-HSSD that needs one graph for each
host involved. Figure 4 shows the result.
In contrast to the I-HSSD approach that depends on the number of possible
straight lines which can be created to determine the number of connection chains,
the I-NSSD approach is based on the number of possible groups generated. In
Fig. 4, four possible groups of SOM nodes can be counted (labeled as a, b, c and d).
Thus, there are four connection chains involved in the experiment. However, the
true number of connection chains involved in this research is only three. Therefore,
7 Hybriding Intelligent Host-Based and Network-Based Stepping Stone Detections 89

Fig. 4 Node of SOM in I-NSSD

there exist false positive reports in the I-HSSD experiment. By using formula (1),
FPR for I-NSSD is 33.3%. For TPR, I-NSSD shows 100% achievement.

6.2 Intelligence Host-Based Stepping Stone Detection (I-HSSD)

As described previously, I-HSSD involves every host that has been listed in
I-NSSD. In this case, arrival time for each host is run by using the SOM approach.
Each result is an output from the execution.
Figure 5 shows that three possible directions can be traced (e, f and g). That
means there are three possible connections which exist in the originating host,
Host 1. Based on the value from Table 1, Host 1 obtains 100% TPR and 0% FPR.
Figure 6 on the other hand shows that there are two connection chains (h and i)
that could possibly exist in Host 2 as the monitored host. Based on the number of
connection chains from Table 1, Host 2 got 100% TPR and 0% FPR.
In Fig. 7, similar to Fig. 6, there are two connection chains (j and k) that
could possibly exist in this graph. Based on the number of connection chains
from Table 1, Host 3 also got 100% TPR and 0% FPR.
Figure 8 shows the last node of SOM that needs to be observed. From the graph,
there are two possible obtainable directions. That means that two connection chains
90 M.N. Omar and R. Budiarto

Fig. 5 Node of SOM on Host 1

Fig. 6 Node of SOM on Host 2

7 Hybriding Intelligent Host-Based and Network-Based Stepping Stone Detections 91

Fig. 7 Node of SOM on Host 3

Fig. 8 Node of SOM on Host 4

92 M.N. Omar and R. Budiarto

Table 2 I-HSSD experiment Host No. of connection No. of TPR FPR

result chain connection (%) (%)
chain detected
1 3 3 100 0
2 2 2 100 0
3 2 2 100 0
4 3 2 66.7 0

(l and m) have been detected. However, based on the number of connection chains
in Table 1, three connection chains should have been detected in Host 4. For that
reason, I-HSSD in Host 4 miss-detects another connection. In this case, the TPR
becomes 66.7%. The FPR on the other hand also gives 0%.
From the result on each host, it shows that only the result of Host 4 does
not achieve 100% TPR. This can be considered as a weakness of the I-HSSD.
Table 2 shows the result of the I-HSSD experiment.
Based on the overall result of the I-HSSD experiment, connection chains have
been successfully detected in Host 1, Host 2 and Host 3. In Host 4, although the
connection chains have also been successfully detected, the number of connection
chains detected is less than the actual number involved. This makes the TPR at just
66.7%. On the other hand, the FPR is the same for all hosts.

6.3 Hybrid Intelligence Stepping Stone Detection (HI-SSD)

As described earlier, HI-SSD is constructed from the combination of I-HSSD and

I-NSSD. Therefore, the information from I-HSSD and I-NSSD is still used in the
HI-SSD. As shown in Fig. 2, the information obtained from I-HSSD and I-NSSD
are rearranged so as to generate a more accurate stepping stone detection result
compared to the use of I-HSSD or I-NSSD alone.
For this purpose, the first information acquired from I-NSSD is observed. From
the result, four connection chains are detected. At this level, false detection still not
has been discovered. All information needed is distributed to the related hosts. The
host which is related to the list then runs the I-HSSD function by itself. This is
to check whether or not the host is used as a stepping stone. In this case, Host 1,
Host 2, and Host 3 have successfully detected the number of existing connection
chains. However, Host 4 has just detected two connection chains compared to three
connection chains that should have been detected.
Although miss-detection has occurred here, the number of maximum possible
hosts from Host 1 helps a lot to balance the Host 4 result. From the I-HSSD result, it
is shown that there are only three connection chains involved compared to four
proposed by I-NSSD. Combining I-HSSD and I-NSSD avoids false and miss-
detections. By using both I-HSSD and I-NSSD, it is shown that 100% TPR and
7 Hybriding Intelligent Host-Based and Network-Based Stepping Stone Detections 93

Table 3 Experiment result Type of stepping stone TPR (%) FPR (%)
detection approach
I-NSSD 100 33.3
I-HSSD 91.67 0
HI-SSD 100 0

0% FPR has been achieved. For a clear picture on the overall HI-SSD, the related
algorithm is given as follows.
Begin
I-NSSD:
capture network packet
collect arrival time
execute SOM
count the group of node, n
list involved host, l
sent n & l to I-HSSD
I-HSSD:
activate I-HSSD on selected host based on l
for 1 to n
execute SOM
count the possible straight line,
identify the existence of connection chain for the host, e
end for
End
Result of the experiment according to the percentage of TPR and FPR obtained
from I-NSSD, I-HSSD and HI-SSD respectively is tabulated in Table 3.
Table 3 shows the experiment result of I-NSSD, I-HSSD and HI-SSD. As
described previously, HI-SSD is a combination of I-NSSD and I-HSSD. Therefore,
the TPR and FPR for HI-SSD is the combination of I-NSSD’s TPR and FPR and
I-HSSD’s TPR and FPR. In the other words, the average TPR and FPR of I-NSSD
and I-HSSD become the result of HI-SSD. In this case, 100% TPR and 0% FPR for
HI-SSD is better than the use of I-NSSD or I-HSSD alone. Although I-NSSD
shows 100% TPR, but the 33.3% FPR can adversely affect the overall function of
stepping stone detection. In the I-HSSD, 100% FPR makes this approach free of
false detection. However, 91.67% TPR is not enough to render this approach to be
fully-functional. As a result, combining both I-NSSD and I-HSSD to form the
HI-SSD will balance the TPR and the FPR at the same time.

7 Conclusion and Future Work

The goal of this research is to prove the effectiveness of the proposed HI-SSD
which is actually a hybrid or combination of I-NSSD and I-HSSD. From the
experiment, it is shown that the HI-SSD is more accurate compared to both
94 M.N. Omar and R. Budiarto

I-NSSD and I-HSSD. Therefore, it is proven that HI-SSD is more accurate com-
pared to I-NSSD or I-HSSD.
In the future, we will improve our approach to be more robust towards active
perturbation attacks such as delay, packet drop and chaffing. Testing on the active
perturbation problem would not only be executed onto our proposed solution, but it
would also involve our closest research that used the Data Mining approach. By
doing this, we not only can measure the proposed approach’s capabilities but at the
same time can also compare it with other similar approaches.
Only one dataset is used in this experiment. With the intention of verifying the
capability of the proposed HI-SSD, we need different types of dataset. Our observa-
tion on the datasets used in SSD-based research has shown that there are two kinds
of datasets which can be used; datasets generated by ourselves and public datasets.
By using various kinds of datasets instead of just one type of dataset, the true
capabilities of the approach can be examined more. As such, testing the proposed
HI-SSD on top of different datasets is one of our future works.
Another future plan is to compare the proposed HI-SSD approach with the statistical-
based approaches. Therefore, to ascertain that the proposed approach is better than
statistical-based approaches, it is good to involve the different approaches together
in the experiment.
Lastly, for a true experience of a fully-functional stepping stone detection, the
overall approach should be translated into a full system. If a complete system is
created, comparisons with other approaches could be made more easily.

References

1. C. Chamber, J. Dolske, J. Iyer, “TCP/IP Security”, Department of Computer and Information

Science (Ohio State University, Ohio 43210). Available at: http://www.linuxsecurity.com/
resource_files/documentation/tcpip-security.html
2. K.D. Mitnick, W.L. Simon, The art of intrusion: the real stories behind the exploits of hackers,
intruders & deceivers” (Wiley, 10475 Crosspoint Boulevard, Indianapolis, IN 46256, 2005)
3. S. Staniford-Chen, L.T. Herberlein, Holding intruders accountable on the Internet, in
Proceedings of the 1995 IEEE Symposium on Security and Privacy, Oakland, CA, 1995,
pp. 39–49
4. H. Wu, S.S. Huang, Neural network-based detection of stepping-stone intrusion. Exp. Sys.
Appl. 37(2), 1431–1437 (March 2010)
5. A. Blum, D. Song, S. Benkataraman, Detection of Interactive Stepping Stone: Algorith-
m0020and Confidence Bounds, vol 3224/2004. Lecture Notes in Computer Science (Springer
Berlin/Heidelberg, 1 Oct 2004), pp. 258–277
6. X. Wang, Tracing Intruder BStepping Stone. Ph.D. Thesis, North Carolina State University,
Raleigh, 2004.
7. M.N. Omar, R. Budiarto, Intelligent host-based stepping stone detection approach, in Pro-
ceeding of The World Congress on Engineering and Computer Science 2009, vol II, San
Francisco, USA, 20–22 Oct 2009, pp. 815–820
8. M.N. Omar, R. Budiarto, Intelligent network-based stepping stone detection approach. Int. J.
Appl. Sci. Eng. Technol. 53–135, 834–841 (2009)
9. A. Almulhem, I. Traore, Detecting connection-chains: a data mining approach. Int. J. Netw.
Security. 11(1), 62–74 (Jan 2010)
7 Hybriding Intelligent Host-Based and Network-Based Stepping Stone Detections 95

10. Y. Zhang, V. Paxson, Detecting stepping stones, in Proceedings of the 9th USENIX Security
Symposium, Denver, CO, 2000, pp. 67–81.
11. K. Yoda, H. Etoh, Finding connection chain for tracing intruders, in Proceedings of the 6th
European Symposium on Research in Computer Security (LNCS 1985), Toulouse, France,
2000, pp. 31–42
12. J.L. Themes, R. Abler, A. Saad, Hybrid intelligent system for network security. ACM
southeast regional conference 2006, 2006.
13. N. Bashah, I.B. Shanmugam, A.M. Ahmed, Hybrid intelligent intrusion detection system,
Proc. World Acad. Sci. Eng. Technol. 6 (2005)
14. M.N. Omar, L. Serigar, R. Budiarto, Hybrid stepping stone detection method. in Proceeding of
1st International Conference on Distrubuted Framework and Application (DFmA 2008),
Universiti Sains Malaysia, Penang, 21–22 Oct 2008, pp. 134–138
15. Wareseeker (2008) [Online] Available: http://wareseeker.com/freeware/telnet-scripting-tool-
1.0/19344/TST10.zip 8 Feb 2008
16. Y. Jianhua, S.H. Shou-Hsuan, A real-time algorithm to detect long connection chains of
interactive terminal session”, in Proceedings of the 3rd International Conference on Informa-
tion Security, China, 2004, pp. 198–203
17. Wireshark (2009). [Online] Available: http://www.wireshark.org 8 Feb 2009
18. H. Duane, L. Bruce, Mastering MATLAB A Comprehensive Tutorial and Reference (Prentice-
Hall, New Jersey, 1996)
19. K.H. Yung, Detecting long connection chains of interactive terminal sessions, in Proceedings
of the International Symposium on Recent Advance in Intrusion Detection (RAID 2002),
Zurich, Switzerland, 2002, pp. 1–16
20. J. Yang, S.S. Huang, A real-time algorithm to detect long connection chains of interactive
terminal sessions, in Proceedings of the 3rd International Conference on Information Security
(INFOSECU 2004), Shanghai, China, 2004, pp. 198–203.
21. J. Yang, S.S. Huang, Matching TCP packets and its application to the detection of long
connection chains on the internet, in Proceedings of the 19th International Conference on
Advance Information Networking and Applications (AINA 2005), Tamkang University,
Taiwan, 2005, pp. 1005–1010
Chapter 8
Open Source Software Use in City Government
Is Full Immersion Possible?

David J. Ward and Eric Y. Tao

Abstract The adoption of open source software (OSS) by government has been a
topic of interest in recent years. National, regional, and local government are using
OSS in increasing numbers, yet the adoption rate is still very low. This study
considers if it is possible from an organizational perspective for small to
medium-sized cities to provide services and conduct business using only OSS.
We examine characteristics of municipal government that may influence the
adoption of OSS for the delivery of services and to conduct city business. Three
characteristics are considered to develop an understanding of city behavior with
respect to OSS: capability, discipline, and cultural affinity. Each of these general
characteristics contributes to the successful adoption and deployment of OSS by
cities. Our goal was to determine the organizational characteristics that promote the
adoption of OSS. We conducted a survey to support this study resulting in 3,316
responses representing 1,286 cities in the Unites States and Canada. We found
most cities do not have the requisite characteristics to successfully adopt OSS on a
comprehensive scale and most cities not currently using OSS have no future plans
for OSS.

1 Introduction

All city governments seek to deliver services in the most efficient manner possible.
Whether in direct support of service delivery or support of conducting the business
of government, Information Technology (IT) has become and integral component
of operations at all levels of government.
In the past 5 years there has been a trend by some national, regional, and local
governments toward use of open source software (OSS) and open standards as a

D.J. Ward (*)

CSU Monterey Bay, 100 Campus Center, Seaside, CA 93955, USA
e-mail: [email protected]

S.-I. Ao et al. (eds.), Machine Learning and Systems Engineering, 97

Lecture Notes in Electrical Engineering 68,
DOI 10.1007/978-90-481-9419-3_8, # Springer ScienceþBusiness Media B.V. 2010
98 D.J. Ward and E.Y. Tao

first choice rather than a curiosity. Recently, The Netherlands has mandated that all
national government agencies will use open standards formatted documents by
April 2009 [1]. The U.S. Navy has clarified the category and status of OSS to
promote a wider use of OSS to provide seamless access to critical information [2].
The U.S. Congress is recognizing the potential value of OSS in the National
Defense Authorization Act for fiscal year 2009. OSS is identified as an objective
in the procurement strategy for common ground stations and payloads for manned
and unmanned aerial vehicles, and in the development of technology-neutral
information technology guidelines for the Department of Defense and Department
of Veteran Affairs.
Small to medium size cities, populations less than 500,000 [3], may have serious
limitations in funding IT efforts. Escalating costs of service delivery coupled with
reduce revenue will force governments to seek novel ways to reduce operating
costs. With limited revenue their budgets seem to force cities to under fund IT infra-
structure in favor of applying resources to increasing labor requirements to deliver
services. Careful and deliberate selection of IT solutions can reduce the labor required
for service delivery freeing that labor to be harvested for other purposes.
Considerable research has been conducted on the topic of e-government and the
development of models to explain e-government maturity. There have also been
ample studies of the trends in adoption of OSS by government at various levels.
However, little research has been done in the characteristics of regional and local
government that would promote adoption and successful deployment of OSS.
The support of city leadership and management as well as the IT staff are
required for successful adoption of any technology, not just OSS. Vision, strategy
and government support are important for success of IT projects, while insufficient
funding and poor infrastructure are major factors for failure [4].
This research focused on the perspectives of city leadership, management, and
IT staff with respect to OSS and its adoption by city government. While OSS was
the focus, the approach used in this study can be applied to investigating the
adoption of most technologies or practices.

2 Related Research

The literature review revealed little research with similar characteristics of this
study. While much work can be found on OSS in a wide variety of topics, the body
of research covering municipal OSS adoption is relatively limited. Of the literature
found relating to OSS adoption by government, a significant portion of that work
examines the adoption of OSS by developing countries.
A fair number of studies have been conducted to examine the extent to which
government entities are using OSS. The studies tend to collect, categorize, and
report the current state of open source adoption [5, 6]. A study of Finnish munici-
palities [7] had a methodology similar to the methodology used in this study.
8 Open Source Software Use in City Government 99

While the Finnish study considered only IT managers as survey subjects, our survey
considered city leaders, managers, and IT staff for subjects, as our focus is examin-
ing organizational behavioral influences on OSS adoption rather than technical and
budgetary influences.
In the current economic climate government at all levels are facing funding
crises as costs of operations increase and revenue decreases. The question of what
can be done to reduce IT operating costs is now a very important one. There are
more benefits to using OSS than just reduced acquisition costs [8]. Restrictive
licensing, vendor lock-in, and high switching costs can be eliminated, which, in
the long term, also may reduce costs.
The level of knowledge of a user with respect to OSS and Closed Source
Software (CSS) will influence their decision to use OSS.
There are two topologies of consumers: a) “informed” users, i.e. those who know about the
existence of both CSS and OSS and make their adoption decision by comparing the utility
given by each alternative, and b) “uniformed” users, i.e. those who ignore the existence of
OSS and therefore when making their adoption decision consider only the closed source
software [9].

We can apply this observation to municipal organizations. The informed municipal

organization, which knows about the existence of open-source alternatives to
commercial products, may make adoption decisions based on the value provided
by each. The uninformed organization either ignores the existence of open-source
alternatives or is unaware of OSS alternatives to commercial products. The unin-
formed organization may have misperceptions of OSS. These misperceptions may
include OSS usability, deployment, and support. A common misperception is that
an organization must have a programmer on staff in order to deploy and maintain
OSS. While in the distant past within the open source era, this may have been true,
the current maturity of most open source applications may require only a competent
IT technician.
The adoption of OSS by municipal government is a technology transformation
problem. Technology transformation is not about the technology, but the organiza-
tion adopting the technology, in this case city governments. Implementation of
technologies to support government efforts in its self does not guarantee success
[10]. Organizations must change to embrace the new technologies in order to use
them effectively [11].
e-Government cannot be achieved by just simply implementing available soft-
ware [12]. Fundamental differences between the public and private sectors influ-
ence the rate at which ICT is employed [13]. The real opportunity is to use IT to
help create fundamental improvement in the efficiency, convenience and quality of
service [14]. Adopting new or different technology requires an organization to
change, with varying degree, at all levels. An organization with a culture that
embraces change will be more successful at adopting new technologies than an
organization with a strong status quo bias.
100 D.J. Ward and E.Y. Tao

3 Research Goals

Our approached in this study was to focus on organizational characteristics required

to promote OSS adoption instead of OSS as technology needing to be implemented.
One fundamental question ultimately coalesced; is it possible for small to medium
cities to use only OSS to deliver services and conduct city business? This is not a
question of technical feasibility or product availability, but one of organizational
characteristics. If we assume OSS products exist in the application domains city
government requires for IT operations, then successful adoption and deployment
becomes an organizational behavior issue.
Three general domains of interest evolved from the core question; capability,
discipline, and cultural affinity. Having the capability to deploy and manage infor-
mation technology was deemed very important for a city to successfully deploy
OSS technologies. A city that has a mature management structure in place that
demonstrates forethought in planning and a deliberate approach to budgeting and
acquisition will have the level of discipline to deploy OSS on a comprehensive
scale. Cultural affinity we define as the predisposition toward or against the use of
OSS. The awareness, support, and understanding of OSS by key city personnel
(leadership, management, and IT Staff) will influence a city’s affinity toward the
use of OSS. From these domains we developed additional questions to guide this
research effort: Do cities have the necessary organizational characteristics to adopt
and use only OSS to deliver services and conduct city business? What are the basic
IT capabilities of cities? Do these capabilities support the adoption and deployment
of open source technologies? Do cities plan and budget for IT in a deliberate
manner that would support the adoption and deployment of open source technolo-
gies? Does the organizational culture of cities promote the adoption of open source
technologies?

4 Methodology

The following sections describe the methodology used for this study. We begin by
describing in general the process, followed by the survey design, survey execution,
and subject selection.
A survey was conducted to collect data from municipal IT managers, IT staff,
city leadership, city management, and city employees. The survey was adminis-
tered online using SurveyMonkey.com. The collection period was initially sched-
uled for 30 days from June 1, 2008 through June 30, 2008.
The survey required soliciting responses from subjects to provide insight into the
characteristics of their cities with respect to IT capability, organizational discipline,
and cultural affinity to OSS. Presenting direct questions would not produce useful
data, as subjects may not have the requisite knowledge in the subject areas. One
goal in the design of the survey was to reduce the number of aborted attempts by
subjects. An aborted attempt is the failure to complete the survey once started.
8 Open Source Software Use in City Government 101

We identified six subject classifications; IT Manager, IT Staff, City Leadership,

City Management, City Employee, Other. These classifications were used to tailor
the set of questions presented to the subject to keep the set within the knowledge
domain of the subject. All survey questions were framed in the context of the subject
and their perception of their city. This was an important aspect of the design as the
survey questions solicit information that indirectly relates to the research questions.
The survey was divided into four sections. The first three sections address in
three interest domains; capability, disciple, and cultural affinity. The fourth section
solicited demographic information.
The first section of the survey collected data regarding city IT capability addres-
sing the capability dimension characteristic of cities. The second section solicited
responses related to the city’s IT strategy addressing the discipline dimension
characteristic of cities. The third section solicited responses related to the subject’s
perspectives and opinions about IT and OSS and the subject’s impression of the city
leadership, management, and IT staff’s perspectives of IT and OSS. These questions
were intended to reveal the city’s cultural affinity to the adoption of OSS.

5 Survey Execution

For the announcement strategy we used three channels to contact potential subjects;
magazines related to city management, municipal associations, and direct e-mail.
Announcing the survey through a magazine was deemed to have potential for
generating significant level of exposure for those subjects who are more likely to
read city government related magazines. Several magazines were contacted for
assistance to announce the survey. Two magazines responded, the Next American
City Magazine and American City and County Magazine. The Next American City
magazine provided a half page ad space to announce this survey. American City and
County Magazine announced the survey in an article posted on the front page of
its website. Although the potential exposure was thought to be high, the magazine
announcement channel only produced 20 responses of which only one response was
valid for analysis.
Municipal associations were thought to be the best vehicle for reaching the
largest number of subjects. The rationale behind this was the assumption that
individuals affiliated with municipal associations might be more inclined to respond
to a survey announcement received from their association. The expectation was
that the greatest number of responses would result from municipal associations.
Individuals affiliated with municipal associations may also have greater interest
in supporting this research as they may see a potential benefit for their city.
A total of 116 municipal associations were contacted to assist with announcing
the survey to their members, 28 associations approved the request for assistance and
forwarded the announcement to their members.
The municipal associations were identified via a search of the Internet. Most of
the associations found were regional providing representation within a county,
102 D.J. Ward and E.Y. Tao

multi-county, state, or multi-state area. Each municipal association was sent an

initial survey announcement with a reminder sent within 7 days of the initial
announcement.
We received 207 responses from subjects indicating they learned of the survey
through a municipal association.
To reach the greatest number of potential subjects a direct e-mail approach was
also used. Direct e-mail has proven to be a very effective and economical means of
reaching the greatest number of survey subjects [15].
Individuals were contacted via email addresses harvested from municipal web-
sites. A commercial email-harvesting program, Email Spider (www.gsa-online.de),
was used to collect the municipal email addresses.
The collection process harvested over 80,000 email addresses from the munici-
pal websites. Invalid email addresses, those not associated with a city government
domain name (i.e. yahoo.com, aol.com), were excluded to produce a set of 60,000
addresses to which the announcements were sent.
Survey announcements were emailed to the potential subjects over a 2-week
period. Reminder emails were sent within 7 days of the initial announcements. Survey
responses can be increased with a reminder email that includes more than just a link
to the survey15. With this in mind the reminder email included the original announce-
ment text with an introductory paragraph explaining the email was a reminder.
The collection period defined in the research design was for 1 month running
from June 1, 2008 through June 30, 2008. Toward the end of the collection period,
between 18 and 25 June, the response activity maintained a significant level (an
average of 143 per day) prompting this researcher to extend the collection period 15
days ending on July 15, 2008. Additionally, many automated email responses
indicated the recipients were on vacation during the month of June. In the days
following distribution of the initial survey announcement email and reminder
emails increased response activity was observed. We anticipated response activity
would increase during the first part of July as potential subjects returned from
vacation. During the 15 day extended collection period 1443 responses were
collected, 43.5% of the total responses.

6 Survey Results

The total estimated exposures to the survey announcement were in excess of

60,000. An exposure for the purpose of this study is defined as the delivery of a
survey announcement to a potential subject. Several factors prevent an accurate
tally of total survey announcement exposures. We did not have access to the
membership numbers for the municipal associations that forwarded the survey
announcement or access to the web site page hit counts for the article on the
magazine website.
Of the 60,000 e-mails sent directly to city leaders, managers, and employees,
53,900 may have reached the addressee. 6,100 of the original 60,000 email
8 Open Source Software Use in City Government 103

announcements were returned as undeliverable, a non-existent address, or reported

by an email server as potential “spam”.
A total of 3,316 individuals responded to the survey announcement resulting in a
response rate of 6%. There are 1,286 distinct cities represented in this response set.
A sample set was created to include cases from cities with populations less than
300,000 and an indicated primary duty of IT Manager, IT staff, City Leader, or City
Manager. While the survey data included responses from cities with populations
greater than 300,000, those responses were too few in number to permit valid analysis
(Fig. 1).
Reponses from city employees (not city leaders, managers, or IT staff) and
individuals not affiliated with a city were excluded from the sample set used for
analysis. The responses from city employees had very limited or no value as these
subjects had little knowledge of city IT capability, strategy, or the views of the city
leadership, management, and IT staff regarding OSS. These criteria produced a
sample set of 1,404 cases representing 1,206 cities.
Cities use a wide variety of desktop operating systems. Within the IT staff
sample sub-set, 15 different operating systems were identified. Virtually all cities
(99.7%) deploy one or more versions of Microsoft Windows on desktop computers.
Twenty percent of the respondents indicate Linux is used on desktop computers.
The survey instrument did not collect the degree to which Linux is deployed on the
desktop in the respondents’ city.
An interesting observation was 13% of the IT staff indicating that Mac OS X was
deployed on desktop computers in their city. Since Mac OS X can only be installed
on Apple Inc. hardware [16], we can conclude these cities are using Apple com-
puters to support service delivery or to conduct city business (Fig. 2).
Server side operating systems showed similar deployment rates with Microsoft
Windows most widely used at 96.5% and Linux following in second at 40%. A
small number of cities continue to use Novell Netware and DecVMS/OpenVMS
indicating some cities may be maintaining legacy software to support service
delivery or city operations (Fig. 3).

Fig. 1 Response by primary

duty
104 D.J. Ward and E.Y. Tao

Fig. 2 Desktop OS deployment

Fig. 3 Server OS deployment

7 Analysis: Interesting Findings

7.1 Few Cities Have All Characteristics

Analysis of the survey data indicates few cities have all the characteristics that
would enable successful comprehensive adoption and deployment of OSS. Of the
1,206 distinct cities in the sample set, just ten cities satisfied all characteristics
within the three dimensions.
Ten cities, listed in Table 1, satisfied the following criteria; has an IT Depart-
ment, IT support is handled in-house, currently uses OSS, has well defined IT
strategy, has IT line item in budget, IT is sufficiently funded, total cost of ownership
acquisition strategy, uses budget for software acquisition.
8 Open Source Software Use in City Government 105

Table 1 Cities satifying all City State Population

selection criteria
Balwin Missouri 30,000
Northglen Colorado 31,000
Houma Louisiana 32,400
Ipswitch Massachusetts 12,000
Largo Florida 73,000
Layton Utah 64,300
Redding California 80,800
Santa Monica California 87,200
Tomball Texas 10,200
Ulysses Kansas 5,600

Largo, Florida is of particular interest. Largo, has embraced the use of OSS. It
has deployed Linux as the operating system for its 400 desktop clients saving the
city an estimated $1 million per year in hardware, software, licensing, maintenance,
and staff costs [17].

7.2 Possible Aversion to OSS If Not Currently Using OSS

Of the 460 Municipal IT managers and staff in the sample set, 56% indicated their
city was not currently using OSS while 39% indicated their city was using OSS.
Considering the widespread use of OSS in the commercial sector, the relatively
high percentage of cities in this survey not currently using OSS required further
investigation (Fig. 4).
Of the Cities currently using OSS, 76% are planning to use OSS in the future and
10% have no plans to use OSS in the future. The high percentage of cities planning
to use OSS in the future that currently use OSS can be expected. It is more likely an
organization will continue to use a software product once deployed and established
than to abandon the product (Fig. 5).
The cities currently not using OSS provides a more interesting observation. Of
the 259 IT managers and staff indicating their city is currently not using OSS, 82%
indicated their city has no plans to use OSS in the future, 9% indicated their city did
plan to use OSS in the future, and 9.7% (25) did not know.
The number of dedicated IT staff at the respondent cities may not be an
influencing factor in decisions to use OSS in the future. While 74% of the cities
not planning to use OSS in the future have IT staff numbering ten or less, 71% of
cities currently using OSS also have ten or less IT staff.
The organizational support for using OSS appears to be a significant influencing
factor for a city’s future plans for OSS use. The survey design included questions
regarding the respondent’s perception of the Leadership, Management, and IT staff
views of OSS. The subjects were asked if the city leadership, management, and IT
staff support the use of OSS. For the cities not planning to use OSS in the future
only 6% of the respondents indicated the city leadership supports the use of OSS,
8% of respondents indicated city management supports the use of OSS, and 33%
106 D.J. Ward and E.Y. Tao

Fig. 4 Cities currently using

OSS

Fig. 5 OSS plans for cities

currently not using OSS

indicated city IT staff supports use of OSS. For cities currently using OSS the
responses were 22% leadership, 33% management, and 71% IT staff.

7.3 Current OSS Support by Leadership, Management,

and IT Staff

IT managers and staff report a significant difference in the perceived current support
of OSS and the support of OSS if using OSS would reduce IT operating costs. 11%
of IT managers and staff indicate they agree their city leadership currently supports
the use of OSS. The IT managers’ and staff’s perception of city management’s
current support of OSS is similar, if somewhat higher, to their perception of city
leadership. Sixteen percent agree their city management currently supports OSS.
The IT managers’ and staff’s perception of city IT staff’s current support of OSS,
that is their perception of themselves, was significantly higher than their perception
of city leadership and management support of OSS with 26% agreeing the city IT
staff supports the use of OSS (Fig. 6).
8 Open Source Software Use in City Government 107

When asked about support of OSS if it meant saving money, 36% of the IT staff
agrees their city leadership would support OSS if it would save money, a threefold
increase. 41% agree their city management would support OSS if it would save
money, a 150% increase. However, only 36% agreed the city IT staff would support
OSS to save money, just a 50% increase. While these results indicate city leadership
and management may be motivated to support OSS given the potential cost savings,
IT staff may not share those same motivations (Fig. 7).
Of note is the drop in frequency (from 70% to 50%) of respondents indi-
cating a neutral position or those who did not know. The possibility of reducing
the costs of information technology is a significant influence on IT strategy and
technology adoption.

Fig. 6 Current support of OSS

Fig. 7 Support if OSS would save money

108 D.J. Ward and E.Y. Tao

7.4 Discrepancy of OSS Awareness: Self, Others

The survey data suggests a discrepancy between the subject’s own awareness of OSS
and their perception of city leadership, management, and IT staff’s awareness of OSS.
Within the sample set 69% of the respondents indicated they are aware of OSS.
However, their responses regarding their city leadership, management, and IT staff’s
awareness of OSS show that most respondents perceive the leadership, management
and IT staff as generally unaware of OSS. The high frequency of those individually
aware of OSS could be attributed to the survey attracting individuals interested in OSS.

8 Conclusion

The results indicate cities in general do not have the necessary characteristics to
successfully adopt OSS to deliver services and conduct city business on a compre-
hensive scale. The key indicators point to significant deficiencies in the three
domains: capability, discipline, and cultural affinity.
While a majority of cities in the study show some characteristics that indicate the
adoption of OSS is possible, and indeed on a trivial level (with a few notable
exceptions) some cities are using OSS, still most cities lack key characteristics in
the three domains to enable a successful comprehensive adoption of OSS.
The data suggest many cities may have an adequate level of discipline to support
open source adoption with IT line items in the city budget and sufficient IT funding.
However, a significant number of cities make software purchases on an ad hoc
basis, indicating potential lack of organizational planning capability.
A city’s Culture, with respect to IT decision making, appears to be a significant
barrier to open source adoption. City leadership and management of cities that do not
support the use of OSS are generally unaware of OSS as an alternative to commercial
software. Cities currently using OSS are highly likely to continue to use OSS in the
future while cities not presently using OSS have no future plans to use OSS.
Because the cities represented in this study in general do not exhibit the
indicators in the three domains examined we conclude most cities do not have the
capability, discipline, and cultural affinity to successfully adopt OSS on more than a
trivia level.

References

1. Associated Press (14 Dec 2007) Dutch Government Ditches Microsoft, Moves to Open-
Source Software. Foxnews.com. Retrieved 7 Feb 2008, from Fox News Web site: http://
www.foxnews.com/story/0,2933,316841,00.html
2. R.J. Carey, Department of the Navy Open Source Software Guidance (-) Washington, DC, U.
S. Navy. Retrieved 8 Feb 2008, from Open Source Software Institute Web site: http://oss-
institute.org/Navy/DONCIO_OSS_User_Guidance.pdf
3. V. Henderson, Medium size cities. Reg. Sci. Urban Econ. 27(6), 583–612 (1997)
8 Open Source Software Use in City Government 109

4. D. Gichoya, Factors affecting the successful implementation of ICT projects in government.

Electron. J. E Govern. 3(4), 175–184 (2005)
5. J. Mtsweni, E. Biermann, An investigation into the implementation of open source software
within the SA government: an emerging expansion model. In Proceedings of the 2008 Annual
Research Conference of the South African institute of Computer Scientists and information
Technologists on IT Research in Developing Countries: Riding the Wave of Technology
(Wilderness, South Africa, October 06–08, 2008). SAICSIT ‘08, vol. 338, ACM, New York,
pp. 148–158
6. W. Castelnovo, M. Simonetta, The evaluation of e-government projects for small local
government organisations. Electron. J. e-Government 5(1), 21–28 (2007)
7. M. V€alim€aki, V. Oksanen, J. Laine, An empirical look at the problems of open source
adoption in Finnish municipalities, in Proceedings of the 7th International Conference
on Electronic Commerce (Xi’an, China, August 15–17, 2005). ICEC ‘05, vol. 113, ACM,
New York, pp. 514–520
8. H. Thorbergsso, T. Bj€ orgvinsson, Á. Valfells, Economic benefits of free and open source
software in electronic governance, in Proceedings of the 1st International Conference on
Theory and Practice of Electronic Governance (Macao, China, December 10–13, 2007).
ICEGOV ‘07, vol. 232, ACM, New York, pp. 183–186
9. S. Comino, F. Manenti, Free/open software: public policies in the software market (July 2004)
10. Z. Ebrahim, Z Irani, E-Government adoption: architecture and barriers. Business Process
Manage. J. 11(5), 580–611 (2005)
11. M. Murphy, Organisational change and firm performance. OECD Directorate for Science,
Technology, and Industry Working Papers (2002), http:////www.oecd.org//sti
12. P. Alpar, Legal requirements and modeling or processes in government. Electron. J.
E-Govern. 3(3), 107–116 (2005)
13. D. Swedberg, Transformation by design: an innovative approach to implementation of
e-government. Electron. J. E-Govern. 1(1), 51–56 (2003)
14. J. Borras, International technical standards for e-government. Electron. J. E-Govern. 2(2),
75–80 (2004)
15. D. Kaplowitz, T. Hadlock, R. Levine, A comparison of web and mail survey response rates.
Public Opin. Quart. 68(1), 94–101 (2004)
16. Apple Inc. (2008) Software License Agreement for MAC OS X, http://manuals.info.apple.
com/en/Welcome_to_Leopard.pdf
17. L. Haber, ZDnet. (2002) City saves with Linux, thin clients, http://techupdate.zdnet.com/
techupdate/stories/main/0,14179,2860180,00.html
Chapter 9
Pheromone-Balance Driven Ant Colony
Optimization with Greedy Mechanism

Masaya Yoshikawa

Abstract Ant colony optimization (ACO), which has been based on the feeding
behavior of ants, has a powerful solution searching ability. However, since proces-
sing must be repeated many times, the computation process also requires a very
long time. In this chapter, we discuss a new ACO algorithm that incorporates
adaptive greedy mechanism to shorten the processing time. The proposed algorithm
switches two selection techniques adaptively according to generation. In addition,
the new pheromone update rules are introduced in order to control the balance of
the intensification and diversification. Experiments using benchmark data prove the
validity of the proposed algorithm.

1 Introduction

Combinatorial optimization problems can be used for a variety of fields, and

various solutions for these types of problems have been studied. Among these
solutions, a heuristic solution, referred to as meta-heuristics, has attracted much
attention in recent years. Meta-heuristics is a general term for an algorithm that is
obtained by technological modeling of biological behaviors [1–10] and physical
phenomena [11]. Using this approach, an excellent solution can be obtained within
a relatively short period of time. One form of meta-heuristics is ant colony optimi-
zation (ACO), which has been based on the feeding behavior of ants. The fundamen-
tal elements of ACO are the secretion of pheromone by the ants and its evaporation.

M. Yoshikawa
Department of information engineering, Meijo university, 1-501 Shiogamaguchi,
Tenpaku Nagoya 468-8502, Japan
e-mail: [email protected]

S.-I. Ao et al. (eds.), Machine Learning and Systems Engineering, 111

Lecture Notes in Electrical Engineering 68,
DOI 10.1007/978-90-481-9419-3_9, # Springer ScienceþBusiness Media B.V. 2010
112 M. Yoshikawa

Using these two elements, ants can search out the shortest route from their nest to a
feeding spot. Fig. 1 shows the actual method for searching the shortest route.
Figure 1a shows the case where two ants A and B have returned from a feeding
spot to the nest via different routes. In this case, ants A and B have secreted
pheromone on their routes from the nest to the feeding spot and from the feeding
spot to the nest. At this point, a third ant C moves from the nest to the feeding spot,
relying on the pheromone trail that remains on the route.
ACO is performed based on the following three preconditions: (1) the amount of
pheromone secreted from all ants is the same, (2) the moving speed of all ants is the
same, and (3) the secreted pheromone evaporates at the same rate. In the above
case, since the moving distance from the nest to the feeding spot is longer for route
A than for route B, a larger amount of pheromone will have evaporated along route
A than along route B, as shown in Fig. 1b. Therefore, ant C will select route B
because a larger amount of pheromone remains on this route. However, ant C does
not go to the feeding spot by a route that is the same as route B; rather, ant C goes to
the feeding spot by route C and then returns to the nest, as shown in Fig. 1c. Another
ant D then goes to the feeding spot either by route B or by route C (on which a larger
amount of pheromone remains). The fundamental optimization mechanism of ACO
is to find a shorter route by repeating this process.
For this reason, ACO has a powerful solution searching ability. However, since
processing must be repeated many times, the computation process also requires a
very long time. In order to shorten the processing time, this study proposes a new
ACO algorithm that incorporates adaptive greedy selection. The validity of the
proposed algorithm is verified by performing evaluation experiments using bench-
mark data.
This chapter is organized as follows: Section 2 indicates the search mechanism
of ACO, and describes related studies. Section 3 explains the proposed algorithm
with modified pheromone update rules. Section 4 reports the results of computer
simulations applied to the travelling salesman problem (TSP) benchmark data.
We summarize and conclude this study in Section 5.

a b c

Fig. 1 Example of pheromone communication: (a) two routes on initial state, (b) positive
feedback reinforcement using pheromone information, and (c) example of another route
9 Pheromone-Balance Driven Ant Colony Optimization with Greedy Mechanism 113

2 Preliminaries

2.1 Ant Colony Optimization

ACO is a general term for the algorithm that can be obtained by technological
modeling of the feeding behavior of ants. The basic model of ACO is the ant system
(AS) [1] designed by M. Dorigo. The ant colony system (ACS) [2] is one of the
modified algorithms of the AS. The ACS provided a better solution when the TSP
was to be solved, compared to a genetic algorithm (GA) [9, 10] and a simulated
annealing (SA) [11] method. Therefore, this study adopts ACS as its basic algo-
rithm. Henceforth, ACO also denotes the ACS in this paper.
The processing procedure of ACO is explained using an example in which ACO
is applied to the TSP. In ACO, each of several ants independently creates a
travelling route (visiting every city just one time). At this time, each ant determines
the next destination according to the probability pk calculated from formula (1).

½tði; jÞ½ði; jÞb

pk ði; jÞ ¼ P (1)
½tðl; jÞ½ðl; jÞb
l2nk

Where, the value (i,j) in formula (1) is called the static evaluation value. It is
the reciprocal of the distance between city i and city j, and is a fixed value. On the
other hand, t (i,j) is referred to as the dynamic evaluation value. It expresses the
amount of pheromone on the route between city i and city j, and this changes during
the process of optimization. The term b represents a parameter and nk represents a
set of unvisited cities.
In ACO, the probability is high for selecting a route whose distance is short and
whose pheromone amount is large. That is, selection using a roulette wheel is
performed, as shown in Fig. 2.

Fig. 2 Example of the selection probability

114 M. Yoshikawa

Here, the amount of pheromone on each partial route is determined by

two types of rules: the local update rule and the global update rule. The local
update rule is expressed by formula (2), and it is applied whenever each ant
moves.

tði; jÞ ð1 cÞtði; jÞ þ ct0 (2)

In formula (2), c is a decay parameter in local update rule, t0 is the initial value
of pheromone. The global update rule is expressed by formula (3), and it is applied
after each ant completes a travelling route.

tði; jÞ ð1 rÞtði; jÞ þ rDtði; jÞ (3)

1=Lþ if ði; jÞ 2 T þ
Dtði; jÞ ¼ (4)
0 otherwise

In formula (3), r represents the evaporation rate of pheromone in the global

update rule, Tþ represents the shortest travelling route among the completed
travelling routes, and Lþ represents the length of the partial route that composes
the shortest travelling route.
In ACO, the amount of pheromone is larger on a route that has been more
often selected, and is smaller, due to evaporation, on a route that has been less
often selected. In addition, a rule called pseudo-random-proportional rule is also
introduced. For this rule, an ant is forced to move to a city, with the selection
probability being the highest at a certain probability. In other words, a random
number q is generated between 0 and 1. If q is smaller than threshold value qt, an
ant will move to a city whose value of numerator in formula (1) is the largest. If
q is larger than the threshold value, the usual selection by formula (1) will be
performed.

2.2 Related Studies

Many studies [1–8] have been reported for the use of ACO, such as those that have
applied ACO to network routing problems [3] and to scheduling problems [4].
A number of studies [5, 6] have also been reported on pheromone, which is an
important factor for controlling the intensification and diversification of a search.
Hybridization with other algorithms, such as a GA [7] or SA [8] method, has also
been studied. However, no study has yet been reported that incorporates adaptive
greedy selection into an ACO, as is proposed in this study.
9 Pheromone-Balance Driven Ant Colony Optimization with Greedy Mechanism 115

3 Hybrid ACO with Modified Pheromone Update Rules

In ACO, when x represents the number of cities, y represents the total number
of ants, and z represents the number of processing repetitions, the number of
calculations for formulae (1), (2), and (3) are expressed by (x 1) y z, a
x y z, and z, respectively. Since the number of processing repetitions in
ACO is large, a processing time problem is inherent in ACO. In this study,
incorporating a greedy selection mechanism into the ACO reduced the number
of calculations in formulae (1) and (2). Consequently, the processing time was
shortened.
A greedy selection approach only employs static evaluation values, with no use
of dynamic evaluation values, when selecting the next destination city. That is, an
ant moves from the present city to the nearest city. Since static evaluation values are
constant in the optimization process, once the calculation is performed in the early
stage, it is not necessary to re-calculate. Therefore, the processing time can be
shortened by performing a greedy selection. However, since greedy selection favors
the local optimal solution, the balance between formula (1) and the greedy selection
becomes important.
In order to control this trade-off relationship, the greedy selection is adaptively
used in this study. The adaptive greedy selection is explained using the TSP of six
cities, as shown in Fig. 3.
In Fig. 3, when city A is set as the start city, the following three selections are
performed using formula (1): (1) movement from city A to city B; (2) movement
from city B to city C; and (3) movement from city C to city D. The greedy selection
is also applied to the other movements; namely, from city D to city E and from
city E to city F.
In contrast, when city D is set as the start city, the following three selections
are performed using the greedy selection: (1) movement from city D to city E,

Fig. 3 Example of adaptive greedy selection

116 M. Yoshikawa

(2) movement from city E to city F, and (3) movement from city F to city A.
Formula (1) is applied to the other movements; namely, from city A to city B and
from city B to city C. In this study, the proportion of the greedy selection and the
selection by formula (1) is changed according to generation, as shown in Fig. 4.
Figure 4 shows an example obtained when the total number of processing
repetitions was set at 1,000, and the proportion of the greedy selection and the
selection by formula (1) was changed every 200 repetitions. In this example, the
selection by formula (1) was performed until the 200th repetition. From the 201st
repetition to the 400th repetition, the selection by formula (1) was applied to 80%
of cities, and the greedy selection was applied to 20% of cities. In other words,
for the problem of 100 cities, the selection by formula (1) was performed on 80
cities and the greedy selection was performed on 20 cities. Similarly, from the
401st repetition to the 600th repetition, the selection by formula (1) was applied
to 60% of cities, and the greedy selection was applied to 40% of cities. Thus, by
changing the proportion of two selection methods according to generation, the
trade-off relationship between the accuracy of the solution and the processing
time can be controlled.
In optimization algorithms, controlling the trade-off relationship between the
intensification and diversification of search is also important. In the adaptive greedy
selection proposed in this study, the pressure of intensification is strong. Therefore,
in order to strengthen the pressure of diversification, the local update rule and the
global update rule are partially changed.
First, the local update rule is generally applied whenever an ant moves. How-
ever, it is not applied when the greedy selection is performed. Next, the global
update rule is generally applied to the best travelling route; i.e., the shortest
travelling route. It is also applied to the second shortest travelling route. By adding
these two modifications, the proposed algorithm can realize a balance between the
intensification and diversification of the search.

Fig. 4 Example of the

proportion of the greedy
selection and the selection by
formula (1)
9 Pheromone-Balance Driven Ant Colony Optimization with Greedy Mechanism 117

4 Experiments and Discussion

In order to verify the validity of the proposed algorithm, several comparison

experiments were performed. Each experiment was performed 10 times. Table 1
shows the conditions used in the experiments. In the table, the designation ACS/
Greedy indicates that the city selection of the first half was performed using formula
(1), and that of the second half was performed using the greedy selection. Similarly,
the designation 80/20 indicates that 80% of the cities were selected using formula
(1), and selection of 20% of the cities used the greedy selection. In each table,
technique (a) represents the conventional ACO. Techniques (b) and (c) represent
algorithms in which the proportion of the greedy selection and the selection by
formula (1) was not changed according to generation. Techniques (d), (e), (f), and
(g) represent algorithms in which the proportion was changed according to genera-
tion. These four algorithms are proposed in this study. Tables 2–4 show the
experimental results.
As shown in Tables 2–4, the processing time was shortened when the greedy
selection was used, regardless of its proportion. By changing the proportion of the
greedy selection and the selection by formula (1) according to generation, the
ability to search a solution also improved, when compared with the case where
the proportion was fixed. In order to examine the optimal proportion, changes in
solutions obtained using techniques (d), (e), (f), and (g) were plotted, as shown in
Fig. 5. In this figure, the horizontal axis represents the number of processing

Table 1 Experimental conditions

Technique Generation (# repetitions)
1–200 201–400 401–600 601–800 801–1000
(a) ACS (N/A)
(b) Greedy/ACS 50/50
(c) ACS/Greedy 50/50
(d) Greedy/ACS 0/100 20/80 40/60 60/40 80/20
(e) ACS/Greedy 100/0 80/20 60/40 40/60 20/80
(f) Greedy/ACS 80/20 60/40 40/60 20/80 0/100
(g) ACS/Greedy 20/80 40/60 60/40 80/20 100/0

Table 2 Results of qt ¼ 0.25

Technique Best Average Worst Time (s)
(a) ACS 21839 22049 22393 20.00
(b) Greedy/ACS 23196 23482 23690 14.73
(c) ACS/Greedy 22348 22599 22915 15.39
(d) Greedy/ACS 22103 22295 22619 15.64
(e) ACS/Greedy 21997 22264 22709 16.35
(f) Greedy/ACS 21967 22161 22535 15.93
(g) ACS/Greedy 21731 22109 22543 16.42
118 M. Yoshikawa

Table 3 Results of qt ¼ 0.50

Technique Best Average Worst Time (s)
(a) ACS 21700 21876 22056 18.90
(b) Greedy/ACS 23082 23490 23700 14.24
(c) ACS/Greedy 22348 22707 23122 14.85
(d) Greedy/ACS 21665 22114 22330 15.36
(e) ACS/Greedy 21866 22062 22386 15.72
(f) Greedy/ACS 21824 22026 22346 15.31
(g) ACS/Greedy 21817 22149 22721 15.81

Table 4 Results of qt ¼ 0.75

Technique Best Average Worst Time (s)
(a) ACS 21558 21768 22262 18.14
(b) Greedy/ACS 23218 23561 23773 13.80
(c) ACS/Greedy 22336 22978 23429 14.37
(d) Greedy/ACS 21787 22038 22503 14.78
(e) ACS/Greedy 21713 21888 22181 15.21
(f) Greedy/ACS 21826 21901 21977 14.73
(g) ACS/Greedy 21665 21975 22312 15.27

Fig. 5 Comparisons of
selective techniques (d), (e),
(f), and (g)

repetitions and the vertical axis represents the travelling route length of the optimal
solution.
As shown in Fig. 5, when the number of processing repetitions was small as the
termination condition, techniques (d) and (e) showed excellent performance.
When the number of processing repetitions was sufficient, techniques (f) and
(g) also exhibited excellent performance. Thus, the selection of technique accord-
ing to the given experimental condition (termination condition) was clearly
important.
9 Pheromone-Balance Driven Ant Colony Optimization with Greedy Mechanism 119

5 Conclusion

In this chapter, we proposed a new technique that incorporates a greedy mechanism

to shorten the processing time of an ACO. A hybrid technique that uses two types of
selection techniques (i.e., the conventional selection technique and the greedy
selection technique) is introduced to generate one solution. A technique was also
developed for adaptively changing these two selection techniques according to
generation. In order to control the balance of the intensification and diversification,
the update rule of pheromone was improved in each phase of global update and
local update. The validity of the proposed technique was verified by comparative
experiments using benchmark data. As a future task, the proposed technique should
be applied to practical problems other than the TSP.

References

1. M. Dorigo, V. Maniezzo, A. Colorni, Ant system: optimization by a colony of cooperating

agents. IEEE Trans. Syst. Man Cybern. Part B. 26(1), 29–41 (1996)
2. M. Dorigo, L.M. Gambardella, Ant colony system: a cooperative learning approach to the
traveling salesman problem. IEEE Trans. Evol. Comput. 1(1), 53–66 (1997)
3. T.H. Ahmed, Simulation of mobility and routing in ad hoc networks using ant colony
algorithms. Proc. Int. Conf. Inf. Technol. Coding Comput. 2, 698–703 (2005)
4. M. Yoshikawa, H. Terai, A Hybrid Ant Colony Optimization Technique for Job-Shop Scheduling
Problems, Proc. of 4th IEEE/ACIS International Conference on Software Engineering Research,
Management & Applications (SERA 2006), 95–100 (2006)
5. A. Hara, T. Ichimura, N. Fujita, T. Takahama, “Effective Diversification of Ant-Based Search
Using Colony Fission and Extinction”, Proc, of IEEE Congress on Evolutionary Computation
(CEC 2006), 1028–1035 (2006)
6. Z. Ren, Z. Feng, Z. Zhang, GSP-ANT: an efficient ant colony optimization algorithm with
multiple good solutions for pheromone update. Proc. IEEE Int. Conf. Intelligent Comput.
Intelligent Syst. 1, 589–592 (2009)
7. G. Shang, J. Xinzi, T. Kezong, “Hybrid Algorithm Combining Ant Colony Optimization
Algorithm with Genetic Algorithm”, Proc. of Chinese Control Conference (CCC 2007),
701–704 (2007)
8. Y.-H. Wang, P.-Z. Pan, A Novel Bi-Directional Convergence Ant Colony Optimization with SA
for Job-Shop Scheduling, Proc. of International Conference on Computational Intelligence and
Software Engineering (CiSE 2009), 1–4 (2009)
9. J. Holland, Adaptation in Natural Artificial Systems, 2nd edn. (The University of Michigan
Press, MIT Press, 1992)
10. D.E. Goldberg, Genetic Algorithms in Search Optimization, and Machine Learning (Addison
Wesley, Reading MA, 1989)
11. R.A. Rutenbar, Simulated annealing algorithms: an overview. IEEE Circuits Dev. Mag. 5(1),
19–26 (1989)
Chapter 10
Study of Pitchfork Bifurcation in Discrete
Hopfield Neural Network

R. Marichal, J. D. Piñeiro, E. González, and J. Torres

Abstract A simple two-neuron model of a discrete Hopfield neural network is

considered. The local stability is analyzed with the associated characteristic model.
In order to study the dynamic behavior, the Pitchfork bifurcation is examined. In the
case of two neurons, one necessary condition for yielding the Pitchfork bifurcation
is found. In addition, the stability and direction of the Pitchfork bifurcation are
determined by applying the normal form theory and the center manifold theorem.

1 Introduction

The purpose of this paper is to present some results on the analysis of the dynamics
of a discrete recurrent neural network. The particular network in which we are
interested is the Hopfield network, also known as a Discrete Hopfield Neural
Network in [1]. Its state evolution equation is

X
N X
M
xi ðk þ 1Þ ¼ win f ðxi ðkÞÞ þ w0 im um ðkÞ þ w00i (1)
n¼1 m¼1

where
xi(k) is the ith neuron output
um(k) is the mth input of the network
win, w0 im are the weight factors of the neuron outputs, network inputs and
w"i is a bias weight
N is the neuron number

R. Marichal (*)
System Engineering and Control and Computer Architecture Department, University of La
Laguna, Avda. Francisco Sánchez S/N, Edf. Informatica, 38206 Tenerife, Canary Islands, Spain
e-mail: [email protected]

S.-I. Ao et al. (eds.), Machine Learning and Systems Engineering, 121

Lecture Notes in Electrical Engineering 68,
DOI 10.1007/978-90-481-9419-3_10, # Springer ScienceþBusiness Media B.V. 2010
122 R. Marichal et al.

M is the input number

f(·) is a continuous, bounded, monotonically increasing function, such as the
hyperbolic tangent
This model has the same dynamic behavior as the Williams–Zipser neural
network. The relationship between the Williams–Zipser states and Hopfield states is

Xh ¼ WXWZ

where
Xh are Hopfield states
XWZ are the Williams-Zipser states
W is the weight matrix without the bias and input weight factor
We will consider the WilliamsZipser model in order to simplify the mathe-
matical calculations.
The neural network presents different classes of equivalent dynamics. A system
will be equivalent to another if its trajectories exhibit the same qualitative behavior.
This is made mathematically precise in the definition of topological equivalence [2].
The simplest trajectories are those that are equilibrium or fixed points that do not
change in time. Their character or stability is given by the local behavior of nearby
trajectories. A fixed point can attract (sink), repel (source) or have directions of
attraction and repulsion (saddle) of close trajectories [3]. Next in complexity are
periodic trajectories, quasi-periodic trajectories or even chaotic sets, each with its
own stability characterization. All of these features are similar in a class of
topologically equivalent systems. When a system parameter is varied, the system
can reach a critical point at which it is no longer equivalent. This is called a
bifurcation, and the system will exhibit new behavior. The study of how these
changes can be carried out will be another powerful tool in the analysis.
With respect to discrete recurrent neural networks as systems, several results on
their dynamics are available in the literature. The most general result is derived
using the Lyapunov stability theorem in [4], and establishes that for a symmetric
weight matrix, there are only fixed points and period two limit cycles, such as
stable equilibrium states. It also gives the conditions under which only fixed-point
attractors exist. More recently Cao [5] proposed other, less restrictive and more
complex, conditions. In [6], chaos is found even in a simple two-neuron network
in a specific weight configuration by demonstrating its equivalence with a one-
dimension chaotic system (the logistic map). In [7], the same author describes
another interesting type of trajectory, the quasi-periodic orbits. These are closed
orbits with irrational periods that appear in complex phenomena, such as frequency-
locking and synchronization, which are typical of biological networks. In the same
paper, conditions for the stability of these orbits are given. These can be simplified,
as we shall show below.
Passeman [8] obtains some experimental results, such as the coexisting of the
periodic, quasi-periodic and chaotic attractors. Additionally, [9] gives the position,
number and stability types of fixed points of a two-neuron discrete recurrent
network with nonzero weights.
10 Study of Pitchfork Bifurcation in Discrete Hopfield Neural Network 123

There are some works that analyze the Hopfield continuous neural networks
[10, 11], like [12–15]. These papers show the stability of Hopf-bifurcation with two
delays.
Firstly, we analyze the number and stability-type characterization of the fixed
points. We then continue with an analysis of the Pitchfork bifurcation. Finally, the
simulations are shown and conclusions are given.

2 Determination of Fixed Points

For the sake of simplicity, we studied the two-neuron network. This allows for an
easy visualization of the problem. In this model, we considered zero inputs so as to
isolate the dynamics from the input action. Secondly, and without loss of generality
with respect to dynamics, we used zero bias weights. The activation function is the
hyperbolic tangent.
With these conditions, the network mapping function is

x1 ðk þ 1Þ ¼ tanhðw11 x1 ðkÞ þ w12 x2 ðkÞÞ

x2 ðk þ 1Þ ¼ tanhðw21 x1 ðkÞ þ w22 x2 ðkÞÞ (2)

where x(k) and y(k) are the neural output of step k.

The fixed points are solutions of the following equations

x1;p ¼ tanhðw11 x1;p þ w12 x2;p Þ

x2;p ¼ tanhðw21 x1;p þ w22 x2;p Þ (3)

The point (0, 0) is always a fixed point for every value of the weights. The
number of fixed points is odd because for every fixed point (x1,p, x2,p), (x1,p, x2,p)
is also a fixed point.
To graphically determine the configuration of fixed points, we redefine the above
equations as

a tanhðx1;p Þ w11 x1;p

x2;p ¼ ¼ Fðx1;p ; w11 ; w12 Þ
w12
a tanhðx2;p Þ w22 x2;p
x1;p ¼ ¼ Fðx2;p ; w22 ; w21 Þ (4)
w21

Depending on the diagonal weights, there are two qualitative behavior functions.
We are going to determine the number of fixed points using the graphical represen-
tation of the above Eq. (4). First, we can show that the graph of the F function has a
maximum and a minimum if wii > 1 or, if the opposite condition holds, is like the
hyperbolic arctangent function (Fig. 1).
124 R. Marichal et al.

Fig. 1 The two possible behaviors of the F function. The left figure corresponds to the respective
diagonal weight lower than unity. The right shows the opposite condition

The combination of these two possibilities with another condition on the ratio of
slopes at the origin of the two curves (4) gives the number of fixed points. The latter
condition can be expressed as

jW j ¼ w11 þ w22 1

where |W| is the weight matrix determinant.

If w11 > 1, w22 > 1 and |W| > w11 + w22 1, then there can exist nine, seven or
five fixed points. When this condition fails, there are three fixed points.
When a diagonal weight is less than one, there can be three or one fixed points.

3 Local Stability Analysis

In the process below, a two-neuron neural network is considered. It is usual for the
activation function to be a sigmoid function or a tangent hyperbolic function.
Considering the fixed point Eq. (3), the elements of the Jacobian matrix at the
fixed point (x, y) are

w11 f 0 ðx1 Þ w12 f 0 ðx1 Þ
J¼
w21 f 0 ðx2 Þ w22 f 0 ðx2 Þ

The associated characteristic equation of the linearized system evaluated at the

fixed point is

l2 ½w11 f 0 ðx1 Þ þ w22 f 0 ðx2 Þl þ jW j f 0 ðx1 Þ f 0 ðx2 Þ ¼ 0 (5)

where w11, w22 and |W| are the diagonal elements and the determinant of the matrix
weight, respectively.
We can define new variables

w11 f 0 ðx1 Þ þ w22 f 0 ðx2 Þ

s1 ¼
2
10 Study of Pitchfork Bifurcation in Discrete Hopfield Neural Network 125

σ2

SOURCE ZONE

SINK ZONE
σ1

Pitchfork bifurcation
|W| = w11 + w22 –1

SADDLE ZONE SADDLE ZONE

SOURCE ZONE

Fig. 2 The stability regions and the Pitchfork bifurcation line at the fixed point (0, 0)

s2 ¼ jW j f 0 ðx1 Þ f 0 ðx2 Þ

The eigenvalues of the characteristic Eq. (5) are defined as

qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
l ¼ s 1 s21 s2

The Pitchfork bifurcation appears when two complex conjugate eigenvalues

reach the unit circle. It is easy to show that this limit condition is.
The boundaries between the regions shown in Fig. 2 are the bifurcations.
At these limit zones the fixed point changes its character. The Pitchfork bifurcation
is represented by the line |W| ¼ w11 + w22 1 in Fig. 2.

4 Pitchfork Bifurcation Direction

In order to determine the direction and stability of the Pitchfork bifurcation, it is

necessary to use the center manifold theory [2]. The center manifold theory
demonstrates that the mapping behavior in the bifurcation is equivalent to the
complex mapping below:
126 R. Marichal et al.

uðk þ 1Þ ¼ uðkÞ þ cð0ÞuðkÞ3 þ oðuðkÞ4 Þ (6)

The parameter c(0) is [2]

1
cð0Þ ¼ hp; Cðq; q; qÞi (7)
6

where C is the third derivative terms of the Taylor development, the notation
< . , . > represents the scalar product, and p, q are the eigenvector Jacobian
matrix and its transpose, respectively. These vectors satisfy the normalization
condition

hp; qi ¼ 1

The above coefficients are evaluated for the critical parameter of the system
where the bifurcation takes place. The c(0) sign determines the bifurcation direc-
tion. When c(0) is negative, a stable fixed point becomes an unstable fixed point and
two additional stable symmetrical fixed points appear. In the opposite case, c(0)
positive, an unstable fixed point becomes a stable fixed point and two additional
unstable symmetrical fixed points appear.
In the neural network mapping, p and q are

d e
q¼ ; 1 (8)
e þ d w21 X2;0

e
p¼ ; 1 (9)
w12 X1;0

where

d ¼ w11 X1;0 1

e ¼ w22 X2;0 1

X1;0 ¼ 1 x21;0

X2;0 ¼ 1 x22;0

x1,0 and x2.0 are the fixed point coordinates where the bifurcation appears.
10 Study of Pitchfork Bifurcation in Discrete Hopfield Neural Network 127

The Taylor development term is

X
2
@fi
Ci ðq; q; qÞ ¼ qj qk ql
j;k;l¼1
@x j @x k @xl

X
2
¼ f 000 ð0Þ dik dil wij wik wil qj qk ql
j;k;l¼1

X
2
¼ f 000 ð0Þ w3ij q2j qj (10)
j¼1

where dij is the Kronecker delta.

In order to determine the parameter c(0), it is necessary to calculate the third
derivate of the mapping (1) given by Eq. (10)

@fi
¼ 2ð1 x2i Þð3x2i 1Þwij wik wil
@xj @xk @xl

The Taylor development term is then

X
2
Ci ða; b; cÞ ¼ 2 ð1 x2i Þð3x2i 1Þwij wik wil aj bk cl :
j;k;l¼1

Taking into account the previous equations and the q autovector Eq. (8)
2 3
2X1;0 ð3x21:0 1Þw312

Cðq; q; qÞ ¼ 2 4 d3
ð13x22;0 Þ
5 (11)
2
X2;0

It can be shown in Eq. (3) that the zero is always a fixed point. Replacing the
expressions for C(q,q,q), q and p given by Eqs. (8) (9) and (11), respectively, and
evaluating them at the zero fixed point, the previous c(0) coefficient is

1 w2 ðw22 1Þ þ ðw11 1Þ3

cð0Þ ¼ h p; Cðq; q; qÞi ¼ 12
6 3ð1 w11 Þ2 ð2 w11 w22 Þ

The previous expression is not defined for the following cases

a. w11 ¼ 1
b. w11 + w22 ¼ 2.
In this paper we only consider condition (a) since condition (b) shows the
presence of another bifurcation, known as Neimark-Sacker and which is analyzed
in another paper [16].
128 R. Marichal et al.

Taking into account condition a) and the bifurcation parameter equation

jW j ¼ w11 þ w22 1

then

w12 w21 ¼ 0

In this particular case, the eigenvalues match with the diagonal elements of the
weight matrix

l1 ¼ w11
l2 ¼ w22

The new q and p eigenvectors are given by

q ¼f1; 0g

w12
p ¼ 1;
w22 1

The c(0) coefficient is

1 1 1
cð0Þ ¼ hp; Cðq; q; qÞi ¼ w311 ¼ :
6 3 3
Therefore, in this particular case, the coefficient of the normal form c (0) is
negative, a stable fixed point becomes a saddle fixed point and two additional stable
symmetrical fixed points appear.

5 Simulations

In order to examine the results obtained. The simulation shows the Pitchfork
bifurcation (Fig. 3). The Pitchfork bifurcation is produced by the diagonal element
weight matrix w11. Figure 3a shows the dynamic configuration before the bifurcation
is produced, with only one stable fixed point. Subsequently, when the bifurcation is
produced (Fig. 3b), two additional stable fixed points appear and the zero fixed point
changes its stability from stable to unstable (the normal form coefficient c is negative).

6 Conclusion

In this paper we considered the Hopfield discrete two-neuron network model. We

discussed the number of fixed points and the type of stability. We showed the bifurcation
Pitchfork direction and the dynamical behavior associated with the bifurcation.
The two-neuron networks discussed above are quite simple, but they are poten-
tially useful since the complexity found in these simple cases might be carried
10 Study of Pitchfork Bifurcation in Discrete Hopfield Neural Network 129

a
1

0.8

0.6

0.4

0.2

X2 0

–0.2

–0.4

–0.6

–0.8

–1
–1 – 0.5 0 0.5 1
X1

b
1

0.8

0.6

0.4

0.2

0
X2
–0.2

–0.4

–0.6

–0.8

–1
–1 – 0.5 0 0.5 1
X1

Fig. 3 The dynamic behavior when the Pitchfork bifurcation is produced. + and D are the saddle
and source fixed points, respectively. (a) w11 ¼ 0.9, w12 ¼ 0.1, w21 ¼ 1 and w22 ¼ 0.5; (b) w11 ¼
1.1, w12 ¼ 0.1, w21 ¼ 1 and w22 ¼ 0.5

over to larger Hopfield discrete neural networks. There exists the possibility of
generalizing some of these results to higher dimensions and of using them to design
training algorithms that avoid the problems associated with the learning process.
130 R. Marichal et al.

References

1. R. Hush, B.G. Horne, Progress in supervised neural network. IEEE Signal Process. Mag. 8–39
(1993)
2. Y.A. Kuznetsov, Elements of Applied Bifurcation Theory, 3rd edn. (Springer-Verlag,
New York, 2004)
3. K.T. Alligood, T.D. Sauer, J.A. Yorke, Chaos: An Introduction to Dynamical Systems
(Springer Verlag, New York, 1998)
4. C.M. Marcus, R.M. Westervelt, Dynamics of iterated-map neural networks. Phys. Rev. A 40,
501–504 (1989)
5. Cao Jine, On stability of delayed cellular neural networks. Phys. Lett. A 261(5–6), 303–308
(1999)
6. X. Wang, Period-doublings to chaos in a simple neural network: an analytical proof. Complex
Syst. 5, 425–441 (1991)
7. X. Wang, Discrete-time dynamics of coupled quasi-periodic and chaotic neural network
oscillators. International Joint Conference on Neural Networks (1992)
8. F. Pasemann, Complex dynamics and the structure of small neural networks. Netw. Comput.
Neural Syst. 13(2), 195–216 (2002)
9. Tino et al., Attractive periodic sets in discrete-time recurrent networks (with emphasis on
fixed-point stability and bifurcations in two-neuron networks). Neural Comput. 13(6),
1379–1414 (2001)
10. J. Hopfield, Neurons with graded response have collective computational properties like those
of two-state neurons. Procs. Nat. Acad. Sci. USA 81, 3088–3092 (1984)
11. D.W. Tank, J.J. Hopfield, Neural computation by concentrating information in time, Proc.
Nat. Acad. Sci. USA, 84, 1896–1991 (1984)
12. X. Liao, K. Wong, Z. Wu, Bifurcation analysis on a two-neuron system with distributed
delays. Physica D 149, 123–141 (2001)
13. S. Guo, X. Tang, L. Huang, Hopf bifurcating periodic orbits in a ring of neurons with delays,
Physica D, 183, 19–44 (2003)
14. S. Guo, X. Tang, L. Huang, Stability and bifurcation in a discrete system of two neurons with
delays. Nonlinear Anal. Real World. doi:10.1016/j.nonrwa.2007.03.002 (2007)
15. S. Guo, X. Tang, L. Huang, Bifurcation analysis in a discrete-time single-directional network
with delays. Neurocomputing 71, 1422–1435 (2008)
16. R. Marichal et al., Bifurcation analysis on hopfield discrete neural networks. WSEAS Trans.
Syst. 5, 119–124 (2006)
Chapter 11
Grammatical Evolution and STE Criterion
Statistical Properties of STE Objective Function

Radomil Matousek and Josef Bednar

Abstract Grammatical evolution (GE) is one of the newest among computational

methods (Ryan et al. 1998; O’Neill and Ryan 2001). Basically, it is a tool used to
automatically generate Backus-Naur-Form (BNF) computer programmes. The
method’s evolution mechanism may be based on a standard genetic algorithm
(GA). GE is very often used to solve the problem of a symbolic regression,
determining a module’s own parameters (as it is also the case of other optimization
problems) as well as the module structure itself. A Sum Square Error (SSE) method
is usually used as the testing criterion. In this paper, however, we will present the
original method, which uses a Sum epsilon Tube Error (STE) optimizing criterion.
In addition, we will draw a possible parallel between the SSE and STE criteria
describing the statistical properties of this new and promising minimizing method.

1 Introduction

The general problem of optimizing a nonlinear model or a specific problem of

nonlinear regression in statistics may be seen as typical problems that can success-
fully be approached using evolutional optimization techniques. A genetic algorithm
method, for example, may be very efficient in determining the parameters of
models representing a multimodal or non-differentiable behaviour of the goal
function. On the other hand, there are numerous mathematical methods that may,
under certain conditions, be successful in looking for a minimum or maximum. The
classical optimization methods may differ by the type of the problem to be solved
(linear, nonlinear, integer programming, etc.) or by the actual algorithm used.
However, in all such cases, the model structure for which optimal parameters are

R. Matousek (*)
Dept. of Mathematic and Applied Computer Science, Brno University of Technology, FME,
Technicka 2896/2, 61600 Brno, Czech Republic
e-mail: [email protected]

S.-I. Ao et al. (eds.), Machine Learning and Systems Engineering, 131

Lecture Notes in Electrical Engineering 68,
DOI 10.1007/978-90-481-9419-3_11, # Springer ScienceþBusiness Media B.V. 2010
132 R. Matousek and J. Bednar

Table 1 Grammar G which was used with respect of the approximation tasks
Approximation task Grammar and example of generating function
Trigonometric (data: ET20x50) G ¼ {þ,,*,/,sin,cos,variables,constants}
notes:
unsigned integer constants ∈ [0,15], real constants ∈ [0,1]
y ¼ sinðxÞ þ sinð2:5xÞ
Polynomial (data: ET10x50) G ¼ {þ,,*,/, variables, constants}
notes:
unsigned integer constants ∈ [0,15], real constants ∈ [0,1]
y ¼ 3x4 3x þ 1

to be found is already known. But it is exactly the process of an adequate model

structure design that may be of key importance in statistical regression and numerical
approximation.
When a problem of regression is dealt with, it is usually split into two parts. First,
a model is designed to fit the character of the empirically determined data as much
as possible. Then its optimum parameters are computed for the whole to be
statistically adequate.
In the event of a pure approximation problem, the situation is the same: a model
is designed (that is, its structure such as a polynomial P(x) ¼ ax2 þ bx þ c) with its
parameters being subsequently optimized (such as a, b, c) using a previously chosen
minimization criterion, which is usually a Sum Square Error (SSE) one.
GE uses such data representation as to deal with the above problem in a
comprehensive way, that is, combining the determination of a model structure
and finding its optimum parameter in a single process. Such a process is usually
called the problem of a symbolic regression. The GE optimisation algorithm usually
uses a GA with the genotype-phenotype interpretation using a context-free Backus-
Naur-Form (BNF) based grammar [1, 2]. The problem solved is then represented by
a BNF structure, which a priori determines its scope – an automatic computer
program generating tool. Further we will focus on issues of symbolic regression of
trigonometric and polynomial data, that is, data that can be described using a
grammar containing trigonometric functions and/or powers.
The advanced GE algorithm implemented was using a binary GA. The GA
implemented the following operators: tournament selection, one- and two-point
structural crossing-over, and structural mutations. Numerous modern methods of
chromosome encryption and decryption were used [3, 4]. The grammar G was used
with respect to the class to be tested (Table 1).

2 STE – Sum Epsilon Tube Error

To keep further description as simple as possible, consider a data approximation

problem ignoring the influence of a random variable, that is, disregarding any errors
of measurement. The least-square method is used to test the quality of a solution in
11 Grammatical Evolution and STE Criterion 133

an absolute majority of cases. Generally, the least-squares method is used to find

a solution with the squared errors r summed over all control points being minimal.
A control point will denote a point at which the approximation error is calculated
using the data to be fitted, that is, the difference between the actual y value and the
approximation y^, (r¼y^ y).
X
SSE ¼ ri2 (1)
i

However, in the event of a GE problem of symbolic regression, this criterion

may not be sufficient. For this reason a new, Sum Epsilon-Tube Error (STE)
evaluation criterion has been devised. For each value e, this evaluation criterion
may be calculated as follows:
(
X 0 ri 2
= ½e; e
STEe ¼ ee ðri Þ; ee ðri Þ ¼ (2)
i 1 ri 2 ½e; e

where e is the radius of the epsilon tube, i is the index set of the control points,
e is an objective function determining whether the approximated value lies within
the given epsilon tube. The workings of this criterion may be seen as a tube of
diameter epsilon that stretches along the entire approximation domain (Fig. 2). The
axis of such a tube is determined by the points corresponding to the approximated
values. Such points may be called control points.
The actual evaluating function then checks whether an approximation point lies
within or out of the tube (2). The epsilon value changes dynamically during the
optimization process being set to the highest value at the beginning and reduced
when the adaptation condition is met.
The variable parameters of the STE method are listed in Table 2. At the
beginning of the evolution process, the algorithm sets the diameter of the epsilon
tube. If the condition (3), which indicates the minimum number of points that need
to be contained in the epsilon cube, is met, the epsilon tube is adapted, that is, the

Table 2 Parameters of the STE minimization method

Parameter Value Description
steCP Unsigned integer The number of control points
steEpsStart Positive real number Epsilon initial value (e-upper bound)
steEpsStop Positive real number Final epsilon value (e-lower bound)
steEpsReduction Unsigned integer Percentage of value decrease on adapting the value
steAdaptation Unsigned integer Percentage indicating the success rate (control points
are in the e tube) at which epsilon is adapted
steModel “Interest-Rate or Model of the computation of a new epsilon value if
Linear Model” adaptation condition is met
steIteration Unsigned integer This may replace the steEpsStop parameter, and then
this parameter indicates the number of adaptations
until the end of the computation
134 R. Matousek and J. Bednar

current epsilon value is reduced. This value can either be reduced using an interest-
rate model with the current epsilon value being reduced by a given percentage of
the previous epsilon value or using a linear model with the current epsilon value
being reduced by a constant value.
The condition for the adaptation of the epsilon tube to be finished may either
be chosen as a limit epsilon value (the steEpsStop parameter) or be given by the
number of adaptive changes (the steIteration parameter).

ðsteCP=STEe Þ 100 steAdaptation (3)

3 STE – Empirical Properties

Practical implementations of the symbolic regression problem use STE as an

evaluating criterion. The residua obtained by applying STE to the approximation
problems implemented have been statistically analyzed. It has been found out that
these residua are governed by the Laplace distribution (4) with m ¼ 0 and b ¼ 1/sqrt
(2) in our particular case. The residua was standardised for requirement of our
experiments (the data are standardized by subtracting the mean and dividing by the
standard deviation).
8
> exp m x
< if x < m
1 jx m j 1 b
f ðxjm; bÞ ¼ exp ¼ x m (4)
2b b 2b >
: exp if x m
b

A random variable has a Laplace (m, b) distribution if its probability density function
is (4). Here, m is a location parameter and b > 0 is a scale parameter. If m ¼ 0 and
b ¼ 1, the positive half-line is exactly an exponential distribution scaled by ½.
An intuitive approach to approximation shows better results for the STE method
(Fig. 1) mainly because there are no big differences between the approximating
functions (Figs. 3 and 4). The reason for this is obvious: very distant points
influence the result of approximation a great deal for SSE while, for STE, the
weight of such points are relatively insignificant.
Next, the main advantages and disadvantages are summarized of the minimiza-
tion methods using the SSE and STE criteria.

3.1 SSE (Advantages, Disadvantages)

þ a classical and well-researched method both in statistical regression and numeri-

cal approximation.
11 Grammatical Evolution and STE Criterion 135

Fig. 1 Comparison of SSE and STE principles. Common principle of residual sum of squares
(above). The principle of the Sum of epsilon-Tube Error (below)

Histogram for ET10 × 50

300 Distribution
Laplace
250

200
frequency

150

100

0
–5 –3 –1 1 3 5
ET10 × 50

Fig. 2 Histogram of the standardized residua (from the experiment ET10x50) and the probability
density function of the Laplace distribution
136 R. Matousek and J. Bednar

Fig. 3 The final approximations (symbolic regression) of polynomial data (see Table 1). STE
method was used (left), respectively SEE method was used (right)

Fig. 4 The final approximations (symbolic regression) of trigonometric data (Table 1). STE
method was used (left), respectively SEE method was used (right)

þ a metric is defined.
þ a N(m ¼ 0, s2) residua error distribution may be assumed with descriptive
statistics for this distribution being available.
more time-consuming.
being used excessively, this method hardly provides an incentive for users to gain
new insights [5].

3.2 STE (Advantages, Disadvantages)

þ less time consuming.

þ more intuitive and readable.
þ when using a GE-based symbolic regression, the Laplace distribution may be
assumed (which is one of the original results of this paper).
þ better results are generated for the problem class.
mathematical processing is worse with no metric defined.
the method being new, it is not so well described.
11 Grammatical Evolution and STE Criterion 137

4 Probabilistic Mapping of SSE to STE

This part shows how the results obtained by a SSE-based minimization method may
be used to derive a result corresponding to the values of a STE-based minimization
criterion. In other words, a procedure will be shown for using SSE to obtain a
corresponding STE. The procedure is based on the following:
l It is assumed that, when applying the least-squares method, the residua are
normally distributed with N(m ¼ 0, s2). Here, the standard deviation may be
arbitrary, but a reduction to the standardized normal distribution N(0, 1) is
possible.
l In order to reach a sufficient statistical significance, the experiment simulated
10,000 instances of solution to an approximation problem (nRUN). This prob-
lem was discretized using 100 control points.
l Different sizes eps ¼ {2, 1, 0.5, 0.25, 0.125} of epsilon tubes were chosen to
simulate possible adaptations of the epsilon tube by (3).
l The frequencies were calculated of the values lying within the given epsilon
tubes.
This simulation and the subsequent statistical analysis were implemented using
the Minitab software package. Table 3 provides partial information on the proce-
dure used.
Where r is residua (X N(0,1)), i is control point index (i ∈ [1, 100] in our
particular case), nRUN is given implementation (nRUN ∈ [1, 10000] in our
particular case).
Next, for each residuum value, the number of cases will be determined of these
residua lying in the interval given by a particular epsilon tube. This evaluation
criterion denoted by STE (Sum epsilon-Tube Error) is defined by (2) and, for the
particular test values, corresponds to:

X
100
STEe ¼ ee ðri Þ; e ¼ f2; 1; 0:5; 0:25; 0:125g (5)
i¼1

By a statistical analysis, it has been determined that STE behaves as a random

variable with a Binomial distribution

Table 3 A fragment of a table of the simulated residua values with standardized normal
distribution and the corresponding ee ðri Þ values
r i nRUN e¼2 e¼1 e ¼ 0.5 e ¼ 0.25 e ¼ 0.125
0.48593 1 1 1 1 1 0 0
0.07727 2 1 1 1 1 1 1
1.84247 3 1 1 0 0 0 0
1.29301 4 1 1 0 0 0 0
0.34326 5 1 1 1 1 0 0
... ... ... ... ... ... ... ...
0.02298 100 10,000 1 1 1 1 1
138 R. Matousek and J. Bednar

Table 4 Table of probabilities ps

e PðNð0; 1Þ< eÞ PðNð0; 1Þ<eÞ pe
2 0.022750 0.977250 0.954500
1 0.158655 0.841345 0.682689
0.5 0.308538 0.691462 0.382925
0.25 0.401294 0.598706 0.197413
0.125 0.450262 0.549738 0.099476

Distribution Plot
Normal; Mean = 0; StDev = 1
0.5
0.954
0.4

0.3
Density

0.2

0.1

0.0
–2 0 2
X

Fig. 5 Calculating probability pe with e ¼ 2

KBiðn; pe Þ; (6)

where

pe ¼ PðX Nð0; 1Þ 2 ½e; eÞ

For the epsilons chosen, the STE characteristic has a binomial distribution given
by (6) where the probabilities pe are calculated from Table 4. For the case of e ¼ 2,
Fig. 5 shows the probability calculation.
The Fig. 6 shows the probability functions for epsilon tubes with value k
indicating, with a given probability, the number of control points for which the
e(r) function takes on a value of one, that is, the number of control points contained
in a given epsilon tube.
The correctness of the calculation was proved empirically. The STE values
simulated (10,000 runs) were compared with distribution (6). The following figure
(Fig. 7) displays an empirical probability density function obtained from Table 3 by
(5) compared with the probability density function as calculated by (6) for e ¼ 1.
It can be seen in Fig. 7 that the empirical distribution is close the calculated one,
which was verified using a goodness-of-fit test.
11 Grammatical Evolution and STE Criterion 139

0.20
Variable
Bi (100, 0.9545)
Bi (100, 0.6827)
0.15
Bi (100, 0.3829)
Bi (100, 0.1974)
Bi (100, 0.0995)
p(k)

0.10

0.05

0.00

0 20 40 60 80 100
k

Fig. 6 STE probability density functions for e-tubes with e ¼ {2, 1, 0.5, 0.25, 0.125}

Bi(100, 0.6826895) vs Empirical Probabilities

0.09
Variable
0.08 Bi (100, 0.6826895)
0.07 Empirical Prob.

0.06
0.05
p(k)

0.04
0.03
0.02
0.01
0.00

50 60 70 80 90
k

Fig. 7 Comparing empirical and calculate probability density functions of occurrences within the
epsilon tube for e ¼ 1

5 Goodness-of-Fit Tests of Data Sets

Residua which we obtained from our symbolic regression experiments were

analyzed for Laplace distribution coincidence. We have tested many residua data
for this distribution. The examples of specific results which confirmed our hypothesis
about Laplace distribution follows. There we denoted data sets ET10x50 (500
values ranging from 5.12124 to 3.27653) for polynomial problem and ET20x50
140 R. Matousek and J. Bednar

(1000 values ranging from 5.12124 to 5.11919) for goniometrical problem which is
described by Table 1.

5.1 Uncensored Data – ET10x50

Fitted distribution is Laplace, where mean is 0.00198 and scale is 1.55075. This
analysis shows the results of fitting a Laplace distribution to the data on ET10x50
(Tables 5 and 6). We can test whether the Laplace distribution fits the data
adequately by selecting Goodness-of-Fit Tests from the list of Tabular Options.
The visual fiiting of Laplace distribution is in the Fig. 2.
This pane shows the results of tests run to determine whether ET10x50 can be
adequately modeled by a Laplace distribution. The chi-squared test divides the
range of ET10x50 into nonoverlapping intervals and compares the number of
observations in each class to the number expected based on the fitted distribution.
The Kolmogorov-Smirnov test computes the maximum distance between the
cumulative distribution of ET10x50 and the CDF of the fitted Laplace distribution.
In this case, the maximum distance is 0.0540167. The other statistics compare the
empirical distribution function to the fitted CDF in different ways.
Since the smallest P-value amongst the tests performed is greater than or equal to
0.05, we can not reject the idea that ET10x50 comes from a Laplace distribution
with 95% confidence. Analogous procedure can be use in case of ET20x50 data sets
and another.

Table 5 Table of goodness-of-fit tests for ET10x50: *chi-squared test

Lower limit Upper limit Observed frequency Expected frequency Chi-squared
At or below 3.0 2 2.39 0.06
3.0 2.33333 1 4.33 2.57
2.33333 1.66667 15 12.19 0.65
1.66667 1.0 39 34.27 0.65
1.0 0.333333 78 96.36 3.50
0.333333 0.333333 217 201.82 1.14
0.333333 1.0 85 95.77 1.21
1.0 1.66667 37 34.06 0.25
1.66667 2.33333 18 12.11 2.86
2.33333 3.0 5 4.31 0.11
above 3.0 3 2.38 0.16
*Chi-squared ¼ 13.1719 with 8 d.f. P-Value ¼ 0.10607

Table 6 Table of goodness-of-fit tests for ET10x50: *Kolmogorov-Smirnov test

Laplace Laplace
DPLUS 0.0397308 DN 0.0540167
DMINUS 0.0540167 *P-Value 0.108113
11 Grammatical Evolution and STE Criterion 141

6 Probabilistic Relationship Between STE and SSE

Fundamental difference between STE and SSE optimal criteria is metric regarding
Eqs. (1) and (2). The SSE contrary to the STE has define the Euclidean metric (in this
case triangle inequality exist). Hence, exact relation form STE to SSE optimal criteria
doesn’t exist. But, if we try to use probabilistic relation, we can obtain relationship
for transformation of Laplace distribution to Normal distribution. In other words, we
can transform STE solution to SSE solution in probabilistic point of view.
We suppose (from caption IV and on the base chi-square goodness test) that
the residua has Laplace distribution in case of STE criteria. As we can see in 0,
Laplace distribution is a typical distribution for some numerical methods. Laplace
distribution is also called double exponential distribution. It is the distribution
of differences between two independent varieties with identical exponential distri-
bution 0. The probability is given by (7)
1 jxmj=b
PðxÞ ¼ e (7)
2b
and the moments about the mean mn are related to the moments about 0 by (8)
! ! (
X X
n floor ðj=2Þ
nj n j 2k n2k n!bn for n even
mn ¼ ð1Þ b m Gð2k þ 1Þ ¼ (8)
j¼0 k¼0 j 2k 0 for n odd

Common uses moments are in the Table 7. Because the Mean Square Error
(MSE) given by MSE ¼ SSE/n is estimation of the variance s2 we can derive from
Table 7 and given parameter b the Sum Square Error by Eq. (9).

SSE ¼ s2 n ¼ 2b2 n (9)

7 Conclusion

l A new STE evaluation criterion for GE was introduced using epsilon tubes.
Based on the experiments conducted, this criterion seems to be more appropriate
for the symbolic regression problem as compared with the SSE method.

Table 7 Common uses moments in normal and Laplacian probabilistic distributions

Relationship Description
m ¼ m Mean
s2 ¼ 2b2 Variance
g1 ¼ 0 Skewness
g2 ¼ 3 Kurtosis
142 R. Matousek and J. Bednar

l Using statistical tools the method of transforming the results obtained by SSE
minimisation into results obtained by STE has been shown.
l Using a particular class implemented using approximations obtained by an STE
minimization, the empirically obtained residua were found to be Laplace
distributed. It was confirmed on the base statistical approach of goodness-of-fit
tests.
l The properties of such residua and transformation of the STE-based minimi-
zation criterion into a SSE-based was shown by means of probabilistic
relationship.

Acknowledgement This work was supported by the Czech Ministry of Education in the frame
of MSM 0021630529 and by the GACR No.: 102/091668.

References

1. M. O’Neill, C. Ryan, Grammatical Evolution: Evolutionary Automatic Programming in

an Arbitrary Language (Kluwer, London, USA, 2003), ISBN 1-4020-7444-1
2. C. Ryan, M. O’Neill, J.J. Collins, Grammatical evolution: solving trigonometric identities, in
Proceedings of MENDEL ‘98 (Brno, Czech Republic, 1998), pp. 111–119, ISBN 80-214-1199-6
3. P. Osmera, R. Matousek, O. Popelka, T. Panacek, Parallel grammatical evolution for circuit
optimization, in Proceedings of MENDEL 2008 (Brno, Czech Republic, 2008), pp. 206–213,
ISBN 978-80-214-3675-6
4. R. Matousek, GAHC: hybrid genetic algorithm, in The Springer Book Series, eds. by S.L. Ao,
B. Rieger, S.S. Chen. Lecture Notes in Electrical Engineering: Advances in Computational
Algorithms and Data Analysis, vol 14 (Springer, The Netherlands 2008), pp. 549–562, ISSN
1876-1100, ISBN 978-1-4020-8918-3
5. P. Popela, Numerical techniques and available software, in Chapter 8 in Part II, Stochastic
Modeling in Economics and Finance, eds J. Dupacova, J. Hurt, J. St_ep_an (Kluwer Academic
Publishers) pp.206–227
6. M. Abramowitz, I.A. Stegun (eds), Handbook of Mathematical Functions with Formulas,
Graphs, and Mathematical Tables, 9th printing (Dover, New York, 1972)
7. E.W. Weisstein, Laplace Distribution. From MathWorld – A Wolfram. http://mathworld.
wolfram.com/LaplaceDistribution.html
8. W.H. Pressm, S.A. Teukolsky, Numerical Recipes, The Art of Scientific Computing, 3rd edn.
(Cambridge University Press, 2007) ISBN 978-0521880688
9. J. Roupec, GA-based parameters tuning in grammatical evolution, in Proceedings of MENDEL
2007 (Prague, Czech Republic, 2007), pp. 66–71, ISBN 978-80-214-3473-8
Chapter 12
Data Quality in ANFIS Based Soft Sensors

S. Jassar, Z. Liao, and L. Zhao

Abstract Soft sensor are used to infer the critical process variables that are
otherwise difficult, if not impossible, to measure in broad range of engineering
fields. Adaptive Neuro-Fuzzy Inference System (ANFIS) has been employed to
develop successful ANFIS based inferential model that represents the dynamics
of the targeted system. In addition to the structure of the model, the quality of the
training as well as of the testing data also plays a crucial role in determining the
performance of the soft sensor. This paper investigates the impact of data quality on
the performance of an ANFIS based inferential model that is designed to estimate
the average air temperature in distributed heating systems. The results of the two
experiments are reported. The results show that the performance of ANFIS based
sensor models is sensitive to the quality of data. The paper also discusses how to
reduce the sensitivity by an improved mathematical algorithm.

1 Introduction

A soft sensor computes the value of the process variables, which are difficult to
directly measure using conventional sensors, based on the measurement of other
relevant variables [1]. The performance of soft sensors is sensitive to the inferential
model that represents the dynamics of the targeted system. Such inferential models
can be based on one of the following modeling techniques:
l Physical model
l Neural network
l Fuzzy logic
l Adaptive Neuro-Fuzzy Inference System

S. Jassar (*)
Ryerson University, 350 Victoria Street, Toronto M5B2K3, Canada
e-mail: [email protected]

S.-I. Ao et al. (eds.), Machine Learning and Systems Engineering, 143

Lecture Notes in Electrical Engineering 68,
DOI 10.1007/978-90-481-9419-3_12, # Springer ScienceþBusiness Media B.V. 2010
144 S. Jassar et al.

Recent research demonstrates the use of ANFIS in the development of an

inferential model that estimated the average air temperature in a distributed heating
system [2]. The estimated average air temperature allows for a closed-loop boiler
control scheme, resulting in higher energy efficiency and improved comfort. In the
current practice, due to the absence of economic and technically reliable method for
measuring the overall comfort level in the buildings, the boilers are normally
controlled to maintain the supply water temperature as a predefined level that
normally does not reflect the heating demand of the buildings [3].
For ANFIS based inferential models, when estimation/prediction accuracy is
concerned, it is assumed that both the data used to train the model and the testing
data to make estimations are free of errors [4]. But rarely a dataset is clean before
extraordinary effort having been made to clean the data. For this problem of
average air temperature estimation, with the measurement errors in the input
variables of the model, it is not unusual to have some uneven patterns in the dataset.
The research presented in this chapter aims to analyze the impact of data quality of
both training and testing datasets on the estimation accuracy of the developed
model.

2 ANFIS Based Inferential Model

As an AI technique, “Soft Computing”, integrates the powerful artificial intelli-

gence methodologies such as neural networks and fuzzy inference systems. While
fuzzy logic performs an inference mechanism under cognitive uncertainty, neural
networks posses exciting capabilities such as learning, adaption, fault-tolerance,
parallelism and generalization. Since Jang proposed ANFIS, its applications are
numerous in various fields, including engineering, management, health, biology
and even social sciences [5].
ANFIS is a multi-layer adaptive network-based fuzzy inference system. An
ANFIS consists of a total of five layers to implement different node functions to
learn and tune parameters in a fuzzy inference system (FIS) structure using a hybrid
learning mode. In the forward pass of learning, with fixed premise parameters,
the least squared error estimate approach is employed to update the consequent
parameters and to pass the errors to the backward pass. In the backward pass
of learning, the consequent parameters are fixed and the gradient descent method
is applied to update the premise parameters. One complete cycle of forward and
backward pass is called an epoch. Premise and consequent parameters will
be identified for membership function (MF) and FIS by repeating the forward and
backward passes. ANFIS has been widely used in prediction problems and other
areas.
ANFIS based inferential model represents the dynamic relationship between
the average air temperature, Tavg, and three easily measurable variables: external
temperature, T0, solar radiation, Qsol, and energy consumed by the boilers, Qin [6].
The FIS structure is generated by Grid partitioning method.
12 Data Quality in ANFIS Based Soft Sensors 145

Grid partition divides the data space into rectangular sub-spaces using axis-
paralleled partition based on pre-defined number of MFs and their types in each
dimension. The wider application of grid partition in FIS generation is blocked by
the curse of dimensions. The number of fuzzy rules increases exponentially when
the number of input variables increases. For example, if there are m MFs for each
input variable and a total of n input variables for the problem, the total number of
fuzzy rules is mn. It is obvious that the wide application of grid partition is
threatened by the large number of rules. According to Jang, grid partition is only
suitable for cases with small number of input variables (e.g. less than 6). In this
research, the average air temperature estimation problem has three input variables.
It is reasonable to apply the grid partition to generate FIS structure, ANFIS-GRID.
Gaussian type MF is used for characterizing the premise variables. Each input
has 4 MFs, thus there are 64 rules. The developed structure is trained using hybrid
learning algorithm [5]. The parameters associated with the MFs change through
training process.

2.1 Training and Testing Data

Experimental data obtained from a laboratory heating system is used for training
and testing of the developed model [7]. The laboratory heating system is located in
Milan, Italy. The details of experimental data collection for the four variables, Qin,
Qsol, T0 and Tavg, are given by the authors [2, 6]. The dataset used for the training of
ANFIS-GRID has 1800 input-output data pairs and is shown in Fig. 1.
The experimental data used for checking the performance of the developed
model is shown in Fig. 2. The testing dataset has 7,132 data pairs, which is large
enough as compared to training dataset used for the development of the model.

3 Impact of Data Quality

Data quality is generally recognized as a multidimensional concept [8]. While no

single definition of data quality has been accepted by the researchers working in this
area, there is agreement that data accuracy, currency, completeness, and consis-
tency are important areas of concern [9]. This Chapter is primarily considering the
data accuracy, defined as conformity between a recorded value and the actual data
value.
Several studies have investigated the effect of data errors on the outputs of
computer based models. Bansel et al. studied the effect of errors in test data on
predictions made by neural network and linear regression models [10]. The training
dataset applied in the research was free of errors. The research concluded that the
error size had a statistically significant effect on predictive accuracy of both the
linear regression and neural network models.
146 S. Jassar et al.

15000
Qin (W)

10000
5000
0
0 50 100 150
Time (Hour of the Year)
15000
Qsol (W)

10000
5000
0
0 50 100 150
Time (Hour of the Year)
10
T0 (Deg C)

5
0
–5
0 50 100 150
Time (Hour of the Year)
22
Tavg (Deg C)

20
18
16
0 50 100 150
Time (Hour of the Year)

Fig. 1 Training dataset (February 2000: day 22 to day 27)

O’Leary investigated the effect of data errors in the context of a rule-based

artificial intelligence system [11]. He presented a general methodology for analyz-
ing the impact of data accuracy on the performance of an artificial intelligence
system designed to generate rules from data stored in a database. The methodology
can be applied to artificial intelligence systems that analyze the data and generate a
set of rules of the form “if X then Y”. It is often assumed that a subset of the
generated rules is added to the system’s rule base on the basis of the measure of the
“goodness” of each rule. O’Leary showed that the data errors can affect the subset
of rules that are added to the rule base and that inappropriate rules may be retained
while useful rules are discarded if data accuracy is ignored.
Wei et al. analyzed the effect of data quality on the predictive accuracy of
ANFIS model [12]. The ANFIS model is developed for predicting the injection
profiles in the Daqing Oilfields, China. The research analyzed the data quality using
TANE algorithm. They concluded that the cleaning of data has improved the
accuracy of ANFIS model from 78% to 86.1%.
12 Data Quality in ANFIS Based Soft Sensors 147

× 104
2
Qin (W)

0
0 100 200 300 400 500 600
Time (Hour of the Year)
× 104
2
Qsol (W)

0
0 100 200 300 400 500 600
Time (Hour of the Year)

20
T0 (Deg C)

–20
0 100 200 300 400 500 600
Time (Hour of the Year)

25
Tavg (Deg C)

15
0 100 200 300 400 500 600
Time (Hour of the Year)

Fig. 2 Testing dataset (February 2000: day 1 to day 21)

In this chapter, the developed ANFIS-GRID based inferential model is trained

and tested using the experimental data collected from a laboratory heating system
[7]. The data collected has some uneven patterns. In this section we will discuss the
experiments conducted to examine the impact of data quality on the predictive
performance of the developed ANFIS-GRID model.

3.1 Experimental Methodology

Data errors may affect the accuracy of the ANFIS based models in two ways. First,
the data used to build and train the model may contain errors. Second, even if
training data are free of errors, once the developed model is used for estimation
tasks a user may use input data containing errors to the model.
148 S. Jassar et al.

The research in this area has assumed that data used to train the models and data
input to make estimation of the processes are free of errors. In this study we relax
this assumption by asking two questions: (1) What is the effect of errors in the test
data on the estimation accuracy of the ANFIS based models? (2) What is the effect
of errors in the training data on the predictive accuracy of the ANFIS based models?
While many sources of error in a dataset are possible, we assume that the
underlying cause of errors affect data items randomly rather than systematically.
One source of inaccuracy that may affect a dataset in this way is the measurement
errors caused by reading the equipment. This type of error may affect any data item
in the dataset and may understate or overstate the actual data value. This study does
not address the effect of systematic data errors on the estimations made by the
ANFIS based models.
Two experiments are conducted to examine the research targets. Both the
experiments used the same application (estimation of average air temperature)
and the same dataset.
Experiment 1 examines the first question: What is the effect of errors in the test
data on the estimation ability of the ANFIS based models? Experiment 2 examines
the second question: How the errors of the training data affect the accuracy of the
ANFIS based models?

3.2 Experimental Factors

There are two factors in each experiment: (1) fraction-error and (2) amount-error.
Fraction-error is the percent of the data items in the appropriate part of the dataset
(the test data in experiment 1 and the training data in experiment 2) that are
perturbed. Amount-error is the percentage by which the data items identified in
the fraction-error factor are perturbed.

3.2.1 Fraction-Error

Since fraction-error is defined as a percent of the data items in a dataset, the number
of data items that are changed for a given level of fraction-error is determined by
multiplying the fraction-error by the total number of data items in the dataset.
Experiment 1: The test data used in experiment 1, shown in Fig. 2, has 4 data
items (1 value for each of the 4 input and output variables for 1 entry of the total
7,132 data pairs). This experiment examines all of the possible number of data
items that could be perturbed. These four levels for fraction-error factor are: 25%
(one data item perturbed), 50% (two data items perturbed), 75% (three data items
perturbed), and 100% (four data items perturbed)
Experiment 2: The training data used in experiment 2 contains 1,800 data pairs
(1 value for each of the 4 input and output variables for 1,800 entries). Four levels of
the fraction-error factor are tested: 5% (90 data items are perturbed), 10% (180 data
12 Data Quality in ANFIS Based Soft Sensors 149

items are perturbed), 15% (270 data items are perturbed), and 20% (360 data items
are perturbed).

3.2.2 Amount-Error

For both the experiments, the amount-error factor has two levels: (1) plus or minus
5% and (2) plus or minus 10%. The amount-error applied to the dataset can be
represented by the following set of equations:

y0 ¼ y 0:05 y (1)

y0 ¼ y 0:1 y (2)

For Eqs. (1) and (2), y0 is the value of the variable after adding or subtracting the
noise error to the unmodified variable y.

3.3 Experimental Design

The experimental design is shown in Table 1. Both the experiments have four levels
for the fraction-error factor and two levels for the amount-error. For each combina-
tion of fraction-error and amount-error, four runs with random combinations of the
input and output variable are performed.
Although the levels of the fraction-error are different in the two experiments, the
sampling procedure is the same. For each fraction-error level, the variables are
randomly selected to be perturbed. This is repeated a total of four times per level.
Table 2 shows the combinations of the variables for experiment 1.
Second, for each level of the amount-error factor, each variable is randomly
assigned either a positive or negative sign to indicate the appropriate amount-error
to be applied. Table 3 shows the randomly assigned amount-error levels in

Table 1 Experimental design

Experiment 1 (errors in the test data)
Fraction-error levels (25%, 50%, 75%, and 100%) 4
Amount-error levels (5%, and 10%) 2
Number of random combinations of the variables considered within each fraction-error 4
level
Total number of samples considered 7132
Experiment 2 (errors in the training data)
Fraction-error levels (5%, 10%, 15%, and 20%) 4
Amount-error levels (5%, and 10%) 2
Number of random combinations of the variables considered within each fraction-error 4
level
Total number of samples considered 1800
150 S. Jassar et al.

Table 2 Four combinations of the variables for each fraction-error level in Experiment 1
Fraction-error level Input and output variable combination
(%) 1 2 3 4
25 (Qin) (Qsol) (T0) (Tavg)
50 (Qin, T0) (Qin, Tavg) (Qsol, T0) (Tavg, T0)
75 (Qin, T0, Qsol) (Qin, Tavg, T0) (Qin, Tavg, Qsol) (T0, Tavg, Qsol)
100 (Qin, T0, Qsol, (Qin, T0, Qsol, (Qin, T0, Qsol, (Qin, T0, Qsol,
Tavg) Tavg) Tavg) Tavg)

Table 3 Randomly assigned percentage increase (+) or decrease () for a given amount-error
level in Experiment 1
Fraction-error level Input and output variable combination
(%) 1 2 3 4
25 (Qin) (Qsol) (T0) (Tavg)
+ +
50 (Qin, T0) (Qin, Tavg) (Qsol, T0) (Tavg, T0)
+, , +,+ ,+
75 (Qin, T0, Qsol) (Qin, Tavg, T0) (Qin, Tavg, Qsol) (T0, Tavg, Qsol)
+,, ,+, +,+, +,,+
100 (Qin, T0, Qsol, (Qin, T0, Qsol, (Qin, T0, Qsol, (Qin, T0, Qsol,
Tavg) Tavg) Tavg) Tavg)
,+,,+ +,+,+,+ ,,+,+ ,,,+

experiment 1. The procedure for experiment 2 differs only in the number of

variables that were randomly selected to be perturbed for the four levels of the
fraction-error factor.

3.4 Experimental Result

For both the experiments, the measured average air temperature values and ANFIS-
GRID estimated average air temperature values are compared using Root Mean
Square Error (RMSE) as a measure of estimation accuracy.

3.4.1 Experiment 1 Results: Errors in the Test Data

Estimation accuracy results, using the simulated inaccuracies for amount-error and
fraction-error for the average air temperature estimation are given in Fig. 3. It
shows that as fraction-error increases from 25% to 100%, RMSE increases results
in a decrease in predictive accuracy. As amount-error increases from 5% to 10%,
RMSE increases also indicating a decrease in estimation accuracy. Both fraction-
error and amount-error have an effect on predictive accuracy.
12 Data Quality in ANFIS Based Soft Sensors 151

Fig. 3 RMSE ( C) values as error level in the test data varies

Fig. 4 RMSE ( C) values as error level in the training data varies

3.4.2 Experiment 1 Results: Errors in the Training Data

Predictive accuracy results, using the simulated inaccuracies for amount-error and
fraction-error for the average air temperature estimation are given in Fig. 4. It
shows that as fraction-error increases from 5% to 20%, RMSE increases indicating
a decrease in predictive accuracy.
152 S. Jassar et al.

Table 4 Approximate functional dependencies detected using the TANE algorithm

Index Approximate dependencies Number of rows with conflicting tuples
1 Qin, Qsol, T0 ! Tavg 42
2 Qin, T0, Tavg ! Qsol 47
3 Qin, Qsol, Tavg ! T0 43
4 Qin, T0, Tavg ! Qsol 54

4 Tane Algorithm for Noisy Data Detection

Data quality analysis results show that the errors in the training data as well as in the
testing data affect the predictive accuracy of the ANFIS based soft sensor models.
This section discusses an efficient algorithm, TANE algorithm, to identify the noisy
data pairs in the dataset.
The TANE algorithm, which deals with discovering functional and approximate
dependencies in large data files, is an effective algorithm in practice [13]. The
TANE algorithm partitions attributes into equivalence partitions of the set of tuples.
By checking if the tuples that agree on the right-hand side agree on the left-hand side,
one can determine whether a dependency holds or not. By analyzing the identified
approximate dependencies, one can identify potential erroneous data in the relations.
In this research, relationship of the three input parameters (Qin, Qsol, and T0) and
the average air temperature (Tavg) is analyzed using TANE algorithm. For equiva-
lence partition, all the four parameters are rounded off to zero decimal points.
After data pre-processing, four approximate dependencies are discovered, as
shown in Table 4. Although all these dependencies reflect the relationships among
the parameters, the first dependency is the most important one because it shows that
the selected input parameters have consistent association relationship with the
average air temperature except a few data pairs, which is a very important depen-
dency for average air temperature estimation.
To identify exceptional tuples by analyzing the approximate dependencies, it is
required to investigate the equivalence partitions of both left-hand and right-hand
sides of an approximate dependency. It is non-trivial work that could lead to the
discovery of problematic data. By analyzing the first dependency, conflicting tuples
are identified. The total dataset has 7,132 data pairs. For the first approximate
dependency from Table 4, 42 conflicting data pairs are present which needs were
fixed using data interpolations for better performance of ANFIS-GRID model.

5 Results

The developed ANFIS-GRID model is validated using experimental results [7]. The
model performance is measured using RMSE
vffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
u
u1 X N
RMSE ¼ t Tavg ðiÞ Tâvg ðiÞ (3)
N i¼1
12 Data Quality in ANFIS Based Soft Sensors 153

Measured Temperature
ANFIS-GRID (raw data) Estimated Temperature
22

21
Tavg (Deg C)

16
0 100 200 300 400 500 600
Time (Hour of the Year)
Measured Temperature
ANFIS-GRID (clean data) Estimated Temperature
22

21
Tavg (Deg C)

16
0 100 200 300 400 500 600
Time (Hour of the Year)

Fig. 5 Comparison of ANFIS-GRID estimated and the measured temperature values

For Eq. (3), N is the total number of data pairs, T^avg is the estimated and Tavg is the
experimental value of average air temperature.
Initially, ANFIS-GRID model uses the raw data for both the training as well as
the testing. Figure 5 compares ANFIS-GRID estimated average air temperature
values with the experimental results. First plot in Fig. 5 shows that ANFIS-GRID
estimated average air temperature values are in agreement with the experimental
results, with RMSE 0.56 C. However there are some points at which estimation is
not following the experimental results. For example, around 152–176 and 408–416
h of the year time, there is a significant difference between estimated and experi-
mental results.
For checking the effect of data quality on ANFIS-GRID performance, the
training and testing datasets are cleaned using TANE algorithm. The conflicting
data pairs are replaced with the required data pairs. Then the cleaned dataset is
applied for the training and the testing of ANFIS-GRID model. A comparison of
the model output with clean data and the experimental results is shown in second
plot of Fig. 5.
154 S. Jassar et al.

Fig. 6 Comparison of results

Figure 6 clearly shows the effect of data quality on predictive accuracy of

ANFIS-GRID model. The RMSE is improved by 37.5–0.35 C. RMSE is
considered as a measure of predictive accuracy. Less RMSE means less difference
between the estimated values and the actual values. Finally, predictive accuracy
is improved with decrease in RMSE. Figure 6 also shows the improvement in
R2 values.

6 Conclusion

ANFIS-GRID based inferential model has been developed to estimate the average
air temperature in the distributed space heating systems. This model is simpler
than the subtractive clustering based ANFIS model [2] and may be used as the air
temperature estimator for the development of an inferential control scheme. Grid
partition based FIS structure is used as there are only three input variables. The
training dataset is also large enough as compared to the modifiable parameters
of the ANFIS. As the experimental data is used for both the training as well as
the testing of the developed model, it is expected that data can have some
discrepancies. TANE algorithm is used to identify the approximate functional
dependencies among the input and the output variables. The most important
approximate dependency is analyzed to identify the data pairs with uneven patterns.
The identified data pairs are fixed and again the developed model is trained and
tested with the cleaned data. Figure 6 shows that the RMSE is improved by 37.5%
and R2 is improved by 12%. Therefore, it is highly recommended that the quality
of datasets should be analyzed before they are applied in ANFIS based modelling.
12 Data Quality in ANFIS Based Soft Sensors 155

References

1. M.T. Tham, G.A. Montague, A.J. Morris, P.A. Lant, Estimation and inferential control.
J. Process Control 1, 3–14 (1993)
2. S. Jassar, Z. Liao, L. Zhao, Adaptive neuro-fuzzy based inferential sensor model for estimat-
ing the average air temperature in space heating systems. Build. Environ. 44, 1609–1616
(2009)
3. Z. Liao, A.L. Dexter, An experimental study on an inferential control scheme for optimising
the control of boilers in multi-zone heating systems. Energy Build. 37, 55–63 (2005)
4. B.D. Klein, D.F. Rossin, Data errors in neural network and linear regression models:
an experimental comparison. Data Quality 5, 33–43 (1999)
5. J.S.R. Jang, ANFIS: adaptive-network-based fuzzy inference system. IEEE Trans. Syst.
Man Cybern. 2, 665–685 (1993)
6. Z. Liao, A.L. Dexter, A simplified physical model for estimating the average air temperature
in multi-zone heating systems. Build. Environ. 39, 1013–1022 (2004)
7. BRE, ICITE, Controller efficiency improvement for commercial and industrial gas and
oil fired boilers, A craft project, Contract JOE-CT98-7010, 1999–2001
8. J. Wang, B. Malakooti, A feed forward neural network for multiple criteria decision making.
Comput. Oper. Res. 19, 151–167 (1992)
9. Y. Huh, F. Keller, T. Redman, A. Watkins, Data quality. Inf. Softw. Technol. 32, 559–565
(1990)
10. Bansal, R. Kauffman, R. Weitz, Comparing the modeling performance of regression and
neural networks as data quality varies. J. Manage. Inf. Syst. 10, 11–32 (1993)
11. D. O’Leary, The impact of data accuracy on system learning. J. Manage. Inf. Syst. 9, 83–98
(1993)
12. M. Wei, et al., Predicting injection profiles using ANFIS. Inf. Sci. 177, 4445–4461 (2007)
13. Y. Huhtala, J. Karkkainen, P. Porkka, H. Toivonen, TANE: an efficient algorithm for
discovering functional and approximate dependencies. Comput. J. 42, 100–111 (1999)
Chapter 13
The Meccano Method for Automatic Volume
Parametrization of Solids

R. Montenegro, J.M. Cascón, J.M. Escobar, E. Rodrı́guez, and G. Montero

Abstract In this paper, we present significant advances of the novel meccano

technique for simultaneously constructing adaptive tetrahedral meshes of 3-D com-
plex solids and their volume parametrization. Specifically, we will consider a solid
whose boundary is a surface of genus zero. In this particular case, the automatic
procedure is defined by a surface triangulation of the solid, a simple meccano
composed by one cube and a tolerance that fixes the desired approximation of the
solid surface. The main idea is based on an automatic mapping from the cube faces to
the solid surface, a 3-D local refinement algorithm and a simultaneous mesh untan-
gling and smoothing procedure. Although the initial surface triangulation can be a
poor quality mesh, the meccano technique constructs high quality surface and volume
adaptive meshes. Several examples show the efficiency of the proposed technique.
Future possibilities of the meccano method for meshing a complex solid, whose
boundary is a surface of genus greater than zero, are commented.

1 Introduction

Many authors have devoted great effort to solving the automatic mesh generation
problem in different ways [4, 15, 16, 28], but the 3-D problem is still open [1].
Along the past, the main objective has been to achieve high quality adaptive meshes
of complex solids with minimal user intervention and low computational cost.
At present, it is well known that most mesh generators are based on Delaunay
triangulation and advancing front technique, but problems, related to mesh quality
or mesh conformity with the solid boundary, can still appear for complex geo-
metries. In addition, an appropriate definition of element sizes is demanded for

R. Montenegro (*)
Institute for Intelligent Systems and Numerical Applications in Engineering (SIANI), University
of Las Palmas de Gran Canaria, 35017 Las Palmas de Gran Canaria, Spain
e-mail: [email protected]

S.-I. Ao et al. (eds.), Machine Learning and Systems Engineering, 157

Lecture Notes in Electrical Engineering 68,
DOI 10.1007/978-90-481-9419-3_13, # Springer ScienceþBusiness Media B.V. 2010
158 R. Montenegro et al.

obtaining good quality elements and mesh adaption. Particularly, local adaptive
refinement strategies have been employed to mainly adapt the mesh to singularities
of numerical solution. These adaptive methods usually involve remeshing or nested
refinement.
We introduced the new meccano technique in [2, 3, 23, 24] for constructing
adaptive tetrahedral meshes of solids. We have given this name to the method
because the process starts with the construction of a coarse approximation of the
solid, i.e. a meccano composed by connected polyhedral pieces. The method builds
a 3-D triangulation of the solid as a deformation of an appropriate tetrahedral mesh
of the meccano. A particular case is when meccano is composed by connected
cubes, i.e. a polycube.
The new automatic mesh generation strategy uses no Delaunay triangulation, nor
advancing front technique, and it simplifies the geometrical discretization problem
for 3-D complex domains, whose surfaces can be mapped to the meccano faces.
The main idea of the meccano method is to combine a local refinement/derefine-
ment algorithm for 3-D nested triangulations [20], a parameterization of surface
triangulations [8] and a simultaneous untangling and smoothing procedure [5]. At
present, the meccano technique has been implemented by using the local refine-
ment/derefinement of Kossaczky [20], but the idea could be implemented with
other types of local refinement algorithms [17]. The resulting adaptive tetrahedral
meshes with the meccano method have good quality for finite element applications.
Our approach is based on the combination of several former procedures (refine-
ment, mapping, untangling and smoothing) which are not in themselves new, but
the overall integration is an original contribution. Many authors have used them in
different ways. Triangulations for convex domains can be constructed from a coarse
mesh by using refinement/projection [25]. Adaptive nested meshes have been
constructed with refinement and derefinement algorithms for evolution problems
[7]. Mappings between physical and parametric spaces have been analyzed by
several authors. Significant advances in surface parametrization have been done
in [8, 10, 11, 22, 27, 29], but the volume parametrization is still open. Floater et al.
[12] give a simple counterexample to show that convex combination mappings over
tetrahedral meshes are not necessarily one-to-one. Large domain deformations can
lead to severe mesh distortions, especially in 3-D. Mesh optimization is thus key for
keeping mesh shape regularity and for avoiding a costly remeshing [18, 19]. In
traditional mesh optimization, mesh moving is guided by the minimization of
certain overall functions, but it is usually done in a local fashion. In general, this
procedure involves two steps [13, 14]: the first is for mesh untangling and the
second one for mesh smoothing. Each step leads to a different objective function. In
this paper, we use the improvement proposed by [5, 6], where a simultaneous
untangling and smoothing guided by the same objective function is introduced.
Some advantages of the meccano technique are that: surface triangulation is
automatically constructed, the final 3-D triangulation is conforming with the
object boundary, inner surfaces are automatically preserved (for example, interface
between several materials), node distribution is adapted in accordance with the
object geometry, and parallel computations can easily be developed for meshing
13 The Meccano Method for Automatic Volume Parametrization of Solids 159

the meccano pieces. However, our procedure demands an automatic construction

of the meccano and an admissible mapping between the meccano boundary and the
object surface must be defined.
In this paper, we consider a complex genus-zero solid, i.e. a solid whose boundary
is a surface that is homeomorphic to the surface of a sphere, and we assume that the
solid geometry is defined by a triangulation of its surface. In this case, it is sufficient
to fix a meccano composed by a single cube and a tolerance that fixes the desired
approximation of the solid surface. In order to define an admissible mapping between
the cube faces and patches of the initial surface triangulation of the solid, we intro-
duce a new automatic method to decompose the surface triangulation into six patches
that preserves the same topological connections than the cube faces. Then, a discrete
mapping from each surface patch to the corresponding cube face is constructed by
using the parameterization of surface triangulations proposed by M. Floater in [8–11].
The shape-preserving parametrizations, which are planar triangulations on the cube
faces, are the solutions of linear systems based on convex combinations.
In the near future, more effort should be made in developing an automatic con-
struction of the meccano when the genus of the solid surface is greater than zero.
Currently, several authors are working on this aspect in the context of polycube-maps,
see for example [22, 27, 29]. They are analyzing how to construct a polycube for a
generic solid and, simultaneously, how to define a conformal mapping between the
polycube boundary and the solid surface. Although harmonic maps have been exten-
sively studied in the literature of surface parameterization, only a few works are
related to volume parametrization, for example a procedure is presented in see [21].
In the following Section we present a brief description of the main stages of the
method for a generic meccano composed of polyhedral pieces. In Section 3 we
introduce applications of the algorithm in the case that the meccano is formed by a
simple cube. Finally, conclusions and future research are presented in Section 4.

2 The Meccano Method

The main steps of the general meccano tetrahedral mesh generation algorithm are
summarized in this section. A detailed description of this technique can be analyzed
in [2, 23, 24]. The input data are the definition of the solid boundary (for example by
a given surface triangulation) and a given tolerance (corresponding to the solid
surface approximation). The following algorithm describes the whole mesh gener-
ation approach.
Meccano tetrahedral mesh generation algorithm
1. Construct a meccano approximation of the 3-D solid formed by polyhedral pieces.
2. Define an admissible mapping between the meccano boundary faces and the solid
boundary.
3. Build a coarse tetrahedral mesh of the meccano.
160 R. Montenegro et al.

4. Generate a local refined tetrahedral mesh of the meccano, such that the mapping of the
meccano boundary triangulation approximates the solid boundary for a given precision.
5. Move the boundary nodes of the meccano to the object surface with the mapping defined
in 2.
6. Relocate the inner nodes of the meccano.
7. Optimize the tetrahedral mesh with the simultaneous untangling and smoothing proce-
dure.

The first step of the procedure is to construct a meccano approximation by

connecting different polyhedral pieces. Once the meccano approximation is fixed,
we have to define an admissible one-to-one mapping between the boundary faces of
the meccano and the boundary of the object. In step 3, the meccano is decomposed
into a coarse and valid tetrahedral mesh by an appropriate subdivision of its initial
polyhedral pieces. We continue with a local refinement strategy to obtain an
adapted mesh which can approximate the boundaries of the domain within a
given precision. Then, we construct a mesh of the solid by mapping the boundary
nodes from the meccano faces to the true solid surface and by relocating the inner
nodes at a reasonable position. After those two steps the resulting mesh is tangled,
but it has an admissible topology. Finally, a simultaneous untangling and smooth-
ing procedure is applied and a valid adaptive tetrahedral mesh of the object is
obtained.
We note that the general idea of the meccano technique could be understood
as the connection of different polyhedral pieces. So, the use of cuboid pieces, or a
polycube meccano, are particular cases.

3 Application of the Meccano Method to Complex

Genus-ZeroSolids

In this section, we present the application of the meccano algorithm in the case of
the solid surface being genus-zero and the meccano being formed by a single cube.
We assume as datum a triangulation of the solid surface.
We introduce an automatic parametrization between the surface triangulation of
the solid and the cube boundary. To that end, we automatically divide the surface
triangulation into six patches, with the same topological connection that cube faces,
so that each patch is mapped to a cube face. These parametrizations have been
done with GoTools core and parametrization modules from SINTEF ICT, available
in the website http://www.sintef.no/math_software. This code implements Floater’s
parametrization in C++. Specifically, in the following application we have used the
mean value method for the parametrization of the inner nodes of the patch triangula-
tion, and the boundary nodes are fixed with chord length parametrization [8, 10].
We have implemented the meccano method by using the local refinement of
ALBERTA. This code is an adaptive multilevel finite element toolbox [26] devel-
oped in C. This software can be used to solve several types of 1-D, 2-D or 3-D
problems. ALBERTA uses the Kossaczky refinement algorithm [20] and requires
13 The Meccano Method for Automatic Volume Parametrization of Solids 161

an initial mesh topology [25]. The recursive refinement algorithm could not termi-
nate for general meshes. The meccano technique constructs meshes that verify the
imposed restrictions of ALBERTA in relation to topology and structure. The
minimum quality of refined meshes is function of the initial mesh quality.
The performance of our novel tetrahedral mesh generator is shown in the
following applications. The first corresponds to a Bust, the second to the Stanford
Bunny and the third to a Bone. We have obtained a surface triangulation of these
objects from internet.

3.1 Example 1: Bust

The original surface triangulation of the Bust has been obtained from the website
http://shapes.aimatshape.net, i.e. AIM@SHAPE Shape Repository. It has 64,000
triangles and 32,002 nodes. The bounding box of the solid is defined by the points
(x, y, z)min ¼ (120, 30.5, 44) and (x, y, z)max ¼ (106, 50, 46).
We consider a cube, with an edge length equal to 20, as meccano. Its center is
placed inside the solid at the point (5, 3, 4). We obtain an initial subdivision of
Bust surface in seven maximal connected subtriangulations by using the Voronoi
diagram associated to the centers of the cube faces. In order to get a compatible
decomposition of the surface triangulation, we apply an iterative procedure to
reduce the current seven patches to six.
We map each surface patch SiS to the cube face SiC by using the Floater
parametrization [8]. The definition of the one-to-one mapping between the cube
and Bust boundaries is straightforward once the global parametrization of the Bust
surface triangulation is built.
Fixing a tolerance e2 ¼ 0.1, the meccano method generates a tetrahedral mesh of
the cube with 147, 352 tetrahedra and 34, 524 nodes, see a cross section of the cube
mesh in Fig. 1a. This mesh has 32, 254 triangles and 16, 129 nodes on its boundary
and it has been reached after 42 Kossaczky refinements from the initial subdivision
of the cube into six tetrahedra. The mapping of the cube external nodes to the Bust
surface produces a 3-D tangled mesh with 8, 947 inverted elements, see Fig. 1b. The
location of the cube is shown in this figure. The relocation of inner nodes by using
volume parametrizations reduces the number of inverted tetrahedra to 285. We
apply our mesh optimization procedure [5] and the mesh is untangled in two
iterations. The mesh quality is improved to a minimum value of 0.07 and an average
qk ¼ 0:73 after 10 smoothing iterations.
We note that the meccano technique generates a high quality tetrahedra mesh
(see Figs. 1c, d): only one tetrahedron has a quality lower than 0.1, 13 lower than
0.2 and 405 lower than 0.3.
The CPU time for constructing the final mesh of the Bust is 93.27 s on a Dell
precision 690, 2 Dual Core Xeon processor and 8 Gb RAM memory. More precisely,
the CPU time of each step of the meccano algorithm is: 1.83 s for the subdivision
of the initial surface triangulation into six patches, 3.03 s for the Floater
162 R. Montenegro et al.

a b

c d

Fig. 1 Cross sections of the cube (a) and the Bust tetrahedral mesh before (b) and after (c) the
application of the mesh optimization procedure. (d) Resulting tetrahedral mesh of the Bust
obtained by the meccano method
13 The Meccano Method for Automatic Volume Parametrization of Solids 163

parametrization, 44.50 s for the Kossaczky recursive bisections, 2.31 s for the exter-
nal node mapping and inner node relocation, and 41.60 s for the mesh optimization.

3.2 Example 2: Bunny

The original surface triangulation of the Stanford Bunny has been obtained from the
website http://graphics.stanford.edu/data/3Dscanrep/, i.e. the Stanford Computer
Graphics Laboratory. It has 12, 654 triangles and 7, 502 nodes. The bounding box
of the solid is defined by the points (x, y, z)min ¼ (10, 3.5, 6) and (x, y, z)max ¼
(6, 2, 6).
We consider a unit cube as meccano. Its center is placed inside the solid at the
point ( 4.5, 10.5, 0.5). We obtain an initial subdivision of the Bunny surface in
eight maximal connected subtriangulations using Voronoi diagram. We reduce the
surface partition to six patches and we construct the Floater parametrization from
each surface patch SiS to the corresponding cube face SiC . Fixing a tolerance e2 ¼
0.0005, the meccano method generates a cube tetrahedral mesh with 54, 496
tetrahedra and 13, 015 nodes, see Fig. 2a. This mesh has 11, 530 triangles and 6,
329 nodes on its boundary and has been reached after 44 Kossaczky refinements
from the initial subdivision of the cube into six tetrahedra.
The mapping of the cube external nodes to the Bunny surface produces a 3-D
tangled mesh with 2, 384 inverted elements, see Fig. 2b. The relocation of inner
nodes by using volume parametrizations reduces the number of inverted tetrahedra
to 42. We apply eight iterations of the tetrahedral mesh optimization and only
one inverted tetrahedron can not be untangled. To solve this problem, we allow the
movement of the external nodes of this inverted tetrahedron and we apply eight
new optimization iterations. The mesh is then untangled and, finally, we apply
eight smoothing iterations fixing the boundary nodes. The resulting mesh quality is
improved to a minimum value of 0.08 and an average qk ¼ 0:68, see Figs. 2c, d. We
note that the meccano technique generates a high quality tetrahedra mesh: only one
tetrahedron has a quality below 0.1, 41 below 0.2 and 391 below 0.3.
The CPU time for constructing the final mesh of the Bunny is 40.28 s on a Dell
precision 690, 2 Dual Core Xeon processor and 8 Gb RAM memory. More precisely,
the CPU time of each step of the meccano algorithm is: 0.24 s for the subdivision of
the initial surface triangulation into six patches, 0.37 s for the Floater parametrization,
8.62 s for the Kossaczky recursive bisections, 0.70 s for the external node mapping
and inner node relocation, and 30.35 s for the mesh optimization.

3.3 Example 3: Bone

The original surface triangulation of the Bone has been obtained from http://www-c.
inria.fr/gamma/download/..., and it can be found in the CYBERWARE Catalogue.
This surface mesh contains 274, 120 triangles and 137, 062 nodes.
164 R. Montenegro et al.

a b

c d

Fig. 2 Cross sections of the cube (a) and the Bunny tetrahedral mesh before (b) and after (c) the
application of the mesh optimization procedure. (d) Resulting tetrahedral mesh of the Bunny
obtained by the meccano method

Steps of the meccano technique are shown in Fig. 3. The resulting mesh has
47, 824 tetrahedra and 11, 525 nodes. This mesh has 11, 530 triangles and 5, 767
nodes on its boundary and it has been reached after 23 Kossaczky refinements
from the initial subdivision of the cube into six tetrahedra. A tangled tetrahedra
mesh with 1, 307 inverted elements appears after the mapping of the cube external
nodes to the bone surface. The node relocation process reduces the number of
inverted tetrahedra to 16. Finally, our mesh optimization algorithm produces a
high quality tetrahedra mesh: the minimum mesh quality is 0.15 and the average
quality is 0.64.
13 The Meccano Method for Automatic Volume Parametrization of Solids 165

a b

c d

Fig. 3 Cross sections of the cube (a) and the Bone tetrahedral mesh before (b) and after (c) the
application of the mesh optimization procedure. (d) Resulting tetrahedral mesh of the Bone
obtained by the meccano method

4 Conclusions and Future Research

The meccano technique is a very efficient adaptive tetrahedral mesh generator for
solids whose boundary is a surface of genus zero. We remark that the method
requires minimum user intervention and has a low computational cost. The proce-
dure is fully automatic and it is only defined by a surface triangulation of the solid, a
cube and a tolerance that fixes the desired approximation of the solid surface.
A crucial consequence of the new mesh generation technique is the resulting
discrete parametrization of a complex volume (solid) to a simple cube (meccano).
166 R. Montenegro et al.

We have introduced an automatic partition of the given solid surface triangula-

tion for fixing an admissible mapping between the cube faces and the solid surface
patches, such that each cube face is the parametric space of its corresponding patch.
The mesh generation technique is based on sub-processes (subdivision, mapping,
optimization) which are not in themselves new, but the overall integration using a
simple shape as starting point is an original contribution of the method and has
some obvious performance advantages. Another interesting property of the new
mesh generation strategy is that it automatically achieves a good mesh adaption to
the geometrical characteristics of the domain. In addition, the quality of the resulting
meshes is high.
The main ideas presented in this paper can be applied for constructing tetrahe-
dral or hexahedral meshes of complex solids. In future works, the meccano tech-
nique can be extended for meshing a complex solid whose boundary is a surface of
genus greater than zero. In this case, the meccano can be a polycube or constructed
by polyhedral pieces with compatible connections. At present, the user has to define
the meccano associated to the solid, but we are implementing a special CAD
package for more general input solid.

Acknowledgements This work has been partially supported by the Spanish Government, “Secre-
tarı́a de Estado de Universidades e Investigación”, “Ministerio de Ciencia e Innovación”, and
FEDER, grant contract: CGL2008-06003-C03.

References

1. Y. Bazilevs, V.M. Calo, J.A. Cottrell, J. Evans, T.J.R. Hughes, S. Lipton, M.A. Scott, T.W.
Sederberg, Isogeometric analysis: toward unification of computer aided design and finite
element analysis. Trends in Engineering Computational Technology (Saxe-Coburg Publica-
tions, Stirling, 2008), pp. 1–16
2. J.M. Cascón, R. Montenegro, J.M. Escobar, E. Rodrı́guez, G. Montero, A new meccano
technique for adaptive 3-D triangulations. Proceedings of the 16th International Meshing
Roundtable (Springer, New York, 2007), pp. 103–120
3. J.M. Cascón, R. Montenegro, J.M. Escobar, E. Rodrı́guez, G. Montero, The Meccano method
for automatic tetrahedral mesh generation of complex genus-zero solids. Proceedings of the
18th International Meshing Roundtable (Springer, New York, 2009), pp. 463–480
4. G.F. Carey, in Computational Grids: Generation, Adaptation, and Solution Strategies (Taylor
& Francis, Washington, 1997)
5. J.M. Escobar, E. Rodrı́guez, R. Montenegro, G. Montero, J.M. González-Yuste, Simultaneous
untangling and smoothing of tetrahedral meshes. Comput. Meth. Appl. Mech. Eng. 192,
2775–2787 (2003)
6. J.M. Escobar, G. Montero, R. Montenegro, E. Rodrı́guez, An algebraic method for smoothing
surface triangulations on a local parametric space. Int. J. Num. Meth. Eng. 66, 740–760 (2006)
7. L. Ferragut, R. Montenegro, A. Plaza, Efficient refinement/derefinement algorithm of nested
meshes to solve evolution problems. Comm. Num. Meth. Eng. 10, 403–412 (1994)
8. M.S. Floater, Parametrization and smooth approximation of surface triangulations. Comput.
Aid. Geom. Design 14, 231–250 (1997)
9. M.S. Floater, One-to-one piece linear mappings over triangulations. Math. Comput. 72,
85–696 (2002)
13 The Meccano Method for Automatic Volume Parametrization of Solids 167

10. M.S. Floater, Mean value coordinates. Comput. Aid. Geom. Design 20, 19–27 (2003)
11. M.S. Floater, K. Hormann, Surface parameterization: a tutorial and survey. Advances in
Multiresolution for Geometric Modelling, Mathematics and Visualization (Springer, Berlin,
2005), pp. 157–186
12. M.S. Floater, V. Pham-Trong, Convex combination maps over triangulations, tilings, and
tetrahedral meshes. Adv. Computat. Math. 25, 347–356 (2006)
13. L.A. Freitag, P.M. Knupp, Tetrahedral mesh improvement via optimization of the element
condition number. Int. J. Num. Meth. Eng. 53, 1377–1391 (2002)
14. L.A. Freitag, P. Plassmann, Local optimization-based simplicial mesh untangling and
improvement. Int. J. Num. Meth. Eng. 49, 109–125 (2000)
15. P.J. Frey, P.L. George, in Mesh Generation (Hermes Sci. Publishing, Oxford, 2000)
16. P.L. George, H. Borouchaki, in Delaunay Triangulation and Meshing: Application to Finite
Elements (Editions Hermes, Paris, 1998)
17. J.M. González-Yuste, R. Montenegro, J.M. Escobar, G. Montero, E. Rodrı́guez, Local refine-
ment of 3-D triangulations using object-oriented methods. Adv. Eng. Soft. 35, 693–702 (2004)
18. P.M. Knupp, Achieving finite element mesh quality via optimization of the Jacobian matrix
norm and associated quantities. Part II-A frame work for volume mesh optimization and the
condition number of the Jacobian matrix. Int. J. Num. Meth. Eng. 48, 1165–1185 (2000)
19. P.M. Knupp, Algebraic mesh quality metrics. SIAM J. Sci. Comput. 23, 193–218 (2001)
20. I. Kossaczky, A recursive approach to local mesh refinement in two and three dimensions.
J. Comput. Appl. Math. 55, 275–288 (1994)
21. X. Li, X. Guo, H. Wang, Y. He, X. Gu, H. Qin, Harmonic volumetric mapping for solid
modeling applications. Proceedings of the ACM Solid and Physical Modeling Symposium,
Association for Computing Machinery, Inc., 2007, pp. 109–120
22. J. Lin, X. Jin, Z. Fan, C.C.L. Wang, Automatic PolyCube-Maps. Lecture Notes in Computer
Science 4975, 3–16 (2008)
23. R. Montenegro, J.M. Cascón, J.M. Escobar, E. Rodrı́guez, G. Montero, Implementation in
ALBERTA of an automatic tetrahedral mesh generator. Proceedings of the 15th International
Meshing Roundtable (Springer, New York, 2006), pp. 325–338
24. R. Montenegro, J.M. Cascón, J.M. Escobar, E. Rodrı́guez, G. Montero, An automatic strategy
for adaptive tetrahedral mesh generation. Appl. Num. Math. 59, 2203–2217 (2009)
25. A. Schmidt, K.G. Siebert, in Design of Adaptive Finite Element Software: The Finite Element
Toolbox ALBERTA. Lecture Notes in Computer Science and Engineering, vol. 42. (Springer,
Berlin, 2005)
26. A. Schmidt, K.G. Siebert, ALBERTA – an adaptive hierarchical finite element toolbox. http://
www.alberta-fem.de/
27. M. Tarini, K. Hormann, P. Cignoni, C. Montani, Polycube-Maps. ACM Trans. Graph. 23,
853–860 (2004)
28. J.F. Thompson, B. Soni, N. Weatherill, in Handbook of Grid Generation (CRC Press, London,
1999)
29. H. Wang, Y. He, X. Li, X. Gu, H. Qin, Polycube splines. Comput. Aid. Geom. Design 40,
721–733 (2008)
Chapter 14
A Buck Converter Model for Multi-Domain
Simulations

Johannes V. Gragger, Anton Haumer, and Markus Einhorn

Abstract In this work a buck converter model for multi-domain simulations is

proposed and compared with a state-of-the-art buck converter model. In the pro-
posed model no switching events are calculated. By avoiding the computation of
the switching events in power electronic models the processing time of multi-
domain simulations can be decreased significantly. The proposed model calculates
any operation point of the buck converter in continuous inductor current conduction
mode (CICM) while considering the conduction losses and switching losses. It is
possible to utilize the proposed modeling approach also for other dc-to-dc converter
topologies. Laboratory test results for the validation of the proposed model are
included.

1 Introduction

For the efficient utilization of multi-domain simulation software it is of high

importance to have fast simulation models of power electronic components on
hand. Especially in simulations of vast and complex electromechanical systems
(e.g. power trains of hybrid electric vehicles [11] or drive systems in processing
plants [7]) it is crucial to limit the processing effort to a minimum. Many times such
electromechanical systems contain power electronic subsystems such as rectifiers,
inverters, dc-to-dc converters, balancing systems (for energy sources), etc. When
simulating these power electronic devices together with the other electrical and
mechanical components of the application, computing the quantities of the power
electronic models requires a large share of the available processing power if
switching events are calculated in the power electronic models. Simulation models

J.V. Gragger (*)

Electric Drive Technologies, Austrian Institute of Technology, Giefinggasse 2, 1210 Vienna,
Austria
e-mail: [email protected]

S.-I. Ao et al. (eds.), Machine Learning and Systems Engineering, 169

Lecture Notes in Electrical Engineering 68,
DOI 10.1007/978-90-481-9419-3_14, # Springer ScienceþBusiness Media B.V. 2010
170 J.V. Gragger et al.

Fig. 1 Topology of a
conventional buck converter

including power electronic devices with switching frequencies around 100 kHz
require at least four calculation points within simulation times of around 10 ms for
calculating the switching events. However, if the energy flow in an electromechan-
ical system has to be investigated by simulation it is not necessary to calculate the
switching events in the power electronic model as long as the relevant losses are
considered.
In this work two different buck converter models are described. The first model,
model A, which is state-of-the-art describes the behavior of a conventional buck
converter, as shown in Fig. 1, including the calculation of switching events. This
means that in model A the switching of the semiconductors in the circuit is imple-
mented with if-clauses. Therefore, model A directly calculates the ripple of the current
through the storage inductor, and the ripple of the voltage across the buffer capacitor.
Due to the if-clauses in model A the duration of the computing time is very high.
The second model in this work, indicated as model B, describes the behavior of
the buck converter without calculating the switching events with if-clauses. Only
the mean and RMS values of the voltages and currents are calculated. Therefore, the
computation times of model B are significantly shorter than the computation times
of model A.
In both models the conduction losses are considered by an ohmic resistance of
the storage inductor, the knee voltage and the on-resistance of the diode, and the on-
resistance of the MOSFET. Linear temperature dependence is implemented for the
ohmic resistances of the storage inductor, the knee voltage and the on-resistance of
the diode and the on-resistance of the MOSFET in both buck converter models.
The switching losses are calculated assuming a linear dependency on the switch-
ing frequency, the blocking voltage and the commutating current between the
MOSFET and the diode. A controlled current source connected to the positive
and the negative pin of the supply side of the buck converter is used to model the
switching losses. This current source assures that the energy balance between the
supply side and the load side of the buck converter is guaranteed.

2 The Model for Calculating Switching Events

If the buck converter circuit in Fig. 1 is operated in continuous inductor current

conduction mode (CICM) the circuit can be in two different states. As long as
the MOSFET S is on and the diode D blocks the current, the buck converter is
14 A Buck Converter Model for Multi-Domain Simulations 171

in state 1. The corresponding equivalent circuit of the buck converter in state 1 is

shown in Fig. 2.
vin is the input voltage and vout is the output voltage of the converter. RS
indicates the on-resistance of the MOSFET and iS denotes the current through the
MOSFET. RL represents the ohmic contribution of the storage inductor L and iL is
the current through L. C stands for the buffer capacitor, iload indicates the output
current of the converter and iD represents the current through the diode (in state 1,
iD ¼ 0).
After S switched from on to off the diode begins to conduct. If S is off and D
conducts, the circuit is in state 2. The corresponding equivalent circuit of the buck
converter in state 2 is shown in Fig. 3 where RD is the on-resistance and VD
represents the knee voltage of the diode.
Discontinuous inductor current conduction mode (DICM) could be considered in
a third state where S and D are open at the same time. The buck converter is in
DICM if S is open and the current passing through the diode becomes zero.
A buck converter model for calculating switching events can be implemented
according to the pseudo code given in Alg. 1 where d stands for the duty cycle,
fs represents the switching frequency, and t indicates the time. scontrol, the
Boolean control signal of the MOSFET, is true during

ton ¼ dT s (1)

and false during

toff ¼ ð1 dÞT s (2)

in a switching period T s ¼ f1 . In Alg. 1 only CICM is considered. However, it is

s
easy to modify the model so that DICM can be simulated as well.
The basic principle of the modeling approach described in Alg. 1 is used in many
state-of-the-art simulation tools. A disadvantage of such a model is the processing

Fig. 2 Equivalent circuit of

the buck converter in state 1.
Switch S is on

Fig. 3 Equivalent circuit of

the buck converter in state 2.
Switch S is off
172 J.V. Gragger et al.

Algorithm 1 Pseudo code of a buck converter model for calculating switching events in CICM
Model:
BuckConverter
Parameter:
L, C, RS,RL,RD,VD,fs
Real variables:
vin,vout,iS,iL,iD,iload,t,d
Boolean variables:
scontrol
Equations:
if(scontrol ¼ true),
consider equations corresponding to the equivalent circuit of state 1
(Fig. 2)
else
consider equations corresponding to the equivalent circuit of state 2
(Fig. 3)

effort that is caused by the if-clauses. Strictly speaking, the whole set of equations
describing the circuit changes whenever the converter switches from state 1 to
state 2 and vice versa. In such a model the relevant conduction losses are considered
inherently. For the consideration of the switching losses a model expansion as
described in Section 4 is necessary.

3 The Averaged Model

If the dynamic behavior of the buck converter is not of interest but the energy flow
needs to be investigated it is possible to model the buck converter without calculat-
ing the switching events. Assuming the buck converter is in steady state the integral
of the inductor voltage vL over one switching period Ts equals zero [9]. Hence,
ð Ts ð ton ð Ts
vL dt ¼ vL dt þ vL dt ¼ 0: (3)
0 0 ton

During the time ton the equivalent circuit of state 1 describes the behavior of the
buck converter. In the circuit in Fig. 2 the inductor voltage is given by

vL;state 1 ¼ vin vout vRL vRS;state 1 ; (4)

where the voltage across RL

vRL ¼ iL RL (5)

and the voltage across RS

vRS;state 1 ¼ iL RS ; (6)
14 A Buck Converter Model for Multi-Domain Simulations 173

with the mean value of the inductor current

iL ¼ iload : (7)

The equivalent circuit of state 2 (shown in Fig. 3) describes the behavior of the buck
converter during the time toff ¼ T s ton . In state 2 the inductor voltage

vL;state 2 ¼ vout vRL vRD;state 2 V D ; (8)

where vRL is given by (5) and the voltage across RD

vRD;state 2 ¼ iL RD : (9)

Combining (3) with (4) and (8) one can derive

dT s ½vin vRL vout vRS;state 1 þ ð1 dÞT s ½vout vRL vRD;state 2 V D ¼ 0:

(10)

From (10) it is possible to find the mean value of the output voltage by

dðvin vRS;state 1 þ vRD;state 2 þ V D Þ ðvRL þ vRD;state 2 þ V D Þ ¼ vout : (11)

vout is a function of the duty cycle d, the input voltage vin, and the mean value of
the load current iload . Consequently, it is possible to calculate the average output
voltage with considering the conduction losses if there are relations for d, vin, and
iload available in other models, which is usually the case. The result of (11) can be
used as the input of a voltage source that is linked to the connectors of the load side
of the buck converter model. Please note that with (11) only the influence of the
conduction losses on the average output voltage is considered. In order to calculate
the influence of the conduction losses on the supply current the conduction losses
of the individual elements (MOSFET, diode, and inductor) need to be known. From
the equivalent circuits in Figs. 2 and 3 it appears that by approximation (with the
assumption that vout only changes insignificantly in one switching period) in state 1
the inductor current rises with a time constant

L
tstate 1 ¼ (12)
RS þ RL

and in state 2 the inductor current decays exponentially with

L
tstate 2 ¼ : (13)
RD þ RL

Provided that the time constants tstate 1 and tstate 2 are much larger than the
switching period Ts (which applies practically to all buck converters with proper
174 J.V. Gragger et al.

design), the instantaneous current through the inductor can be assumed to have a
triangular waveform such as

iL;state 1 if nT s <t ðn þ dÞT s
iL ¼ (14)
iL;state 2 if (n + d)Ts <t (n þ 1)Ts

with n ¼ 0, 1, 2, 3, . . . and

DI L DI L
iL;state 1 ¼ iload þ t (15)
2 dT s

DI L DI L
iL;state 2 ¼ iload þ t; (16)
2 ð1 dÞT s

where the current ripple

vout þ V D
DI L ¼ ð1 dÞT s : (17)
L

Considering the two states of the buck converter circuit and using (14)–(17) the
waveform of the current through the MOSFET

iL;state 1 if nT s < t ðn þ dÞT s
iS ¼ (18)
0 if ðn þ dÞT s < t ðn þ 1ÞT s

and the waveform of the current through the diode

0 if nT s <t ðn þ dÞT s
iD ¼ (19)
iL;state 2 if ðn þ dÞT s <t ðn þ 1ÞT s :

For calculating the conduction losses of the individual elements in the converter,
the RMS values of the current through the MOSFET IS,rms, the current through the
diode ID,rms and the inductor current IL,rms have to be available. Applying the
general relation
sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
ð
1 t0 þT 2
I rms ¼ iðtÞ dt (20)
T t0

to (18) and (19) results in

sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi

DI 2L
I S;rms ¼ d I L;min þ I L;min DI L þ
2
; (21)
3
14 A Buck Converter Model for Multi-Domain Simulations 175

with

DI L
I L;min ¼ iload (22)
2

and
sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi

DI 2L
I D;rms ¼ ð1 dÞ I L;max I L;max DI L þ
2
; (23)
3

with

DI L
I L;max ¼ iload þ : (24)
2

Using (21) – (24) and considering

iL ¼ iS þ iD (25)

the RMS value of the inductor current can be written as

qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
I L;rms ¼ I 2S;rms þ I 2D;rms : (26)

The conduction losses of the MOSFET PS,con and the storage inductor PL,con can
be calculated by

PS;con ¼ RS I 2S;rms (27)

and

PL;con ¼ RL I 2L;rms : (28)

When calculating the conduction losses of the diode also the portion of the power
emission contributed by the knee voltage has to be taken into account. Since the
knee voltage is modeled as a constant voltage source the mean value of the current
through the diode

iD ¼ ð1 dÞiload (29)

has to be used to calculate the respective contribution to the conduction losses. The
total conduction losses in the diode can be written as

PD;con ¼ RD I 2D;rms þ V D iD : (30)

176 J.V. Gragger et al.

Using (27), (28), and (30) the total amount of conduction losses can be calculated
by

Ptot;con ¼ PS;con þ PD;con þ PL;con : (31)

4 Consideration of Switching Losses

Models on different levels of detail for switching loss calculation have been
published. In many MOSFET models the parasitic capacitances are considered
and in some also the parasitic inductances at the drain and at the source of the
MOSFET are taken into account. In [1] a model considering the voltage depen-
dence of the parasitic capacitances is proposed. A model in which constant parasitic
capacities as well as parasitic inductances are considered is suggested in [2] and in
[4] voltage dependent parasitic capacities together with the parasitic inductances
are used for the calculation.
In data sheets such as [6] an equation combining two terms is used. In the first
term constant slopes of the drain current and the drain source voltage are assumed
and in the second the influence of the output capacitance is taken into account. Also
for this approach the parasitic capacities as well as the switching times (or at least
the gate switch charge and the gate current) have to be known. In [10] is stated that
the approach in [6] leads to an overestimation of the switching losses in the
MOSFET.
A general approach for switching loss calculation in power semiconductors
using measurement results with linearization and polynomial fitting is presented
in [3]. In [12] the switching losses are considered to be linear dependent on the
blocking voltage, the current through the switch, and the switching frequency. This
approach was initially developed for modeling switching losses in IGBTs but it can
also be applied to the calculation of MOSFET switching losses. In [8] a modified
version of the model proposed in [12] is presented. The difference is that in [8] the
switching losses are dependent on the blocking voltage, the current through the
switch, and the switching frequency with higher order.
In the presented work the approach described in [12] is used to model the
switching losses. The two buck converter models in Section 2 and 3 can be
expanded with switching loss models using

fs iload vin
Pswitch ¼ Pref;switch ; (32)
f ref;s iref;load vref;in

where Pswitch represents the sum of the actual switching losses in the MOSFET
and the diode of the buck converter, fs denotes the actual switching frequency, iload
is the actual commutating current between the diode and the MOSFET, and vin is
the actual blocking voltage of the diode and the MOSFET. Pref,switch represents a
14 A Buck Converter Model for Multi-Domain Simulations 177

measured value of the switching losses at a reference operation point defined by

fref,s, iref,load, and vref,in.
With this approach no knowledge of the parasitic capacitances and inductances
is needed. Neither the switching energy nor the switching times need to be known.
For the accuracy of the model given in (32) the precision of the measurement results
at the reference operation point is very important.

5 Implementation of the Simulation Models

The buck converter models described in Sections 2 and 3 got implemented with
Modelica modeling language [5] using the Dymola programming and simulation
environment. Modelica is an open and object oriented modeling language that
allows the user to create models of any kind of physical object or device which
can be described by algebraic equations and ordinary differential equations.
In both models the conduction losses with linear temperature dependence and
the switching losses (calculated according to Section 4) are considered.
In Fig. 4 the scheme of model A, the model calculating the switching events (as
explained in Section 2) is shown.
The conduction losses are inherently considered in Alg. 1 and the switching
losses are considered by means of a controlled current source with

Pswitch
imodel A ¼ (33)
vin

whereas Pswitch is calculated by (32).

Figure 5 illustrates the scheme of the averaged buck converter model (as
explained in Section 3) with consideration of the switching and conduction losses.
The basic components of model B are the current source controlled with i∗model B,
the voltage source controlled with v ¼ vout , and the power meter measuring the
averaged output power Pout ¼ vout iload .

Fig. 4 Model A (buck

converter model)
178 J.V. Gragger et al.

Fig. 5 Model B (buck

converter model)

In model B the control signal of the voltage source v∗ is computed according to

(11) and the control signal of the current source i∗model B is calculated by

Pswitch þ Ptot;con þ Pout

imodel B ¼ : (34)
vin

In (34) Pswitch is given by (32), Ptot,con is calculated from (31), and Pout is the
output signal of the power meter in Fig. 5.

6 Simulation and Laboratory Test Results

The approach applied in model A is well established. Therefore the results of model
A are used as a reference for the verification of model B. For the comparison of the
two models a buck converter ( fs ¼ 100 kHz) supplied with a constant voltage of
30 V and loaded with a constant load current of 40 A was simulated using model A
and model B. In the two simulations the duty cycle was decreased step by step with
Dd ¼ 0.1 every 0.03 s starting from d ¼ 0.8 to d ¼ 0.2.
The purpose of model B is to calculate the efficiency and the electric quantities
in steady state. The supply current signals and the load voltage signals in Fig. 6
show that after the transients decay both models reach the same operation point.
Please note that in Fig. 6 the instantaneous supply current signal computed with
model A is averaged over a switching period.
Both simulations were computed on a state-of-the-art PC with 3 GHz dual core
and 3 GB RAM. It took only 2.8 s to process the results of the simulation with
model B whereas the CPU time for the simulation with model A was 36 s. The large
difference between the CPU times indicates that it is much more efficient to use
model B if the energy flow through a converter is the focus of the simulation.
For the validation of the two simulation models several laboratory tests have
been conducted. In order to avoid core losses an air-cored coil was implemented as
the storage inductor. As the passive and the active switch two IRFPS3810 power
MOSFETs were chosen whereas the body diode of one of the MOSFETs was used
as the freewheeling diode. The temperatures of the two MOSFETs and the air-cored
coil were measured with type-K thermocouples.
14 A Buck Converter Model for Multi-Domain Simulations 179

Fig. 6 Supply current (top) 40

current (A)
and load voltage (bottom) model A
30 model B
simulated with model A and B
20
10
0
0 0.04 0.08 0.12 0.16 0.2
30

voltage (V)
model A
20 model B

0
0 0.04 0.08 0.12 0.16 0.2
time (s)

Fig. 7 Measured and 1

simulated efficiency of
the buck converter with 0.8
vin ¼ 30 V and iload ¼ 40 A
efficiency (1)

0.6
measured
simulated
0.4

0.2

0
0 0.2 0.4 0.6 0.8 1
duty cycle (1)

The buck converter used in the laboratory tests was operated with a similar duty
cycle reference signal as used for the results in Fig. 6. However, the step time of the
duty cycle signal in the laboratory test was significantly longer compared to the
signal used for the results in Fig. 6. Because of this, the temperatures of the
semiconductors increased significantly. Figure 7 shows the measured efficiency
of the circuit under test and the respective (steady state) simulation results of model
A and B. The measured and simulated results show satisfactory coherence.
In Fig. 8 the measured losses of the buck converter operated with d ¼ 0.2, and
d ¼ 0.8 during warm-up tests are compared with the results of a simulation carried
out with model B. In the top diagram it can be seen that the conduction losses
decrease with increasing time and temperature. This is because the knee voltage of
the freewheeling diode has a negative temperature coefficient and at d ¼ 0.2 the
freewheeling diode conducts 80 % of the time in a switching period. In the bottom
diagram of Fig. 8 the conduction losses raise with increasing time and temperature.
The reason for this is the positive linear temperature coefficient of the on-resistance
of the MOSFET and the longer duration in which the MOSFET conducts during a
180 J.V. Gragger et al.

Fig. 8 Warm-up test results d = 0.2

120
at d ¼ 0.2 (top) and d ¼ 0.8

power (W)
measured
(bottom). Measured and simulated
simulated losses of the buck 115
converter with vin ¼ 30 V
and iload ¼ 40 A 110
0 50 100 150 200
d = 0.8
120

power (W)
110
measured
simulated
100
0 50 100 150 200
time (s)

switching period. Please note that the MOSFET dissipates more energy and reaches
higher temperatures if the buck converter is operated with high duty cycles.

7 Conclusion

An analytical approach to calculate the steady state behavior of a buck converter

including the consideration of conduction losses is described. The presented model B
is generated from the derived equations and expanded so that switching losses and
temperature dependence of the conduction losses are considered. For the verification
of the described modeling approach two simulation models (models A and B) are
programmed with Modelica language. In steady state model A, the model calculating
switching events, matches the behavior of model B, the model based on the approach
of system averaging. When comparing the CPU times of models A and B it appears
that model B can be computed more than 10 times faster than model A. Conse-
quently, it is recommended to preferably use model B in simulations whenever only
the steady state values of the electrical quantities in the buck converter are of interest.
This is for instance the case in energy flow analyzes and in simulations for core
component dimensioning of electromechanical systems. The simulation results of
model A and B show satisfying conformity with the laboratory test results.

References

1. L. Aubard, G. Verneau, J.C. Crebier, C. Schaeffer, Y. Avenas, Power MOSFET switching

waveforms: an empirical model based on a physical analysis of charge locations. The 33rd
Annual Power Electronics Specialists Conference, IEEE PESC, 3, 1305–1310 (2002)
14 A Buck Converter Model for Multi-Domain Simulations 181

2. Y. Bai, Y. Meng, A.Q. Huang, F.C. Lee, A novel model for MOSFET switching loss
calculation. The 4th International Power Electronics and Motion Control Conference,
IPEMC, 3, 1669–1672 (2004)
3. U. Drofenik, J.W. Kolar, A general scheme for calculating switching- and conduction-losses
of power semiconductors in numerical circuit simulations of power electronic systems.
Proceedings of the International Power Electronics Conference, IPEC, 2005
4. W. Eberle, Z. Zhang, Y.-F. Liu, P. Sen, A simple switching loss model for buck voltage
regulators with current source drive. The 39th Annual Power Electronics Specialists Confer-
ence, IEEE PESC, 2008, pp. 3780–3786
5. P. Fritzson, Principles of Object-Oriented Modeling and Simulation with Modelica 2.1. (IEEE
Press, Piscataway, NJ, 2004)
6. Datasheet of the IRF6603, N-Channel HEXFET Power MOSFET. International Rectifier,
2005
7. H. Kapeller, A. Haumer, C. Kral, G. Pascoli, F. Pirker, Modeling and simulation of a large
chipper drive. Proceedings of the 6th International Modelica Conference, 2008, pp. 361–367
8. K. Mainka, J. Aurich, M. Hornkamp, Fast and reliable average IGBT simulation model with
heat transfer emphasis. Proceedings of the International Conference for Power Conversion,
Intelligent Motion and Power Quality, PCIM, 2006
9. N. Mohan, T.M. Undeland, W.P. Robbins, Power Electronics – Converters, Applications, and
Design, 2nd edn. (Wiley, New York, 1989)
10. Z.J. Shen, Y. Xiong, X. Cheng, Y. Fu, P. Kumar, Power MOSFET switching loss analysis:
A new insight. Conference Record of the IEEE Industry Applications Conference, 41st IAS
Annual Meeting, 3, 1438–1442 (2006)
11. D. Simic, T. B€auml, F. Pirker, Modeling and simulation of different hybrid electric vehicles in
modelica using Dymola. Proceedings of the International Conference on Advances in Hybrid
Powertrains, IFP, 2008
12. D. Srajber, W. Lukasch, The calculation of the power dissipation of the IGBT and the inverse
diode in circuits with the sinusoidal output voltage. Conference Proceedings of Electronica,
1992, pp. 51–58
Chapter 15
The Computer Simulation of Shaping in Rotating
Electrical Discharge Machining

Jerzy Kozak and Zbigniew Gulbinowicz

Abstract The effect of the tool electrode wear on the accuracy is very important
problem in the Rotating Electrical Discharge Machining (REDM). Two mathematical
models of REDM are presented: the first one considers machining with the face of
the end tool electrode and the second one considers EDM with the lateral side of the
electrode. The software for computer simulation of EDM machining with the side
and face of the electrodes has been developed. This simulation model for NC
contouring EDM using rotating electrode may also be applied for tool electrode
path optimization. The experimental results confirm the validity of the proposed
mathematical models and the simulation software.

1 Introduction

Today’s manufacturing industries are facing challenges from advanced difficult-

to-machine materials (tough super alloys, ceramics, and composites), stringent
design requirements (high precision, complex shapes, and high surface quality),
and machining costs. The greatly-improved thermal, chemical, and mechanical
properties of the material (such as improved strength, heat resistance, wear resis-
tance, and corrosion resistance) are making ordinary machining processes unable to
machine them economically. The technological improvement of manufacturing
attributes can be achieved by high efficiency Rotating Electrical Discharge Machining
(REDM: Electrical Discharge Grinding-EDG or Electrical Discharge Milling),
Abrasive Electrodischarge Grinding (AEDG), Rotary Electrochemical Arc/Discharge
Machining – RECAM/RCDM or grinding using metallic bond diamond wheels.
These machining processes use a rotating tool.

J. Kozak (*)
Warsaw University of Technology, ul. Narbutta 85, 02-524 Warsaw, Poland
e-mail: [email protected]

S.-I. Ao et al. (eds.), Machine Learning and Systems Engineering, 183

Lecture Notes in Electrical Engineering 68,
DOI 10.1007/978-90-481-9419-3_15, # Springer ScienceþBusiness Media B.V. 2010
184 J. Kozak and Z. Gulbinowicz

The Electrical Discharge Machining process is widely used to machine compli-

cated shapes with high accuracy on hard and advanced materials including
hardened steels, super-alloys, tungsten carbides, electrically conductive engineer-
ing ceramics, polycrystalline diamonds etc. The material is removed by the thermal
energy generated by a series of electrical discharges between the tool electrode and
workpiece immersed in the dielectric. There is no direct contact between the tool
and workpiece during the machining process. The workpiece can be formed, either
by replication of a shaped tool electrode or by 3D movement of a simple electrode
like in milling.
The fabrication of complex shaped multiple electrodes used in the die-sinking
micro and macro Electrical Discharge Machining (EDM) is expensive and time
consuming [1]. Moving a simple shaped electrode along designed tool paths has
been proposed as a solution to some of these problems [2–5]. A numerical control
system enables the workpiece or tool electrode to move a previously programmed
path. In this way, very complex 2 or 3 dimensional shape according to the design
requirement can be shaped.
However, the tool wear during machining adversely affects the accuracy of
the machined components. The problem of tool wear in 3D micro and macro
EDM using simple shaped electrodes has been addressed by applying the Uniform
Wear Method to maintain the electrode shape unchanged and compensate for the
longitudinal tool wear [5–8].
Tool wear in EDM is characterized by the relative tool-electrode wear (RTW)
which is generally defined as ratio tool wear rate (TWR, which is the volume of tool
material removed per unit time) to material removal rate (MRR, which is the
volume of workpiece material removed per unit time):

TWR
n¼ (1)
MRR

Depending upon the operating parameters of REDM, the relative wear may be
0.01–2. Changes in dimensions of tool due to wear during machining is expected to
reflect in the actual depth of cut and finally in the profile and dimensional accuracy
of machined parts.
Results of investigation of EDM with rotating electrode reported in studies
[9, 10] show a slope curvilinear profile of bottom surface of machined groove
due to the wear of disk electrode. Controlling the path of electrode can reduce shape
error. In the paper [3] preliminary analysis of shape error indicates that one of the
main factors leading to shape errors is wheel wear. More extended study of this
problem based on mathematical modeling and experiments is reported in articles
[11, 12], where the general differential equation described relationship between tool
wear, initial and final shape of machined surface has been derived. Effect of wheel
wear on dimensional accuracy of grinding is known from theory of tolerances.
During machining of parts loaded together in a pocket, and in one pass of the rotary
tool, height of parts achieved is random variable in the machined set. Based on the
15 The Computer Simulation of Shaping in Rotating Electrical Discharge Machining 185

a b
Tool electrode
Workpiece

V
f Z
x, y

Fig. 1 Example of EDM using rotating tool electrode: (a) machining with the face of end
electrode, (b) machining by the side of rotating electrode

Fig. 2 Principal scheme for y = g(x)

mathematical modeling of
REDM by end tool electrode
Vf
y y = f(x)

a(0)

y = F(x)

assumption of constant wear rate, uniform probability density function (PDF) has
been obtained for height of machined parts [11].
Two cases of machining operations are taken for mathematical modeling: the
first one considers machining with the face of the tool electrode and the second one
considers EDM with the side of the electrode (Fig. 1).
In first case, the technique of integrating Uniform Wear Method with CAD/
CAM software has been successful in generating very complex 3D cavities, it
involves a time consuming, empirical approach for selecting tool paths and machin-
ing parameters. Therefore, it is necessary to develop a theoretical model which
accounts for the effect of tool wear on the surface profile generation during each
pass. As the number of passes can be very large, a corresponding computer
simulation software simulation also needs to be developed.

2 Mathematical Modelling of Redm Shaping

by End Tool Electrode

The principal scheme of shaping using the end tool electrode process is presented
in Fig. 2. The initial profile of workpiece is given by function y ¼ f(x). The
electrode is controlled to move along the tool head path y ¼ g(x). However, the
186 J. Kozak and Z. Gulbinowicz

longitudinal tool wear results in the profile of machined surface y ¼ F(x), which is
different from g(x).
The purpose of this mathematical modeling and computer simulation is to
determine surface profile y ¼ F(x), taking into account the change in tool
length which occurs due to the wear of electrode. The final profile depends on
input parameters, such as depth of cut a0, initial profile y ¼ f(x), diameter of tool d0
and tool head path y ¼ g(x).
Let us consider the case of machining presented in Fig. 2, when the initial
surface is y ¼ f(x) and g(x) ¼ constant.
In determining the profile of machined surface, the following assumptions
are made:
– Changes in tool shape are neglected because the Uniform Wear Method
is applying.
– The gap between tool electrode and workpiece is neglected.
– Material removal rate MRR is equal:

MRR ¼ ð f ðxÞ FðxÞÞ d0 Vf (2)

– Tool wear rate TWR, defined as the volume of tool material removed per unit
time, for g(x) ¼ const. is:

p d02 dF
TWR ¼ (3)
4 dt

p d02 dF dx p d02 dF
TWR ¼ ¼ Vf (4)
4 dx dt 4 dx

where Vf is feed rate.

After substituting (2) and (4) in (1) for n, the equation describing the profile
of machined surface takes the following form:

dF
þ mF ¼ mf ð xÞ (5)
dx

with initial condition F(0) ¼ – a(0) (Fig. 2), where m ¼ 4v/p·d0 is the wear
factor.
In many cases it is possible to assume, that relative wear during machining
is constant (m ¼ const.), i.e. it is not dependent on the actual depth of cut. For
this condition and for f(x) ¼ 0, the solution of (5) becomes:

FðxÞ ¼ a0 expðmxÞ (6)

15 The Computer Simulation of Shaping in Rotating Electrical Discharge Machining 187

While considering accuracy, the profile is conveniently described in coordinates

relative to required allowance, a0 i.e.:
hðxÞ ¼ F ða0 Þ ¼ a0 ½1 expðmxÞ (7)
For more universal application of (6) a non-dimensional form of notations
is used. The non-dimensional variables are defined as h ¼ h=a0 and x ¼ x=L,
where L is the length of workpiece. In non-dimensional form, (7) can be written as:
h ¼ 1 expðA xÞ (8)
where A ¼ m · L ¼ 4v L/(p · d0).
Profiles of machined surface obtained for different values of A are presented in
Fig. 3. As result of wear, the depth of cut is decreasing and the tool wear rate (TWR)
is changing during machining. For comparison in Fig. 3 the dashed line (for A ¼ 1)
represents the profile neglecting changes in TWR during machining.
In general case when g ¼ g(x), mathematical model of REDM process is
described as follows:
dF dg
þ mF ¼ mf ð xÞ þ (9)
dx dx
with initial condition F(0) ¼ a0.
The machining accuracy may be improved by control motion of tool electrode.
For example, the tool wear can be compensated by moving tool or workpiece along
y-axis, to obtain flat surface. In the case when initial surface is flat f(x) ¼ 0 and the
depth of cut is equal a0, the required flat shape is F(x) ¼ a0. Therefore, based on
solution (9) the linear path of tool is needed, which can be described as follows:

y ¼ gðxÞ ¼ H0 a0 m x (10)
where H0 is initial position of tool head.

Fig. 3 Profile of machined

surfaces for different values
of A
188 J. Kozak and Z. Gulbinowicz

For machining with constant feed rate Vf along x-axis, the compensation of
the tool wear can be obtained by adding the relative motion of the tool/workpiece
with constant feed rate Vy along y-axis equal:

4n a0
Vy ¼ a0 m Vf ¼ Vf (11)
p d0

This theoretical conclusion about linear path of tool electrode for compensation
of wear has been confirmed by the experiments [5].

3 Mathematical Modelling of REDM Shaping by Lateral

Surface of Tool Electrode

The principal scheme of shaping process using the lateral side of tool electrode is
presented in Fig. 4. The purpose of this mathematical modeling and computer
simulation is to determine the profile of generated surface y ¼ F(x), taking into
account the change in tool diameter which occurs due to the tool wear of rotary tool
during machining (Fig. 4). The final surface profile depends on the input data, such
as depth of cut a0, initial surface profile y ¼ f(x), diameter of the tool d0 and the
curvilinear path of the center of tool y ¼ g(x).
For the modeling purpose the following assumptions were made:
– The inter electrode gap s is constant and included to the effective radius of tool
i.e. R ¼ R(electrode) þ s.
– Actual depth of cut, a, is determined by the position of points A and B, which are
point tangency of the tool electrode to the generated profile and intersection
point of the tool and the initial profile y ¼ f(x), respectively (Fig. 4).
– Feed rate along axis y is significant lower in compare to the feed rate Vf i.e. the
value of dg/dx << 1.
– Changes in the tool shape along the axis of rotation are neglected.

Fig. 4 Scheme of machining

with curvilinear path of
rotating tool
15 The Computer Simulation of Shaping in Rotating Electrical Discharge Machining 189

After machining for certain time t, the center of the tool has reached the
coordinate xC ¼ x, the feed rate is V(t) ¼ Vf and the effective tool radius due to
the wear is R(t).
The material removal rate MRR at dg/dx << 1 is:

MRR ¼ a b Vf ¼ ½ðf ðzÞ FðxÞ b Vf (12)

where b is width of workpiece and x is function of x, x ¼ x(x).

The coordinates of intersection point B(x, f(x)) can be calculated from the
relation AC ¼ BC (Fig. 4):

ðx xÞ2 þ ½Fð xÞ gðxÞ2 ¼ ðB xÞ2 þ ½f ðBÞ gðxÞ2 (13)

The tool wear rate TWR, is given by:

dR
TWR ¼ 2p b RðtÞ (14)
dt

Since

dR dR dx dR
¼ ¼ Vf (15)
dt dx dt dx

and according to the relation from Fig. 4,

R2 ðtÞ ¼ ½gðxÞ Fð xÞ2 þ ðx xÞ2 (16)

the rate of change of tool radius can be expressed as following:

dR 1 dg dF dx dx
¼ ½ gð xÞ Fð x Þ þ ð x xÞ 1 (17)
dt R dx dx dx dx

Substituting (12) and (17) into (1) and performing transformation, the profile of
machined surface can be described by (18):

dF n F dx
þ ¼
dx 2p ½gðxÞ F dx
(18)
n f ð BÞ dx dg ð x xÞ dx
¼ þ 1
2p ½gðxÞ F dx dx ½gðxÞ F dx

with initial condition F ¼ F(0) ¼ g(0) R(0).

Generally, the value of relative tool wear is depending on the depth of cut a ¼ f
(x)-F(x) i.e. v ¼ v(a), and this function can be only determined experimentally.
190 J. Kozak and Z. Gulbinowicz

An additional equation to (18) and (13) can be derived from the condition for
tangency of the tool to the profile machined surface at the point A as follows:

dF
½gðxÞ Fð xÞ ¼xx (19)
dx

The described above mathematical model can be used in following processes:

REDM/EDG (side EDM milling), AREDM, RECAM/RCDM and grinding (in this
case from definition of the G-ratio, v ¼ 1/G).
Based on the presented mathematical model, computer simulation of evolution
of workpiece profile can be carried out by using in-house developed software and
two main tasks can be formulated:
1. The tool path, g(x), and initial shape of surface, f(x), are known but the resulting
shape of the machined surface, F(x), and needs to be predicted.
2. For a required shape of the machined surface, F(x), the tool path, g(x), needs to
be determined in order to compensate for the tool wear.
These tasks have been solved numerically using the Finite Difference Method
and iterative procedure.
In many cases the changes in the radius and the differences between coordinates
x and x (for points A and B, respectively) are usually small when compared to the
initial radius. Therefore dx/dx ﬃ 1 and (18) will evolve in:

dF n F n f ð xÞ dg
þ ¼ þ (20)
dx 2p ½gð xÞ F 2p ½gð xÞ F dx

This simplified mathematical model can be successfully used to plan REDM

operations and in solving the mentioned above two main tasks.

4 Software for Computer Simulation

The developed software supports process design for the REDM. The software is
easy to use, menu driven application allowing evaluating the effect of various
parameters on process performance.
Programming of kinematics of the tool electrode can be achieved by supplying
program with tool path in x direction described. Geometry of the surface at the
beginning of machining can be defined in two ways:
– By supplying software with the surface equation, y ¼ f(x)
– By supplying software with coordinates of the surface (for example, a surface
just obtained a machining process cycle can be used as a starting surface for the
next machining operation)
15 The Computer Simulation of Shaping in Rotating Electrical Discharge Machining 191

The important feature of the software is the capability to define relative tool wear
as function of depth of cut.
Input date is inserted in the overlap Data. The windows for input data are shown
in Fig. 5.
Results of simulation can be viewed in two ways: tables of coordinates of points
of the workpiece surface and 2D graphs.
Fig. 6 shows an example of a graph of REDM machined surface defined
by function y ¼ a/(1 þ bsin(2x/). The tool path was given by g(x) ¼ H0 a0 þ
R0 ¼ constant (in Fig. 7 is shown shifted tool path to axis x).

Fig. 5 Overlap data

Fig. 6 The profiles from software screen (R0 ¼ 10 mm, a0 ¼ 0.6, n ¼ 0.3)
192 J. Kozak and Z. Gulbinowicz

In Fig. 7 is shown simulation of REDM process with compensation of tool wear.

An effect of periodical changes in the depth on final shape is appeared on machined
surface.

5 Experimental Verification

The theoretical model and simulation results were verified in Rotary Electrical
Discharge Machining (REDM) on M35K Mitsubishi, EDM – NC machine tool
using electrode with diameter of 8.2 and 18 mm. The range for workpiece move-
ment was 0–160 mm for x-axis. Experiments were performed on tool steel material
P233330 (HRC 56) with copper electrode using the following process parameters:
Pulse voltage of U: 40 V, Pulse current I: 55 and 120 A, Pulse on time tp ¼ pulse off

Fig. 7 The surface profile and tool path at EDM with compensation of tool wear (R0 ¼ 10 mm,
a0 ¼ 0.6, n ¼ 0.3)

Fig. 8 Comparison of theoretical and experimental results in non-dimensional system coordi-

nates. Setting parameters: tp ¼ t0 ¼ 120 ms, n ¼ 200 rpm
15 The Computer Simulation of Shaping in Rotating Electrical Discharge Machining 193

Table 1 The machining conditions for experiments carried out on EDIOS-16

Setting parameters
Test t (min) a0 (mm) I (A) U (V) n (rev/min) TWR (mm3/min) MRR (mm3/min)
a 25 1 6 90 600 0,033 9,077
b 19 1,5 10 50 2,700 0,254 2780
c 18 0,5 5 100 6,000 1,954 3,262

a
–1,08
0 10 20 30 40 50 60
–1,09

–1,1
Y, mm

–1,11

–1,12
Experimental
–1,13
Simulation
–1,14
X, mm

b
0 10 20 30 40 50 60
–0,84
–0,85
–0,86
–0,87
Y, mm

–0,88
–0,89
– 0,9
–0,91 Experimental
–0,92
Simulation ni(y)
–0,93
X, mm

c
0
0 10 20 30 40 50 60
– 0,1

– 0,2

– 0,3
Y, mm

Experimental
– 0,4 Simulation

– 0,5

– 0,6

– 0,7
X, mm

Fig. 9 Comparison of theoretical and experimental results

194 J. Kozak and Z. Gulbinowicz

time t0: 60 and 120 ms, Depth of cut a0: 1.2 and 1.5 mm, Feed rate Vf: 0.15, 1.0, 2.0,
and 6.0 mm/min, Rotation speed n: 200 rpm. For different combination of
setting parameters for verification used non-dimensional system coordinates X ¼
v·x/(2p·R0) and Y ¼ y=a0 . The results are presented in Fig. 8.
Further experiments were carried out on EDIOS-16 machine tool using copper
electrode with diameter of 4 mm, pulse on time 160 ms pulse off time 10 ms and the
maximal travel distances in x axis were 0–50 mm.
The machining conditions are shown in the Table 1.
Experimental results are compared with simulation prediction in Fig. 9.
Experimental verifications show high accuracy of developed mathematical
model and computer simulation. An overall average of 6% (of the initial depth of
cut) deviation was found between simulation and experimental results.

6 Conclusion

The study showed a good agreement of theoretical and experimental results of

modeling of REDM process. The developed software can be useful for analysis of
the REDM process, parameter optimization and surface prediction. Computer
simulation of REDM has a significant potential to be used in industry.

Acknowledgements This work was supported in part by the European Commission project
“Micro-Technologies for Re-Launching European Machine Manufacturing SMEs (LAUNCH-
MICRO)”.

References

1. M. Kunieda, B. Lauwers, K.P. Rajurkar, B.M. Schumacher, Advancing EDM through funda-
mental insight into the process. Ann. CIRP 54(2), 599–622
2. P. Bleys, J.P. Kruth, B. Lauwers, Milling EDM of 3D shapes with tubular electrodes,
Proceedings of ISEM XIII, 2001, pp. 555–567
3. P. Bleys, J. P. Kruth, Machining complex shapes by numerically controlled EDM. Int.
J. Electric. Mach. 6, 61–69 (2001)
4. J.P. Kruth, B. Lauwers, W. Clappaert, A study of EDM pocketing, Proceedings of ISEM X,
1992, pp. 121–135
5. Z.Y. Yu, T. Masuzawa, M. Fujino, Micro-EDM for three-dimensional cavities, development
of uniform wear method. Ann. CIRP 47(1), 169–172 (1998)
6. K.P. Rajurkar, Z.Y. Yu, 3D micro EDM using CAD/CAM., Ann. CIRP 49(1), 127–130 (2000)
7. Z.Y. Yu, T. Masuzawa, M. Fujino, A basic study on 3D Micro-EDM. Die Mould Technol. 11
(8), 122–123 (1996) (in Japanese)
8. Z.Y. Yu, J. Kozak, K.P. Rajurkar, Modeling and simulation of micro EDM process. Ann.
CIRP 52(1), 143–146 (2003)
9. Y. Uno, A. Okada, M. Itoh, T. Yamaguchi, EDM of groove with rotating disk electrode, Int.
J. Electr.Mach. 1, 13–20 (1996)
15 The Computer Simulation of Shaping in Rotating Electrical Discharge Machining 195

10. J. Quian, H. Ohmori, T. Kato, I. Marinescu, Fabrication of micro shapes of advanced materials
by ELID- Grinding. Trans. NAMRI/SME 27, 269–278 (2000)
11. J. Kozak, Effect of the wear of rotating tool on the accuracy of machined parts, in Proceedings
of the 2nd International Conference on Advances in Production Engineering APE-2, vol. I.
Warsaw, 2001, pp. 253–262
12. J. Kozak, Z. Gulbinowicz, D. Gulbinowicz, Computer simulation of rotating electrical
machining (REDM). Arch. Mech. Eng. XL(1), 111–125 (2004)
Chapter 16
Parameter Identification of a Nonlinear Two
Mass System Using Prior Knowledge

C. Endisch, M. Brache, and R. Kennel

Abstract This article presents a new method for system identification based on
dynamic neural networks using prior knowledge. A discrete chart is derived from a
given signal flow chart. This discrete chart is implemented in a dynamic neural
network model. The weights of the model correspond to physical parameters of the
real system. Nonlinear parts of the signal flow chart are represented by nonlinear
subparts of the neural network. An optimization algorithm trains the weights of the
dynamic neural network model. The proposed identification approach is tested with
a nonlinear two mass system.

1 Introduction

If the system is unknown and no prior knowledge is available the identification

process can start with an oversized neural network. To find a model of the system as
simple as possible a pruning algorithm deletes unnecessary parts of the network.
This approach reduces the complexity of the network. As shown in [2, 3] this
identification approach is able to remove weights from an oversized general
dynamic neural networks (short GDNN, in GDNNs the layers have feedback
connections with time delays, see Fig. 1).
The approach presented in this article is completely different. The assumption is
that prior knowledge is available in form of a continuous signal flow chart. The
model is derived in two steps. First the structure is altered in such a way, that all
backward paths contain integrator blocks. In the second step the integrator blocks
are discretized by Implicit and Explicit Euler approximation. In this article the

C. Endisch (*)
Institute for Electrical Drive Systems, Technical University of Munich, Arcisstraße 21, 80333
M€unchen, Germany
e-mail: [email protected]

S.-I. Ao et al. (eds.), Machine Learning and Systems Engineering, 197

Lecture Notes in Electrical Engineering 68,
DOI 10.1007/978-90-481-9419-3_16, # Springer ScienceþBusiness Media B.V. 2010
198 C. Endisch et al.

p1(t) T a1(t) a2(t) a3(t) = ŷ(t)

D IW1, 1 Σ f1 LW2, 1 Σ f2 LW3, 2 Σ f3
R1´1 L S1×1
S2×1 S3×1
S1´R1 S2×S1 S3×S2

1 b1 1 b2 1 b3

S1×1 S2×1 S3×1

T T T
D LW1, 3 D LW2, 3 D LW3, 3
L L L
S1×S3 S2×S3 S3×S3
T T
D LW1, 2 D LW2, 2
L L
S1×S2 S2×S2
T
D LW1, 1
L
S1×S1

Layer 1 Layer 2 Layer 3

Fig. 1 Example of a three-layer GDNN with feedback connections in all layers – the output of a
tapped delay line (TDL) is a vector containing delayed values of the TDL input. Below the matrix-
boxes and below the arrows the dimensions are shown. Rm and Sm respectively indicate the
dimension of the input and the number of neurons in layer m. y^ is the output of the GDNN

parameters are trained with the powerful Levenberg–Marquardt (LM) algorithm

[5]. Real time recurrent learning (RTRL) is used to calculate the necessary Jacobian
matrix.
In [6, 7] a so called structured recurrent neural network is used for identifying
dynamic systems. On a first glance the structured recurrent network seems to be
similar to the approach in this article. However, the approach presented in this
article does neither depend on a state observer nor on system-dependent derivative
calculations. No matter what model is used for the identification process, the
derivative calculations are conducted for the GDNN-model in general [9–11].
The second section presents the general dynamic network (GDNN). For imple-
menting a special model structure administration matrices are introduced. Section 3
explains the optimization method used to train the network parameters throughout
this article. Section 4 describes the mechatronic system to be identified. The
structured GDNN is constructed in Section 5. In Section 6 the identification
approach is tested with a nonlinear dynamic system. Finally, Section 7 summarizes
the results.

2 General Dynamic Neural Network

Figure 1 shows an example of a three-layer GDNN with feedback connections. De

Jesus described the GDNN-model in his doctoral thesis [10]. The sophisticated
formulations and notations of the GDNN-model allow an efficient computation of
16 Parameter Identification of a Nonlinear Two Mass System 199

the Jacobian matrix using real time recurrent learning (RTRL) [4, 9, 11, 16]. In this
article we follow these conventions suggested by De Jesus. The simulation equation
for layer m is calculated by
X X X X
nm ðtÞ ¼ LW m;l ðdÞ al ðt dÞ þ LW m;l ðdÞ pl ðt dÞ þ bm ;
g l2I m d2DI m;l g
l 2 Lfm d 2 DLm;l

(1)

where nm ðtÞ is the summation output of layer m; pl ðtÞ is the l-th input to the
network, IW m;l is the input weight matrix between input l and layer m, LW m;l is the
g
layer weight matrix between layer l and layer m, bm is the bias vectorg of layer m,
m, l
DL is the set of all delays in the tapped delay line between layer l and layer m,
DIm, l is the set of all input delays in the tapped delay line between input l and layer
m, Im is the set of indices of input vectors that connect to layer m and Lfm is the set
of indices of layers that directly connect forward to layer m. The output of layer m
is

am ðtÞ ¼ f m ðnm ðtÞÞ; (2)

where f m ðÞ are either nonlinear tanh- or linear activation functions. At each point
in time the Eqs. 1 and 2 are iterated forward through the layers. Time is incremented
from t ¼ 1 to t ¼ Q. (See [10] for a full description of the notation used here.) In
order to construct a flexible model-structure, it is necessary that only particular
weights in the weight matrices do exist. This is realized by the introduction of
administration matrices.

2.1 Administration Matrices

For each weight matrix there exists one weight administration matrix to mark which
weights are used in the GDNN-model. The layer weight administration matrices
AL m;l ðdÞ have the same dimensions as the layer weight matrices LW m;l ðdÞ, the
g g
input weight administration matrices AI m;l ðdÞ have the same dimensions as the
g
input weight matrices IW ðdÞ and the bias weight administration vectors Abm
m;l
g
have the same dimensions as the bias weight vectors bm. The elements of the
administration matrices can have the boolean values 0 or 1, indicating if a weight
is valid or not. If e.g. the layer weight lw m;l ðdÞ ¼ ½LW m;l ðdÞk;i from neuron i of
g g
layer l to neuron k of layer m with ak;idth-order time-delay is valid, then
m;l
AL m;l ðdÞ ¼ alk;i ðdÞ ¼ 1. If the element in the administration matrice equals
g k;i
to zero, the corresponding weight has no influence on the GDNN. With these
definitions the kth output of layer m can be computed by
200 C. Endisch et al.

!
X X X
Sl

k ðtÞ
nm ¼ lwm;l
k;i ðdÞ m;l
alk;i ðdÞ ali ðt dÞ
l2Lfm d2DLm;l i¼1
!
X X X
Rl
þ iwm;l
k;i ðdÞ aim;l
k;i ðdÞ pli ðt dÞ þ bm
k abk ;
m

l2Im d2DI m;l i¼1

k ðtÞ ¼ f k ðnk ðtÞÞ;

am m m
(3)

where Sl is the number of neurons in layer l and Rl is the dimension of the lth input.
By setting certain entries of the administration matrices to one a certain GDNN-
structure is generated. As this model uses structural knowledge from the system, it
is called Structured Dynamic Neural Network (SDNN).

2.2 Implementation

For the simulations throughout this paper the graphical programming language
Simulink (Matlab) is used. SDNN and the optimization algorithm are implemented
as S-function in Cþþ.

3 Parameter Optimization

First of all a quantitative measure of the network performance has to be defined. In

the following we use the squared error

1 X
Q
Eðwk Þ ¼ ðy y^q ðwk ÞÞT ðyq y^q ðwk ÞÞ
2 q¼1 q

1 X T
Q
¼ e ðw Þ eq ðwk Þ; (4)
2 q¼1 q k

where q denotes one pattern in the training set, yq and y^qðwk Þare the desired target
and the actual model output of the q-th pattern respectively. The vector wk is
composed of all weights in the SDNN. The cost function Eðwk Þ is small if the
training process performs well and large if it performs poorly. The cost function
forms an error surface in a (N þ 1)-dimensional space, where N is equal to the
number of weights in the SDNN. In the next step this space has to be searched in
order to reduce the cost function.
16 Parameter Identification of a Nonlinear Two Mass System 201

3.1 Levenberg–Marquardt Algorithm

All Newton methods are based on the second-order Taylor series expansion about
the old weight vector wk:

Eðwkþ1 Þ ¼ Eðwk þ Dwk Þ

1 (5)
¼ Eðwk Þ þ gTk Dwk þ DwTk H
k
Dwk :
2
If a minimum on the error surface is found, the gradient of the expansion Eq. 2 with
respect to Dwk is zero:

rEðwkþ1 Þ ¼ gk þ H
k
Dwk ¼ 0: (6)

Solving Eq. 6 for Dwk results in the Newton method

1
Dwk ¼ H
k
gTk ;
1
(7)
wkþ1 ¼ wk H
k
gk :

1
The vector H k
gTk is known as the Newton direction, which is a descent
direction, if the Hessian matrix H k
is positive definite. The LM approach approx-
imates the Hessian matrix by [5]

H
k

J T ðwk Þ
J ðwk Þ (8)

and it can be shown that

gk ¼
J T ðwk Þ eðwk Þ; (9)

J ðwk Þ is the Jacobian matrix:

where
2 3
@e1 ðwk Þ @e1 ðwk Þ @e1 ðwk Þ

6 @w1 @w2 @wN 7
6 7
6 @e2 ðwk Þ @e2 ðwk Þ @e2 ðwk Þ 7
6 7
J ðwk Þ ¼ 6
6 @w1 @w2 @wN 7 7 (10)
6 .. .. .. .. 7
6 7
4 @e .ðw Þ @e .ðw Þ . .
@eQ ðwk Þ 5
Q Q
k k

@w1 @w2 @wN

The Jacobian matrix includes first derivatives only. N is the number of all weights in
the neural network and Q is the number of evaluated time steps. With Eqs. 4, 5 and
6 the LM method can be expressed with the scaling factor mk
202 C. Endisch et al.

1
wkþ1 ¼ wk J T ðwk Þ
J ðwk Þ þ mk I
J T ðwk Þ eðwk Þ; (11)

where I is the identity matrix. As the LM algorithm is the best optimization

method for small and moderate networks (up to a few hundred weights), this
algorithm is used for all simulations in this paper.
LM optimization is usually carried out offline. In this paper we use a sliding time
window that includes the information of the last Q time steps. With the last Q errors
the Jacobian matrix J ðwk Þ from Eq. 10 is calculated quasi-online. In every time
step the oldest training pattern drops out of the time window and a new one (from
the current time step) is added – just like a first in first out (FIFO) buffer. If the time
window is large enough, it can be assumed that the information content of the
training data is constant. With this simple method we are able to implement the LM
algorithm quasi-online. For the simulations in this paper the window size is set to
Q ¼ 25000 using a sampling time of 1ms.

3.2 Jacobian Calculations

To create the Jacobian matrix, the derivatives of the errors have to be computed, see
Eq. 10. The GDNN has feedback elements and internal delays, so that the Jacobian
cannot be calculated by the standard backpropagation algorithm. There are two
general approaches to calculate the Jacobian matrix for dynamic systems: By
backpropagation through time (BPTT) [15] or by real time recurrent learning
(RTRL) [16]. For Jacobian calculations the RTRL algorithm is more efficient
than the BPTT algorithm [11]. According to this the RTRL algorithm is used in
this paper. The interested reader is referred to [2, 8, 11] for further details.

4 Two-Mass-System

The considered plant (shown in Fig. 2) is a nonlinear two-mass flexible servo

system (TMS), which is a common example for an electrical drive connected to a
work machine via flexible shaft. Figure 3 displays the signal flow chart of the TMS,

Fig. 2 Laboratory setup of

the TMS
16 Parameter Identification of a Nonlinear Two Mass System 203

d MR2

MW
1 MD 1
1
s·J1 s c s·J2
− '˙ 1 − − '˙ 2
M1 MB 1 − − Δ'˙ Δ' MC MB2

MR1

Fig. 3 Signal flow chart of the TMS with friction

where the spring constant c and the damping d model the shaft between the two
machines [14]. ’_ 1 and ’_ 2 denote the rotation speed of the main engine and the work
machine respectively. The torque of inertia of the machines are depicted by J1 and
J2. The motor torque is M1 and the torque at the work machine is M2. MB1 and MB2
are the acceleration torques of the main engine and the work machine respectively.
The difference of the rotation speeds is denoted by D’_ and D’ is the difference of
the angles. The torque of the spring and the damping torque are depicted by MC and
MD respectively. The friction torques of the engine and the working machine are
MR1 and MR2 respectively. The objective in this paper is to identify the linear TMS-
parameters and the characteristics of the two friction torques.

5 Structured Dynamic Neural Networks

To construct a structured network with the help of GDNNs it is necessary to

redraw the signal flow chart from Fig. 3 because the implementation of algebraic
loops is not feasible [1]. All feedback connections must contain at least one time
delay, otherwise the signals cannot be propagated through the network correctly.
This goal is accomplished by inserting integrator blocks in the feedback loops.
Figure 4 displays the redrawn version of Fig. 3. By using the Euler approximation
it is possible to discretize the redrawn signal flow chart. The Implicit Euler
approximation y(t) ¼ y(t 1) þ x(t)T replaces all integrator blocks in the forward
paths and the Explicit Euler approximation y(t) ¼ y(t 1) þ x(t 1)T replaces all
integrator blocks in the feedback paths. This approach ensures that all feedback
connections contain the necessary time delays. The resulting discrete signal flow
chart, which can be implemented as a SDNN, is displayed in Fig. 5. z 1 denotes a
first order time delay and T is the sampling time. All other denotations are the
same as in Fig. 3. In total the SDNN consists of 16 layers. The summing junctions
depict the neurons of the network. Every summing junction is marked with a
number, which denotes the layer of the neuron and its position within the layer.
204 C. Endisch et al.

d MR2
1
s·J1 1
s
1 MW
1 MD 1
s·J1 J2 1
s c s
− '˙ 1 − − '˙ 2
M1 − 1 − Δ'˙ Δ' MC MB2 1
s·J1 s
MR1

Fig. 4 Redrawn signal flow chart of the TMS from Fig. 3

−1
z
1.1 10.1
T
d J1
8.1
T MD T
J1 c·T J2
M1 1.2 2.1 − '˙ 7.1 8.2 MC 13.1 14.1 16.1 '˙ 2
1
3.1 − 5.1 − Δ'˙ 9.1 11.1 −
z
−1 T z
−1
z
−1
J1
6.1 9.2
4.1 12.1
.. ..
. .
1.3 15.1
−1 −1
z z
1.4

−1
15.2
z

Fig. 5 Structured recurrent network of a nonlinear TMS

For instance, 15. 1 marks the first neuron of the 15th layer. The friction of the
engine and the work machine can be modeled by an optional number of neurons in
the 5th layer and in the 11th layer respectively. These are the only neurons with
tanh-transfer functions. All other neurons have linear transfer functions. The
connections in Fig. 5 which do neither belong to a linear parameter (depicted as
box) nor to a friction-subpart are initialized with 1 or –1. The optimization
algorithm is able to tune the parameters corresponding to the spring constant c,
the damping d and the torque of inertia J2 and the friction weights of the work
machine. As it is not possible to identify the two torques of inertia as well as
the two characteristic curves of the TMS simultaneously, the engine parameters
are determined in a first no-load-identification which is conducted in idle running,
see Section 6.
16 Parameter Identification of a Nonlinear Two Mass System 205

6 Identification

6.1 Excitation Signal

The system is excited by an APRBS-signal (Amplitude Modulated Pseudo Random

Binary Sequence [13]) combined with a bias produced by a relay. The APRBS-
signal has an amplitude range between –7 and 7 Nm and an amplitude interval
between 10 and 250 ms. The relay output switches to –4 Nm if the rotation speed of
the TMS is greater than 10 rads and it switches to 4 Nm if the rotation speed is smaller
than 10 rads . The suggested excitation signal ensures that the TMS, which is
globally integrating, remains in a well defined range for which the SDNN is able
to learn the friction. Moreover, the output of the relay is multiplied by 0.2 for a
rotation speed in the range of 2 to 2 rad
s . Thus, the SDNN receives more informa-
tion about the friction in the region of the very important zero crossing. The
resulting output of the TMS can be regarded in upper panel of Fig. 8. This excitation
signal is used in all the following identification processes.

6.2 Engine Parameters

For a successful TMS-identification it is necessary to identify the engine para-

meters in idle mode first. The obtained values are used as fix parameters in the
identification of the whole TMS in Chapter 6. The upper left drawing of Fig. 6
displays the signal flow chart of the nonlinear engine, where M1 is the torque, ’_ 1
is the rotation speed, and J1 denotes the torque of inertia. In order to be able to
discretize the signal flow chart, we insert the integrator block in the backward
path. The resulting signal flow chart is shown in the upper right drawing of Fig. 6.
As explained above, for discretizing the signal flow chart the Implicit Euler
approximation has to replace the integrator in the forward path, whereas the
Explicit Euler approximation replaces the integrator in the backward path. The
sampling time T is incorporated in the gain corresponding to the torque of inertia J1.
The lower drawing of Fig. 6 displays the resulting discrete signal flow chart, which
can be implemented as a SDNN in which all the summing junctions are regarded as
neurons. The neurons in layer 5 model the friction of the engine.
For the identification process the engine is excited with the signal explained in
Section 6. The quasi-online calculated cost function E(wk) Eq. 4 with Q ¼ 25000 is
minimized by the LM optimization algorithm Eq. 11. The sampling time is set to
T ¼ 1ms and the identification process starts after 5 s, seen in the upper panel of
Fig. 7. The SDNN-model of Fig. 6 has three neurons in the fifth layer with six
weights to model the friction. These weights are initialized randomly between –0.5
and 0.5. Table 1 shows two different initial values for the torque of inertia J1 and the
corresponding results. The calculated results are mean values between the last 25,
000 optimization steps. Figure 7 displays the characteristic curve of the friction at
206 C. Endisch et al.

signal flow chart redrawn signal flow chart

1 1 1
s .J 1 s J1
'˙ 1 '˙ 1
M1 M1 1
− − s

MR1 MR1

resulting SDNN
T
J1
1.1 2.1 3.1 '˙ 1
M1 − 5.1
z−1

6.1
4.1
..
.
1.2

z−1

Fig. 6 Signal flow chart of the engine (upper left side), redrawn signal flow chart of the engine
(upper right side) and resulting SDNN (lower drawing)

t ¼ 100 s identified by the SDNN. We observe that the network is able to model the
jump due to the static friction with just three neurons. The following identification
of the whole TMS uses this friction curve from Fig. 7 and the result J1 ¼ 0. 1912
from Table 1 for the torque of inertia.

6.3 TMS Parameters

To identify the parameters of the whole TMS we excite the plant with the torque
signal described in Section 6 and use the SDNN-model constructed in Chapter 5.
The torque of inertia of the engine J1 and its friction are initialized according to the
results of Section 6. These weights remain unchanged during the whole identifica-
tion process, whereas the weights corresponding to the torque of inertia of the work
machine J2, the spring constant c, the damping d and the work machine friction are
trained. The work machine friction is modeled by three neurons with tanh functions
in the 11th layer, see Fig. 5. The six weights of this nonlinear subpart are initialized
16 Parameter Identification of a Nonlinear Two Mass System 207

1
torque of inertia J1

0.5

0
0 10 20 30 40 50 60 70 80 90 100
Start time [sec]

0.75
friction torque [Nm]

0.5
0.25
0
−0.25
−0.5
−0.75
−10 −8 −6 −4 −2 0 2 4 6 8 10
rad
rotation speed s

Fig. 7 Identification of the engine parameters – Torque of inertia and friction curve identified by
the nonlinear subpart in the 5th layer

Table 1 Initial values and J1 E

final values of the torque of [kg m2] (Mean value)
inertia of the engine J1
Initial value 0. 6
Result (mean value) 0.1912 5.629 10 2
Initial value 0.01
Result (mean value) 0.1912 4.387 10 2

randomly between –0.5 and 0.5. The upper panel of Fig. 8 displays the outputs of
the TMS and the SDNN-model during the identification process for the first set of
initial values of Table 2. The lower panel of this figure shows only 5 s for a detailed
view. The identification process starts after 5 s. The sampling time is set to T ¼ 1
ms. The quasi-online calculated cost function Eðwk Þ Eq. 4 with Q ¼ 25000 is
minimized by the LM optimization algorithm Eq. 11 and is depicted in the middle
panel of Fig. 8. Due to the quasi-online approach the cost function value increases
until t ¼ 25 s, until the training data window is completely filled up. The results in
Table 2 are mean values of the last 25000 optimization steps.
Figure 9 displays the developing of the damping, the spring constant and the
torque of inertia during the identification process for the first initialization of
Table 2. Figure 10 shows the characteristic curve of the work machine friction
208 C. Endisch et al.

rad 10
s SDNN 'ˆ˙2
system '˙ 2
rotation speed

−10
10 20 30 40 50 60 70 80 90 100
time [sec]

100
cost function

10−1

10−2

10−3
10 20 30 40 50 60 70 80 90 100
time [sec]

10
rad
s

5
rotation speed

0
SDNN 'ˆ˙2
−5 system '˙ 2
90 91 92 93 94 95
time [sec]

^_ 2 and the real TMS ’_ 2 with resulting cost function

Fig. 8 Output signals of the SDNN model ’

Table 2 Initial values and final values for the identification of the TMS

Nms Nm
J2 Error
[kgm2] rad rad (mean value)
d c
Initial value 0.7 0.4 100
Result (mean value) 0.3838 0.2406 477.2 3.792 102
Initial value 0.1 0.7 800
Result (mean value) 0.3881 0.0919 477.6 6.400 102

identified by the SDNN after 100 s. In addition to that Table 2 shows the results of a
second identification run with another initialization. The resulting torque of inertia
and spring constant are almost equal. Only the damping shows different final
values. The higher mean error (compared to the first identification) implies that
the second optimization process ended in a local minimum.
16 Parameter Identification of a Nonlinear Two Mass System 209

1
damping d

0.5

0
0 10 20 30 40 50 60 70 80 90 100
Start
time [sec]

1000
spring constant c

500

0
0 10 20 30 40 50 60 70 80 90 100
Start
time [sec]

1
torque of inertia J2

0.5

0
0 10 20 30 40 50 60 70 80 90 100
Start
time [sec]

Fig. 9 Linear parameter signals during the identification

1
friction torque [Nm]

0.5

−0.5

−1
−10 −8 −6 −4 −2 02 4 6 8 10
rotation speed rad
s

Fig. 10 Friction curve identified by the nonlinear subpart in the 11th layer
210 C. Endisch et al.

7 Conclusion

The system identification approach in this article is based on a given continuous

signal flow chart. Before discretizing the signal flow chart, integrator blocks are
inserted in all feedback loops. The Implicit Euler approximation replaces all
integrator blocks in the forward paths, whereas the Explicit Euler approximation
is used to replace the integrator blocks in the feedback paths. Since the resulting
discrete flow chart contains time delays in all feedback connections it can be
implemented as a SDNN without algebraic loops. Physical parameters of the
system are represented by certain weights of the SDNN. Nonlinear subparts of
the network model nonlinearities of the system. The suggested identification
approach is tested with a nonlinear TMS. The results verify that the suggested
approach enables us to identify the torques of inertia, the spring constant, the
damping and the two friction curves of the TMS. Problems may occur if the
optimization algorithm gets stuck in a local minimum. In this case the identification
process has to be restarted with a different set of initial values. With the help of the
mean error the success of the identification process can be validated.

References

1. M.J. Brache, Identification of dynamic systems using previous knowledge, Diploma Theses,
Lehrstuhl f€ur Elektrische Antriebssysteme, Technische Universit€at M€ unchen, 2008
2. C. Endisch, C. Hackl, D. Schr€ oder, Optimal brain surgeon for general dynamic neural net-
works, in Lecture Notes in Artificial Intelligence (LNAI) 4874 (Springer, Berlin, 2007), pp.
15–28
3. C. Endisch, C. Hackl, D. Schr€ oder, System identification with general dynamic neural net-
works and network pruning. Int. J. Computat. Intel. 4(3), 187–195 (2008)
4. C. Endisch, P. Stolze, C. Hackl, D. Schr€oder, Comments on backpropagation algorithms for a
broad class of dynamic networks. IEEE Trans. Neural Network 20(3), 540–541 (2009)
5. M. Hagan, B.M. Mohammed, Training feedforward networks with the marquardt algorithm.
IEEE Trans. Neural Networks 5(6), 989–993 (1994)
6. C. Hintz, B. Angerer, D. Schrder, Online identification of mechatronic system with structured
recurrent neural networks. Proceedings of the IEEE-ISIE 2002, L’Aquila, Italy, pp. 288–293
7. C. Hintz, Identifikation nichtlinearer mechatronischer Systeme mit strukturierten rekurrenten
Netzen. Dissertation, Lehrstuhl fr Elektrische Antriebssysteme, Technische Universit€at
M€unchen, 2003
8. O. De Jesus, M. Hagan. Backpropagation Algorithms Through time for a general class of
recurrent network. IEEE International Joint Conference Neural Network, Washington, 2001,
pp. 2638–2643
9. O. De Jesus, M. Hagan, Forward perturbation algorithm for a general class of recurrent
network. IEEE International Joint Conference Neural Network, Washington, 2001, pp.
2626–2631
10. O. De Jesus, Training general dynamic neural networks. Ph.D. dissertation, Oklahoma State
University, Stillwater, OK, 2002
11. O. De Jesus, M. Hagan backpropagation algorithms for a broad class of dynamic networks.
IEEE Trans Neural Networks 18(1), 14–27 (2007)
16 Parameter Identification of a Nonlinear Two Mass System 211

13. O. Nelles, in Nonlinear System Identification (Springer, Berlin, 2001).

14. D. Schr€oder, in Elektrische Antriebe – Regelung von Antriebssystemen, 2nd edn. (Springer,
Berlin, 2001)
15. P.J. Werbos, Backpropagation through time: what it is and how to do it. Proc. IEEE 78(10),
1550–1560 (1990)
16. R.J. Williams, D. Zipser, A learning algorithm for continually running fully recurrent neural
networks. Neural Comput. 1, 270–280 (1989)
Chapter 17
Adaptive and Neural Learning for Biped
Robot Actuator Control

Pavan K. Vempaty, Ka C. Cheok, Robert N.K. Loh,

and Micho Radovnikovich

Abstract Many robotics problems do not take the dynamics of the actuators into
account in the formulation of the control solutions. The fallacy is in assuming that
forces/torques can be instantaneously and accurately generated. In practice, actua-
tor dynamics may be unknown. This paper presents a Model Reference Adaptive
Controller (MRAC) for the actuators of a biped robot that mimics a human walking
motion. The MRAC self-adjusts so that the actuators produce the desired torques.
Lyapunov stability criterion and a rate of convergence analysis is provided. The
control scheme for the biped robot is simulated on a sagittal plane to verify the
MRAC scheme for the actuators. Next, the paper shows how a neural network (NN)
can learn to generate its own walking gaits using successful runs from the adaptive
control scheme. In this case, the NN learns to estimate and anticipate the reference
commands for the gaits.

1 Introduction

Biped walking dynamics is highly non-linear, has many degrees of freedom and
requires developing complicated model to describe its walking behavior. Many
novel approaches have emerged in the field of biped walking to address this
complicated control mechanism. Existing biped walking methods [3, 6, 7, 10]
give precise stability control for walking bipeds. However these methods require
highly precise biped walking dynamics. In recent years, biped walking through
imitation has been a promising approach, since it avoids developing complex
kinematics of the human walking trajectory and gives the biped a human like

P.K. Vempaty (*)

Department of Electrical and Computer Engineering, Oakland University, Rochester, Michigan
48309, USA
e-mail: [email protected]

S.-I. Ao et al. (eds.), Machine Learning and Systems Engineering, 213

Lecture Notes in Electrical Engineering 68,
DOI 10.1007/978-90-481-9419-3_17, # Springer ScienceþBusiness Media B.V. 2010
214 P.K. Vempaty et al.

walking behavior. These methods combine the conventional control schemes to

develop the walking mechanism for the bipeds. Examples include, imitation based
on intelligent control methods like genetic algorithm [3, 14], fuzzy logic [15],
neural network approach [8], and other methods such as adaptation of biped
locomotion [10], learning to walk through imitation [8] and reinforcement learning
[9, 12, 15]. But these methods cannot adapt their behavior to the changes in the
dynamics of the process and the character of the disturbances [7]. Therefore,
adaptive control approaches [1, 2, 5, 11] are useful.
To address the problem of poorly known actuator dynamic characteristics and
unpredictable variations of a biped system, we propose a Lyapunov based model
reference adaptive control system (MRAC) method for the biped walking control.
The controlled plant (biped actuator) adapts itself to the reference model (desired
dynamics for the actuators). Lyapunov’s stability criterion and convergence analy-
sis are shown for parameter tuning. Through this scheme, a robot can learn its
behavior through its reference models. The paper further describes a NN scheme for
estimating and anticipating reference commands needed for walking gaits. Data
from successful adaptive control biped runs were used to train the NN.

2 Problem Description

2.1 Objective

Consider the objective of controlling a biped robot so that it imitates the movement
of a person. Figure 1 shows the basic idea where the human movement is repre-
sented by yd and the biped movement by y. The biped motion is determined by the
actuators which are controlled by the inputs ua. The overall objective is to find the
adaptive ua such that y ! yd.
The actuator dynamics have uncertainties including non-linearities, unknown
parameter values and delays, which have not been widely addressed. Figure 2 shows
the adaptive actuator objective where the actuator output moment M is made to
follow a required Md, which will be computed from the desired requirement that
y tracks yd. Figure 2 also shows a NN scheme for generating an estimate ŷd of

ua y
yd

Uncertain M q
ua
Actuators Biped q̇
Fig. 1 Human movements, Dynamics
Dynamics y = Cqq
biped robot, its actuators
17 Adaptive and Neural Learning for Biped Robot Actuator Control 215

Fig. 2 MRAC scheme Md M F

yd
for the biped walker Desired Adaptive
Biped
Torque Actuator
Dynamics
ŷd Generator Scheme
q, q̇
Cyq

Neural
Network

the reference command signal. This paper deals with the formulation and
simulation aspects of the MRAC actuator and neural network learning schemes.

2.2 Biped Dynamics

Equations for describing the dynamics of a biped robot were introduced in [4], and
can be summarized as follows.

Aq ðqÞ€ _ M; FÞ
q ¼ Bðq; q; (1)

where q is the generalized coordinates of the robot, q_ and q

€ are the first and second
derivatives, M the moments/torques applied to the joints in the robot and F the
reaction forces at the contact of the robot’s feet and ground surface.

2.3 Uncertain Actuator Dynamics

The literature often assumes that M can be readily generated without considering
the dynamics of the actuators. For example, if a set of desired torques are calculated
as Md, then it would assume that M ¼ Md and applied directly as inputs to the
robot. However, this is not a valid assumption since in practice the moments M will
be generated by actuators which normally have unknown parameters, time delays
and non-linearities. The moments M can be modelled as the states of

_
x_ a ðtÞ ¼ Aa xa ðtÞ þ Ba ua ðtÞ þ da ðqðtÞ; qðtÞ; text ðtÞÞ
(2)
M ¼ C a xa

where ua are inputs of the actuators, and da(q, q, _ text) represents disturbance
torques to the actuators due to robot movements. t ext is an external disturbance
torque. We assume that the moments/torques M can be measured; for example, by
measuring the currents in motors or pressure in hydraulics. In pre-tuned actuators,
we can assume that M ¼ xa, i.e., Ca ¼ I.
216 P.K. Vempaty et al.

2.4 Desired Moments Md

The desired moments Md can be derived as the output of a controller that operates
on yd and y. For example,

Md ðsÞ ¼ Gc ðsÞ yd ðsÞ yðsÞ (3)

where s is the Laplace variable, Gc(s) is the controller transfer function. The
controller Gc is designed to generate the desired moments Md, required for the
adaptive actuator scheme by using the information from y and yd.

2.5 Adaptive Control Approach

In the presence of actuator unknown parameters and uncertainties in (2). We would

like the actuator output M to follow Md. Figure 2 shows the adaptation scheme.

3 Solution

3.1 Reference Model for Actuator

To apply the MRAC approach to the actuator (2), a reference model for the actuator
is needed as follows. The computed desired moments Md (3) will be represented as
the states of the reference model

x_ m ðtÞ ¼ Am xm ðtÞ þ Bm um ðtÞ

(4)
Md ðtÞ ¼ xm ðtÞ

where Am and Bm represent the desired dynamics for the actuator to follow. um(t)
represents the command input to the reference model of the actuator and is required
for the MRAC.

3.2 Inverse Reference Model

However, we do not know the input um. So the approach here is to estimate the
unknown um knowing xm. The unknown um can be represented as the output of an
waveform shaping model, i.e.,
x_ u ¼ Au xu þ wu
(5a)
um ¼ Cu xu
17 Adaptive and Neural Learning for Biped Robot Actuator Control 217

where Au and Cu represent approximate waveform characteristics and wu is a sparse

and small shaping input [13].
An estimate of the of um can be found using an observer of the form
^_ mu ¼ Amu ^
x xmu þ Kmu ½xm Cmu ^xmu
(5b)
^m ¼ ½0 Cu ^
u xmu

xm Am Bm Cu 0
where, xmu ¼ ; Amu ¼ ; Bmu ¼ and Cmu ¼ [I 0]; 0 and
xu 0 Au I
I are null and identity matrix. Kmu is chosen such that Amu Kmu Cmu has
exponentially stable eigenvalues. We refer (5b) as the inverse reference model,
^m .
xm is given and input um is estimated by the inverse model as the output u

3.3 MRAC Scheme

3.3.1 Configuration of MRAC Actuator

The adaptive actuator scheme is shown in Fig. 3, where the reference and control
models are specified by
x_ m ¼ Am xm þ Bm u
^m
(6)
ua ¼ Lxa þ N^ um
The adaptation algorithm in the MRAC will adjust the gains L and N based on
Lyapunov stability criteria as illustrated in Fig. 3.

yd Inverse Reference Model

Md Input ûm
Gc(s)
Predictor
y
q
da(q; q̇ ; ¿ext) F
ua M
Actuator Biped
N Σ
Dynamics Dynamics
q; q̇
L

Adaptation
Mechanism

Cqq

Fig. 3 The MRAC with the Neural Network

input predictor
218 P.K. Vempaty et al.

3.3.2 Error Dynamics

The error e ¼ xm xa between the actuator torque xa ¼ M and the desired torque
xm ¼ Md behave according to
e_ ¼ Am e þ ½Am Aa Ba Lxa
(7)
þ ½Bm Ba N^ _ text Þ
um da ðq; q;

3.3.3 Lyapunov Stability Analysis

Define a candidate for a Lyapunov function as

v ¼ eT Pe
h i
þ trace ðAm Aa Ba LÞT QðAm Aa Ba LÞ (8)
h i
þ trace ðBm Ba NÞT RðBm Ba NÞ

where P ¼ PT > 0, Q ¼ QT > 0 and R ¼ RT > 0 are positive definite matrices. Then
v_ ¼ e_ T Pe þ eT P_e (9)

_ we choose
Based on the analysis of v,
Ba L_ ¼ Q1 PexTa
(10)
Ba N_ ¼ R1 Pe^
uTm
_ text). We next choose an S ¼ ST > 0
_ eT[PAm þ AmTP]e 2eTAmTda(q, q,
so that v¼
and solve P from
PAm þ ATm P ¼ S (11)

We arrive at v¼ _ eTSe þ 2eTAmTda(q, q, _ text), where v_ is negative under the

assumption e Se > 2eTAmTda(q, q,
T
_ text), which implies that the magnitude of error
should be larger than the disturbance. In practice this means that the system should be
persistently excited. Hence we conclude that the overall dynamic system comprising
of (4) and (7) has a candidate function that satisfies the Lyapunov stability criterion.

4 MRAC for Walking Biped Actuators

4.1 Dynamics of Walking Biped

The bipedal model has five links with four pin joints as shown in Fig. 4. One link
represents the upper body and two links are for each lower limb. The biped has two
17 Adaptive and Neural Learning for Biped Robot Actuator Control 219

hip joints, two knee joints and two ankles at the tips of the lower limbs. There is an
actuator located at each joint and all of the joints are considered rotating only in the
sagittal plane. The system contains five links and seven degrees of freedom,
selected according to Fig. 4a and given by

q ¼ ½x0 ; y0 ; a; bL ; bR ; gL ; gR T (12)

The coordinates (x0, y0) fix the position of the center of mass of the torso, and the
rest of the coordinates describe the joint angles. The link lengths are denoted by
(h0, h1, h2) and masses by (m0, m1, m2). The centers of mass of the links are located
at the distances (r0, r1, r2) from the corresponding joints.
The model is actuated with four moments; two of them acting between the torso
and both thighs, and two at the knee joints, as illustrated in Fig. 4b.

M ¼ ½ML1 ; MR1 ; ML2 ; MR2 T (13)

The walking surface is modelled using external forces that affect the leg tips, as
shown in Fig. 4b.

T
F ¼ FLx ; FLy ; FRx ; FRy (14)

a b
m0
y
(x0; y0)
x
r0
h0

®
r1
m1 h1
¯R MR1
¯L r2 ML1

m2
°L h2 °R MR2
FLx ML2 FRx

FLy FRy

Fig. 4 (a) Biped robot with the corresponding seven coordinates q. (b) Biped robot with the
moments M and reaction forces F
220 P.K. Vempaty et al.

When the leg should touch the ground, the corresponding forces are switched on to
support the leg. As the leg rises, the forces are zeroed.
Using Lagrangian mechanics, the dynamic equations for the biped system
can be derived as shown in (1) where A(q) 2 ℜ77 is the inertia matrix and
_ M, F 2 ℜ71 is a vector containing the right hand sides of the seven partial
bq, q,
differential equations. The closed form formulas for both A and b are presented
in [4].

4.2 Computation of Desired Moments

The desired human movement angles will be measured and represented by

yd ¼ ½ am ðbL bR Þ gLm gRm T (15)

An output y is constructed from the biped feedback information q as

y ¼ ½a ðbL bR Þ gL gR T ¼ C y q (16)

The desired torque is derived as the output of a controller that operates on the error
between yd and y. That is

Md ðsÞ ¼ GC ðsÞ yd ðsÞ yðsÞ
(17)
Md ¼ ½Md1 Md2 Md3 Md4 T

4.3 Dynamics of Actuators

We note that the actuator states xa ¼ M are the torques that will drive the biped
robot. The goal is to find ua such that M ! Md. We assume that DC motors are used
as actuators. It follows that (2) can be decoupled into individual motors represent-
ing the first-order stator-rotor dynamics (aai, bai) that generates output torque (xai)
while subject to disturbance (dai), that is
xa ¼ ½xa1 xa2 xa3 xa4 T ; ua ¼ ½ua1 ua2 ua3 ua4 T
Aa ¼ diagfaa1 aa2 aa3 aa4 g; Ba ¼ diagfba1 ba2 ba3 ba4 g
_ text Þ ¼ ½da1
da ðq; q; da2 da3 da4 T

where Aa, Ba and da are the uncertain parameter values.

17 Adaptive and Neural Learning for Biped Robot Actuator Control 221

4.4 Configuration of MRAC Actuator

The control specified by

ua ¼ Lxa þ N^ um
L ¼ diagfl1 ; l2 ; l3 ; l4 g; (18)
N ¼ diagfn1 ; n2 ; n3 ; n4 g

It follows from the Lyapunov design that the gains are adjusted according to
1 1
li ¼ p ðMdi xai Þxai ; ni ¼ p ðMdi xai Þ^umi (19)
bai qi i bai r i i

4.5 Convergence Analysis of MRAC

The convergence of the MRAC depends on the following dynamics

e_ i ¼ ami ei þ bai xai ~li bai u _ text Þ
^mi n~i dai ðq; q;
~l_i ¼ pi xai ei ; p^ umi (20)
~
ni ¼ i e i
bai qi bai r i
where ei ¼ xm xai, l~i ¼ li l^i , n~i ¼ ni n^i , and li∗ and ni∗ are the true values of
parameters li and ni, whose dynamics2 are characterized by a third order polynomial
px p u^2
with coefficients given by ami ; iqiai þ irimi and 0. We would assign the first two
coefficients as the convergence characteristics. The first coefficient in turn defines
ami. We next choose a large value for si to ensure that the Lyapunov rate is satisfied,

_ t ext Þ < 0
v_ i ¼ si e2i 2ami ei dai ðq; q; (21)

We can now compute pi, qi, and ri for the algebraic Lyapunov function as

si 2pi x2a 2pi u2m

pi ¼ ; qi ¼ ; ri ¼
2ami c2 c2

The analysis here was used to assist tuning of the parameters.

4.6 Neural Network Learning

The purpose of the NN is to eventually replace the necessity of human input (yd).
After the MRAC converges, the NN is trained to generate an estimate ^yd of yd using
222 P.K. Vempaty et al.

y and y_ as inputs. The basis for the mapping comes from the closed-loop system
formed by (1), (3) and The closed-loop characteristic can be expressed in a generic
_ q
form by f ðq; q; €; xa ; yd ; FÞ ¼ 0. The NN is an approximate mapping of f1(y, y,
_ yd)
¼ 0, where f1 is a reduced version of f.

5 Simulation Results

The biped movements from (16) are measured and used to generate the desired
torques from (17), (18) and (19) were implemented as the MRAC scheme. Three
sets of simulation runs are shown below. The first simulation does not include the
disturbance torque da(q, q,_ text). We introduced the disturbance in the second
simulation [13]. The third simulation shows the estimation performance of the
NN in predicting the reference command.

5.1 First Simulation (Without Disturbance)

5.1.1 Results of the Biped Moments M and Desired Biped Moments Md

Figure 5 shows MR1 converging to Md2 for one step cycle. Similarly, it can be noted
that the rest of the biped moments converge to the reference model’s behavior.
Therefore, this gives the biped walker a human-like gait as described by Md.

5.1.2 Results of the Biped Walking Simulation

Figure 6 plots the stable torso height y0 (see Fig. 4) of the biped walking as the result
of the adapted torques M.

5.2 Second Simulation (Disturbance)

5.2.1 Results of the Biped Walking with Torque Disturbance

Equation (2) includes disturbance torques da(q, q,_ text) which represents torque
feedback and other external moments. This causes the dynamics of the actuator to
vary. Figure 7 shows the biped torso height y0, recovering from the impact due to
17 Adaptive and Neural Learning for Biped Robot Actuator Control 223

Fig. 5 MR1 tracking Md2

Fig. 6 Height of the biped torso y0

the external disturbance to the biped walker, introduced by four impulses at 0.05,
0.3, 1.34, and 2.2 s with a magnitude of 5 and 6 Nm.

5.3 Third Simulation (Neural Network Estimation)

Figure 8 shows an 8-15-1 neural network model with y & y_ as the inputs and
estimate ^yd3 of the reference command signal yd3 as the output.
224 P.K. Vempaty et al.

_ text),
Fig. 7 Height of the biped torso without disturbance and with disturbance torques da(q, q,
applied at t ¼ 0.05, t ¼ 0.3, t ¼ 1.34, and t ¼ 2.2s

Fig. 8 Tracking performance of trained gL pattern to the reference pattern

6 Conclusions

In this paper, we presented an MRAC technique to ensure that the actuators reliably
produce desired torques necessary for a walking robot. An observer was used to
predict an anticipated state of the desired torque, thus causing the adaptive actuators
to anticipate motion torques. We provided a proof to show that the MRAC scheme
results in a stable system in the sense of Lyapunov when errors between the desired
and actuator torques are significant. Also, the convergence analysis for tuning p, q,
and r is provided. Simulation results verify that the system is robust when tested
with various parameters and unknown coefficients. This paper also investigates and
presents a neural network estimation of the human gait reference inputs. In the
future, we plan to implement the MRAC and neural network schemes to control a
real-time biped walker with transport delay.
17 Adaptive and Neural Learning for Biped Robot Actuator Control 225

References

1. E. Bobasu, D. Popescu, Adaptive nonlinear control algorithms of robotic manipulators.

Proceedings of the 7th WSEAS International Conference on Automation & Information,
2006, pp. 83–88
2. R. Chalodnan, D.B. Grimes, R.P.N. Rao, Learning to walk through imitation. Proceedings of
the 20th International Joint Conference on Artificial Intelligence, 2007, pp. 2084–2090
3. S. Ha, Y. Han, H. Hahn, Adaptive gait pattern generation of biped robot based on humans gait
pattern analysis. Proc. Int. J. Mech. Sys. Sci. Engr. 1(2), 80–85 (2008)
4. O. Haavisto, H. Hyotyniemi, Simulation tool of biped walking robot model. Master’s thesis,
Helsinki University of Technology, Espoo, Finland, 2004
5. J. Hu, J. Pratt, C.-M. Chew, H. Herr, G. Pratt, Adaptive virtual model control of a bipedal
walking robot. Int. J. Artif. Intel. Tools 8(3), 337–348(1999)
6. S. Kajita, F. Kanehiro, K. Kaneko, K. Fijiwara, K. Yokoi, H. Hirukawa, A realtime pattern
generator for biped walking. Proceedings of the IEEE International Conference on Robotics &
Automation, vol. 1, 2002, pp. 31–37
7. S. Kajita, F. Kanehiro, K. Kaneko, K. Fujiwara, K. Harada, K. Yokoi, H. Hirukawa, Biped
walking pattern generation by using preview control of zero-moment point. Proceedings of
the IEEE International Conference on Robotics & Automation, vol. 2, 2003, pp. 1620–1626
8. P. Manoonpong, T. Geng, T. Kulvicius, B. Porr, F. Worgotter, Adaptive, fast walking in a biped
robot under neuronal control and learning. PLos Computat. Biol. 3(7), 1305–1320(2007)
9. J. Morimoto, G. Cheng, C.-G. Atkeson,G. Zeglen, A simple reinforcement learning algorithm
for biped walking. Proceedings of the IEEE Conference on Robotics and Automation, vol. 3,
2004, pp. 3030–3035
10. J. Nakanishi, J. Morimoto, G. Endo, G. Cheng, S. Schaal, M. Kawato, Learning from
demonstration and adaptation of biped locomotion with dynamical movement primitives.
Proceedings of the 4th IEEE/RAS International Conference on Humanoid Robotics, vol. 2,
2004, pp. 925–940
11. J. Pratt, C.-M. Chew, A. Torres, P. Dilworth, G. Pratt, Virtual model control: an intuitive
approach for bipedal locomotion. Int. J. Robot. Res. 20(2), 129–143 (2001)
12. A. Takanobu, S. Jun, K. Shigenobu, Reinforcement learning for biped walking using human
demonstration towards the human like walking of biped robot. Chino Shisutemu Shinpojiumu
Shiryo, 32, 393–398 (2005)
13. P.K. Vempaty, K.C. Cheok, R.N.K. Loh, Model reference adaptive control for actuators of a
biped robot locomotion. Proceedings of the World Congress on Engineering and Computer
Science, San Francisco, USA, vol. 2, 2009, pp. 983–988
14. K. Wolff, P. Nordin, Evolutionary learning from first principles of biped walking on a
simulated humanoid robot. Proceedings of the Business and Industry Symposium of the
Simulation Technologies Conference, 2003, pp. 31–36
15. C. Zhou, Q. Meng, Dynamic balance of a biped robot using fuzzy reinforcement learning
agents. Fuzzy Sets Syst. Arch. 134, 169–187 (2003)
Chapter 18
Modeling, Simulation, and Analysis
for Battery Electric Vehicles

Wei Zhan, Make McDermott, Behbood Zoghi, and Muhammad Hasan

Abstract Steady state and dynamic vehicle models are derived for analyses of
requirements on motors and batteries for battery electric vehicles. Vehicle level
performance requirements such as driving range, maximum cruise speed, maximum
gradeability, and maximum acceleration are used to analyze the requirements on
motors and batteries including motor power and torque, battery weight and specific
energy. MATLAB simulation tools are developed to allow validation of these
vehicle level performance requirements for a given set of motor/battery.

1 Introduction

Even though the internal combustion engine (ICE) is currently still the dominant
power source for automobiles, the cost of fuel and more stringent government
regulations on greenhouse gas emissions have led to more active interest in hybrid
and electric vehicles. Hybrid vehicles have better gas mileage than ICE-powered
vehicles. But they still have greenhouse gas emissions and the dual power sources
make them more complex and expensive. Battery electric vehicles (BEV) have
zero emission with a single power source that makes their design, control, and
maintenance relatively simple compared to hybrid vehicles. In addition, the
wide use of BEVs will reduce dependence on foreign oil, lower the cost per mile
of driving, and can potentially reduce the cost of electricity by using the vehicle-
to-grid power capability.

W. Zhan (*)
Department of Engineering Technology and Industrial Distribution, Texas A&M University,
College Station, TX 77843-3367, USA
e-mail: [email protected]

S.-I. Ao et al. (eds.), Machine Learning and Systems Engineering, 227

Lecture Notes in Electrical Engineering 68,
DOI 10.1007/978-90-481-9419-3_18, # Springer ScienceþBusiness Media B.V. 2010
228 W. Zhan et al.

The main limitations on BEVs lie in the battery technology. The low energy and
power densities of batteries compared to hydrocarbon fuels significantly reduce the
driving range and other vehicle level performances. Initial cost is another factor that
slows the commercialization of electric vehicles. However, the latest developments
in battery technologies are making BEVs more and more attractive. From lead-acid
batteries to nickel metal-hydride (NiMH) batteries, lithium-ion cell technology [1],
and the latest nano-technology based batteries [2], the energy and power densities
have improved drastically.
The U.S. government is investing heavily to support battery technologies and
infrastructure for electric vehicles. It has set a target of one million electric vehicles
on U.S. roads by 2012. Tax credits up to $7,500 for buyers of plug-in electric
vehicles are offered by the U.S. government. The private sector is also investing
billions of dollars in electric vehicles. GM and Ford are both planning to roll out
plug-in electric vehicles in 2010. All these developments point to a trend toward
electric vehicles in the auto industry.
Not only are an increasing number of new BEVs being manufactured, there is
also significant interest in converting existing ICE-powered vehicles to electric
power. A Google search for “conversion to EV” results in millions of websites
and books, many of them providing Do it Yourself kits with focus on removal
and addition of components [3]. There is increasing interest in academia in the
development of BEVs; for example, an undergraduate senior design project devel-
oped an electric vehicle conversion [4]. However, many of these efforts lack
consideration of detailed system design requirements. Most of the components
for conversions are selected to provide similar output to that of the ICE or
by using the vehicle weight to determine the energy and power requirements. The
conversion to electric propulsion is a complex process and requires analysis that can
be very different from that of an ICE-powered vehicle [5]. Many performance
objectives impose conflicting demands. If not designed carefully, the resulting
electric vehicle can have many problems such as driving range shorter than
expected, battery and motor lacking enough power for desired acceleration, and
safety-related design problems.
In this paper, first principle models are derived for electric vehicles and used to
establish quantitative design requirements. Software tools are developed in
MATLAB [6, 7] to allow users to quickly determine expected vehicle level
performances for a given set of motors and batteries. They can also be used to
conduct trade-off studies for many design parameters.

2 Steady State Analysis

One of the system level requirements for BEV is the driving range at constant
speed. This requirement can be used to derive motor power and battery power and
energy requirements. Since the vehicle is assumed to be moving at a constant speed,
steady state analysis can be use to study this problem.
18 Modeling, Simulation, and Analysis for Battery Electric Vehicles 229

Fig. 1 Road load force

components Wsin(Θ)
DA RX

Θ W

The energy required to move a vehicle is determined by the distance it travels

and the force it has to overcome. The road load force the vehicle must overcome to
move the given distance has three components [8, 9], as illustrated in Fig. 1:
l The component of the gravity force in the direction of travel, if it is an inclined
path (Wsin(y))
l The aerodynamic drag (DA)
l The rolling resistance (Rx)

2.1 Projected Gravity Force

The gravity force is decomposed into two components, one in the direction of travel
and the other in the direction perpendicular to the surface. In order to move the
vehicle up the inclined surface, the vehicle must overcome the gravity force
component in the direction of travel. This is given by

Wx ¼ WsinðyÞ (1)

where W is the gravity force, y is the angle of the inclined surface, and Wx is the
component of the gravity force in the direction of travel.

2.2 Aerodynamic Drag

The drag is a function of speed for any given vehicle. At low speed the drag force is
negligible. At high speed, the drag becomes a significant factor. For simplicity, a
semi-empirical model is used here [9]

1
DA ¼ rV 2 CD A (2)
2

where V is the vehicle speed (ft/s), A is the frontal area of the vehicle (ft2), CD is the
aerodynamic drag coefficient, DA is the aerodynamic drag (lb), r is the air density
230 W. Zhan et al.

(lb-s2/ft4). A nominal value of r ¼ 0.00236 (lb-s2/ft4) and an estimated value of

0.45 for CD are used in this paper.

2.3 The Rolling Resistance

Rolling resistance of the tires is a major vehicle resistance force. It is the dominant
motion resistance force at low speed (<50 mph). The rolling resistance can be
modeled as the load on the tires multiplied by the coefficient of rolling resistance fr:

Rx ¼ f r W (3)

The coefficient of rolling resistance is affected by tire construction, tire temper-

ature, vehicle speed, road surface, and tire pressure. For instance, the rolling
resistance coefficient changes as the temperature changes. To simplify our analysis,
the coefficient of rolling resistance is assumed constant. We use 0.015 as the
nominal value.

2.4 Power Required

Based on the above analysis, the power required to drive the vehicle at a given
speed V (mph) is given by the total road load forces multiplied by the vehicle
speed, i.e.,

HP ¼ 0:00267ðDA þ Rx þ Wx ÞV (4)

where Wx can be calculated using (1), DA is given by (2), Rx can be calculated using
(3) with fr ¼ 0.015, and 0.00267 is the conversion factor to horsepower, HP. To
calculate these quantities, we need the following inputs:
l Vehicle speed (mph)
l Vehicle weight (including trailer if there is one) (lb)
l Frontal area of the vehicle, (including trailer if there is one) (ft2)
l Aerodynamic drag coefficient (including trailer if there is one)
l Coefficient of rolling resistance
l Surface incline angle (degree)

2.5 Energy Required

Energy is power integrated over time. If the total distance traveled is long enough,
the initial acceleration and final deceleration have negligible effect on the total
18 Modeling, Simulation, and Analysis for Battery Electric Vehicles 231

energy calculation. Also, since this is a steady state analysis the aerodynamic drag
is constant. Noting that Wx ¼ Wsin (y) and V ¼ dx/dt, it follows that
ð ð ð
dx
Wx dt ¼ WsinðyÞdx ¼ WsinðyÞdx ¼ WDh
dt
where Dh is the change in elevation between the starting and ending points. Thus,
the energy required to move a vehicle for a distance of d (miles) at a speed V (mph)
with a change in elevation of Dh (miles) is given by
EðkWhÞ ¼ 0:00267½ðDA þ Rx Þd þ WDh 0:746ðkWÞ=HP
(5)
¼ 0:002½ðDA þ Rx Þd þ WDh:
Define y* as the average slope; i.e.,

siny ¼ Dh=d

and

Wx ¼ Wsiny

Then the trip energy becomes

EðkWhÞ ¼ 0:002ðDA þ Rx þ Wx Þd: (6)

If speed is not constant, Eqs. (5) and (6) do not apply and the power consumed to
overcome drag must be evaluated as an integral.
The energy calculations in (6) can be converted to MJ (1 kWh ¼ 3.6 MJ):
EðMJÞ ¼ 0:0072ðDA þ Rx þ Wx Þd: (7)

2.6 Battery Specific Energy

The total energy required for driving a vehicle at constant speed over a given range
can be used to derive the requirement on battery specific energy Dse (MJ/kg).
Let the battery weight be Wb (lb) and the battery/motor delivery efficiency be .
The total available energy ET (MJ) from the battery/electric motor is determined by

ET ðMJÞ ¼ 0:455ðkg=1bÞ Dse ðMJ=KgÞ Wbð1bÞ : (8)

This amount must be greater than or equal to the total energy required as given
in (7), i.e.,

0:0072ðDA þ Rx þ Wx Þd 0:455 Dse Wb : (9)

232 W. Zhan et al.

From this, one can solve for Dse required to travel a given distance

DA þ Rx þ Wx
Dse 0:0158 d: (10)
Wb

Alternatively, we can calculate the maximum distance dmax the vehicle can
travel when the specific energy is given

Wb
dmax ¼ 63:29 Dse : (11)
DA þ Rx þ Wx

Note that the battery weight Wb is part of the vehicle weight W, which is used in
the calculation of Rx and Wx. Denoting the vehicle weight without the battery by W0
(lb), we have

W ¼ W0 þ Wb (12)

Combining (9) and (12), we get

Wb ½63:29Dse dðsin y þ fr Þ d½DA þ ðsin y þ fr ÞW0 : (13)

Since the right hand side is positive, and the battery weight must be positive, we
must have

63:29Dse
d< (14)
sin y þ fr

The right hand side of (14) provides a theoretical upper bound for the distance a
vehicle can travel with infinite battery weight (energy) regardless of the speed,
vehicle weight, and aerodynamic drag.
Under the assumption that (14) holds, one can determine the weight of a battery
in order to travel a distance d

DA þ ðsiny þ fr ÞW0
Wb d (15)
63:29Dse dðsiny þ fr Þ

One can conclude that, as long as (14) holds, a sufficiently heavy battery would
always enable the vehicle to travel a given distance d. In practice, there are other
constraints such as the volumetric limitation for the battery and vehicle load
capacity.
Using (12) in (10) yields

DA W0 d
Dse 0:0158 þ ðfr þ sinðy ÞÞ 1 þ (16)
Wb Wb
18 Modeling, Simulation, and Analysis for Battery Electric Vehicles 233

Design constraint on battery and driving range (at 40 mph)

3000

2500

2000
Wbmin (lbs)

1500

d = 150 miles

1000

d = 100 miles

500

0
0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2
Specific Energy Density (MJ / kg)

Fig. 2 Design constraint among specific energy, driving range, and battery weight for a given speed

The relationship between minimum battery weight and specific energy is plotted
in Fig. 2 for different values of driving range. The efficiency for battery and motor
is assumed to be 75% [10] and the speed is 40 mph. For a given driving range, the
battery weight and specific energy must be chosen so that the point is above the
curve corresponding to the driving range.
Similarly, one can plot the design constraints for any fixed driving range and
various speed.

2.7 Maximum Cruise Speed

The maximum cruise speed that the vehicle is required to maintain imposes
requirements on the power delivered by the motor and batteries. The example
below is for a cruise speed of 80 mph.
Using the following vehicle parameters in the steady state model: vehicle speed ¼
80 mph; vehicle weight (including motors, but without engine, transmission, battery)
¼ 4,000 lb; frontal area of the vehicle ¼ 34 ft2; air temperature ¼ 59 F; atmospheric
234 W. Zhan et al.

Power requirement with battery weight = 1000 lbs

65
Motor Power Requirement (kWatt)

30
75 80 85 90 95
Max Cruise Speed (mph)

Fig. 3 Motor power as a function of max vehicle speed

pressure ¼ 29.92 in-Hg; aerodynamic drag coefficient ¼ 0.45; coefficient of rolling

resistance ¼ 0.015; efficiency ¼ 75%; surface incline angle ¼ 0 ; battery weight ¼
1,000 lbs, one can calculate the requirement on motor power for a maximum cruise
speed of 80 mph to be greater than 40 kW. The power requirement on the motor is
plotted as a function of maximum cruise speed in Fig. 3. If two motors are used, then
each motor should have rated power greater than 20 kW. The corresponding power
requirement on the batteries can be derived by dividing the motor power by the
efficiency. With a maximum cruise speed of 80 mph, the battery power can be
calculated as greater than 53 kW. This can be translated to the current/voltage
requirements on the batteries. For example, if the battery voltage output is 300 V,
then the current must be greater than 53,000/300 ¼ 177 A.
Figure 3 shows that the power is very sensitive to the maximum cruise speed.
One can also find the impact of the battery weight on the power requirement based
on maximum cruise speed. Simulation result shows that a 200% increase of battery
weight from 500 to 1,500 lbs only results in about 5% change in the battery power
requirement. A 53 kW total motor power can cover all realistic battery weights.
A similar conclusion holds for battery power required vs. battery weight: the impact
of the battery weight on the power required based on maximum speed is insignifi-
cant. It can be calculated that a 71 kW battery power is sufficient for any realistic
battery weight for an 80 mph cruise speed.
18 Modeling, Simulation, and Analysis for Battery Electric Vehicles 235

The maximum cruise speed scenario is used to determine the rated power for
batteries and motors. In other words, the battery and motor are required to provide
the power over a long period of time.

3 Dynamic Analysis

When the vehicle is driven over a short period of time or the time spent in
accelerating/decelerating is a significant portion of the total time, steady state
analysis is not adequate. Instead, dynamic analysis is needed. The dynamic model
developed in this section will be used to derive requirements on motor and battery
outputs.
The forces acting on the vehicle are illustrated in Fig. 4, where
l W is the gravity force.
l Rxf and Rxr are front and rear rolling resistant forces and Rxf þ Rxr¼Rx.
l Wf and Wr are front and rear normal forces.
l DA is the aerodynamic drag.
l LA is the aerodynamic lift.
l Fx is the tractive force (rear wheel drive is assumed).
Newton’s Second Law is applied in the direction of the vehicle movement and
the direction perpendicular to the road surface.

W
Fx W sin y DA a Rx ¼ 0 (17)
g

Wf þ Wr þ LA W cos y ¼ 0 (18)

where the aerodynamic lift force is given by

1
LA ¼ rV 2 CL A (19)
2

LA
L/2 b
c DA

Wsin(Θ) ha
ma h
Wcos(Θ)

Wf
W
Wr Rxf
Fig. 4 Vehicle dynamics Rxr Fx
model Θ
236 W. Zhan et al.

The typical range [6] of values for the aerodynamic lift coefficient is CL ¼
0.0–0.5. The lift force is applied at the center of the wheel base.
A moment equation about the contact point at the front wheels can also be written.

Wr ðb þ cÞ DA ha ðWa=g þ W sin yÞh þ LðLA =2Þ W cosðyÞb ¼ 0 (20)

There are two scenarios for the traction force Fx:

l Limited by the motor output (power limited)
l Limited by the road surface friction coefficient (traction limited)

3.1 Power Limited

During acceleration, the motor, transmission, and wheel dynamics have significant
impact on the vehicle acceleration.
Newton’s Second Law can be applied to the motor to get

Tm T t ¼ I m a m (21)

where Tm is the total motor torque (Nm) (if two motors are used, the torque from
each motor needs to be multiplied by 2), Tt is the torque input to the drive train
(transmission and differential) (Nm), Im is the motor total rotational inertia (kg m2),
and am is the motor angular acceleration (rad/s2).
The torque delivered at the output of the drive train is amplified by the gear ratio
of the transmission times the gear ratio of the differential, but is decreased by the
torque required to accelerate the gears and shafts. If the drive train inertia is
characterized by its value on the input side, we have the following equation:

Tt Tw =G ¼ It am (22)

where G is the gear ratio of the combined transmission and differential, i.e., the
ratio between the angular velocities of the motor shaft and the driven wheels, Tw is
the torque at the output of the drive train, and It is the rotational inertia for the drive
train.
Applying Newton’s Second Law to the driven wheels, one gets

T w F x r ¼ I w aw (23)

where Fx is the total tractive force for the two driven wheels, r is the tire radius, Iw is
the rotational inertia of the wheels (Iw includes the rotational inertia of everything
down stream of the drive train), and is the wheel acceleration (rad/s2) of the wheels.
By the definition of gear ratio, we have

Gaw ¼ am (24)
18 Modeling, Simulation, and Analysis for Battery Electric Vehicles 237

Equations (21)–(24) could be combined to solve for the tractive force available
at the ground. Recognizing that for power limited operation the vehicle accelera-
tion, a, is the wheel rotational acceleration, aw, times the tire radius, yields:

G ðIm þ It ÞG2 þ Iw
Fx ¼ Tm a (25)
r r2
where r is the radius of the tire.
The effect of mechanical losses can be approximated by including an efficiency
factor, t, in the first term in the right hand side of (26), as a result we have

G ðIm þ It ÞG2 þ Iw
Fx ¼ t Tm a (26)
r r2

In order to see more clearly the effect of the rotational inertia on the vehicle
acceleration, (17) and (26) are combined to get the following

G
ðM þ Mr Þa ¼ Tm Wsiny DA Rx (27)
r t

where M ¼ W/g and Mr is the equivalent mass of the rotating components, given by

ðIm þ It ÞG2 þ Iw
Mr ¼ (28)
r2

Clearly, the effect of the rotating components on the vehicle acceleration is

equivalent to increasing the vehicle mass by the amount in (28).

3.2 Traction Limited

In this case, the tractive force calculated by (26) exceeds the maximum tractive
force the road surface and the tire can generate, which is determined by

Fx ¼ mWr (29)

where m is the surface friction coefficient. For truck tires, on dry asphalt, m is
approximately 1.
From (17), (20), and (29), we can solve for the maximum tractive force Fxmax

m L
Fx max ¼ DA ha ðDA þ Rx Þh LA þ Wb cos y (30)
L mh 2

In the model, these two cases can be combined by calculating the minimum of
the two forces in (26) and (30).
238 W. Zhan et al.

3.3 0–60 mph

The requirement on 0–60 mph time determines the maximum outputs from the
batteries and the motors. A 10 s 0–60 mph time is used as a vehicle acceleration
requirement. During the 10 s while the vehicle accelerates to 60 mph from a
standing start, the maximum power generated by the motor and battery can be
significantly higher than the rated values.
Figure 5 shows the characteristic of a typical AC induction motor. This can be
used together with the vehicle dynamics model to derive the maximum power
requirements on the motor and battery. The combined model can be used
to determine the 0–60 mph time. Based on the simulation result, one can find if a
specific motor/battery combination meets the requirement of 0–60 mph time.
In addition to the parameters used in the derivation of power requirements for
maximum speed, the following parameters are used:
Gear ratio ¼ 6; Motor inertia ¼ 0.36 kg m2 ¼ 3.18 lb in s2; Rotational inertia of
drive train ¼ 0.045 kg m2 ¼ 0.40 lb in s2; Rotational inertia of each driven wheel ¼
1.69 kg m2 ¼ 14.92 lb in s2; Drive train efficiency ¼ 96%; Wheel base ¼ 126 in;
CG height ¼ 3.3 ft; Height of aerodynamic drag ¼ 3.5 ft; Radius of loaded wheel ¼
13.82 in; Weight percentage on front wheels ¼ 55%; Motor torque and motor
efficiency curves given in Fig. 5; battery power limit ¼ 140 kW; surface mu ¼ 0.95;

Typical AC induction motor characteristics

200
power
180 efficiency
Torque output (Nm), efficiency (%), power (kW)

torque
160

140

120

100

0
0 2000 4000 6000 8000 10000 12000
Motor speed (rpm)

Fig. 5 Motor power, torque, and efficiency as a function of speed

18 Modeling, Simulation, and Analysis for Battery Electric Vehicles 239

where the gear ratio is the ratio between the angular velocities of the motor and the
driven wheels.
The simulation result in Fig. 6 shows that the vehicle can accelerate from 0 to 60
mph in 9.5 s.
By varying the battery weight and repeating the simulation, the 0–60 mph time
as a function of the battery weight is plotted in Fig. 7. It can be seen that the battery
weight has a significant impact on the 0–60 mph time.

3.4 Maximum Gradeability

Maximum gradeability is defined as the largest surface incline that a vehicle can
overcome. It is an important vehicle level performance requirement. Lower level
requirements on motor torque and battery power can be developed based on the
maximum gradeability requirement.
In the analysis of maximum gradeability, the following conditions hold:

a 0; DA ¼ 0; La ¼ 0

Battery weight = 1000 lbs

50
Vehicle speed (mph)

0
0 2 4 6 8 10 12
Time (seconds)

Fig. 6 Maximum vehicle acceleration

240 W. Zhan et al.

10.6

10.4

10.2

10
0-60 mph time (seconds)

9.8

9.6

9.4

9.2

8.8

8.6
500 600 700 800 900 1000 1100 1200 1300 1400 1500
Battery weight (lbs)

Fig. 7 Impact of battery weight on maximum acceleration

There are two cases one must consider: traction limited and power limited.
When the vehicle is traction limited, from (17), (20), and (29) we have
m
Fx ¼ W ðh sin y þ b cos yÞ W sin y þ Rx (31)
L
From the above inequality, we can solve for the maximum surface incline when
it is traction limited

1 mb Lfr
y tan (32)
L mh
When the vehicle is power limited, from (17) and (26), we have
G
Tm W sin y þ Rx (33)
r
Note that t¼1 is assumed for maximum gradability analysis. Combining (32)
and (33), one can plot the maximum grade as a function of motor torque.

4 Conclusion

Several design requirements for electric vehicles are discussed in this paper. System
level design requirements are used to derive requirements on motor power and
torque and battery power, weight and specific energy based on simulation tools
18 Modeling, Simulation, and Analysis for Battery Electric Vehicles 241

developed using first principle models. A design constraint involving battery

weight, specific energy, and driving range is derived. The maximum cruise speed
requirement results in requirements on the rated motor torque, power, and battery
power. For a given motor and battery, the maximum acceleration can be simulated
to see if the 0–60 mph time requirement is met. The maximum gradeability
requirement can be used to derive requirements on maximum motor torque/battery
current. A software package EVSim [11] has been developed based on the analysis
in this paper. The simulation tool allows further investigation of design trade-offs
among different parameters. Future research includes the optimization of vehicle
performance and cost using the simulation tools developed in this paper, sensitivity
studies for the design parameters in the simulation model, and incorporating motor
dynamics [12] into our model.

References

1. G.A. Nazri, G. Pistoia, Lithium Batteries: Science and Technology (Kluwer/Plenum, Boston,
2004)
2. C.K. Chan, H. Peng, G. Liu, K. McIlwrath, X.F. Zhang, R.A. Huggins, Y. Cui,
High-performance lithium battery anodes using silicon nanowires. Nat. Nanotechnol. V(3)
31–35 (2008)
3. T. Lucas, F. Riess, How to Convert to an Electric Car (Crown Publishers, New York, 1980)
4. J. Dave, J. Dong, Conversion of an existing car to a rechargeable electric vehicle, ASEE
Annual Conference, 2009
5. I. Husain, Electric and Hybrid Vehicles (CRC Press, Boca Raton, FL, 2003)
6. R. Pratap, Getting Started with MATLAB 7: A Quick Introduction for Scientists and Engineers
(Oxford University Press, New York, 2006)
7. The MathWorks, Inc. MATLAB®7 Getting Started Guide (2008)
8. M. Ehsani, Y. Gao, S.E. Gay, A. Emadi, Modern Electric, Hybrid Electric, and Fuel Cell
Vehicles (CRC Press LLC, London, 2005)
9. T.D. Gillespie, Fundamentals of Vehicle Dynamics (Society of Automotive Engineers, Inc.,
Warrendale, PA, 1992)
10. Formula Hybrid Rules, SAE International, August 26, 2008. Available at http://www.formula-
hybrid.org/pdf/Formula-Hybrid-2009-Rules.pdf. Last accessed on 29 Dec 2009
11. W. Zhan, M. McDermott, M. Hasan, EVSim Software Design (Aug 2009)
12. W. Gao, E. Solodovnik, R. Dougal, Symbolically-aided model development for an induction
machine in virtual test bed. IEEE Trans. Energy Conver. 19(1) 125–135 (March 2004)
Chapter 19
Modeling Confined Jets with Particles and Swril*

Osama A. Marzouk, and E. David Huckaby

Abstract We present mathematical modeling and numerical simulations, using the

finite volume method, of a coaxial particle-laden airflow entering an expansion in a
vertical pipe. An Eulerian approach is used for the gas (air) phase, which is modeled
by the unsteady Favre-averaged Navier–Stokes equations. A Lagrangian model is
used for the dispersed (particles) phase. The results of the simulations using three
implementations of the ke turbulence model (standard, renormalization group –
RNG, and realizable) are compared with measured axial profiles of the mean gas-
phase velocities. The standard model achieved the best overall performance. The
realizable model was unable to satisfactorily predict the radial velocity; it is also the
most computationally-expensive model. The simulations using the RNG model
predicted extra recirculation zones.

1 Overview

The use of computational fluid dynamics (CFD) to accurately model energy

production systems is a challenging task [1]. Of current interest, due to ever-
increasing energy demands, are coal-based energy systems such as pulverized
coal (PC) boilers and gasifiers with an emphasis on systems which provide for
carbon capture and storage (e.g. PC-oxyfuel). Turbulence and particle sub-models

*The support of this work by U.S. DOE Existing Plants Emissions and Capture program is
gratefully acknowledged. The first author was supported in part by an appointment to the U.S.
Department of Energy (DOE) Postgraduate Research Program at the National Energy Technology
Laboratory administered by the Oak Ridge Institute for Science and Education.
O.A. Marzouk (*)
United States Department of Energy, National Energy Technology Laboratory, 3610 Collins Ferry
Road, Morgantown, WV 26507-0880, USA
e-mail: [email protected]

S.-I. Ao et al. (eds.), Machine Learning and Systems Engineering, 243

Lecture Notes in Electrical Engineering 68,
DOI 10.1007/978-90-481-9419-3_19, # Springer ScienceþBusiness Media B.V. 2010
244 O.A. Marzouk and E.D. Huckaby

are one of many sub-models which are required to calculate the behavior of these
gas–solid flow systems.
The particle-laden swirling flow experiment of Sommerfeld and Qiu [2] was
selected as a test-case to assess the performance of three implementations of
the ke turbulence model for gas–solid flows. Previous numerical investigations
of this experiment include, Euler–Lagrange (EL)/Reynolds-averaged Navier–
Stokes (RANS-steady) [3], EL/large eddy simulations (LES) [4], Euler–Euler
(EE)/RANS-unsteady [5], and EE/LES [6]. The extensive experimental measure-
ments make this experiment a good test-case for gas–solid CFD.
A schematic of the experiment is shown in Fig. 1. The coaxial flow consists of
a primary central jet, laden with particles at a loading of 0.034 kg-particles/kg-air
and an annular secondary jet with a swirl number of 0.47 based on the inlet velocity.
Coaxial combustors have a similar configuration to this system. Generating a
swirling flow is an approach used to stabilize combustion and maintain a steady
flame [7]. Swirl entrains and recirculates a portion of the hot combustion products.
It also enhances the mixing of air and fuel. The inlet swirl number was calculated as
the ratio between the axial flux of angular momentum to the axial flux of linear
momentum
R Rsec
2 r Uy U x r 2 dr
S¼ 0
R Rsec (1)
Dcyl 0 r U 2x r dr

where Ux and Uy are the axial and tangential (swirl) velocities, Rsec ¼ 32mm is the
outer radius of the swirling secondary jet, and Dcyl ¼ 197mm is the inner diameter

Fig.1 Illustration of the coaxial flow (with swirl and particles)

19 Modeling Confined Jets with Particles and Swril 245

of the pipe into which the jets enter. The inlet Reynolds number (Re) is approxi-
mately 52,400 based on the outer diameter of the secondary jet, thus

^ x ð2 Rsec Þ
rU
Re ¼ (2)
m

where r is the density, m is the dynamic viscosity, and Ûx is an equivalent axial
velocity that accounts for the total volume flow rate from both primary and
secondary jets. The particles were small spherical glass beads, with a density of
2,500kg/m3. The beads injected according to a log-normal distribution with a mean
number diameter of 45 mm.

2 Gas Phase and Turbulence Models

The continuity and momentum equations for the resolved gas-phase fields are
expressed and solved in Cartesian coordinates as

@ r @ðr U j Þ
þ ¼0 (3)
@t @ xj

@ðr Ui Þ @ðr U i Uj Þ @ p @ðsij þ tij Þ

þ ¼ þ þ r gi þ Sp (4)
@t @ xj @ xi @ xj

where Ui is the velocity vector, p is the pressure, sij and tij are the viscous and
Reynolds (or turbulent) stress tensors, gi is gravitational vector (we only have
g1 ¼ 9.81m/s2), and Sp is a source term accounting for the momentum from the
dispersed phase. As for Newtonian fluids, sij is calculated as

sij ¼ 2m Sdev
ij

where Sijdev is the deviatoric (traceless) part of the strain-rate tensor Sij

1 @ Ui @ Uj
Sij ¼ þ
2 @ xj @ xi

1 @ Ui @ Uj 2 @ Uk
Sij ¼
dev
þ dij
2 @ xj @ xi 3 @ xk

The tensor tij is not resolved directly. Instead, its effects are approximated using the
gradient transport hypothesis
2
tij ¼ 2mt Sdev
ij r k dij (5)
3
246 O.A. Marzouk and E.D. Huckaby

where mt is the turbulent (or eddy) viscosity. The spherical tensor on the right-hand
side is not considered here [3, 8]. Different eddy-viscosity turbulence models propose
different strategies to calculate mt. In the case of ke models, mt is calculated as

k2
mt ¼ Cm r (6)
E
where k is the turbulent kinetic energy per unit mass and e is its dissipation rate.
They are calculated by solving two coupled transport equations. The forms of these
equations vary depending on the model implementation. We consider here three
implementations, which are described in the following subsections.

2.1 Standard ke Model

The standard ke model refers to the Jones–Launder form [9], without wall
damping functions, and with the empirical constants given by Launder and Sharma
[10]. The k and e equations are

@ðr kÞ @ðr Uj kÞ @ mt @ k
þ ¼ mþ þP rE (7)
@t @ xj @ xj sk @ xj

@ðr EÞ @ðr U j EÞ @ mt @ E E
þ ¼ mþ þ ðCE1 G CE2 rEÞ
@t @ xj @ xj sE @ xj k

2 @ Uk
CE1 þ CE3 r E (8)
3 @ xk

where P is the production rate of kinetic energy (per unit volume) due to the
gradients in the resolved velocity field

@ Ui
P ¼ tij
@ xj

which is evaluated as

2 @ Uk
P ¼ G rk
3 @ xk

with
!
@ Ui 1 @ Uk 2
G¼ 2mt Sdev ¼ 2mt Sij Sij
ij
@ xj 3 @ xk
19 Modeling Confined Jets with Particles and Swril 247

The addition of Ce3 in the last term on the right-hand side of (19.8) is not included in
the standard model. It was proposed [8, 11] for compressible turbulence. However,
we will refer to this implementation as the standard model. The model constants are

Cm ¼ 0:09; sk ¼ 1:0; sE ¼ 1:3; CE1 ¼ 1:44; CE2 ¼ 1:92; CE3 ¼ 0:33

2.2 Renormalization Group (RNG) ke Model

The RNG model was developed [12, 13] using techniques from the renormalization
group theory with scale expansions for the Reynolds stress. The k and e equations
have the same form in (7) and (8), but the constants have different values. In
addition, the constant Ce1 is replaced by C∗
e1, which is no longer a constant, but is
determined from an auxiliary function as

ð1 =0 Þ
CE1 ¼ CE1
1 þ b 3

where

k pffiffiffiffiffiffiffiffiffiffiffiffiffiffi
¼ 2 Sij Sij
E

is the expansion parameter (ratio of the turbulent time scale to the mean-strain time
scale). The model constants are

Cm ¼ 0:0845; sk ¼ 0:7194; sE ¼ 0:7194; CE1 ¼ 1:42

CE2 ¼ 1:68; CE3 ¼ 0:33; 0 ¼ 4:38; b ¼ 0:012

2.3 Realizable ke Model

The realizable ke model was formulated [14] such that the calculated normal
(diagonal) Reynolds stresses are positive definite and shear (off-diagonal) Reynolds
stresses satisfy the Schwarz inequality. Similar to the RNG model, the form of the k
equation is the same as the one in (7). In addition to altering the model constants,
the two main modifications lie in replacing the constant Cm used in calculating the
eddy viscosity in (6) by a function, and in changing the right-hand side (the
production and destruction terms) of the e equation. The last term in (8) is dropped.
With this, the e equation becomes
248 O.A. Marzouk and E.D. Huckaby

@ðr EÞ @ðr U j EÞ @ m @E E2
þ ¼ mþ t þ C1 r E S CE2 r pffiffiffiffiffiffiffiffiffiffiffiffiffiffi (9)
@t @ xj @ xj sE @ xj k þ ðm=rÞE

where

1
C1 ¼ max 0:43; ; Cm ¼
þ5 A0 þ AS ðU k=EÞ

pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi 1 @ Ui @ Uj
U ¼ Sij Sij þ Oij Oij ; Oij ¼
2 @ xj @ xi

pffiffiffi 1 pffiffiffi
AS ¼ 6 cosðfÞ; arccos 6 Wf¼
3

pffiffiffi Sij Sjk Sik 1 1
W ¼ min max 2 2 ; pffiffiffi ; pffiffiffi
S3 6 6

and Oij is rate-of-rotation tensor. The model constants are

sk ¼ 1:0; sE ¼ 1:2; CE2 ¼ 1:9; A0 ¼ 4:0

3 Dispersed Phase

The Lagrangian equations of motion of a particle (in vector form) are

dx
¼u (10a)
dt

du
m ¼f (10b)
dt

where m is the mass of the particle, u is the particle velocity, and f is the force acting
on the particle. In this study; the drag, gravity, and buoyancy are considered, thus
the force f has the following form [15]:

p d2
f¼ r CD ju U jðu U Þ þ m g r 8 g (11)
8

where d is the particle diameter, CD is the drag coefficient (which is a function of

the particle Reynolds number, Red, as will be described later), 8 is the particle
volume and U∗ is the instantaneous fluid velocity
19 Modeling Confined Jets with Particles and Swril 249

U U þ U0 (12)

The vector U is the resolved velocity of the fluid (interpolated at the particle
location) which is calculated after solving the governing equations of the flow,
coupled with the turbulence model. The fluctuating velocity, U0 , is estimated using
the discrete random walk algorithm [16, 17]. In this algorithm, uncorrelated eddies
are generated randomly, but the particle trajectory is deterministic within an eddy.
The fluctuating velocity affects the particle over an interaction time, Tinterac, which
is the shortest of the eddy life time (Lagrangian integral time scale of turbulence),
Teddy, and the residence or transit time, Tresid. The latter is the time needed by the
particle to traverse the eddy. These characteristic times are calculated as

k
T eddy ¼ (13a)
E

k3=2
T resid ¼ Cresid (13b)
E ju U U 0 j

T interac ¼ min T eddy ; T resid (13c)

In (19.13b), U0 is lagged from the previous time step, and Cresid ¼ 0.093/4 ¼
0.16432. The turbulence information, thus the characteristic times in (13), are
updated every time step to account for the fact that the turbulence encountered by
a particle in its trajectory is not homogeneous.
The drag coefficient for the spherical particles is determined from the
following two-region formula [11], which is very similar to the Schiller–Nau-
mann[18] expression for Red 1,000, and uses a constant Newton drag coefficient
for Red > 1,000
8
< 24 1 2=3
1 þ Red ; Red 1000
CD ¼ Red 6 (14)
:
0:424; Red > 1000

where the particle Reynolds number is defined as

rju U jd
Red ¼ (15)
m

Combining (19.10a) and (19.11), and using m ¼ rppd3/6 (rp is the particle
density), the particle equations of motion become

dx
¼u (16a)
dt
250 O.A. Marzouk and E.D. Huckaby

!
du u U r
¼ þ 1 g
dt t rp

where

4 rp d rp d2 24
t¼ ¼
3 r CD ju U j 18 m Red CD

is the momentum relaxation time of the particle.

The particle position is tracked using the algorithm described by Macpherson
et al. [19]. The algorithm consists of a series of substeps in which the particle
position, x, is tracked within a cell using a forward (explicit) Euler scheme followed
by integrating the particle momentum equation using a backward (implicit) Euler
scheme to update the particle velocity, u. When calculating the resultant force due
to the particle on the fluid phase, the algorithm takes into account the particle
residence time in each cell. Interaction with the wall is represented through elastic
frictionless collisions.

4 Simulation Settings

The problem is treated as axisymmetric (although the results are mirrored in some
figures for better visualization). The domain starts at the expansion location with
x ¼ 0 in Fig. 1 and extends to x ¼ 1.0m. The domain is a 3D wedge (full-opening
angle 5∘), with a Frontal Area of 1.0m, 0.097m, and 240 and 182 mesh points in the
axial and radial directions, respectively. The mesh is nonuniform both axially and
radially, with finer resolution near walls and between the two jets. The mesh has
40,080 cells. The inlet condition for the velocity in the primary (inner) jet is
specified in terms of the mass flow rate (9.9g/s). For the secondary (outer) jet, the
inlet velocity is specified using the experimental velocity profile. A zero-gradient
condition is applied to the pressure at the inlet. The specific turbulent kinetic
energy, k, is set to 0.211m2/s2 and 0.567m2/s2 in the primary and secondary jets,
respectively; and the dissipation rate, e, is set to 0.796m2/s3 and 3.51m2/s3 in the
primary and secondary jets, respectively. The inlet k was estimated assuming 3%
turbulence intensity (the experimental value was not specified, but 3% is a reason-
able medium-turbulence level [20]) and the inlet e was then estimated from [21]

e ¼ C3=4
m k =l
1:5
(17)

where the standard value 0.09 is used for Cm, and l is the turbulence length scale,
which is approximated as 10% of the pipe diameter (l ¼ 0.02m). At the outlet,
zero-gradient conditions are applied for all variables except the pressure, where a
19 Modeling Confined Jets with Particles and Swril 251

constant value of 105 N/m2 is imposed. At the walls, the wall-function treatment is
used for the turbulence, and a zero-gradient condition is used for the pressure.
The PISO (pressure implicit splitting of operators) scheme is used to solve the
governing flow equations. A variable time step is adjusted dynamically to limit the
maximum convective CFL to 0.3. The backward Euler scheme is used for the time
integration of the flow equations. Upwind differencing is used for the convective
terms. Linear (second-order central difference) interpolation is used to find the mass
fluxes at the face centers of the cells from the cell-center values, and is also used for
the diffusion terms.
The particle mass flow rate is 0.34 g/s, which corresponds to 0.00472 g/s for our
case of 5 wedge. The particles are injected at a velocity of 12.5m/s, which is the
nominal axial inlet velocity in the primary jet.
We used version 1.5 of the finite volume open source code OpenFOAM [22, 23]
to perform all the simulations presented here. Similar particles are grouped into a
parcel (or a particle cloud), where they have a common velocity, to reduce the
amount of computation for the dispersed phase. The number of particles per parcel
is not uniform.

5 Results

The simulated flow time is 0.6 s for the results presented in this chapter. We have
found that this time interval is sufficient for all particles to traverse the domain and
for the gas phase to achieve stationary conditions. The last 0.1 s is used to obtain the
mean gas-phase velocities.
Figure 2 shows three snapshots of the parcels after 0.05, 0.1, and 0.15 s using the
standard ke model (the diameters of the particles associated with a parcel are evenly
scaled by a factor of 100). This figure illustrates that the model captures well the
expected dynamics of the dispersed phase. The larger particles (with larger inertia)
maintain their axial motion and penetrate the central recirculation bubble. They are
not affected strongly by the swirl and radial velocity of the gas phase. Smaller
particles are entrained due to smaller relaxation times, and directed to the walls.
The mean axial, radial, and tangential velocities of the gas phase are shown in
Fig. 3. The negative mean axial velocity along the centerline and the walls identify
the regions of recirculation. The strong variations in all velocities are confined to a
distance of 150 mm after the inlet. The axial and tangential velocities exhibit initial
decay, whereas the radial velocity increases at the upstream boundary of the central
recirculation bubble.
A comparison between the mean streamlines obtained with the three turbu-
lence models is given in Fig. 4. Besides the central bubble, there are secondary
recirculation zones (due to the sudden expansion) at the top of the pipe. The
standard model gives the shortest recirculation bubble, with best agreement with
the experimental results. The realizable model gives a longer bubble, but with a
qualitatively similar structure. The RNG model resolves, in addition to the central
252 O.A. Marzouk and E.D. Huckaby

Fig. 2 Parcel motion with the standard ke model at t ¼ 0.05s (left), t ¼ 0.10s (middle), and
t ¼ 0.15s (right)

Fig. 3 Mean gas-phase velocities near the inlet with the standard ke model

and two secondary recirculation zones, two noticeable tertiary recirculation zones
at the beginning (top) of the central bubble. This feature was not reported in the
experimental results.
19 Modeling Confined Jets with Particles and Swril 253

Fig. 4 Comparison of the mean streamlines with 3 ke models: standard (left), RNG (middle),
and realizable (right)

The mean gas-phase velocity fields are sampled at different axial stations. We
compare the mean velocities obtained using the three turbulence models with the
measured values at two stations in Fig. 5, located at x ¼ 3 and 112 mm. The first
station is located close to the inlet, thus upstream of the central bubble, whereas the
second one spans the bubble. At x ¼ 3 mm, all models predict axial and tangential
velocities that are in agreement with the measured ones. The radial velocity using
the realizable model shows considerable disparity, with excessively-negative
(inward) velocity in the region away from the jets, followed by an outward flow
near the wall, which is opposite to the inward flow observed experimentally. The
standard model has a slightly better agreement with the measurements than the
RNG model at this axial location. At x ¼ 112 mm, the RNG predictions deviate
considerably from the measurements. This is a direct consequence of the tertiary
recirculation shown in Fig. 4. As in the earlier station, the standard and realizable
models provide similar results except for the radial velocity, with the realizable
model failing to capture the measured peak at r 80 mm. The standard model
provides better prediction of the axial velocity in the vicinity of the wall than the
other two models.
On a computing machine with two processors: quad core Intel Xeon L5335
2.00 GHz, a simulation interval of 0.5 s required 17.13 h CPU time for the standard
model, 19.44 h for the RNG model, and 24.81 h for the realizable model. The
standard model has the lowest computational demand due to its relative simplicity.
The realizable model is the most computationally-expensive implementation, with
CPU time equal to 145% and 128% of the CPU times in the case of the standard and
RNG models, respectively.
254 O.A. Marzouk and E.D. Huckaby

At 3 mm after inlet At 3 mm after inlet

25
Experiment
2

Mean Radial Velocity (m / s)

20 Standard
Mean Axial Velocity (m / s)

RNG
Realizable
15
0

–2
5
Experiment
Standard
0 RNG
–4 Realizable
–5
0 0.02 0.04 0.06 0.08 0.1 0 0.02 0.04 0.06 0.08 0.1
Radial Distance (m) Radial Distance (m)

At 3 mm after inlet At 112 mm after inlet

20 9
Experiment Experiment
Mean Tangential Velocity (m / s)

Standard
Mean Axial Velocity (m / s)
15 Standard
RNG RNG
6 Realizable
Realizable
10
3
5

0
0

–5 –3
0 0.02 0.04 0.06 0.08 0.1 0 0.02 0.04 0.06 0.08 0.1
Radial Distance (m) Radial Distance (m)

At 112 mm after inlet At 112 mm after inlet

2 3
Experiment
Mean Tangential Velocity (m / s)

Standard
Mean Radial Velocity (m / s)

RNG
Realizable
2
1

Experiment
0 Standard
0 RNG
Realizable

0 0.02 0.04 0.06 0.08 0.1 0 0.02 0.04 0.06 0.08 0.1
Radial Distance (m) Radial Distance (m)

Fig. 5 Comparison of the mean gas-phase velocities at x ¼ 3 and 112 mm

6 Conclusions

We simulated a coaxial airflow with a particle-laden primary jet and a swirling

annular jet, entering a sudden expansion. Three implementations of the ke model:
standard, renormalization group (RNG), and realizable were applied. The standard
model had the best overall performance based on the mean gas-phase velocities.
The RNG model showed considerable deviations from the measurements in some
regions. The main drawback of the realizable model was its erroneous prediction of
19 Modeling Confined Jets with Particles and Swril 255

the radial velocity. The main differences in the predicted velocity profiles were
related to the different flow structures and mean streamlines. We should point out
that our finding that the more complex models did not outperform the simpler one
should not be considered a universal statement. In fact, more work is needed to
investigate the reasons of this unexpected outcome. This includes examining the
effect of Ce3, and the effect of dropping the spherical component in the modeled
Reynolds stresses (which causes violation of one of the realizability constraints).

References

1. T.F. Wall, Combustion processes for carbon capture. P. Combust. Inst. V31(N1), pp. 31–47
(2007)
2. M. Sommerfeld, H.-H. Qiu, Detailed measurements in a swirling particulate two-phase flow
by a phase-Doppler Anemometer. Int. J. Heat Fluid Fl. V12(N1), pp. 20–28 (1991)
3. M. Sommerfeld, A. Ando, D. Wennerberg, Swirling, particle-laden flows through a pipe
expansion. J. Fluids Eng. V114(N4), pp. 648–656 (1992)
4. S.V. Apte, K. Mahesh, P. Moin, J.C. Oefelein, Large-eddy simulation of swirling particle-
laden flows in a coaxial-jet combustor. Int. J. Multiphase Flow V29, pp. 1311–1331 (2003)
5. Y. Yu, L.X. Zhou, C.G. Zheng, Z.H. Liu, Simulation of swirling gas-particle flows using
different time scales for the closure of two-phase velocity correlation in the second-order
moment two-phase turbulence model. J. Fluids Eng. V125(N2), pp. 247–250 (2003)
6. M. Boileau, S. Pascaud, E. Riber, B. Cuenot, L.Y.M. Gicquel, T.J. Poinsot, M. Cazalens,
Investigation of two-fluid methods for large eddy simulation of spray combustion in gas
turbines. Flow Turbul. Combust. V80(N3), pp. 291–321 (2008)
7. G. Monnot, in Principles of Turbulent Fired Heat, Editions Technip, 1985
8. F.P. K€arrholm, Numerical modelling of diesel spray injection, turbulence interaction and
combustion. Ph.D. Dissertation, Chalmers University of Technology, G€ oteborg, Sweden,
2008
9. W.P. Jones, B.E. Launder, The prediction of laminarization with a two-equation model. Int. J.
Heat Mass Tran. V15(N2), pp. 301–314 (1972)
10. B.E. Launder, B.I. Sharma, Application of the energy-dissipation model of turbulence to the
calculation of flow near a spinning disk. Lett Heat Mass Tran., V1(N2), pp. 131–138 (1974)
11. P.A.N. Nordin, Complex chemistry modeling of diesel spray combustion. Ph.D. Dissertation,
Chalmers University of Technology, G€ oteborg, Sweden, 2001
12. V. Yakhot, S.A. Orszag, Renormalization group analysis of turbulence: 1. Basic theory. J. Sci.
Comput. V1(N1), pp. 3–51 (1986)
13. V. Yakhot, S.A. Orszag, S. Thangam, T.B. Gatski, C.G. Speziale, Development of turbulence
models for shear flows by a double expansion technique. Phys. Fluids A. V4(N7), pp.
1510–1520 (1992)
14. T.-H. Shih, W.W. Liou, A. Shabbir, J. Zhu, A new ke eddy-viscosity model for high
reynolds number turbulent flows – model development and validation. Comput. Fluids V24
(N3), pp. 227–238 (1995)
15. S. Elghobashi, On predicting particle-laden turbulent flows. Appl. Sci. Res. V52(N4), pp.
309–329 (1994)
16. D. Milojević, Lagrangian stochastic-deterministic (LSD) predictions of particle dispersion in
turbulence. Part Part Syst. Charact., V7(N1-4), pp. 181–190 (1990)
17. D. Lakehal, On the modelling of multiphase turbulent flows for environmental and hydrody-
namic applications. Int. J. Multiphase Flow V28(N5), pp. 823–863 (2002)
18. L. Schiller, A. Naumann, Ûber die grundlegenden Berechungen bei der Schwerkraftaufber-
eitung. Zeit Ver Deut Ing V77, pp. 318–320 (1933)
256 O.A. Marzouk and E.D. Huckaby

19. G.B. Macpherson, N. Nordin, H.G. Weller, Particle tracking in unstructured, arbitrary poly-
hedral meshes for use in CFD and molecular dynamics. Commun. Numer. Meth. Eng. V25
(N3), pp. 263–273 (2009)
20. C. Sicot, P. Devinanta, S. Loyera, J. Hureaua, Rotational and turbulence effects on a wind
turbine blade. Investigation of the stall mechanisms. J. Wind Eng. Ind. Aerod. V96(N8-9), pp.
1320–1331 (2008)
21. J.H. Ferziger, M. Perić, in Computational Methods for Fluid Dynamics, 3rd edn. (Springer,
2002)
22. OpenFOAM: the open source CFD toolbox. Available from:http://www.openfoam.org
23. H.G. Weller, G. Tabor, H. Jasak, C. Fureby, A tensorial approach to computational continuum
mechanics using object-oriented techniques. Comput. Phys. V12(N6), pp. 620–631 (1998)
Chapter 20
Robust Tracking and Control of MIMO
Processes with Input Saturation and
Unknown Disturbance

Ajiboye Saheeb Osunleke and Mingcong Deng

Abstract In this chapter, the design of robust stabilization and output tracking
performance of multi-input multi-output processes with input saturations and
unknown disturbance are considered. The proposed control technique is the robust
anti-windup generalized predictive control (RAGPC) scheme for multivariable
processes. The proposed control scheme embodies both the optimal attributes of
generalized predictive control and the robust performance feature of operator-based
theoretic approach. As a result, a strongly robust stable feedback control system
with disturbance rejection feature and good tracking performance is achieved.

1 Introduction

The control of multivariable systems is paramount in nearly all industrial and

allied processes. This is so because of the need to optimize outputs at the expense
of limited available materials. In designing MIMO control systems, systems and
control engineers have to deal with some challenges which impose limitations on
the performance of MIMO systems.
In recent past, several techniques for control of MIMO systems have been
proposed and good reviews are available [1–3]. Among these techniques, the predic-
tive control approach is distinctive in its application to control of MIMO processes
which exhibit some of these performance limited characteristics such as input satu-
ration and disturbance. Predictive control strategy is now widely regarded as one of the
standard control paradigms for industrial processes and has continued to be a central

A.S. Osunleke (*)

Division of Industrial Innovation Sciences, Graduate School of Natural Science and Technology,
Okayama University, 3-1-1 Tsushima-Naka, 3-chome, Okayama 700-8530, Japan
e-mail: [email protected]

S.-I. Ao et al. (eds.), Machine Learning and Systems Engineering, 257

Lecture Notes in Electrical Engineering 68,
DOI 10.1007/978-90-481-9419-3_20, # Springer ScienceþBusiness Media B.V. 2010
258 A.S. Osunleke and M. Deng

research focus in recent times [1, 4, 5. In practice, control systems have to deal with
disturbances of all kinds, such as stepwise load disturbances, high frequency sensor or
thermal noise, input saturations etc. These undesirable effects are systematically dealt
with using predictive control.
The new proposed control strategy under the generalized predictive control
scheme is known as multivariable robust anti-windup generalized predictive
control (MRAGPC). It is the multivariable extension of the robust anti-windup
generalized predictive control (RAGPC) scheme for SISO systems in earlier work
[8]. Like RAGPC scheme, MRAGPC also shares the optimal and robust perfor-
mance attributes characteristic of a good control strategy. This work proposes a
design procedure for Multivariable systems having both input saturation and dis-
turbance. The proposed MRAGPC design procedure can be implemented in four
different modes depending on the nature of problem at hand and the design control
objective. The design application entails firstly, the design of compensator to
compensate the effect of minimal phase and strictly properness as these are
characteristics of MIMO industrial processes. Secondly is to construct stable
coprime factors for the compensated system. Subsequently, anti-windup controllers
are designed to achieve a stable feedback closed loop control system. This proce-
dure is completed by the design of a tracking operator using operator-based
theoretic approach. In order to estimate the unknown disturbance in the system,
unknown disturbance estimation mechanism for MIMO system is also proposed.
Based on the outlined procedure, the robust performance of the closed-loop system
in the presence of input saturation and disturbances is ensured.
In this proposed design, the coupling effect of interactions between components
of the MIMO systems is curtailed by the appropriate choice of MRAGPC design
parameters. The improved performance of the proposed scheme is confirmed by
simulation of the model of a two-input two-output MIMO system.

2 MRAGPC Design Scheme

In order to control MIMO processes with all the design limitations, MRAGPC
scheme is implemented in three different modes depending on the type and nature
of problems at hand.
Mode 1 – known as Anti-windup mode, is implementable for the case of input
saturation but with no disturbances load on the input to the system.
Mode 2 – otherwise called compensation mode, is implementable for the case of
input saturation, non-minimal phase and disturbance load in the input to the system.
Mode 3 – referred to as the fault detection mode, for system with both input
saturation and unknown disturbance.
In this paper, all these modes of MRAGPC application are implemented.
20 Robust Tracking and Control of MIMO Processes with Input Saturation 259

2.1 MRAGPC Problem Formulation

The proposed linear multivariable system model is given by Deng et al. [4].

A½ pYð pÞ ¼ B½ pUð pÞ þ C½ pVð pÞ (1)

where Y( p), U( p), V( p) are m 1 output, q 1 input and p 1 disturbance

vectors respectively. B[ p] is a m q polynomial matrix while A[ p] and C[ p] are
m m diagonal polynomial matrices. As before, the polynomial matrices A[ p] and
B[ p], A[ p] and C[ p] are coprime. The elements of C[ p] are with a degree of one
less or equal to that of the corresponding elements of A[p] and are chosen by the
designer. The model in (1) corresponds to a Left Matrix Fraction Description
(LMFD) described as

H0 ðpÞ ¼ A1 ½ pB½ p (2)

Equation (2) can be equivalently expressed as

H0 ð pÞ ¼ D0 1 ð pÞN0 ð pÞ (3)

where P0( p) is the nominal multivariable plant, D0(p) and N0(p) are coprime
factors of P0( p) and are defined as
1 1
~ ½ pB½
N0 ð pÞ ¼ D ~ p; D0 ð pÞ ¼ D
~ ½ pA½
~ p (4)
~
where DðpÞ is a Hurwitz polynomial matrix with degree equal to that of AðpÞ. ~
~
The polynomial matrix BðpÞ is the same as B(p) for non-strictly proper and minimal
system. The control input uj(t) is subject to the following constraint (Fig. 1)

umin;j uj ðtÞ umax;j ðj ¼ 1; 2; : : : ; mÞ (5)

The constraint (7) is equivalently expressed as

0 1
sð
u1 ðtÞÞ
B .. C
uðtÞ ¼ sðu1 Þ ¼ @ . A; (6)
sð
um ðtÞÞ

u max
σ(v)
u(t) y(t)
v
umin

LIMITER
Fig. 1 Input saturation model
260 A.S. Osunleke and M. Deng

where u1 is the ideal controller output vector. The objective is to design stable
controllers using coprime factorization and Youla-Kucera parameterization [5] for
the above process.

2.2 Controllers Parameterization

The proposed design scheme for the MIMO systems follows four steps as high-
lighted below.

2.2.1 Design of Stable Feedback MCGPC Controllers V ( p) and U ( p)

with Input Constraint Part (Fig. 2)

By adapting the MCGPC design method to the anti-windup design approach, the
proposed controllers are given by Youla-Kucera parameterization as follows.

UðpÞ ¼ XðpÞ þ QðpÞDðpÞ (7)

VðpÞ ¼ YðpÞ QðpÞNðpÞ (8)

where Q(p) ∈ RH1 is a design parameter matrix for ensuring a strongly stable
feedback controllers in the above and is given by

QðpÞ ¼ U1
d ð pÞUn ð pÞ (9)

Δ(t)∈U

MCAGPC Closed-Loop D0–1

e (t)∈U
r(t) ∈Y + + + u1(t) σ (v) u (t)∈U +
D0–1 N0
– – + +

Ip-V

Fig. 2 Proposed MIMO control structure

20 Robust Tracking and Control of MIMO Processes with Input Saturation 261

N( p), D( p) are the coprime factor representation of D0–1 ( p), X( p), Y( p) ∈ RH1
are operators chosen to satisfy the Bezout Identity

XðpÞNðpÞ þ YðpÞDðpÞ ¼ Im (10)

^ XðpÞ
NðpÞ ^ ^ YðpÞ
þ DðpÞ ^ ¼ Im (11)

^
XðpÞ; ^
YðpÞ; XðpÞ; YðpÞ 2 RH1 (12)

^ pÞ and Dð
where the coprime factorization N( p) and D( p), Nð ^ pÞof the CAGPC
Plant can be chosen as follows.

H0 ð pÞ ¼ D1 ~ 1 1 ~
0 ð pÞ ¼ D½ pA ½ p ¼ A ½ pD½ p (13)

From (13), defining the stable closed-loop characteristic polynomial matrices

^ c ½p, then we choose N(p) and
of the process without input constraint as Tc ½p and T
^
D(p), NðpÞ ^
and DðpÞ as

^
NðpÞ ¼ D½ ^
^ 1 ½ pD½
~ pT ~ p; NðpÞ
^ ¼ T1 ~
c c ½ pD½ p (14)

^ T
DðpÞ ¼ A½p
1^
^ ½pD½p;
~ DðpÞ ¼ T1
c c ½ pA½ p (15)

XðpÞ ¼ Ke Re þ Ke Ce ðpÞFðpÞ (16)

YðpÞ ¼ Im þ Ke Ce ð pÞGð pÞ (17)

Tc[p] is the closed-loop characteristic polynomial matrix for the control system
in Fig. 2 and is given by

~ p
Tc ½ p ¼ A½ p þ Ke Lð pÞ þ Ke Re D½ (18)

where Ke, Ce, Re, G[p] and F[p] are given in [4].

2.2.2 Robust Stability of D0–1 (p) with Input Constraint

The predictive anti-windup part is quasi-linear with respect to the effect of input
constraint. The closed loop transfer function of this part of the control structure is
always stable and invertible. Therefore, it can be seen that the plant retains a robust
rcf and it is stable. Then, we can design a Bezout operator S(p) to satisfy the
following Bezout Identity [7]

Rð pÞD0 ð pÞ þ Sð pÞN0 ð pÞ ¼ Im (19)

262 A.S. Osunleke and M. Deng

2.2.3 Tracking Operator Design

Here a tracking operator M(p) for reference signal r(t) such that plant output y(t)
tracks to the setpoint signal r(t) as depicted in Fig. 3. Based on the proposed lemma
and proof in [7], operator M(p) is designed to satisfy

N0 ð pÞðMð pÞrð pÞ þ Rð pÞDð pÞÞ ¼ Im rð pÞ (20)

2.2.4 Decoupling of the Control System

In the proposed control scheme, the possible interactions between different control
loops are identified as input-input, disturbance-input and input-output. In order to
minimize the effect of these interactions, the following design steps are proposed.
l Choosing larger value for control orders and/or smaller value for the maximum
prediction horizons and/or using reference models for the output in the design
step 1.
l Design a decoupling matrix GD( p) such that loop interactions are decoupled.
The elements of GD( p) can be selected in a number of ways to achieve optimally
decoupled control system. This is further investigated in the future works.

3 Additional Design Schemes for MRAGPC

In this section, we present two other design features to enhance the performance of
the proposed method for MIMO processes similarly to that obtained for SISO [8]
namely, the robust parallel compensator and unknown fault detection schemes.

3.1 Robust Parallel Compensator (RPC) Scheme

for MIMO Processes

In an extension to the design of robust parallel compensator designed for SISO

system [9], we present in this section, an equivalent form of this compensator for

Δ(t)∈U

r(t)∈Y + u(t)∈U w(t)∈U y(t)∈Y

M Im N0
+

Fig. 3 Robust tracking structure for MIMO system

20 Robust Tracking and Control of MIMO Processes with Input Saturation 263

MIMO processes. Here, a RPC, F( p) (see Fig. 4) is designed for all classes of system
which exhibit non-minimal phase and non-strictly properness. The condition for this
design to satisfy static output feedback is the same as in SISO [10] that is:
l The process must be inversely stable.
l Its relative degree must be zero i.e. non-strictly properness.
l The leading coefficient must be positive.
For the process represented by (2), we propose the following GRPC design
scheme.

Fð pÞ ¼ F1 d
d ½ pFn ½ pPp (21)

where
Fn[ p] and Fd [ p] are (g i)th and (g i + 1)th order monic stable polynomial
matrices respectively and are as given as

Fn ½ p ¼ bgk;i;j pgk þ þ b0;i;j ; i ¼ 1; 2; . . . ; m; j ¼ 1; 2; . . . ; q (22)

Fd ½ p ¼ agkþ1;i;j pgkþ1 þ þ a0;i;j ; i ¼ 1; 2; . . . ; m; j ¼ 1; 2; . . . ; q (23)

and Pdp is a diagonal matrix m q of element pd where d is the degree of

polynomial in p.
~ pÞ be the augmented system matrix defined as
Let Hð

~ pÞ ¼ Hð pÞ þ Fð pÞ
Hð (24)

Equivalently,

Hð ~ 1 ½ pB½
~ pÞ ¼ A ~ p (25)

~
Augmented Plant H
Δ(t)∈U
F

y(t)∈Y
r(t)∈Y + e(t)∈U σ(v) +
H
– +

Anti-windup part
b∈U

Fig. 4 Robust parallel compensator design for MIMO system

264 A.S. Osunleke and M. Deng

In summary, the coefficients bk,i,j > 0, ak,i,j > 0 in (25) above are determined
such that
~
1. B½p, ~ ∈ RH1 for sufficiently large b0,i,j.
A½p
2. The augmented plant Hð ~ pÞ has the same steady state gain as the nominal plant
Hð pÞ.
~
3. There exists positive constants ak,i,j such that the augmented system HðpÞ in (25)
will satisfy the sufficient condition of SOF for any small positive integer value of
d and a0,i,j > 0.
For the RAGPC design scheme, it is just sufficient to choose d ¼ 1 and g as
minimum as possible while the unknown coefficients are designed satisfying the
above stated conditions using Kharitonov’s theorem [11]. Without loss of generality,
~ pÞ have a relative degree equal zero and a positive
it is obvious from (25) that Hð
~ pÞbeing SOF stabi-
leading coefficient thereby satisfying all the conditions of Hð
lized and almost strictly positive real (ASPR).

3.2 Unknown Disturbance Estimation Scheme

for MIMO Processes

As noted in earlier work [2], the unknown disturbance of the system is estimated as
fault signals acting on the process input. If stable low-order operators S0 and R0 can
be designed such that the pair [S0, R0] [S, R] (see Fig. 5). Then the following
equations enable us to detect the fault signal [2]

u0 ¼ R0 ud ðtÞ þ S0 ðya ÞðtÞ
(26)
yd ¼ D0 ðu0 ÞðtÞ

Then as shown in Fig. 3, the fault signal is estimated as

~ ¼ absðR1 ðu0 ðtÞ rm ðtÞÞÞ

D (27)

Detected Fault
signal
~
+ R–1 Δ (t)
Fault signal
Δ(t) u 0(t)
R0 + S0
R
Compensated system ya(t)∈Y
r(t)∈Y u 0(t) +e(t)∈U w(t)∈U y(t)∈Y
M1 ~ ~
+ R–1 D−1 N
r m (t) –
Anti-windup part
b∈U
S

Fig. 5 Unknown disturbance estimation structure for MIMO system

20 Robust Tracking and Control of MIMO Processes with Input Saturation 265

4 Simulation Examples

The Mode 1 and Mode 2 implementation of MRAGPC are considered in this

section. The process models used are the same. The objective is to control this
MIMO system with and without disturbance load into the system.

4.1 Control of MIMO System Without Disturbance

The parameters of the 2-input 2-output system are given in Table 1 below.

4.2 Control of MIMO System with Disturbance

The process model (4.1) is considered here with disturbance load in the input. The
process and control parameters are as shown in Table 2.

Table 1 Simulation parameters for MIMO process without disturbance

System parameters:
A11 ¼ p2 þ p þ 0.1 A22 ¼ p2 þ 0.5p þ 0.06 B11 ¼ 0.2p þ 10
B12 ¼ 0.5p þ 2 B21 ¼ 0.1p þ 1.2 B22 ¼ 0.25p þ 6
C11 ¼ C22 ¼ p þ 0.8 m¼q¼2
Controller parameters:
Ny1 ¼ Ny2 ¼ 10 Nu1 ¼ Nu2 ¼ 3 rm1 ¼ rm2 ¼ 0.5
Un11 ¼ 0.3 Un22 ¼ 0.2 u1,1 ¼ u1,2 ¼ 0
u2,1 ¼ u2,2 ¼ 0.1 r1 ¼ 10, r2 ¼ 2 l1 ¼ l2 ¼ 0.1
Ud ¼ Iq T1,1 ¼T1,2 ¼ 0 T2,1 ¼T2,2 ¼ 6

Table 2 Simulation parameters for MIMO process with disturbance

System parameters:
A11 ¼ p2 þ p þ 0.1 A22 ¼ p2 þ 0.5p þ 0.06 B11 ¼ 0.2p þ 10
B12 ¼ 0.5 p þ 2 B21 ¼ 0.1 p þ 1.2 B22 ¼ 0.25 pþ 6
C11 ¼ C22 ¼ p þ 0.8 m ¼ q ¼ 2 D1 ¼ D2¼ 20
sin(1.8t)
Compensator parameters:
d ¼ 1, g ¼ 1 b0,11 ¼ 1.21 b0,12 ¼ 0.64 b0,21 ¼ 0.64 b0,22 ¼ 0.06
a,0,11 ¼ 1.476 a,0,12 ¼ 4.51 a,0,21 ¼ 0.88 a,0,22 ¼ 0.004
a,1,11 ¼ 3.105 a,1,12 ¼ 1.25 a,1,21 ¼ 1.25 a,1,22 ¼ 0.112
Controller parameters:
Ny1 ¼ Ny2 ¼ 12 Nu1 ¼ Nu2 ¼ 3 rm1 ¼ rm2 ¼ 1 Un11 ¼ Un22
¼ 0.45
u1,1 ¼ u1,2 ¼ 0 u2,1 ¼ u2,2 ¼ 0.1 r1 ¼ 10, r2 ¼ 2 l1 ¼ l2 ¼ 0.1
Ud ¼ Iq T1,1 ¼ T1,2 ¼ 0 T2,1 ¼ T2,2 ¼ 6
266 A.S. Osunleke and M. Deng

0.1

0.05

0
0 20 40 60 80 100 120
Time(s)
0.1
u2

0.05