Lecture Notes GIS
Lecture Notes GIS
Lecture 1
A gentle introduction to GIS
1
1/30/2018
DATA
Definitions
“…a system of hardware, software, and procedures designed to support the capture,
management, manipulation, analysis, modeling, and display of spatially referenced data
for solving complex planning & management problems” (Rhind, 1989).
“…a computer system capable of assembling, storing, manipulating, and displaying
geographically referenced information…” (USGS, 1997)
“…a set of computer-based systems for managing geographic data and using those
data to solve spatial problems” (Lo & Yeung, 2002).
“….a computer system that allows the analysis and display of data with a spatial
component (Phillips, 2002).
4
2
1/30/2018
3
1/30/2018
Lecture 2
Geographic information and spatial data types
Objectives
Distinguish between different types of geographic phenomena like fields and
objects and between discrete and continuous phenomena.
Understand the principle of “spatial autocorrelation”
Understand the concept of the following computer representation:
• Regular tessellation
• Irregular tessellation
• Point, Line and Polygon
4
1/30/2018
Notions to be grasped
❖ Select a suitable computer representation, based on the different types of
geographic phenomena.
❖ Difference between the real world and computer representations.
❖ Different types of geographic phenomena.
❖ Boundaries (crisp – fuzzy).
❖ Regular versus irregular tessellations (quadtrees).
❖ Vector representations (TIN, point, line, area).
❖ Properties of geometric space used for spatial data (Euclidean space, metric space,
topological space, interior and boundary).
❖ Two dimensional topology.
❖ Scale and Resolution. 9
10
5
1/30/2018
11
Geographic Fields
Fields can be discrete or continuous, and if they are continuous, they can even be
differentiable.
In a continuous field, the underlying function is assumed to be continuous
(temperature). Continuity means that all changes in field values are gradual.
In a differentiable field, we can determine a measure of change per unit of distance
anywhere. (for elevation, this measure would be slope).
12
6
1/30/2018
Geographic Fields
In a discrete field,
Discrete fields, cut up the study space in mutually exclusive, bounded parts,
with all locations in one part having the same field value. (land classification,
geological units)
One may note that discrete fields are a step from continuous fields towards
geographic objects:
• Discrete fields as well as objects make use of ‘bounded’ features.
• A discrete field still assigns a value to every location.
13
14
7
1/30/2018
Geological units
Elevation 15
Spatial Autocorrelation
• locations that are close are more likely to have similar values than
locations that are far apart.
• Principle is used when an interpolation is performed.
• Continuous field phenomena are based on the spatial autocorrelation
concepts.
• These fields although characterized by a continuous function must be
finitely represented.
• This also applies for lines.
16
8
1/30/2018
Boundaries
Which phenomena have boundaries:
-Discrete fields
-Objects
Two different types of boundaries:
• Crisp boundaries
• Fuzzy boundaries
17
18
9
1/30/2018
20
10
1/30/2018
Advantages:
• We know how they partition space. This leads to fast algorithms.
Disadvantages:
• They do not adapt to the spatial phenomenon we want to represent.
• Cell boundaries are both artificial and fixed: they may or may not
coincide with the boundaries of the phenomenon of interest.
21
Irregular Tessellations
Again, these are partitions of space into mutually exclusive cells, but now,
the cells vary in size and shape, allowing them to adapt to the spatial
phenomena they represent.
One example: region quadtree.
It splits up the area into four quadrants.
22
11
1/30/2018
Region Quadtree
This procedure stops when all cells in a quadrant have the same field value.
23
Vector representations
In vector representations, an attempt is made to associate
georeferences with the geographic phenomena explicitly.
12
1/30/2018
Georeferences
Instead, they provide a georeference of the lower left corner of the raster, for
instance, together with an indicator of the raster’s resolution, thereby
implicitly providing georeferences for all cells in the raster.
25
13
1/30/2018
TIN
A plane fitted through the anchor points has a fixed aspect and gradient and
can be used to compute an approximation of e.g. elevation of other
locations.
27
28
14
1/30/2018
Point representations
Points are defined as single coordinate pairs (x,y), when we work in 2D or
coordinate triplets (x,y,z) when we work in 3D
Line representations
• Used to represent one-dimensional objects
(roads, railroads, canals, rivers…).
Line representation
A network is a collection of connected lines.
Area representations
When area objects are stored using a vector approach, the
usual technique is to apply a boundary model. This means
that each area feature is represented by some arc/node
structure that determines a polygon as the area’s
boundary.
30
15
1/30/2018
Area representation.
A simple but naive representation of area features would be to list for each
polygon simply the list of lines that describe its boundary.
Why is this not working?
-Redundancy
-If we want to find neighboring polygons, we have to do time-consuming analysis
(topology)
31
Area representation
32
16
1/30/2018
34
17
1/30/2018
We can define within the topological space, features that are easy to handle
and that can be used as representations of geographic objects.
These features are called simplices as they are the simplest geometric
shapes of some dimension:
▪ point (0-simplex),
▪ line segment (1-simplex),
▪ triangle (2-simplex),
▪ and tetrahedron (3-simplex).
35
Topology of 2D
We use the topological properties of interior and boundary to define
relationships between spatial features.
Region A →Boundary A and Interior A
Region B →Boundary B and Interior B
When the two regions meet this is defined as the Boundary of A intersects
the boundary of B.
The interiors do not intersect and the boundary of one does not intersect the
interior of the other.
36
18
1/30/2018
Topology of 2D
The mathematical definition of meets.
A meets B = interior (A) ∩interior (B) = Ø∧
interior (A)∩boundary (B)= Ø∧
boundary (A)∩interior (B)= Ø∧
boundary (A)∩boundary(B)≠Ø
Spatial Relationships
38
19
1/30/2018
39
2½-D
• Rules of 2D topology apply.
• Nodes have a z-value.
• A node with an x-and y-coordinate can have only one z-value.
• Two different nodes, with identical x, y values, cannot have different z-
values.
• Consequently, true solids cannot be represented in a 2½D GIS.
40
20
1/30/2018
2½-D
41
21
1/30/2018
43
44
22
1/30/2018
45
46
23
1/30/2018
47
Lecture 3
Data Processing Systems
48
24
1/30/2018
Contents
• What are data processing systems?
• Functional GIS components
– input
– output
– data storage
– spatial queries & analysis
• GIS & databases
-databases
• DBMS
• Example
- using GIS & DBMS together 49
50
25
1/30/2018
51
26
1/30/2018
53
Disciplines in GIS
Geography
Remote Sensing
Surveying
Cartography
Photogrammetry
Computer Science Geo-informatics
Mathematics
Archeology
Forestry
Earth Sciences
Urban Planning
Legal Sciences
54
Economy
27
1/30/2018
GIS Components
A GIS is a computer-based system that provides the following four sets of
capabilities to handle geo-referenced data :
1. data capture and preparation
2. data management
3. data manipulation and analysis
4. data presentation
55
Data Capture
And Preparation Storage and Maintenance
Manipulation and
Data Presentation
Analysis
56
28
1/30/2018
Data Input
• Available data
– Tape, CD, Net
• Manual input
– Keyboard, digitizer
– On-screen, photogrammetry
• Semi-automatic digitizing
– Line-following
• Automatic digitizing
– Scanner
57
58
29
1/30/2018
Spatial Proximity
Objects that are near in geographic space should be near in
storage space as well.
59
30
1/30/2018
Spatial analysis
GIS uses spatial analysis functions:
Operators that use spatial data to derive new geo-information.
61
31
1/30/2018
Data Output
• Hardcopy
– printer
– plotter
• Softcopy
– on-screen
• Data sets
– CD, DVD, Internet
63
64
32
1/30/2018
66
33
1/30/2018
67
68
34
1/30/2018
69
70
35
1/30/2018
Other DBMS’s
Relational databases are well suited for:
• Large amount data
• Simple data types
Not setup well to deal with spatial data
• Object-oriented / object-relational models
• GIS applications will rely on DBMS add-on application packages
71
72
36
1/30/2018
Vector
73
Topology
Topological data structure provides:
• Automated way to handle digitizing errors
• Reduced storage space (Poly boundaries stored only once)
• No overlapping polygons
• Advanced spatial analysis (adjacency, networks)
74
37
1/30/2018
75
38
1/30/2018
Lecture 4
• Data quality
• Spatial referencing
77
Objectives
Data quality
• Explain the relationship between accuracy and precision.
• List main components of spatial data quality.
• Understand and apply the RMSE for measuring location accuracy (section 4.3.1).
Spatial Referencing
• Understand basic concepts of spatial referencing pertinent to published maps and spatial data.
• Know the meaning of items in a map legend (vertical and horizontal datum, ellipsoid, coordinate
types, etc).
• Distinguish between local and global reference systems.
• Understand the concepts of height measurements (in relation to the geoid or an ellipsoid).
• Comprehend datum transformations.
• Know the classification of map projections.
• Know the distortion properties of map projections.
• Describe the process of selecting a suitable map projection.
• Grasp the logic of map projection equations and the principles of transforming
78
maps from one
projection system to another.
39
1/30/2018
Inaccurate data
80
40
1/30/2018
41
1/30/2018
Incomplete data
83
Lineage
42
1/30/2018
Lineage
Provide information about;
• transformations,
• interpolations,
• cartographic displacements,
• exaggerations etc.
Any data set created from distinct sources should be provided
with lineage information to identify the quality of the data set
in terms of ‘fitness for use’. 85
43
1/30/2018
87
Example
1.0 mm in source document
0.5 mm in map registration for digitizing
0.2 mm in digitizing
If sources are combined independently, an estimate for the overall accuracy, will be:
44
1/30/2018
89
Computation of RMSE
90
45
1/30/2018
91
46
1/30/2018
Spatial Referencing
Contents
➢ Spatial reference surfaces and datums
• The Geoid – vertical datum
• The Ellipsoid – horizontal (geodetic) datum
➢ Local and global datums
➢ From Earth to map
➢ Coordinate transformations
➢ Map projections
• Classification of map projections
• Universal Transverse Mercator (UTM)
• Map projection selection 93
94
47
1/30/2018
Vertical datums
Starting from the mean sea level points, the heights of points on the
Earth can be measured using a technique known as geodetic
leveling.
48
1/30/2018
97
49
1/30/2018
99
50
1/30/2018
101
102
51
1/30/2018
Mapping organizations do not only coach the user community about the
implications of the geocentric datum. They also develop tools to enable
users to transform coordinates of spatial objects from the new datum to the
old one. This process is known as datum transformation.
The tools are called datum transformation parameters.
104
52
1/30/2018
Datum Transformations
53
1/30/2018
107
Overview
108
54
1/30/2018
109
Coordinate Transformations
110
55
1/30/2018
111
112
56
1/30/2018
113
Datum Transformations
114
57
1/30/2018
115
116
58
1/30/2018
117
Map Projections
118
59
1/30/2018
119
Forward equation
Inverse equation
120
60
1/30/2018
Forward equations
Inverse equations
121
122
61
1/30/2018
124
62
1/30/2018
125
126
* also called Plate Carrée projection
63
1/30/2018
127
128
64
1/30/2018
129
130
65
1/30/2018
UTM-projection
131
UTM-Zones
132
66
1/30/2018
133
134
67
1/30/2018
135
68
1/30/2018
137
69
1/30/2018
139
70
1/30/2018
CHAPTER 5
Data entry and Data preparation
141
71
1/30/2018
143
144
72
1/30/2018
Digitizing Process
145
146
73
1/30/2018
On-screen Digitizing
147
74
1/30/2018
Map Registration
149
Map Registration
150
75
1/30/2018
What to Digitize ?
151
Digitizing Errors
152
76
1/30/2018
153
154
77
1/30/2018
Working Principle
A digital scanner illuminates the document and measures
with a sensor the intensity of the reflected or transmitted light.
155
156
78
1/30/2018
Scanning Resolution
Scanning Resolution can be expressed in:
• in millimeters
• in microns
• in dots per inch
Example:
pixel size = 0.05 mm = 50 μm = 508 dpi
157
79
1/30/2018
Scanning Modes
Line Art, 1 bit
159
160
80
1/30/2018
161
162
81
1/30/2018
Scanner Output
The scanner output is only a digital copy of the source
document in raster cell values.
163
164
82
1/30/2018
165
166
83
1/30/2018
167
168
84
1/30/2018
169
Vector Editing
170
85
1/30/2018
171
Feature Forming
• Splitting lines to form line segments and nodes.
• Joining line segments to form polygons and features.
• Feature coding
172
86
1/30/2018
Vectorized Lines
(after vector editing and data structuring)
173
Vector / Raster
174
87
1/30/2018
175
176
88
1/30/2018
177
Means:
Data in different formats and standards, and with different
quality !
178
89
1/30/2018
Data Preparation
179
180
90
1/30/2018
181
182
91
1/30/2018
183
Data Quality
• Geometric
Are the coordinates correct?
• Topological
Are the relationships correct?
• Semantic ≈ "meaning"
Does the data make sense?
184
92
1/30/2018
185
186
93
1/30/2018
187
Associating Attributes
188
94
1/30/2018
189
Topology Generation
• Topology deals with spatial relationships between features.
• Topological relations are used in a GIS to perform spatial
operations like:
• overlaying
• buffering
• network tracing
• shortest-path routing
190
95
1/30/2018
Topological Relationships
“Green area is … covered by … blue area” (by 2D geographic extent)
191
96
1/30/2018
Differences in Accuracy
Combining two data sets derived from different maps may
lead to sliver polygons.
193
Differences in Representation
194
97
1/30/2018
195
196
98
1/30/2018
197
Interpolation
Very often, we only have a limited number of
observation points, but we're interested in every
location in the area.
How do we find the 'missing' points?
Answer: using interpolation techniques.
198
99
1/30/2018
Discrete Field
• Qualitative
• What is it?
Continuous Field
• Quantitative
• How much is it?
199
100
1/30/2018
Qualitative Quantitative
P = Forest P = 112 meter
Q = Grassland Q = 168 meter
201
Thiessen polygons
202
101
1/30/2018
Interpolation techniques:
• Trend surface fitting
• Moving window averaging
• Triangulation
203
204
102
1/30/2018
Example, 1-D
205
206
103
1/30/2018
207
104
1/30/2018
209
210
105
1/30/2018
Squared
Computed field value = ?
211
10.66
Squared
Computed field value = 10.66
212
106
1/30/2018
Triangulation
Delaunay
213
TIN construction
214
107
1/30/2018
215
216
108
1/30/2018
217
Spatial Filtering
Linear Filtering or Convolution
The difference with moving window averaging is that the moving window in
filtering is itself a little raster, which contains cell values that are used in the
computation for the output cell value. This little raster is known as the filter
218
109
1/30/2018
gradient
219
220
110
1/30/2018
Spatial Filtering
221
222
111
1/30/2018
223
112
1/30/2018
225
226
113
1/30/2018
227
228
114
1/30/2018
229
Summary
From Points to Fields
1. Discrete Fields (Vector!)
2. Continuous Fields (Raster!)
Interpolation
1. Thiessen Polygons
2. Surface fitting, MWA, IDW, Delaunay
230
115
1/30/2018
CHAPTER 6
Spatial data analysis
231
116
1/30/2018
Introduction
233
Introduction
117
1/30/2018
Introduction
235
118
1/30/2018
Measurements
Measurements -Vector
238
119
1/30/2018
Measurements -Vector
239
Measurements -vector
240
120
1/30/2018
Measurement -Vector
241
Measurement -Vector
242
121
1/30/2018
Measurement -Vector
243
Measurement -Raster
244
122
1/30/2018
Measurement -Raster
245
Measurement -Raster
246
123
1/30/2018
Measurement -Summary
Vector:
• Vector measurements include: location (coordinates and
centroids), length, distance and area size.
• Another geometric measurement is the minimal bounding box
of lines and polygons.
• Pythagorean distance function is used.
Raster:
• Raster measurements include: location, distance and area
size.
• Standard Pythagorean distance function applied to the
locations of their mid-points.
247
Interactive
Spatial Selection by Attribute conditions
• Relational operators
• Logical operators
• Combining attribute conditions
124
1/30/2018
249
250
125
1/30/2018
251
252
126
1/30/2018
253
254
127
1/30/2018
255
256
128
1/30/2018
257
258
129
1/30/2018
259
260
130
1/30/2018
(Re) Classification
262
131
1/30/2018
263
264
132
1/30/2018
(Re) Classification
265
In user-controlled classification we
indicate the classification attribute and
the classification method.
This is normally done via a
classification table.
266
133
1/30/2018
Automatic classification
Automatic classification
268
134
1/30/2018
Automatic classification
Automatic classification
The two techniques are illustrated here
again. On the horizontal axis, the original
values.
The classes are the blue vertical lines. In the
equal interval example, the blue lines are
equally spaced. On the vertical axis, the
frequency.
In the equal frequency technique, when the
frequency goes up, the classes become
smaller.
270
135
1/30/2018
Overlay
Standard overlay operators take two input
data layers and assume they are geo-
referenced in the same system and
overlap in study area. If either condition is
not met, the use of an overlay operator is
senseless.
The principle of spatial overlay is to
compare the characteristics of the same
location in both data layers, and to
produce a new output value for each
location.
272
136
1/30/2018
Overlay
274
137
1/30/2018
276
138
1/30/2018
Vector Overlay
277
139
1/30/2018
Output_raster_name : = Raster_calculus_expression
279
Overview:
• Arithmetic overlay operators
• Comparison and logical operators
• Conditional expressions
• Decision table
280
140
1/30/2018
Arithmetic operators
• +, -, *, /
• MOD (modulo division)
• DIV (integer division)
• Goniometric operaters: sin, cos, tan, asin, acos, atan.
• For example:
Raster2 : = Raster1 * 5
281
Logical operators
• AND,OR,NOT, XOR (exclusive)
a XOR b is true if either a or b is true, but not
both.
282
141
1/30/2018
283
142
1/30/2018
286
143
1/30/2018
Overlay - Summary
288
144
1/30/2018
Neighborhood functions
289
Neighborhood functions
145
1/30/2018
Neighborhood functions
Neighborhood functions
292
146
1/30/2018
147
1/30/2018
Raster layers:
• Need target cell(s).
• The distance function applies the Pythagorean
distance between the cell centers.
• Using cell resolution as the unit.
• The distance from a non-target cell to the
target is the minimal distance one can find
between that non-target cell and any target
cell.
295
Thiessen polygons
296
148
1/30/2018
Thiessen polygons
297
Spread Computation
149
1/30/2018
Spread Computation
299
Spread Computation
300
150
1/30/2018
Spread Computation
Spread Computation
302
151
1/30/2018
Spread Computation
Lowest Value
303
Spread Computation
304
152
1/30/2018
Seek Computation
305
Seek Computation
306
153
1/30/2018
Seek Computation
Seek Computation
308
154
1/30/2018
Seek Computation
310
155
1/30/2018
Network analysis
Types of Networks
156
1/30/2018
Types of Networks
Types of Networks
314
157
1/30/2018
Types of Networks
315
Analysis on networks
Network partitioning
158
1/30/2018
Cost factors
• The cost can be defined on both lines
and nodes.
• For lines, the cost can be the same or
different along and against the line
direction.
• The cost on nodes is used to define the
turns.
318
159
1/30/2018
319
320
160
1/30/2018
321
Network partitioning
161
1/30/2018
Network Allocation
Network allocation
162
1/30/2018
Trace Analysis
326
163
1/30/2018
Error propagation
327
Error propagation
328
164
1/30/2018
Error propagation
329
CHAPTER 7
Data Visualization
330
165
1/30/2018
Contents
GIS and maps
The visualization process
Visualization and strategies
The cartographic ‘toolbox’
• data characteristics,
• representation of these characteristics
Examples, how to map:
•qualitative data,
•quantitative data, (terrain or statistical) elevation, time
332
166
1/30/2018
Map Characteristics
Map Characteristics
• simplified
map. • classified
• symbolized
334
167
1/30/2018
Scale
Scale
Scale indications
• verbal e.g. one-inch-to-the mile
• representative fraction e.g. 1 : 100 000
• graphical (scale bar) suitable in digital
environments!
168
1/30/2018
Definition of a Map
A map is:
• a representation or abstraction of
geographic reality; a tool for
representing geographic information in a
way that is visual, digital.
• a reduced and simplified representation
of (parts of) the Earth’s surface on a
plane. 337
Types of Maps
338
169
1/30/2018
Map Dimensions
• Flat
• Flat + Height
• Flat + Height + Time
339
340
170
1/30/2018
Cartographic Tools
171
1/30/2018
Geo-visualization
Geo-visualization
344
172
1/30/2018
Cartographic Visualization
345
173
1/30/2018
347
174
1/30/2018
175
1/30/2018
351
176
1/30/2018
353
354
177
1/30/2018
355
356
178
1/30/2018
357
358
179
1/30/2018
359
Test Yourself
360
180
1/30/2018
Screen maps:
• Often smaller
362
181
1/30/2018
182
1/30/2018
1. Additional information
marginal info (or metadata): makes
the map more usable.
2. Adding text improves the
identification of features.
3. Contrast improves overall map
legibility.
365
Web Maps
366
183
1/30/2018
Summary
References
Arbia, G., Griffith, D., and Haining, R. Error propagation modeling in raster gis: overlay operations. International
Journal of Geographical Information Science 12, 2 (1998), 145–167.
Aronoff, S. Geographic Information Systems: A Management Perspective. WDL Publications, Ottawa, Canada,
1989.
Bertin, J. S´emiology Graphique. Mouton, Den Haag, The Netherlands, 1967. BIJKER, W. Radar for rain forest—A
monitoring system for land cover change in the Colombian Amazon. PhD thesis, International Institute for
Aerospace Survey and Earth Sciences, Enschede, The Netherlands, 1997.
Board, C. Report of the working group on cartographic definitions. Cartographic Journal 29, 1 (1990), 65–69.
Bonham-Carter, G. F. Geographic information systems for geoscientists : Modeling with GIS, vol. 13 of Computer
methods in the geosciences. Pergamon, Kidlington, U.K., 1994.
Burrough, P. A. Natural objects with indeterminate boundaries. In Geographic objects with indeterminate
END OF COURSE
boundaries, P. A. Burrough and A. U. Frank, Eds. Taylor and Francis, London, U.K., 1996, pp. 3–28.
Chrisman, N. R. Errors in categorical maps: testing versus simulation. In Proceedings AutoCarto (1989), pp. 521–
529.
Good Luck
Clarke, D. G., and Clark, M. Lineage. In Elements of Spatial Data Quality, S. C. Guptill and J. L. Morrison, Eds.
Elsevier Science, Oxford, U.K., 1995, pp. 13–30.
Date, C. J. An Introduction to Database Systems, seventh ed. Addison- Wesley, Reading, Ma, 2000.
Dibiase, D. Visualization in earth sciences. Earth and Mineral Sciences, Bulletin of the College of Earth and Mineral
Sciences 59, 2 (1990), 13–18.
ElmasrI, R., AND Navathe, S. B. Fundamentals of Database Systems, second ed. Benjamin/Cummings, Redwood
City, Ca, 1994.
368
184
1/30/2018
Longley, P. A., Goodchild, M. F., Maguire, D. M., and Rhind, D. W., Eds. Geographical Information Systems:
Principles, Techniques, Management, and Applications, second ed., vol. 1. John Wiley & Sons, New York, N.Y.,
1999.
Mccormick, B., Defanti, T. A., and (eds.), M. D. B. Visualization in scientific computing. ACM SIGGRAPH Computer
Graphics—Special issue 21, 6 (1987).
National mapping Division, U. S. Geological Survey. Spatial data transfer standard. Tech. rep., U. S. Department of
the Interior, 1990.
Office of the Surveyor General of Land Information New Zealand. A proposal for geodetic datum development. Tech.
Rep. OSG TR2.1, Land Information New Zealand (LINZ), 1998.
Openshaw, S., Charlton, M., and Carver, S. Error propagation: a Monte Carlo simulation. In Handling geographical
information: methodology and potential applications, I. Masser and M. Blakemore, Eds. Longman, Harlow, U.K.,
1991, pp. 78–101.
Preparata, F. P., and Shamos, M. I. Computational Geometry—An Introduction. Springer-Verlag, New York, NY,
1985.
Robinson, A. H., Morrison, J. L., MuehrckE, P. C., Kimerling, A. J., and Guptill, S. C. Elements of Cartography, sixth
ed. John Wiley & Sons, New York, N.Y., 1995.
Samet, H. The Design and Analysis of Spatial Data Structures. Addison- Wesley, Reading, Ma, 1990.
Strunk Jr., W., and White, E. B. The Elements of Style, third ed. MacMillan Publishing Company, New York, NY,
1979.
Veregin, H. Developing and testing of an error propagation model for GIS overlay operations. International Journal
of Geographical Information Systems 9, 6 (1995), 595–619.
Zadeh, L. A. Fuzzy sets. Information and Control 8 (1965), 338–353.
370
185