HGES408 Module
HGES408 Module
Honours in Geography
and Environmental
Studies
Sigauke Esther
Msc Environmental Policy and Planning (UZ)
BA Humanities and Social Science (Africa University)
Mount Pleasant
Harare, ZIMBABWE
You also need to be open-minded, frank, inquisitive learning package together with the sources to
and should leave no stone unturned as you analyze which you are referred. Fully-fledged lectures
ideas and seek clarification on any issues. It has can, therefore, be misleading as the tutor may
been found that those who take part in tutorials dwell on matters irrelevant to the ZOU course.
actively, do better in assignments and examinations
because their ideas are streamlined. Taking part Distance education, by its nature, keeps the tutor
properly means that you prepare for the tutorial and student separate. By introducing the six hour
beforehand by putting together relevant questions tutorial, ZOU hopes to help you come in touch
and their possible answers and those areas that with the physical being, who marks your
cause you confusion. assignments, assesses them, guides you on
preparing for writing examinations and
Only in cases where the information being assignments and who runs your general academic
discussed is not found in the learning package can affairs. This helps you to settle down in your
the tutor provide extra learning materials, but this course having been advised on how to go about
should not be the dominant feature of the six hour your learning. Personal human contact is,
tutorial. As stated, it should be rare because the therefore, upheld by the ZOU.
information needed for the course is found in the
Note that in all the three sessions, you identify the areas
that your tutor should give help. You also take a very
important part in finding answers to the problems posed.
You are the most important part of the solutions to your
learning challenges.
G
eographical Information Systems (GIS) is a set of computer-based
procedures that enables capture, modelling, storage, retrieval,
sharing, manipulation, analysis, and presentation of geographically
referenced data. The capabilities and functions of GIS in capturing, modelling,
storing, retrieving, sharing, manipulating and analysing data has resulted in a
growing trend in the use of GIS in organisations. Thus, all these capabilities
and functions of GIS have been covered in the module.
Unit One introduces you to the basic concepts of Geographical Information
Systems. These concepts help us to understand what GIS is all about. The
definition of GIS is captured as well as functions and components of GIS.
The advantages and disadvantages of GIS are also explored. In the unit we
also provide an understanding of the Global Positioning Systems (GPS)
including its functions and associated errors and how they can be corrected.
In Unit Two we focus on the conceptual models of the real world phenomena.
This unit is important to understand because we do not store the real world
into a computer but we use models to represent the real world. The key
concepts covred in this unit include raster and vector data models.
Geography Information Systems HGES 408
In addition, the unit covers geographic data types, namely nominal, ordinal,
interval and ratio data. These data types determine how a phenomenon should
be represented in the computer.
Units Three and Four focus on spatial referencing in GIS. Unit Three covers
the concepts such as ellipsoids, geoids and datums. The coordinate systems
such as geographic, polar and Cartesian coordinates are explained in this
unit. Unit Four goes deeper in explaining the concept of spatial referencing by
introducing map projections. The thrust of Unit Four is in different types of
map projections
Unit Five covers spatial data capture and preparation methods. The methods
of spatial data capture include digitising, Global Positioning Systems (GPS)
and keyboard entry. Some of the data sources are primary sources, secondary
sources and others. In this unit, we also discuss the sources of GIS data as
well as sources of error and possible remedies as well. We further discuss
data quality, that is, how we can check for accuracy and precision.
Unit Six focuses on spatial data management and processing systems. That is
systems that facilitate the management and processing of geo-information,
hardware and software trends as well as data handling procedures. In addition
to this, databases, including geographic databases are covered.
Unit Seven covers the methods of transforming data in GIS. These techniques
use both interpolation continuum and discrete data. The interpolation techniques
are nearest neighbour interpolation, Thiessen's polygons, trend surface fitting,
triangulation, moving window averaging and kriging.
Unit Eight and Nine present the various techniques of spatial data analysis.
These include techniques namely, querying, buffering, classification,
reclassification and neighbourhood functions. In Unit Nine we go further in
discussing neighbourhood analysis, where proximity computations (buffer zone
generation and Thiessen's polygon generation), spread and diffuse computations
as well as seek computations, spatial operations on continuous surfaces and
network analysis.
Introduction to Geographical
Information Systems
1.1 Introduction
I
n this unit, you will be introduced to some basic concepts of Geographical
Information Systems (GIS). These concepts will help us to understand
what GIS is all about. We will learn the definition of and functions of
GIS. In the unit we also provide an understanding of the Global Positioning
Systems (GPS) including its functions and associated errors and how they
can be corrected.
Geography Information Systems HGES 408
1.2 Objectives
By the end of this unit you should be able to:
define the terms Geographical Information Systems, data, information
spatial data, model
explain the functions of Geographical Information Systems in Geography
and Environmental Studies
state the advantages and disadvantages of Geographical Information
Systems
describe how Global Position Systems works and its functions
discuss the sources of error in Global Position Systems
From the definitions you can also deduce that, GIS as a tool for spatial analysis,
enables us to answer question on location, condition, trends, patterns and
modelling. The following are examples of the five main types of questions that
a GIS can answer:
1. Location: What is at? This question looks at what is at a particular
location. A location can be described in many ways such as place name,
post code or geographic reference such as latitude or longitude or x
and y. Examples include:
What is at the corner of 3rd Street and Kwame Nkurumah Avenue?
What feature is located at geographic position 165o E, 5o N?
2. Condition: Where is it? This question inquires about the specific
location and requires spatial data to answer. For example:
Arable land within 100m of the main road and with soils suitable for
Soya bean production.
Which hotels are within 600m from the main road and are classified as
three star?
3. Trends: What has changed since? This question inquires about how
conditions have changed over time over the earth's surface. Examples
are:
Have forest areas in Mashonaland West Province decreased in size
over the past 15 years?
Have land use activities in Gokwe changed over the past decade?
4. Patterns: What spatial patterns exist? This question describes and
compares spatial patterns at different locations through the process of
spatial analysis. In this case, we need to know the relationship between
two or more datasets that occupy the same location? For instance:
Is there a relationship between a region's varying elevation and the
amount of rainfall that falls across it?
Do seasonal variations affect flood and drought occurrence?
5. Modelling: What if…? Such questions involve scenarios that differ
when you change the model's parameters. An example includes:
What happens to guinea fowl habitat when a road is constructed through
a forest area?
Which areas are at most risk (worst affected) if a flood occurs?
Activity 1.1
1. Define the term GIS.
? 2. Explain the five main types of questions that GIS can answer.
3. GIS enables you to perform trend and pattern analysis. Discuss.
http://www.differencebetween.net/language/difference-between-data-and-
information/
Thus, we can say that, when facts (data) are interpreted, they become
knowledge (information). In this case, we can also note that it is difficult to
separate the two terms.
Activity 1.2
1. Define the terms: data, spatial data, temporal data and model.
? 2. Give at least 5 examples of spatial data.
3. Distinguish between data and Information.
market for GIS include, Integrated Land and Water Information System
(ILWIS), Arc View and Earth Resources Data Analysis Systems (ERDAS).
1.4.2 Data
It includes any information that is spatial or tabular that relates to geography
and speciality fields. For example, they may include road names, crime
statistics, borehole points and illegal dump sites.
1.4.3 People
GIS technology requires people who manage and develop the system. GIS
users include technical specialists who design and maintain the system and
those who use it in their everyday work.
1.4.4 Methods
GIS works with models and operating practices.
Activity 1.3
1. State the 5 components of GIS.
? 2.
3.
Why are the 5 components important in GIS?
Define the terms software and hardware.
4. Give examples of GIS software and hardware.
1.5.4 Manipulation
When data are stored in a GIS, many manipulation options are available to
users. These functions are often available in the form of "toolkits." A toolkit is
a set of generic functions that a GIS user can employ to manipulate and analyse
geographic data. Toolkits provide processing functions such as data retrieval,
measuring area and perimeter, overlaying maps, performing map algebra, and
reclassifying map data (De By, 2001). Data manipulation tools include
coordinate change, projections, and edge matching, which allow a GIS to
reconcile irregularities between map layers or adjacent map sheets called
tiles.
1.5.5 Analysis
The analysis functions in GIS use the spatial and non-spatial attributes in the
database to answer questions about the real world. Geographic analysis
facilitates the study of real-world processes by developing and applying models.
Activity 1.4
1. What are the functions of GIS?
? 2. List some data input techniques used in GIS.
3. Explain why data analysis is an important function of GIS.
Activity 1.5
1. What are the advantages of using GIS?
? 2. What are the disadvantages of using GIS?
1.7 Summary
In this unit, we discussed the basic concepts of Geographical Information
Systems. These concepts help us to understand what GIS is all about. We
learnt the definition of GIS, functions and the components of GIS. The unit
also provided an understanding of the Global Positioning Systems (GPS)
including its functions and associated errors and how they can be corrected.
In the next unit we focus on conceptual models of real world phenomena.
References
Aronoff, S. (1989). Geographic Information Systems: A Management
Perspective. Ottawa, WDL Publications.
Burrough, P., and McDonnell, R. (1998). Principles of Geographical
Information Systems. (New York: Oxford University Press).
Clarke, K.C. (1997). Getting Started with Geographic Information
Systems. London, Prentice Hall.
De By. (2001). Principles of Geographic Information Systems: An
Introductory Textbook, International Institute of Geo-Information and
Earth Observation, Enschede, De By et al. Educational Textbook Series.
Dueker, K.J. (1979). Land Resource and Information Systems: A review
of 13 years experience-Geo-processing 1.
Goodchild, F.M (2005). GIS and Modeling Overview, National Center for
Geographic Information and Analysis, University of California, Santa
Barbara, California.
Longley, P., Goodchild, M., Maguire, D., and Rhind, D., 2004, Geographic
Information Systems and Science. New York, Wiley.
http://www.ncgia.ucsb.edu/giscc/units/u002/u002.html : Accessed 3/09/2012.
http://dusk.geo.orst.edu/gis/Chapter9_notes.pdf : Accessed 28/08/2012.
http://igis.nust.edu.pk/compogis.htm: Accessed12/09/2012.
http://www.westminster.edu/staff/athrock/GIS/GIS.pdf-Accessed:18/09/
2012.
http://teaching.ust.hk/~gned008/socioecondata09.pdf: Accessed 06/11/2012.
http://www.differencebetween.net/language/difference-between-data-and-
information: Accessed 06/11/2012.
http://www.tiger.esa.int/TrainingCds/cd_01/content_2/sez_2_3/Unit-III-
GIS.pdf: Accessed 21/11/2012.
2.1 Introduction
I
n this unit we introduce you to some conceptual models of the real world
phenomena. It is important to understand these models since we do not
store the real world into a computer but we use models to represent the
real world. We introduce you to geographic data types, namely nominal,
ordinal, interval and ratio data. We learn about raster and vector data models
and structures. We also outline the advantages and disadvantages of these
models.
Geography Information Systems HGES 408
2.2 Objectives
By the end of this unit, you should be able to:
define geographic phenomena, objects and fields
discuss geographic data types (nominal, ordinal, interval and ratio)
represent real world phenomena either as points, lines and polygons
describe vector and raster data models and structures
explain the advantages and disadvantages of raster and vector data
2.3.1 Fields
Continuous surfaces form the basis of geographic phenomena, known as the
field view. The field view represents the real world as a finite number of
variables, each one defined at every possible position. Examples of continuous
field data are elevation, air pressure, temperature, or clay content of the soil
(Burrough and McDonnell 1998).
Fields can also be created from classifications of land, into categories of land
use or soil type. Such fields change suddenly at the boundaries between
different classes. Other types of fields can be defined by continuous variation
along lines rather than across space. Traffic density, for example, can be defined
everywhere on a road network and flow volume can be defined everywhere
on a river (De By, 2001).
2.3.2 Objects
Geographic objects are identified by their dimensionality. Objects that occupy
area, including lakes and forest stands, are termed two-dimensional and
generally referred to as areas or polygons. Other objects that are linear,
including roads, railways, and rivers, are termed one-dimensional and generally
referred to as lines. Objects that are single locations, including individual animals
and buildings, are termed zero-dimensional and generally referred to as points.
Since we have distinguished between discrete and continuous data, now let
us look at the data types that we can use to represent our phenomena.
Specifically, we look at nominal, ordinal, interval and ratio data.
1. Nominal data
2. Ordinal data
Ordinal data values establish order only. Comparisons of size can be made,
but no other mathematical operation. For example, air pollution monitoring
equipment situated in different suburbs enables the suburbs to be rated 1st,
2nd, 3rd, etc., according to their air quality. This information does not tell us
how much worse the 5th and 8th suburbs are compared to the 1st. Another
example is household income can be identified as low, average and high.
3. Interval data
On interval data values, the numbering scale does not start at zero. Interval
data is measured along a scale in which each position is equidistant from one
another.
http://changingminds.org/explanations/research/measurement/types data.htm
4. Ratio data
The ratio scale of measurement has an absolute zero, and the difference
between numbers is significant. Mathematical operation such as addition,
subtraction, and division make sense. For example, the population data coded
for census districts can be manipulated in many ways, in particular, the
population can be divided by area (another ratio scale measurement) to obtain
population density.
Activity 2.1
1. Define the following terms:
? a. Geographic Phenomena
b. Geographic Fields
c. Geographic Objects
2. Distinguish between discrete and continuous data.
3. Explain the four data types that we can use to represent our phenomena.
2.4.1 Model
We can use GIS to help analyse and understand more about processes and
phenomena in the real world. Thus, models can be used in representation of
Figure 2.1 shows various entity types. For example, towns such as Kadoma,
Kwekwe, Chinhoyi and Marondera are represented as points. National Parks
such as Hwange, Matusadonha and Gonarezhou are represented as polygons.
Then roads and rivers are represented by lines.
a. Point representations
Figure 2.2 (a): Point features (Source, Longley et al., and 2004:184)
b. Line representations
Line data are used to represent one-dimensional objects such as roads,
rivers and power lines. Again, in this case, there is need to consider relevance
for application and the scale that the application requires. For example, in a
tourist map, subways and streetcar routes can be line features. Connected
lines may represent a phenomenon that is viewed as networks. Figure 2.2(b)
shows line features.
c. Polygon/area representations
An area fully encompassed by a series of connected lines is a polygon/area
representation. Because lines have direction, the system can determine the
area that falls within the lines comprising the polygon. Polygons are often an
irregular shape. Each polygon contains one type of data. Examples of polygon/
area features are forests and national parks. All of the data points that form
the perimeter of the polygon must connect to form an unbroken line. Figure
2.2(c) illustrates polygon/area features.
d. Network representations
e. Surface representations
A surface entity is used to represent continuous features or phenomena.
For these features, there is a measurement or value at every location as is the
case with elevation, temperature and pressure. The continuous nature of
surface entities distinguishes them from other entity types (points, lines, polygons
& networks) which are discrete, that is either present or absent at a particular
location.
Activity 2.2
1. Define the term model.
? 2. Giving examples, describe the five types of entities used to represent
geographic phenomena.
The size of cells in a tessellated data structure is selected on the basis of the
data accuracy and the resolution needed by the user. A raster data structure is
in fact a matrix where any coordinate can be quickly calculated if the origin
point is known, and the size of the grid cells is known. Since grid-cells can be
handled as two-dimensional arrays in computer encoding, many analytical
operations are easy to program. This makes tessellated data structures a
popular choice for many GIS software (De By, 2001:86).
Figure 2.4 shows how a range of different features represented by five different
entities can be modelled using the raster approaches. You can see that the
hotel is modelled by a single, discrete cell that is a single pixel. Rivers are
modelled by a group of cell into lines that is a string of pixels and forest by
grouping cells into blocks that is a group of pixels. The road network is
modelled by linking cells into networks. The relief of the area has been modelled
by giving every cell in the raster image an altitude value.
The pixel is the basic element of raster data-a term derived from picture
elements, which is cell representing a certain terrain property such as soil type
and altitude. These properties are always represented in a pixel as a numerical
value.
Several data structures have developed for the storage of vector data (points,
lines and polygons). The most popular ones are the spaghetti model and the
topological model.
The Spaghetti Model stores data as string of coordinate pairs, without any
indication regarding their spatial relation. This means that coordinate pairs are
relatively in unstructured form. It is a very simple model and does not require
complex file structures. The model does not pose any problems with the storage
of points (single x,y coordinate pair). However, for the storage of lines and
polygons, this model is less effective because of the following reasons:
No direct relation exists between different lines, so that a network analysis
is very time consuming.
The coordinates which form part of the boundary lines between two
polygons have to be stored twice, which may lead to problems when
file is edited.
Figure 2.4 shows the raster and vector data models that we have discussed in
previous sections. It illustrates how point, line, area/polygon, network and
surface entities have been represented using the raster and the vector models.
Activity 2.3
1. Distinguish between raster model and a vector model.
? 2.
3.
Explain the term tessellation.
Distinguish between a spaghetti model and a topological model.
4. Describe the term topology and its importance in GIS.
5. What are the advantages and disadvantages raster and vector models?
2.7 Summary
In this unit, we discussed about some conceptual models of the real world
phenomena. It has been important to understand these models since we do
not store the real world into a computer but we use models to represent the
real world. We also discussed geographic data types, namely nominal, ordinal,
interval and ration data. We also learnt about raster and vector data models
and their advantages and disadvantages. In the next unit we will focus on
spatial referencing.
References
Burrough, P., and McDonnell, R. (1998). Principles of Geographical
Information Systems. (New York: Oxford University Press).
Cornelius, S. and Heywood, I. (2000). An Introduction to Geographic
Information Systems, London, Prentice Hall.
De By. (2001). International Institute of Geo-Information and Earth
Observation. Principles of Geographic Information Systems: An
Introductory Textbook, Enschede, De By Educational Textbook
Series.
Longley, P.A., Goodchild, M.F., Maguire, D.J. and Rhind, D.W. (2004).
Geographical Information Sciences Volume 1: Principles and
Technical Issues, New York, John Wiley and Sons Ltd.
Longley, P.A., Goodchild, M.F., Maguire, D.J. and Rhind, D.W. (2001).
Geographical Information Systems and Science, Chichester, John
Wiley and Sons Ltd.
http://www.eng.auburn.edu/users/doughmp/webfiles/spatial Accessed 15/02/
2012.
http://bgis.sanbi.org/gis-primer/page Accessed 19/04/2012.
http://www.verlter.virginia.edu Accessed 19/09/2012.
http://adlib.itc.nl Accessed 19/09/2012.
http://wikitravel.org/en/Zimbabwe: Accessed 19/09/2012.
http://www.indiana.edu/~gisci/courses/g338/lectures/introduction_vector.html:
Accessed 10/11/2012.
Spatial Referencing I
3.1 Introduction
I
n this unit we focus on spatial referencing in GIS. The thrust of this unit is
to introduce you to coordinate systems such as geographic coordinates,
polar coordinates and Cartesian coordinates. Other concepts to be learnt
in this unit are ellipsoids, geoids and datums. These are all important concepts
in locating ones position on the earth's surface.
Geography Information Systems HGES 408
3.2 Objectives
By the end of the unit, you should be able to:
define the terms: ellipsoid, datum and geoid
describe the geographic coordinate systems, polar coordinates and
cartesian coordinates
discuss the process of converting polar to cartesian coordinates and
vice versa
explain the term of Georeferencing
Figure 3.1: The geoid and ellipsoid reference surfaces (De By, 2001:192)
3.3.1 Geoid
The geoid is shown on figure 3.2. The earth's geoid is a surface which is
complex to accurately describe mathematically. But it can be identified by
measuring gravity. The earth's geoid is regarded as being equal to mean sea
level. Over open oceans, the geoid and mean sea level approximately the
same, but in continental areas they can differ significantly. However, it must be
noted that this difference is not of any practical consequence for most people
and it is considered reasonable that they are regarded as the same.
Activity 3.1
1. Define spatial referencing.
? 2. Explain the terms geoid and ellipsoid.
3. Explain the importance of geoids and ellipsoids.
3.4 Datum
A datum is a reference surface which is defined mathematically and
approximates the shape of the earth in particular areas. At different areas
across the world, different map datums were (and some still are) used due to
the differences in the earth's general surface shape at different places. Specific
map datums are more applicable to particular areas or regions than others
(http://www.gpswaypoints.co.za).
The motivation to make geodetic results mutually comparable across the globe
has resulted in the need for global horizontal datums. An example is the
World Geodetic System (WGS84) used by the GPS. We also call this
referencing system a geocentric datum since it is positioned with respect to
the centre of the mass of the earth (De By, 2001: 203)
Activity 3.2
1. Define map datum.
? 2. Explain the importance of map datums.
3. Distinguish between a local horizontal datum and a global horizontal
datum.
They form circles on the surface of the ellipsoid. Lines of equal longitude are
called meridians and they form ellipses (meridian ellipses) on the ellipsoid.
Both lines form the graticule when projected onto a map plane. http://
kartoweb.itc.nl/geometrics/Coordinate%20systems/coordsys.html
Lines of latitude run parallel to the equator and divide the earth into 180 equal
portions from north to south (or south to north). The reference latitude is the
equator and each hemisphere is divided into ninety equal portions, each
representing one degree of latitude.
In the northern hemisphere degrees of latitude are measured from zero at the
equator to ninety at the North Pole. In the southern hemisphere degrees of
latitude are measured from zero at the equator to ninety degrees at the South
Pole. To simplify the digitisation of maps, degrees of latitude in the southern
hemisphere are often assigned negative values (0 to -90°). Wherever you are
on the earth's surface, the distance between lines of latitude is the same, so
they conform to the uniform grid criterion assigned to a useful grid system.
Lines of longitude, on the other hand, run perpendicular to the equator and
converge at the poles. The reference line of longitude (the prime meridian)
runs from the North Pole to the South Pole through Greenwich, England.
Subsequent lines of longitude are measured from zero to 180 degrees east or
west (values west of the prime meridian are assigned negative values for use
in digital mapping applications) of the prime meridian. (http://geology.isu.edu/
geostac/Field_Exercises/toppomaps/grid_assign.htm)
Activity 3.3
1. Define the term coordinate system.
? 2. Explain the following coordinate systems:
a) 2D and 3D geographic coordinates
b) 2D polar coordinates
c) 2D cartesian coordinates
3.6 Georeferencing
Georeferencing is the process of aligning geographic data to a known
coordinate system so that it can be queried, and analysed with other geographic
data http://resources.esri.com. This is usually done in two main ways. First
you have another layer of information (e.g. road networks of the area of
interest with a coordinate system) and you can match with your raster image
and the layer together. Secondly, you have a series of control points with
their coordinates and you know where on the map they correspond. The
points on the GIS dataset features that correspond to points on the image are
called control points. Since the coordinates of a GIS dataset reflect the map
projection, by assigning coordinates we are also declaring the image to be
displayed in a particular map projection (www.victoria.ac.nz/sgees/pdf/
georeference-image.pdf). In this case, the issue of map projections becomes
imperative.
Activity 3.4
1. Define georeferencing.
? 2. Explain the steps taken in georeferencing.
3.7 Summary
In this unit we focused on spatial referencing in GIS. We introduced you to
coordinate systems such as geographic coordinates, polar coordinates and
Cartesian coordinates. Other concepts that were learnt in this unit are
ellipsoids, geoids and datums. These are all importance concepts in locating
ones position on the earth's surface. In the next chapter you will learn about
map projections.
References
De By et al. (2001). International Institute of Geo-Information and Earth
Observation Principles of Geographic Information Systems: An
Introductory Textbook, Enschede, De By Educational Textbook
Series.
Schwartz, C.R. (1989). North American Datum of 1983, NOAA
Professional Paper NOS 2, Rockville: National Geodetic Survey.
Smith, J. (1997). Introduction to Geodesy: The History and Concepts of
Modern Geodesy, New York: Wiley.
Zhang, L. (2005). Implications of Geodesy, Spatial Reference Systems,
and Map Projections in Processing, Conversion, Integration, and
Management of GIS Data, eMap Div., Integrated Solution Services
Dept, Saudi Aramco
http://www.icsm.gov.au.mapping/datums1.html: Accessed 12/03/2012.
http://www.gpswaypoints.co.za: Accessed 12/03/2012.
http://www.biology.ualberta.ca: Accessed 12/03/2012.
http://geology.isu.edu/geostac/Field_Exercises/toppomaps/grid_assign.htm:
Accessed 12/03/2012.
http://en.wikipedia.org/wiki/ca_cartesian_coordinate_system: Accessed 14/
03/2012.
http://en.wikipedia.org/wiki/polar_coordinate_system: Accessed 14/03/2012.
http://kartoweb.itc.nl/geometrics/Coordinate%20systems/
coordsys.html: accessed 27/11/2012.
http://resources.esri.com Accessed 10/11/2012.
www.victoria.ac.nz/sgees/pdf/georeference-image.pdf: Accessed 14/10/2012.
Spatial Referencing II
4.1 Introduction
I
n this unit we discuss map projections. We define map projection, name
and describe map projections and methods of changing from one
projection type to another.
Geography Information Systems HGES 408
4.2 Objectives
By the end of this unit, you should be able to:
define map projections
describe different types of map projections
explain methods of changing from one projection to another
Activity 4.1
1. Define map projections.
? 2.
3.
What are the functions of a map projection?
Name types of normal projections.
4. Describe the characteristics of normal projections
surface at a point (plane) or along a closed line for cone and cylinder (De By,
2001: 225; Longley et al., 2004: 119). A cone is used to map different
continents. The Cylinder is used to map the entire world. The cylinder is
tangent to the sphere along a great circle (the circle formed on the surface of
the Earth by a plane passing through the centre of the Earth).
Secant projections are used to reduce/ average scale errors because the line(s)
of intersection are not disturbed on the map. The symmetry axes of the plane,
cone and cylinder coincide with the rotation axis of the ellipsoid or sphere i.e.
a line from the North and South Pole (see figure 4.2).
Activity 4.2
1. Name developable surfaces that are used for map projections.
? 2. Describe the secant, transverse and oblique map projections.
3. What are the differences between secant, transverse and oblique map
projections?
Therefore, angles with short sides and shapes of small areas are shown correctly
on the map ((De By, 2001: 223; Longley et al., 2004: 118).
Shape is preserved locally on conformal maps and angles at any point are
correct, although sizes change. Even though conformal projections preserve
the angles, the directions of lines can be changed in the process of projecting
them.
4.6.4 Direction
A map preserves direction when azimuths (angles from a point on a line to
another point) are portrayed correctly in all directions (De By, 2001: 223).
The type of projections depends on use. Most GIS applications warrant equal
area projection. Usually when doing analysis that requires areal calculations,
an equal area projection should be used. If area is small enough, the difference
between conformal and equal area is not significant.
Activity 4.3
1. Identify types of map distortions.
? 2. Describe types map distortions and their properties.
Where and are given in radians, R is the radius of the spherical reference
surface (for the WGS84 ellipsoid the equatorial radius is 6378137m), o is
the central meridian of the projection, e= 2.7182818 and ln is the natural
logarithm (De By, 2001: 219; Longley et al., 2004: 118).
Activity 4.4
1. Given that the central meridian is 270, =290 and R=6378137m, what
? is the value of x?
2. Given that =170 and R=6378137m, what is the value of y?
The answers will give you the x, y Cartesian coordinates for a particular
location.
3. Given that x=600000, y=7000000 and R=6378137m calculate the
value of and .
The result will be geographic coordinates in degrees.
4. Describe how you change from geographic to projection coordinates.
5. Describe how you change from projection coordinates to geographic
coordinates.
Activity 4.5
1. Describe how you transform polar coordinates into Cartesian
? coordinates.
2. Describe how you can transform Cartesian coordinates into polar
coordinates.
Activity 4.6
1. Identify and describe types of projections.
? 2. Describe how you would change from one projection coordinate to
another?
4.10Summary
In this unit we covered the definition of map projections, types of map
projections and distortion properties of map projections as well as how to
change from one coordinate system to another. Now you have to read widely
on the concept of map projections.
References
De By (2001). International Institute of Geo-Information and Earth
Observation, Principles of Geographic Information Systems: An
Introductory Textbook, Enschede, De By, Educational Textbook
Series.
Longley, P.A., Goodchild, M.F., Maguire, D.J. and Rhind, D.W. (2004).
Geographical Information Systems and Science, Second Edition,
New York, John Wiley and Sons Ltd.
http://mathworld.wolfram.com/topics/MapProjections.html/ accessed 26
November 2012.
5.1 Introduction
I
n this unit, we cover spatial data capture and preparation methods; define
important terms and concepts, types and sources of GIS data, sources
of error and possible remedies as well as data quality. Analysis and
modelling in a GIS requires input of relevant data. Therefore, we need to
learn how we can acquire and input relevant data in our GIS.
Geography Information Systems HGES 408
5.2 Objectives
By the end of this unit, you should be able to:
identify types and sources of GIS data
discuss the methods of capturing and preparing spatial data
describe spatial data capture techniques
discuss the sources of error in spatial data capture and preparation
explain possible remedies to errors in spatial data capture and
preparation
evaluate the concept of data quality
Data input is process of encoding data into computer readable format and
making it compatible with GIS. There are two types of GIS data namely
spatial data and attribute data (De By, et al., 2001: 149).
Spatial data is data whose X and Y coordinates are known (Data whose
physical/ geographic location is known). It can be points, line (segments) or
polygons (areas) (De By, 2001: 27).
Attribute data is data that provides descriptive information for spatial features
e.g. names, length.
5.4.1 Digitising
This refers to the process of converting analogue data into digital data (De
By, 2001: 149). It also includes the process of creating vector data from
raster data (for example a scanned topographical maps). In this process, we
trace features like roads, rivers, farm boundaries, homesteads and schools
from scanned maps. Scanning hardcopy topographic and topological maps
and aerial photography is a method of converting analogue data into digital
format (De By, 2001: 149; Longley et al., 2004: 201). Digitising methods
include on-tablet digitising, on-screen digitising, scanning and vectorisation.
In digitising, we have to consider whether the data we want to extract is point,
line or polygon. Points, lines and areas (polygons) are geographic primitives
(Clarke, 1997: 54). Point objects represent single locations of data e.g. mine,
soil sample. They are represented by a symbol e.g. circle, triangle and diamond
and the point has attributes attached to it for example, road. Line/segment
objects represent the generalised shape of the geographic feature e.g. roads,
rivers, fault lines defined by a set of sequential coordinates (vertices and nodes).
A line is defined by two or more nodes. Polygon (Area) objects are defined
by a line describing the location of its boundary. The first and last points of the
lines have the same coordinate pair.
GPS can be used to determine the exact geographic location with an accuracy
that is much higher than that achieved by a map or a compass. GPS satellites
circle the earth twice a day in a very precise orbit and transmit signal information
to earth. GPS receivers take this information and. essentially, the GPS receiver
compares the time a signal was transmitted by a satellite with the time it was
received. The accuracy of positions determined with a GPS range between
+/- 1m and +/- 300m and is dependent on:
The number of satellites that can be located by GPS rover unit.
The strength of the GPS receiver
The constellation of the satellites in the sky
The topography of the terrain
Atmospheric conditions
The rover unit is a portable GPS that is carried in the field and calculates the
location of the user. The rover unit connects to the satellites, calculates the
location of the unit and saves the location as a rover file. In some rover units,
the data can be stored as one of three different feature types; point, lines or
polygons. The rover unit, consists of three distinct parts namely, the antenna
and the data logger and the GPS receiver.
The antenna acquires signals from the satellites.
The GPS receiver locates and maintains contact with satellites.
The Data logger calculates and stores geographic position data for the
user.
c. The base station
The base station is a stationery GPS unit known as geographic location. Its
position is predetermined with the highest accuracy possible. A base station is
not a requirement of the GPS system, but it does increase accuracy.
GPS accuracy
Signal multi path - This occurs when the GPS signal is reflected off objects
such as tall buildings or large rock surfaces before it reaches the receiver. This
increases the travel time of the signal, thereby causing errors.
Number of satellites visible - The more satellites a GPS receiver can "see,"
the better the accuracy.
Activity 5.1
1. Define the following terms:
? a. data entry
b. spatial data
c. attribute data
2. Explain the five methods of spatial data capture.
3. Describe the major components of the GPS.
4. Explain the major sources of GPS error.
5. Describe how these errors can be corrected.
Activity 5.2
1. Identify sources of geographic data.
? 2. Describe each of the sources of GIS data that you identified.
5.6.2 Completeness
Completeness refers to whether there are data lacking in the database
compared to what exists in the real world. It can be spatial, temporal or
thematic aspects of the data. For example, a property boundary may contain
10 houses instead of 12 houses that are on the ground.
5.6.4 Lineage
Lineage describes the history of the data set, for example, date and scale of
aerial photography. Most datasets can have historical information on them
such as date of acquisition and scale. There may be errors in these recordings.
NB: the absence of inconsistencies does not necessarily imply that data is
accurate
Activity 5.3
1. Identify sources of error in acquiring GIS data.
? 2. Describe the sources of error in GIS data acquisition.
Figure 5.2: Digitizing errors and their remedies (Source: De By, 2001:
306).
Activity 5.4
1. Identify common errors in digitizing GIS data.
? 2. What are the remedies for each particular digitizing error?
Activity 5.5
1. Identify constraints that can be encountered when combining data from
? different sources.
2. What could be possible remedies when you are faced with such
scenarios?
5.9 Summary
In this unit, we discussed spatial data capture and preparation methods, defined
important terms and concepts and types and sources of GIS data as well as
sources of error and possible remedies as well as data quality. Now you
should read widely on spatial data capture. The methods also change and
improve over time. So, keep searching and reading widely.
References
Clarke, K.C. (1997). Getting Started with Geographic Information
Systems, London, Prentice Hall.
De By (2001). International Institute of Geo-Information and Earth
Observation, Principles of Geographic Information Systems: An
Introductory Textbook, Enschede, De By, Educational Textbook
Series.
Longley, P.A., Goodchild, M.F., Maguire, D.J. and Rhind, D.W. (2004).
Geographical Information Systems and Science, Second Edition,
New York, John Wiley and Sons Ltd.
http://www8.garmin.com/aboutGPS/applications.html: Accessed 7/08/2012.
www.worldclim.org.
http://glovis.usgs.gov/
http://earthexplorer.usgs.gov/
http://www.fao.org/geonetwork.
6.1 Introduction
A
GIS should be able to manage and process spatial data. In this unit,
we will discuss the systems that facilitate the management and
processing of geo-information, hardware and software trends as well
as stages in data handling. We also cover databases, including geographic
databases.
Geography Information Systems HGES 408
6.2 Objectives
By the end of this unit, you should be able to:
describe hardware trends that aided the development of GIS
discuss software trends in GIS
explain the components of a GIS
discuss the stages of spatial data handling
analyse reasons for using databases
describe the components of a relational database
explain the steps taken in database creation
evaluate the methods of querying relational databases
Activity 6.1
1. Identify software changes that have taken place over time.
? 2. Describe software trends that have taken place over time and how
they have aided GIS development.
3. Explain hardware changes that have taken place over time
4. Describe hardware trends that have taken place over time and how
they have aided GIS development.
Manual digitising covers coordinate and attribute entry via keyboard, digitising
tablet with mouse cursor, mouse cursor on the computer (heads up digitising)
and digital photogrammetry. Automatic digitising involves scanning documents
with flatbed or drum scanners. Semi-automatic digitising covers vectorisation
of data using line following software or point extracting software. We can
also input available digital data from CD-Rom, DVD-Rom, via computer
networking or internet (including geo - web services) (De By, 2001: 149;
Longley et al., 2004: 201).
Output Devices
Hardcopy Printer
Plotter
Film writer
Activity 6.2
1. Identify the components of a GIS.
? 2. Describe the components of a GIS.
3. Discuss the stages in spatial data handling.
Query is a computer program that extracts data from the database that meets
the conditions indicated in the query. A query uses a set of rules (criteria) to
select specific records from the database (De By et al., 2001: 161).
Primary key is a field that can uniquely identify each record (De By, 2001:
167; Longley et al., 2004: 225).
Table refers to data arranged in rows and columns. All database information
is stored in tables.
Activity 6.3
1. Define the terms: Database Management System, database, query,
? field, form, primary key, record, table, relationship and reports.
2. Explain the importance of using database in GIS.
6.9 Databases
In this section we focus on the eight steps taken in the creation of a database.
Activity 6.4
1. Define:
? a. The relational data model
b. The relation schema
c. The database schema
d. The attribute domain
2. Describe the steps taken in creating a database. Illustrate your answer
with an example.
Figure 6.2: The table Title Deed has a foreign key in its attribute Plot
(De By, 2001: 170)
Activity 6.5
1. What attribute is the primary key of the relation Title Deed?
? 2. Can Deed Date be the primary key of the relation? Explain your
answer.
There are three basic query operators used in performing queries. These are
record selection, attribute projection and joining.
Record selection works like a filter. It allows only records that meet the
selection condition to pass and disallows records that do not meet the
condition. Figure 6.3 is an illustration of record selection. The query selects
fewer records as compared to the entire database as illustrated in 6.3 below.
Attribute projection passes through all records of the input but reshapes each
of them in the same way. The query produces fewer attributes. Figure 6.4 is
an illustration of attribute projection, the table at the top shows three attributes
before attribute projection and the one at the bottom shows two attributes
after record selection.
The join operator is another query operation. It can be used to join tables in
a database (see Figure 6.5). The table at the bottom in Figure 6.5 shows the
result of joining the two tables at the top.
Record selection, attribute projection and the join query can be combined.
This results in a combined selection, projection and join query. The combined
query is illustrated on figure 6.6.
A combined selection/ projection/ join query selecting owners and deed dates
for parcels with a size larger than 1000. The join is carried out first, and then
follows tuples collection on the result tuples of the join. Finally an attribute
projection is carried out (De By, 2001: 177).
The illustrations in Figure 6.8 and 6.9 show that maps (spatial data) can be
linked with attribute data in a spatial enabled database in order to perform
spatial query and retrieve, analyse and present information on demand.
Activity 6.6
1. Name methods used to query relational databases.
? 2. Describe the methods of querying relational databases.
6.10Summary
In this unit, we covered the components of a GIS that facilitate the management
and processing of geo- information, hardware and software trends as well as
stages in data handling. In addition, we covered databases including geographic
databases. Now, you have to read widely on the functional components of a
GIS, stages in spatial data handling and databases.
References
De By (2001). International Institute of Geo-Information and Earth
Observation. Principles of Geographic Information Systems: An
Introductory Textbook, Enschede, De By Educational Textbook
Series.
Longley, P.A., Goodchild, M.F., Maguire, D.J. and Rhind, D.W. (2004).
Geographical Information Systems and Science, Second Edition,
New York, John Wiley and Sons Ltd.
Mather, P.M. (Ed.) (1993). Geographical Information Handling- Research
and Applications, Chichester, John Wiley and Sons Ltd.
7.1 Introduction
I
n this unit, we will cover the methods of transforming data in a GIS. For
example, a sample of points may be available, but we may need to derive
a value for the phenomenon at another location or for the whole study
area. There may be need to transform points into other representations in
order to facilitate interpretation or integration with other data. Examples include,
defining homogeneous area (polygons) from point data and deriving contour
lines. We cover interpolation on continuous and discrete data. We also cover
interpolation techniques such as nearest neighbour interpolation, Thiessen's
polygons, trend surface fitting and triangulation as well as moving window
averaging and kriging.
Geography Information Systems HGES 408
7.2 Objectives
By the end of this unit, you should be able to:
define interpolation
explain the parameters taken into consideration in interpolation
describe methods of interpolation of discrete data
discuss the methods of interpolation of continuous data
7.3 Interpolation
Interpolation is the calculation of a value from "surrounding" observations
(Burrough and MacDonnell, 1989: 203; De By, 2001:320; Longley et al.,
333; Mather, 1993: 109). There are various methods of interpolation. Some
are suitable for continuous data and some for discrete data. We need to
consider the type of data. Basically, we have two data types namely discrete
data and continuous data. We also consider the nature of the surface, scale
and resolution of the data. The surface can be simple or complex.
Activity 7.1
1. Define the term interpolation.
? 2. Identify parameters taken into consideration for interpolation.
3. Describe how the parameters considered for interpolation are taken
into consideration in interpolating data.
The value of a point for a given location (X, Y) is calculated from the nearest
value to the point and assigned that value. Each location is assigned the value
of the closest measured point. The nearest neighbour technique constructs
"zones" around the points of measurement. Each point belonging to a zone is
assigned the same value (De By, 2001:323).
Thiessen polygons
Activity 7.2
1. Identify methods of interpolation discrete data.
? 2. Describe the methods of interpolation discrete data.
The entire study area can be represented by a formula f(X, Y). For a given
location with coordinates (X, Y) we will use an approximate the value of the
field in that location. We have to derive a formula that best describes the field.
Regression techniques can be used to determine the coefficients of the equation
(De By et al., 2001: 326).
Triangulation
Moving window averaging derives raster data from a set of sample points.
Cell values in the output raster are computed one by one. A window known
as a kernel has to be defined.
The averaging function will compute the arithmetic, treating all measurements
equally through the following formula:
Where d = distance from measurement point to the centre of the cell (De By,
2001: 333). The squares in figure 7.3 show the moving window and its centre,
whilst the + (crosses) show the measurement points with their values and
distances to the centre. Some measurement points are inside and some are
outside the window (see Figure 7.4).
Figure 7.4: The inverse distance weighting technique (De By, 2001:
334).
Kriging
There are two important techniques in kriging. The first step is to create a
Semi-variogram. A semi-variogram compares successive pairs of point
measurements. Secondly, weights are calculated. The Variogram is used to
calculate weights used in interpolation (De By, 2001: 93; 337; Longley et al.,
2004: 336).
Activity 7.3
1. Discuss the methods of interpolation of continuous data.
? 2. Describe methods of interpolation continuous data
7.4 Summary
In this unit we covered the methods of transforming data in a GIS. Here we
covered interpolation on continuous and discrete data. Under interpolation
on discrete data we covered nearest neighbour interpolation and Thiessen's
polygons whilst under interpolation on continuous data covered interpolation
techniques such trend surface fitting, triangulation, moving window averaging
and kriging. Now you are required to read widely on these spatial data
transformation techniques.
References
Burrough, P.A. and McDonnell, R.A. (1989). Principles of Geographic
Information Systems, London, Oxford University Press.
De By (2001). International Institute of Geo-Information and Earth
Observation. Principles of Geographic Information Systems: An
Introductory Textbook, Enschede, De By et al. Educational Textbook
Series.
Longley, P.A., Goodchild, M.F., Maguire, D.J. and Rhind, D.W. (2004).
Geographical Information Systems and Science, Second Edition,
New York, John Wiley and Sons Ltd.
Mather, P.M. (Ed.) (1993). Geographical Information Handling- Research
and Applications, Chichester, John Wiley and Sons Ltd.
8.1 Introduction
I
n this unit we introduce you to spatial data analysis. Remember, in Unit
One we learnt that data analysis is one of the functions of GIS. In this
unit, we shall learn the various techniques of spatial data analysis. These
techniques include querying, buffering, classification, reclassification and
neighbourhood functions.
Geography Information Systems HGES 408
8.2 Objectives
By the end of this unit, you should be able to:
explain data analysis techniques in GIS
describe how features are measured in GIS
explain spatial and attribute data are querying in GIS
perform spatial data analysis through use of operators
describe how networks are analysed in GIS
By integrating GIS layers, you can ask the spatial questions outlined in Unit 1:
"What is at…?", "Where is it…?", "What has changed since?", "What spatial
patterns exist?", and "What if…?" The first two of these questions inventory
features and minimally examine feature location and relationships. The last
three questions are more complex. To answer these questions, you must use
or string together some of the analytical functions that you will learn about in
this unit. There are a wide range of functions for data analysis available in
most GIS packages which include measurement techniques, attribute queries,
proximity analysis, and overlay techniques and modelling of surfaces and
networks. Spatial data analysis can be done through the use of operators
namely the Logical operators and Mathematical operators.
a) Boolean Operators
Figure 8.1 shows venn diagrams which illustrate that, AND, OR, XOR, NOT
are boolean operators that can be used in carrying out spatial analysis. Table
8.1 illustrates some examples in which these operators can be used in carrying
out spatial data analysis. For example, on table 8.1, the boolean operator and
requires you to find hotels where both conditions A and B are met. The
condition A, being hotels in luxury category while the condition B, being hotels
with more than 20 bedroom.
A and B Which hotels are in luxury category and have more than 20 bedrooms?
B not A Which hotels are in luxury category but do not have more than 20 bedrooms?
B XOR A Which hotels are either in luxury category or have more than 20 bedrooms?
(A and B)or C Which hotels are in luxury category and have more than 20 bedrooms or have
more than 5 swimming pools?
A and (B or C) Which hotels are in luxury category and have more than 20 bedrooms or more
than 20 pools?
Figure 8.4 illustrates the areas that have forest on map C1, where the there
are zeros the area is not covered by forest.
c) Relational Operators:
> Greater than
< Less than
>= Greater than or equal to
<=Less than or equal to
Map D1-areas with forest (A) and are less than 500m (B)
Map D2- areas with forest (A) or less than 500m (B)
Map D3- areas with forest (A) with either or less than 500m (B)
Map D4- areas with forest (A) and not less than 500m (B)
Figure 8.4 shows how mathematical operators can be used in a GIS. For
example, in map C1 and C2 addition has been done. In C3, subtraction,
addition and multiplication has been used.
Activity 8.1
1. Define the term spatial data analysis.
? 2. Discuss the three types of logical operators used in GIS.
3. Explain the four types of mathematical operators used in data analysis.
Area size measurements are used when one wants to sum up the area sizes of
all polygons belonging to the same class. This could be crop type, for example,
what is the size of the area covered by potatoes? If our crop classification is
in a stored data layer the computation could include:
selecting the potato area
summing up their (stored) area sizes
Activity 8.2
1. Explain how measurements are done in:
? a) Vector data
b) Raster data
Activity 8.3
1. Define the terms: attribute query and spatial query.
? 2. Explain the following terms:
a) intersection
b) proximity
c) adjacency
d) containment
8.7 Classifications
Classification is technique of purposefully removing detail from an input data
set, in the hope of revealing important patterns of (spatial distributions). In the
process, we produce an output data set, so that the input set can be left intact.
We do so by assigning a characteristic value to each element in the input set,
which is usually a collection of spatial features that can be raster cells, points,
lines or polygons.
8.7.1 Reclassification
Reclassification occurs when input data set may have itself been a result of
classification and in such case, we call it reclassification. For example, we
may have a soil map that shows different soil type units and we would like to
show suitability for a specific crop. In this case, it is better to assign to the soil
units an attribute of suitability for the crop.
When identifying areas at most risk of flooding input rasters might be slope,
soil type, and vegetation. Each of these rasters might be reclassified on a
scale of 1 to 10 depending on the susceptibility of each attribute in each raster
to flooding-that is, steep slopes in the slope raster might be given a value of
10 because they are most susceptible to flooding. There are usually four steps
in producing a suitability map (see illustration of figure 8.5):
1. Input datasets. Decide which datasets you need as inputs.
2. Derive datasets. When applicable, create the datasets that you can
derive from your base input datasets, for example, slope and aspect
can be derived from the elevation raster. Create data from existing data
to gain new information.
3. Reclassify datasets. Reclassify each dataset to a common scale (for
example, 1 to 10), giving higher values to more suitable attributes.
4. Weight and combine datasets. Weight datasets that should have more
influence in the suitability model if necessary then combine them to find
the suitable locations.
Figure 8.5 is a flow diagram of a sample for finding the best locations for a
school. The four steps to produce such a suitability map are:
Decide which datasets you need as inputs. The datasets you will use in
this exercise are displayed to the right.
Derive datasets. Create data from existing data to derive new
information.
Reclassify each dataset to a common scale (for example, 1-10) giving
higher values to more suitable attributes.
Weight datasets that should have more influence in the suitable locations.
Figure 8.5 shows that, the input base layers are land use, elevation, recreation
sites, and existing schools. The derived datasets are slope, distance to
recreation sites, and distance to existing schools. Each raster is then reclassified
on a scale of 1 to 10. The reclassified rasters are added together with distance
from recreation sites and other schools having a higher weight.
determines the class breaks. This can be done through equal interval and
equal frequency techniques.
a) In the Equal interval technique the minimum and maximum values of
the classification parameter are determined and the (constant) interval
size for each category is calculated as (Vmax-Vmin)/n, where n is the
number of classes chosen by the user.
b) In the equal frequency technique or the quantile classification, the
objective is to create categories with roughly equal numbers of features
per category. The total number of features is determined first and
calculated. The class break points are then determined by counting of
the features in order of classification parameter values
Activity 8.4
1. Differentiate between classification and reclassification.
? 2. Distinguish between user controlled classification and automatic
classification.
Point in polygon overlay is used to find out the polygon in which a point
falls. For example, one would like to identify the boreholes in a village.
Line in polygon overlay is more complicated than the point in line polygon.
Imagine we want to know where roads pass through forest areas to plan for
a scenic drive. To do this, we need to overlay the road data on a data layer
containing forest polygons. The output contains roads split into smaller
segments representing roads in the forest areas and roads outside forest areas.
Topographical information must be contained; therefore, this is more complex
From figure 8.6, you can observe that the output raster shows areas that are
suitable for the desired model. Forests on alluvial terrain and grassland on
shale are considered suitable.
Activity 8.5
1. Explain how you would carry out overlay functions.
? 2. Differentiate between vector based overlays and raster based overlay.
3. Describe how you would use the overlay techniques in selecting a
suitable nuclear waste repository.
4. Using examples, explain the importance of spatial data analysis in GIS.
8.9 Summary
In this unit, we discussed about spatial data analysis. We learnt the various
techniques of data analysis. These techniques learnt are querying, buffering,
classification, and reclassification. In the next unit, we learn about
neighbourhood analysis as another data analysis method.
References
De By et al. (2001). International Institute of Geo-Information and Earth
Observation. Principles of Geographic Information Systems: An
Introductory Textbook, Enschede, De By. Educational Textbook
Series.
Longley, P.A., Goodchild, M.F., Maguire, D.J. and Rhind, D.W. (2001).
Geographical Information Systems Volume 1: Principles and
Technical Issues, New York, John Wiley and Sons Ltd.
www.geo.hunter.cuny.edu, Accessed 20/11/2012.
http://maps.unomaha.edu/Peterson/gisII/Labs/NewSchool/NewSchool.htm,
Accessed 20/11/2012.
9.1 Introduction
I
n this unit, we discuss neighbourhood analysis. Under neighbourhood
analysis, we cover proximity computations (buffer zone generation and
Thiessen's polygon generation), and spread and diffuse computations as
well as seek computations, spatial operations on continuous surfaces and
applications of neighbourhood analysis. In this unit we also introduce you to
another method of analysing data. This method is called network analysis.
We focus on optimal path finding, trace analysis, network partitioning and
network allocation.
Geography Information Systems HGES 408
9.2 Objectives
By the end of this unit you should be able to:
define neighbourhood analysis
discuss the methods of neighbourhood analysis
explain the applications of neighbourhood analysis
analyse the optimal path finding concept
describe trace analysis
distinguish between network partition and explain network allocation
Buffering creates physical zones around features. These "buffers" are usually
based on specific straight-line distances from selected features. Buffers,
common to both raster and vector systems, are created around point, line, or
polygon features.
In buffer zone generation, we select one or more target locations and then
we determine the area around them within a certain distance. For example,
roads and rivers can be selected as targets and a certain distance around
them can be defined as the buffer. For instance, in the case of stream bank
cultivation, one may define a 30m buffer as per the Laws of Zimbabwe. Figure
9.1 is an example of buffer zone generation (De By, 2001:396). In 9.1a
distances of 25m and 75m were selected for minor and major roads
respectively. In 9.1b zoned buffers of 100m, 200m and 300m form roads
were selected.
Figure 9.1: Buffer zones around streets (roads) (De By et al., 2001:396)
Once complete, buffer layers are used to determine which features (in other
layers) occur either within or outside the buffers (spatial queries), to perform
overlay, or to measure the area of the buffer zone. In some instances zoned
buffers must be determined, for instance in assessments of traffic noise effects.
Activity 9.1
1. Define neighbourhood analysis.
? 2.
3.
Define proximity.
Describe buffer zone generation.
4. What is the importance of buffer zone generation?
5. Describe how Thiessen's polygons are generated.
For example, from the shaded cell on figure 9.3 there are three paths to its
northeast neighbour cell (with local resistance 5). Path 1 = N-E, Path 2 = E-
N and path 3 = NE based on compass direction from the shaded cell.
Path 1 =
Path 2 =
Thus, path 2 is the minimal cost path. Spread computations can be used for
base stations for cellphones, missile launch and assessing nuclear spillage.
Activity 9.2
1. Define spread computations.
? 2. Describe the concept of spread and diffuse computations.
For each cell in the raster, the steepest downward slope to the neighbour cell
is calculated and its direction is stored in a new raster map. The computation
determines the elevation differences between the cell and a neighbour cell and
takes into account cell distance minus one for neighbour cells in the N-S
direction and the square root of 2 cells in the NE-SW and NW-SE directions
(refer to figure 9.4). Amongst the neighbour cells, GIS picks the one with the
steepest path. From the flow direction raster, the GIS can compute an
accumulated flow count raster map. The accumulated flow count raster map
indicates how many cells have water flow into a particular cell. Cells with high
flow counts have concentrated flow and thus, belong to a stream and cells
with zeros are local topographic highs (De By, 2001:403).
Seek functions are used to delimit streams. In delimiting streams they calculate
flow direction, the steepest slope, the flow count raster map and the flow
accumulation raster.
Figure 9.4: Flow computations (a) the original elevation raster (b) the
flow direction raster computed from it (c) accumulated flow count raster
map (De By, 2001:405).
Activity 9.3
1. Define seek computation.
? 2.
3.
Describe the concept behind seek computations.
Differentiate between spread and seek computations.
4. Differentiate between overlay functions and neighbourhood functions.
b=100
b=50
a=100 a=100
% of slope % of slope
= (b/a)*100 = (b/a)*100
= (100÷100)*100 = (50÷100)*100
= 100 = 50
b) Slope Angle/Degree
b=100
b=50
45o 26.6o
a=100 a=100
= (b/a) = (b/a)*100
= (100÷100) = (50÷100)
= 45o = 26.6o
Interpolation
g) Filtering
Figure 9.10: High pass filter and low pass filter (Adapted from: http://
courses.washington.edu/gis250/lessons/raster_analysis2/index.html)
Activity 9.4
1. What is the importance of knowing the slope percent, degree and aspect
? in a crop production study?
2. Distinguish between high pass filters and low pass filters.
Each linear feature affects the resource flow. For example, a street segment
might only provide flow in one direction (a one-way street) and at a certain
speed. Nodes can also affect flow. A stuck valve might allow too much of a
resource to stream out and away from its intended destination. Network
analysis tools help you analyze the "cost" of moving through the network.
Like spread functions, "cost" can represent money, time, distance, or effort.
Network analyses are vector-based applications, but there are similarities
with raster-based spread functions.
Activity 9.5
1. Explain the tern network analysis.
? 2. Describe the importance of network analysis.
and congestion. However, the shortest path may not be defined simply in
terms of distance, For example, for an emergency vehicle to reach an accident,
the quickest route may be needed and this may require the traverse of less
congested minor roads. Several paths may be considered before the route
with the least cumulative impedance is constructed from the intervening network.
Another example, is the waste collection vehicle, which needs to visit a specific
set of clients in a day and to do so, the best route is required. So the question
is: In which order should the stops be visited and which path should
taken between? In GIS network analysis, the ordering of the stops can be
determined by calculating the minimum path between each stop and every
other stop in the list based on impedance met in the network. A trial and error
method can be used to order the visits so that the impedance from the first
stop to the last is minimised.
Activity 9.6
1. What is trace analysis?
? 2. Explain how trace analysis can be used to trace pollutants from a spillage
site.
3. Explain the concept of optimal path finding.
4. Describe how you would apply optimal path finding in waste
management.
9.7 Summary
In this unit, we discussed neighbourhood analysis. Under neighbourhood
analysis, we covered proximity computations (buffer zone generation and
Thiessen's polygon generation), and spread and diffuse computations as well
as seek computations, spatial operations on continuous surfaces and
applications of neighbourhood analysis. Now you are required to read widely
on neighbourhood analysis and explore more literature and examples. In this
unit we introduced you to another method of analysing data called network
analysis. We focused on optimal path finding and trace analysis.
References
Burrough, P.A. (1986). Principles of GIS for Land Resources Assessment,
Oxford, Clarendon Press.
Burrough, P.A. and McDonnell, R.A. (1989). Principles of Geographic
Information Systems, London, Oxford University Press.
De By et al. (2001). International Institute of Geo-Information and Earth
Observation. Principles of Geographic Information Systems: An
Introductory Textbook, Enschede, De By Educational Textbook
Series.
Longley, P.A., Goodchild, M.F., Maguire, D.J. and Rhind, D.W. (1999).
Geographical Information Systems Volume 1: Principles and
Technical Issues, New York, John Wiley and Sons Ltd.
Longley, P.A., Goodchild, M.F., Maguire, D.J. and Rhind, D.W. (2004).
Geographical Information Systems and Science, Second Edition,
New York, John Wiley and Sons Ltd.
http://giscommons.org : Accessed (11/07/2012).
GIS Applications
10.1 Introduction
I
n this unit we focus on Geographic Information Systems applications.
These include disaster risk assessment, species distribution (habitat)
modelling, vector and disease management, fleet management and route
planning, agricultural activities and waste management.
Geography Information Systems HGES 408
10.2 Objectives
By the end of this unit you should be able to:
state the application of GIS in disaster risk assessment
explain the application of GIS in species distribution (habitat) modelling
discuss the application of GIS Waste management
analyse the application of GIS in vector and disease management
discuss application of GIS in fleet management and route planning
describe the application of GIS in agricultural activities
Figure 10.1: Buffalo habitat suitability map (Matawa et al. 2012: 195)
GIS is used by highway authorities who need to decide location of new routes
and to keep track of highway condition. It is also used by logistics companies
(e.g., parcel delivery companies, shipping companies) to organise their
operations, to decide where to place their central sorting warehouses and the
facilities that transfer goods from one mode to another (for example, from
truck to ship), how to route parcels from origins to destinations, and how to
route delivery trucks.
Transit authorities also use GIS. They use it to plan routes and schedules, to
keep track of vehicles and to deal with incidents that delay them, and to
provide information on the system to the travelling public.
10.8Agricultural Activities
GIS has been used to show the spatial distribution of soil properties in
agricultural fields as shown on Figure 10.4. The conventional tillage field has
the highest average bulk density as compared with basin tillage and mulch
ripping fields. The maps on figure 10.4 show that, the conventional tillage field
has darker shading (high bulk density), while the basin tillage and mulch ripping
fields have lighter shading (low bulk density).
a b
Figure 10.4: Spatial Distribution of bulk density in fields (a) basin tillage
(b) mulch ripping (c) conventional tillage in Ward 1, Insiza District,
Zimbabwe (Sigauke et al., 2009 unpublished)
10.9Governance
A GIS can be used to manage populations, develop administrative boundary
and manage supply of public services and utilities as well as for development
planning. Under governance the list is endless. For example it can be used to
define census blocks for enumerating people.
tracking crime
Operational, tactical, and strategic decision making in law enforcement,
health care planning, and managing education systems.
Activity 10.1
1. Explain the importance of GIS to environmentalists.
? 2. Discuss the application of GIS in:
a) Habitat mapping
b) vector and disease management
c) fleet management and route planning
d) waste management
e) agriculture
3. Suppose you work for a city or town Council. Describe how you
would use GIS in selection of a suitable waste disposal site for that
City or Town.
10.10 Summary
In this unit we focused on the applications of GIS. These included disaster
risk assessment; species distribution (habitat) modelling; vector and disease
management; fleet management forest management; governance.
References
Brownstein, J.S., Rosen, H., Purdy, D., Miller, J.R., Merlino, M., Mostashari,
M., Fish, D., (2002). Spatial Analysis of West Nile Virus: Rapid Risk
Assessment of an Introduced Vector-borne Zoonosis. Vector-Borne
and Zoonotic Diseases, 2 (3), pp. 157-164.
Clarke, K.C., McLafferty, S.L. and Tempalski, B.J. (1996). On Epidemiology
and Geographic Information Systems: A Review and Discussion of
Future Directions in Emerging Infectious Diseases, Volume 2 (2),
pp85-92.
Cecchi, G., Mattioli, R.C., Slingenbergh, J., Dela Rocque, S., (2008). Land
cover and tsetse fly distributions in sub -Saharan Africa. Medical and
Veterinary Entomology, 22, pp. 364-373.
Drake, J.M., Randin, C., Guisan, A., (2006). Modelling ecological niches
with support vector machines. Journal of Applied Ecology, 43, pp.
424-432.
Kitron, U., Otieno, L.H., Hungerford, L.L., Odulaja, A., Brigham, W.U.,
Okello, O.O., Joselyn, M., Mohamed-Ahmed, M.M., Cook, E.,
(1996). Spatial analysis of the distribution of tsetse flies in the Lambwe
Valley ,Kenya, using Landsat TM satellite imagery and GIS. Journal
of Animal Ecology, 65, pp. 371-380.
Matawa F., Murwira, A., Schmidt, K.S., (2012). Explaining elephant
(Loxodonta africana) and buffalo (Syncerus caffer) spatial distribution
in the Zambezi Valley using maximum entropy modelling. Ecological
Modelling, 242, pp. 189- 197.
Matawa F., Murwira K.S., Shereni, W., Modelling the distribution of Glossina
spp. in the north western parts of Zimbabwe using remote sensing and
climate data, unpublished.
Rácz, G.R., Bán, E., Ferenczi, E., Berencsi, G., (2006). A simple spatial model
to explain the distribution of human tick-borne encephalitis cases in
Hungary. Vector-Borne and Zoonotic Diseases, 6(4). pp. 369-378.
Sigauke E., Mujere N., Mabiza C. (2009). Effects of conventional and
conversation tillage practices on soil properties, in Ward 1, Insiza District
(unpublished).