GIS Unit 1-5 Notes
GIS Unit 1-5 Notes
Unit-I
Chapter 1: A Gentle Introduction to GIS
Contents
Our world is dynamic. Many aspects of our daily lives and our environment are
constantly changing, and not always for the better. Some of these changes appear
to have natural causes (e.g. volcanic eruptions, meteorite impacts), while others
are the result of human modification of the environment. (e.g. land use changes or
land reclamation from the sea).
The fundamental problem that we face in many uses of GIS is that of under-
standing phenomena that have a spatial or geographic dimension, as well as a
- means that
our object of study has different characteristics for different locations (the
geographic dimension) and also that these characteristics change over time (the
temporal dimension). The El Nino event is a good example of such a phenomenon,
because sea surface temperatures differ between locations, and sea surface
temperatures change from one week to the next. El Nino is an aberrant pattern in
weather and sea water temperature that occurs with some frequency (every4 9
nine years) in the Pacific.
Ocean along the Equator. It is characterized by less strong western winds across
the ocean, less upwelling of cold, nutrient-rich, deep-sea water near the South
American coast, and therefore by substantially higher sea surface temperatures
(see figures below). It is generally believed that El Nino has a considerable impact
on global weather systems, and that it is the main cause for droughts in Wallacea
and Australia, as well as for excessive rains in Peru and the southern U.S.A.
1
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
2
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
capabilities, and today are widely used all over the world for a wide range of
purposes.
GIS can refer to a number of different technologies, processes, and methods. It is
attached to many operations and has many applications related to engineering,
planning, management, transport/logistics, insurance, telecommunications, and
business. For that reason, GIS and location intelligence applications can be the
foundation for many location-enabled services that rely on analysis and
visualization.
GIS is a computer-based system that provides the following four sets of capabilities to
handle georeferenced data:
1. Data capture and preparation
2. Data management, including storage and maintenance
3. Data manipulation and analysis
4. Data presentation
In the El Nino case, data capture refers to the collection of sea water temperatures
and wind speed measurements. This is achieved by placing buoys with measuring
equipment at various places in the ocean. Each buoy measures a number of things:
wind speed and direction; air temperature and humidity; and sea water temperature
at the surface and at various depths down to 500 metres.
A typical buoy is illustrated in Figure 1.2, which shows the placement of various
sensors on the buoy. For monitoring purposes, some 70 buoys were deployed at
strategic places within 10 latitude of the Equator, between the Galapagos Islands
and Papua New Guinea. Figure 1.3 provides a map that illustrates the positions of
these buoys. The buoys have been anchored, so they are stationary. Occasional
malfunctioning is caused by high seas and bad weather or by the buoys becoming
entangled in long-line fishing nets.
All the data that a buoy obtains through its thermometers and other sensors, as
daily.
3
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
3. Data Management:
For our example application, data management refers to the storage and
maintenance of the data transmitted by the buoys via satellite communication. This
phase requires a decision to be made on how best to represent our data, both in
terms of their spatial properties and the various attribute values which we need to
store.
We will from here on assume that the acquired data has been put in digital form,
that is, it has been converted into computer-readable format, so that we can begin
our analysis.
Once the data has been collected and organized in a computer system, we can start
analyzing it. Here, let us look at what processes were involved in the eventual
production of the maps of Figure1.1 . Note that the actual production of maps
belongs to the phase of data presentation that we discuss below.
4
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
Here, we look at how data generated at the buoys was processed before map
production. A closer look at Figure 1.1 reveals that the data being presented are
based on the monthly averages for SST and WS (for two months), not on single
measurements for a specific date. Moreover, the two lower figures provide
Data presentation:
After the data manipulations discussed above, our data is prepared for producing
output. In this case, the maps of Figure 1.1. The data presentation phase deals with
putting it all together into a format that communicates the result of data analysis in
the best possible way. Many issues arise in this phase. Among other things, we
need to consider what the message is that we want to portray, who the audience is,
what kind of presentation medium will be used, which rules of aesthetics apply,
and what techniques are available for representation. These issues may sound a
little abstract, so let us clarify with the El Nino case.
The message we wanted to portray is what are the El Nino and La Nina events,
both in absolute figures, but also in relative figures, i.e. as differences from a
normal situation.
The audience for this data presentation clearly were the readers of this text book,
i.e. students of ITC who want to obtain a better understanding of GIS. The
medium was this book, (printed matter of A4 size) and possibly a website. The
font size. The rules of aesthetics demanded many things: the maps should be
printed north-up; with clear georeferencing; with intuitive use of symbols et
cetera, We actually also violated some rules of aesthetics, for instance, by
applying a different scaling factor in latitude (horizontally) compared to longitude
(vertically).
The techniques that we used included the use of a color scheme and isolines, plus
a number of other techniques
i. An urban planner might want to assess the extent of urban fringe growth in her/his
city, and quantify the population growth that some suburbs are witnessing. S/he
5
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
might also like to understand why these particular suburbs are growing and others
are not.
ii. A biologist might be interested in the impact of slash-and-burn practices on the
populations of amphibian species in the forests of a mountain range to obtain a
better understanding of long-term threats to those populations.
iii. A natural hazard analyst might like to identify the high-risk areas of annual
monsoon-related flooding by investigating rainfall patterns and terrain
characteristics.
iv. A geological engineer might want to identify the best localities for constructing
buildings in an earthquake-prone area by looking at rock formation characteristics.
v. A mining engineer could be interested in determining which prospective copper
mines should be selected for future exploration, taking into account parameters
such as extent, depth and quality of the ore body, amongst others
vi. A geo-informatics engineer hired by a telecommunications company may want to
1.1.6. Components
1. Computer System: The computer system includes the computer and the operating
system to run GIS. Typically the choices are PCs that use the Windows operating
system (e.g., Windows 2000, Windows XP) or workstations that use the UNIX or
Linux operating system. Additional equipment may include monitors for display,
digitizers and scanners for spatial data input. GPS receivers and mobile devices for
fieldwork, and printers and plotters for hard-copy data display.
2. GIS Software: The GIS software includes the program and the user interface for
driving the hardware. Common user interfaces in GIS are menus, graphical icons,
command lines, and scripts.
3. People: People refers to GIS professionals and users who define the purpose and
objectives, and provide the reason and justification for using GIS.
4. Data: Data consist of various kinds of inputs that the system takes to produce
information.
5. Infrastructure (METHOD): The infrastructure refers to the necessary physical,
organizational, administrative, and cultural environments that support GIS
6
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
Spatial data describe the locations of spatial features, which may be discrete or
continuous. Discrete features are individually distinguishable features that do not
exist between observations. Discrete features include points (e.g. wells), lines
(e.g., roads), and areas (e.g., land use types). Continuous features are features that
exist spatially between observations. Examples of continuous features are
elevation and precipitation. A GIS represents these spatial features on the Earth's
surface as map features on a plane surface. This transformation involves two main
issues: the spatial reference system and the data model.
Attribute data describe the characteristics of spatial features. For raster data, each
cell has a value that corresponds to the attribute of the spatial feature at that
location. A cell is tightly bound to its cell value. For vector data, the amount of
attribute data to be associated with a spatial feature can vary significantly. A road
segment may only have the attributes of length and speed limit, whereas a soil
polygon may have dozens of properties, interpretations, and performance data.
How to join spatial and attribute data is therefore important in the case of vector
data.
Key components of spatial data quality include positional accuracy (both horizontal and
vertical), temporal accuracy (that the data is up to date), attribute accuracy (e.g. in
labelling of features or of classifications), Lineage (history of the data including sources),
completeness (if the data set represents all related features of reality), and logical
consistency (that the data is logically structured).
These components play an important role in assessment of data quality for several
reasons:
1. Even when source data, such as official topographic maps, have been subject to
stringent quality control, errors are introduced when these data are input to GIS.
2. Unlike a conventional map, which is essentially a single product, a GIS database
normally contains data from different sources of varying quality.
3. Unlike topographic or cadastral databases, natural resource databases contain data
that are inherently uncertain and therefore not suited to conventional quality
control procedures.
4. Most GIS analysis operations will themselves introduce errors.
7
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
Spatial data Input 1. Data entry: use existing data, create new data
2. Data editing
3. Geometric transformation
4. Projection and reprojection
The discipline that deals with all aspects of the handling of spatial data and
geoinformation.
and software, and also people such as the database creators or administrators,
analysts who work with the software, and the users of the end product. Related
terms include geoinformatics, geomatics, and spatial information science. These
are all similar terms which have much the same meaning, although each approach
8
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
has slight differences in the way it deals with problems, some emphasizing
engineering approaches, others computational solutions, and so on.
Project-based GIS applications usually have a clear-cut purpose, and these
applications can be short-lived: the research is carried out by collecting data,
entering data in the GIS, analyzing the data, and producing informative maps. An
example is rapid earthquake damage assessment. Institutional GIS applications, on
the other hand, usually have as their goal the continued administration of spatial
change and the sustained availability of spatial base data. Their needs for
advanced data analysis are usually less, and the complexity of these applications
lies more in the continued provision of trustworthy data to others. They are thus
long-lived applications. An obvious example are automated cadastral systems.
is a term used in many different ways and which has many different
meanings. A representation of some part of the real world can be considered a
model because the representation will have certain characteristics in common with
the real world. Specifically, those which we have identified in our model design.
This then allows us to study and operate on the model itself instead of the real
world in order to test what happens under various conditions, and help us answer
ata or alter the parameters of the model,
and investigate the effects of the changes.
9
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
structuring a database. This process involves the identification of the kinds of data
that the database will store, as well as the relationships between these kinds of
data.
Most maps and databases can be considered static models. At any point in time,
they represent a single state of affairs. Usually, developments or changes in the
real world are not easily recognized in these models.
Dynamic models or process models address precisely this issue. They emphasize
changes that have taken place, are taking place or may take place sometime in the
future. Dynamic models are inherently more complicated than static models, and
usually require much more computation. Simulation models are an important class
of dynamic models that allow the simulation of real world processes.
1.2.2. Maps
Maps are perhaps the best known (conventional) models of the real world. Maps
have been used for thousands of years to represent information about the real
world, and continue to be extremely useful for many applications in various
domains.
Their conception and design has developed into a science with a high degree of
sophistication. A disadvantage of the traditional paper map is that it is generally
restricted to two-dimensional static representations, and that it is always displayed
in a fixed scale.
The map scale determines the spatial resolution of the graphic feature
representation. The smaller the scale, the less detail a map can show. The
accuracy of the base data, on the other hand, puts limits to the scale in which a
map can be sensibly drawn. Hence, the selection of a proper map scale is one of
the first and most important steps in map design.
A map is always a graphic representation at a certain level of detail, which is
determined by the scale. Map sheets have physical boundaries, and features
spanning two map sheets have to be cut into pieces. Cartography, as the science
and art of map making, functions as an interpreter, translating real world
phenomena (primary data) into correct, clear and understandable representations
for our use. Maps also become a data source for other applications, including the
development of other maps.
With the advent of computer systems, analogue cartography developed into digital
cartography, and computers play an integral part in modern cartography.
Alongside this trend, the role of the map has also changed accordingly, and the
10
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
The traditional role of paper maps as a data storage medium is being taken over
by (spatial) databases, which offe
Notwithstanding these developments, paper maps remain as important tools for
the display of spatial information for many applications.
Databases
epository for storing large amounts of data. It comes with a
1. A database can be used by multiple users at the same time i.e. it allows
concurrent use
2. A database offers a number of techniques for storing data and allows the use of the
most efficient one i.e. it supports storage optimization,
3. A database allows the imposition of rules on the stored data; rules that will be
automatically checked after each update to the data i.e. it supports data integrity,
4. A database offers an easy to use data manipulation language, which allows the
execution of all sorts of data extraction and data updates i.e. it has a query
facility,
5. A database will try to execute each query in the data manipulation language in the
most efficient way i.e. it offers query optimization.
Databases can store almost any kind of data. Modern database systems organize
the stored data in tabular format. A database may have many such tables, each of
which stores data of a certain kind. It is not uncommon for a table to have many
thousands of data rows, sometimes even hundreds of thousands.
Spatial databases and Spatial analysis
A GIS must store its data in some way. For this purpose the previous generation of
there has been an increasing trend in GIS applications that used a GIS for spatial
analysis, and used a database for storage. In more recent years, spatial databases
(also known as geo-databases) have emerged.
Besides traditional administrative data, they can store representations of real world
geographic phenomena for use in a GIS. These databases are special because they
use additional techniques different from tables to store these spatial
representations.
A geo-database is not the same thing as a GIS, though both systems share a
number of characteristics. These include the functions listed above for databases in
general: concurrency, storage, integrity, and querying, specifically, but not only,
spatial data.
A GIS, on the other hand, is tailored to operate on spatial data. It Geodatabases
11
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
Model: GISs help us to analyze and understand more about processes and
phenomena in the real world. In practical terms, this refers to the process of
representing key aspects of the real world digitally (inside a computer). These
representations are made up of spatial data, stored in memory in the form of bits
and bytes, on media such as the hard drive of a computer.
This digital representation can then be subjected to various analytical functions
(computations) in the GIS, and the output can be visualized in various ways.
12
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
As highlighted in in Figure 2.1, the process of translating the relevant aspects of the real
world into a computer representation of it is a domain of expertise by itself. It might be
achieved through direct observations using sensors, and digitizing (converting) the sensor
output for computer usage. This is the domain of remote sensing, the topic of Principles
of Remote Sensing. It may be done by making use of the output of a previous project,
such as a paper map, and re-digitizing it.
We can use the GIS to create visualizations from the computer representation,
either on-screen, printed on paper, or otherwise. It is crucial to understand the
fundamental differences between these notions. The real world, after all, is a
1.4.Geographic phenomena
A GIS operates under the assumption that the relevant spatial phenomena occur in a two-
or three-dimensional Euclidean space, unless otherwise specified. Euclidean space can be
13
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
For instance, in water management, the objects of study might be river basins, agro-
ecologic units, measurements of actual evapo transpiration, meteorological data, ground
water levels, irrigation levels, water budgets and measurements of total water use. Note
that all of these can be named or described, georeferenced and provided with a time
interval at which each exists.
In multipurpose cadastral administration, the objects of study are different: houses, land
parcels, streets of various types, land use forms, sewage canals and other forms of urban
infrastructure may all play a role. Again, these can be named or described, georeferenced
and assigned a time interval of existence.
undetermined.
The array of buoys of the previous chapter is a good example: there is a fixed
number of buoys, and for each we know exactly where it is located. The buoys are
typical examples of (geographic) objects.
14
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
A simple rule-of-thumb is that natural geographic phenomena are usually fields, and
man-made phenomena are usually objects.
1.4.3.Geographic fields
A field is a geographic
area. We can therefore think of a field as a mathematical function f that associates
a specific value with any position in the study area. Hence if (x,y) is a position in
the study area, then f(x,y) stands for the value of the field f at locality (x,y). Fields
can be discrete or continuous.
In a continuous field
15
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
1. Nominal data values are values that provide a name or identifier so that we can
discriminate between different values, but that is about all we can do. Specifically,
we cannot do true computations with these values.
Examples are the names of geological units. This kind of data value is called
categorical data when the values assigned are sorted according to some set of non-
overlapping categories. For example, we might identify the soil type of a given
area to belong to a certain (pre-defined) category.
2. Ordinal data values are data values that can be put in some natural sequence but
that do not allow any other type of computation. Household income, for instance,
natural sequence, but this is all we can say we can not say that a high income is
twice as high as an average income.
3. Interval data values are quantitative, in that they allow simple forms of
computation like addition and subtraction. However, interval data has no
arithmetic zero value, and does not support multiplication or division. For
instance, a temperature of20 C is not twice as warm as 10 C, and thus centigrade
temperatures are interval data values, not ratio data values.
4. Ratio data values allow most, if not all, forms of arithmetic computation. Rational
data have a natural zero value, and multiplication and division of values are
possible operators (distances measured in meters are an example). Continuous
fields can be expected to have ratio data values, and hence we can interpolate
them.
1.4.4.Geographic Objects:
Anyway, at the more refined scale, electric poles, rivers, roads, tramlines, water
pipes, fire hydrants, lakes, vineyards, agricultural land and the forest patches are
examples of geographic objects. Most of the tangible geographic objects have the
defined acceptable boundary and properties such as name, types and status etc.
These are called discrete geographic objects.
16
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
For example, lake Geneva has a defined boundary. Its name (property) is called
Geneva lake. After that boundary, it is not known as Geneva lake. It is one of the
most famous (status) urban lakes in the world. Therefore, Geneva lake is a discrete
geographic object.
1.4.5.Boundaries
Where shape and/or size of contiguous areas matter, the notion of Boundary
comes into play. Location, shape and size are fully determined if we know an
area
A crisp boundary is one that can be determined with almost arbitrary precision,
dependent only on the data acquisition technique applied. Fuzzy boundaries
contrast with crisp boundaries in that the boundary is not a precise line, but rather
17
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
18
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
more likely to have similar values than locations that are far apart commonly referred to
Fields are usually implemented using Tessellation approach and Objects with
Topological (Vector) approach.
1.5.1.Regular tessellations:
A tessellation (or tiling) is a partitioning of space into mutually exclusive cells that
together make up the complete study space. With each cell, some (thematic) value
is associated to characterize that part of space.
Three regular tessellation types are illustrated in Figure 2.5. In a regular
tessellation, the cells are the same shape and size. The simplest example is a
rectangular raster of unit squares, represented in a computer in the 2D case as an
array of n×m elements.
The three most common regular tessellation types: square cells, hexagonal cells,
and triangular cells.
In all regular tessellations, the cells are of the same shape and size, and the field
attribute value assigned to a cell is associated with the entire area occupied by the
cell. In general these cells are named Raster.
A raster is a set of regularly spaced (and contiguous) cells with associated (field)
values. The associated values represent cell values, not point values. This means
that the value for a cell is assumed to be valid for all locations within the cell.
resolution. Sometimes, the word grid is also used, but strictly speaking, a grid
refers to values at the intersections of a network of regularly spaced horizontal and
perpendicular lines (see Figure 2.6). Grids are often used for discrete
measurements that occur at regular intervals. Grid values are often considered
synonymous with raster cells, although they are not.
19
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
The location associated with a raster cell is fixed by convention, and may be the cell
centroid (mid-point) or, for instance, its left lower corner. Values for other positions than
these must be computed through some form of interpolation function, which will use one
or more nearby field values to compute the value at the requested position. This allows us
to represent continuous, even differentiable, functions.
An obvious disadvantage is that they are not adaptive to the spatial phenomenon we want
to represent. The cell boundaries are both artificial and fixed: they may or may not
coincide with the boundaries of the phenomena of interest.
1.5.2.Irregular tessellations:
Irregular tessellations are more complex than the regular ones, but they are also
more adaptive, which typically leads to a reduction in the amount of memory used
to store the data. A well-known data structure in this family upon which many
more variations have been based is the region quadtree. It is based on a regular
tessellation of square cells, but takes advantage of cases where neighboring cells
have the same field value, so that they can together be represented as one bigger
cell. A simple illustration is provided in Figure 2.7.
20
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
The quadtree that represents this raster is constructed by repeatedly splitting up the area
into four quadrants, which are called NW, NE, SE, SW for obvious reasons. This
procedure stops when all the cells in a quadrant have the same field value. The procedure
produces an upside-down, tree-like structure, known as a quadtree.
The vector data model uses the geometric objects of point, line, and area to
represent simple.
Spatial features (Figure 2.8). Dimensionality and property distinguish the three
types of geometric objects as well as the features they represent.
A point has 0 dimension and has only the property of location. A point may also
be called a node, vertex, or 0-cell. A point feature is made of a point or a set of
separate points. Wells, benchmarks, and gravel pits are examples of point
features.
A line is one-dimensional and has the property of length. A line has two end
points and points in between to mark the shape of the line. The shape of a line
may be a smooth curve or a connection of straight-line segments. Smooth curves
are typically fitted by mathematical equations such as splines. Straight-line
segments may represent human-made features such as canals and streets, or they
may simply be approximations of curves. A line is also called an edge, link, chain,
or 1-cell. A line feature is made of lines. Roads, streams, and contour lines are
examples of line features.
An area is two-dimensional and has the properties of area (size) and perimeter.
Made of connected lines, an area may be alone or share boundaries with other
areas. An area may contain holes, such as a national forest containing private land
parcels (holes). The existence of holes means that the area has both external and
internal boundaries. An area is also called a polygon, face, zone, or 2-cell. An
area feature is made of polygons. Examples of area features include timber stands,
land parcels, and water bodies.
21
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
Figure 2.9 Rubber sheet transformation: The space is transformed, Yet many
relationships between the constituents remain unchanged.
The mathematical properties of the geometric space used for spatial data can be
described as follows:
The space is a three-dimensional Euclidean space where for every point we can
determine its three-dimensional coordinates as a triple (x,y,z) of real numbers. In
this space, we can define features like points, lines, polygons, and volumes as
geometric primitives of the respective dimension. A point is zero-dimensional, a
line one-dimensional, a polygon two-dimensional, and a volume is a three-
dimensional primitive.
22
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
The space is a metric space, which means that we can always compute the
distance between two points according to a given distance function. Such a
function is also known as a metric.
The space is a topological space, of which the definition is a bit complicated. In
essence, for every point in the space we can find a neighbourhood around it that
fully belongs to that space as well.
Interior and boundary are properties of spatial features that remain invariant
under topological mappings. This means that under any topological mapping, the
interior and the boundary of a feature remains unbroken and intact.
We can define within the topological space, features that are easy to handle and
that can be used as representations of geographic objects. These features are called
simplices as they are the simplest geometric shapes of some dimension: point (0-
simplex), line segment (1-simplex), triangle (2-simplex), and tetrahedron (3-
simplex).
When we combine various simplices into a single feature, we obtain a simplicial
complex.
23
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
Map scale can be defined as the ratio between the distance on a paper map and the
distance of the same stretch in the terrain. A 1:50,000 scale map means that 1 cm
on the map -
that the ratio is large, so typically it means there is much detail, as in a 1:1,000
-
1:2,500,000 paper map. When applied to spatial data, the term resolution is
commonly associated with the cell width of the tessellation applied.
Digital spatial data, as stored in a GIS, is essentially without scale: scale is a ratio
notion associated with visual output, like a map or on-screen display, not with the
data that was used to produce the map.
Map of Boston area with two different scales 1:100000000 and 1:34000 scale.
Problems:
Q: What is the length in cm of one km on a 1:25,000 map?
Q: The distance between two towns measures 6cm on a map. What is the true distance if
the scale is 1: 50000?
Q: The plans of a house show a room to be 4.0m X 3.6m. The dimensions of the room
on the plans measure 20mm X 18mm. What was the scale used?
Feature Representation
To work in a GIS environment, real world observations (objects or events that can be
recorded in 2D or 3D space) need to be reduced to spatial entities. These spatial entities
can be represented in a GIS as a vector data model or a raster data model.
24
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
Vector
Vector features can be decomposed into three different geometric primitives: points,
polylines and polygons.
Point
Polyline
25
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
no area. And like a point, a line is symbolized using shapes that have a color, width and
style (e.g.
polylines in a GIS.
Polygon
A polygon is composed of one or more lines whose starting and ending coordinate pairs
are the same. Polygons represent both length (i.e. the perimeter of the area) and area.
They also embody the idea of an inside and outside; in fact, the area that a polygon
encloses is explicitly defined in a GIS environment.
Raster
26
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
A raster can be thought of as a special case of an area object where the area is
divided into a regular grid of cells. But a regularly spaced array of marked points
may be a better analogy since rasters are stored as an array of values where each
cell is defined by a single coordinate pair inside of most GIS environments.
Implicit in a raster data model is a value associated with each cell or pixel. This is
in contrast to a vector model that may or may not have a value associated with the
geometric primitive.
The main principle of data organization applied in GIS system is that of a spatial
data layer. A spatial data layer is either representation of a continuous data or
discrete data or a collection of objects of the same kind. Usually data is organized
so that similar elements are in a single data layer. For example, all ATMs would
be in one layer, and all road line objects in another.
A data layer contains spatial data as well as attribute data which further describes
the field or objects in a layer. Attribute data is quite often arranged in a tabular
form, maintained in some kind of geodatabase. Data layers can be overlaid with
each other, inside the GIS package, co as to study combinations of geographic
phenomenon, A GIS can be used to study the spatial relationship between different
phenomenon, requiring compotation which overlays one layer with another .
27
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
Valid time(world time) is the time period during which a fact is true in the real world.
Transaction time is the time period during which a fact stored in the database was known.
Time can be considered to be linear, extending from the past to present and into future.
This view give a single time line. Branching time in which different time lines from
28
GEOGRAPHIC INFORMATION SYSTEMS Unit 1
certain points in time onwards are possible. Cyclic time in which repeating cycles such as
seasons or days of week are recognized ,make more sense and can be useful.
4. Time Granularity-
When measuring time we speak of granularity as the precision of the time value in GIS or
database(e.g. seconds, minutes ,hours, years).In cadastral application, time granularity
might well be a day ,as the law requires deeds to be date marked. In geological mapping
applications, time granularity is more likely in the order of thousand or millions of years.
at
Sample Questions:
29
Geographic Information Systems Unit 2
Unit-II
Data Management and Processing Systems
Contents
2.1 Introduction
2.2 GIS Software.
2.3 Stages of Spatial Data Infrastructure
2.4 Database Management System
2.5 GIS and Spatial Databases
2.1 INTRODUCTION
30
Geographic Information Systems Unit 2
31
Geographic Information Systems Unit 2
supports both raster and vector layers; vector data is stored as either point,
line, or polygon features. Multiple formats of raster images are supported
and the software can georeference images.
5. JUMP GIS / OpenJUMP - UMP is a Java based vector and raster GIS and
programming framework. It reads and writes the file formats ESRI
Shapefile, it reads database datastores PostGIS, SpatiaLite, Oracle Spatial
and MySQL. It writes PostGIS datastore reads raster files (world file
supported) eg. GeoTIFF, TIFF, JPEG, BMP, PNG, FLT, ASC, JPEG
2000.It writes raster eg. GeoTIFF, TIFF, PNG.it supports full geometry and
attribute editing.
6. MapWindow GIS It is an application and set of programmable mapping
components. It has been adopted by the United States Environmental
Protection Agency . MapWindow GIS is distributed as an open source
application under the Mozilla Public License distribution license, It is an
extensible geographic information system. This means that advanced users
or developers can write plug-ins to add additional functionality and pass
these along to any number of the users clients and end users.
7. SAGA GIS (System for Automated Geoscientific Analysis): It is a
geographic information system (GIS) computer program, used to edit
spatial data. It is free and open-source software, developed originally by a
small team at the Department of Physical Geography, University of
Göttingen, Germany, and is now being maintained and extended by an
international developer community. SAGA has a fast-growing set of
geoscientific methods, bundled in exchangeable module libraries. SAGA
GIS is an effective tool with user friendly graphical user interface (GUI)
that requires only about 10 MB disk space. No installation needed
8. uDig[User Friendly Desktop Internet GIS] is a GIS software program
produced by a community led by Canadian-based consulting company
Refractions Research. It is based on the Eclipse platform and features full
layered Open Source GIS. It supports shape files, PostGIS , WMS, and
many other data sources natively. uDig is commonly used as a framework
for building other GIS platforms and applications. Such applications
include DIVA-GIS. Following table represents comparative study of GIS
-sources.
32
Geographic Information Systems Unit 2
Characteristics: It supports remote sensing feature for both vector and raster
processing. Its features include digitizing, editing, analysis and
display of data, and production of quality maps.
Characteristics: It is used to edit spatial data and an effective tool with user friendly
graphical user interface (GUI) that requires only about 10 MB disk
space. No installation needed.
Characteristics: It writes raster eg. GeoTIFF, TIFF, PNG. It supports full geometry
and attribute editing. . It reads and writes the file formats ESRI
Shape file, it reads database data stores PostGIS.
Characteristics: This software allows users to analyze and edit spatial information, in
addition to composing and exporting graphical maps. QGIS supports
both raster and vector layers; vector data is stored as either point,
line, or polygon features. Multiple formats of raster images are
supported and the software can geo-reference images
33
Geographic Information Systems Unit 2
Characteristics: It supports shape files, PostGIS, WMS, and many other data sources
natively. uDig is commonly used as a framework for building other
GIS platforms and applications.
34
Geographic Information Systems Unit 2
As shown in above Fig the architecture consists of 3 tiers: The Client tier that
presents the data to the user; the Middleware Tier consisting of the Web
Application and GIS Servers; and the Server Tier including the database that
stores the spatial and non-spatial data. The roles of the various components of
this architecture include:
1. Client (Presentation) Tier: It supports various web browsers for
presentational purposes. It also includes desktop software for complex
spatial data manipulation and visualization tasks, as well as support for the
increasingly popular mobile devices.
2. Middleware (Application) Tier:
Web server: Map server is an open-source web server that can render maps by
generating objects from the database and its shape files . A scripting language,
such as PHP, provides the interaction between the user and the system and
ge in the database. Open
Database Connectivity (ODBC) protocol is available to connect to the database
and Map server supports numerous OGC standards.
GIS Server: This is distinct from web map servers as it does not provide web
mapping services, but rather provides GIS processing functionality such as
visualization, spatial data.
3. Data (Server) Tier: The geo-database in the data tier efficiently stores,
manages and retrieves data for relevant purposes.
Hardware: It is the computer on which a GIS operates. Today, GIS software runs on
a wide range of hardware types, from centralized computer servers to desktop
computers used in stand-alone or networked configurations.
Software: GIS software provides the functions and tools needed to store, analyze, and
display geographic information. Key software components are
Tools for the input and manipulation of geographic information
A database management system (DBMS)
Tools that support geographic query, analysis, and visualization
A graphical user interface (GUI) for easy access to tools.
Data: The most important component of a GIS is the data. A GIS will integrate spatial
data with other data resources and can even use a DBMS, used by most organizations
to organize and maintain their data, to manage spatial data. In GIS data is classified
into 3 types namely
Spatial Data : It includes co-ordinates, boundaries, wells, road networks
Non-Spatial Data(attribute , a-spatial data): It includes address,
population density, land ownership , Soil PH values
35
Geographic Information Systems Unit 2
People: GIS technology is of limited value without the people who manage the
system and develop plans for applying it to real-world problems.
1. Spatial data Input 1. Data entry: use existing data, create new data
2. Data editing
3. Geometric transformation
4. Projection and re-projection
36
Geographic Information Systems Unit 2
The term spatial data infrastructure was coined in 1993 by the U.S. National Research
Council to denote a framework of technologies, policies, and institutional
arrangements that together facilitate the creation, exchange, and use of geospatial data
and related information resources across an information-sharing community. Such a
framework can be implemented narrowly to enable the sharing of geospatial
information within an organization or more broadly for use at a national, regional, or
global level.
As per Kuhn (2005): "An SDI is a coordinated series of agreements on
technology standards, institutional arrangements, and policies that enable
the discovery and use of geospatial information by users and for purposes
other than those it was created for."
An SDI provides its users with different facilities for finding, viewing,
downloading and processing data. Because the organizations in an SDI are
normally widely distributed over space, computer networks are used as the
means of communication. With the development of the internet, the functional
components of GIS have been gradually become available as web-based
applications.
37
Geographic Information Systems Unit 2
38
Geographic Information Systems Unit 2
The way that data is stored plays a central role in the processing and the
eventual understanding of that data. In most of the available systems, spatial
data is organized in layers by theme and/or scale.
For instance, the data may be organized in thematic categories, such as land
use, topography and administrative subdivisions, or according to map scale.
Spatial data are stored in raster or vector format [comparison of raster and
vector format-covered in types of spatial data].
Maintenance of (spatial) data can best be defined as the combined
activities to keep the data set up-to-date and as supportive as possible to
the user community.
It deals with obtaining new data, and entering them into the system, possibly
replacing outdated data. The purpose is to have an up-to-date stored data set
available.
Spatial queries and process models play an important role in this functionality.
Spatial decision support systems (SDSS) SDSS are a category of information
systems composed of a database, GIS software models, and a so-called
knowledge engine which allow users to deal specifically with locational
problems.
Spatial Analysis of Raster Data: There are four basic functions of raster data
according to which analysis of raster data is done. These are:
39
Geographic Information Systems Unit 2
Buffering
Buffers are common vector analysis tools used to address questions of proximity in a
GIS and can be used on points, lines, or polygons
Geo-processing Operations
- GIS. The term can (and should)
be widely applied to any attempt to manipulate GIS data. The following represents the
most common geo-processing tools.
The dissolve operation combines adjacent polygon features in a single feature
dataset based on a single predetermined attribute.
The append operation creates an output polygon layer by combining the
spatial extent of two or more layers.
The select operation creates an output layer based on a user-defined query that
selects particular features from the input layer. The output layer contains only
those features that are selected during the query.
The merge operation combines features within a point, line, or polygon layer
into a single feature with identical attribute information. Often, the original
features will have different values for a given attribute. In this case, the first
attribute encountered is carried over into the attribute table, and the remaining
attributes are lost.
40
Geographic Information Systems Unit 2
(d) Merge
41
Geographic Information Systems Unit 2
There are various reasons for using a DBMS for data storage and processing.
DBMS supports the storage and manipulation of very large data sets
Some data sets are so big that storing them in text files or spreadsheet files
becomes too awkward for use in practice. The result may be that finding
simple facts takes minutes, and performing simple calculations perhaps
even hours. A DBMS is specifically designed for doing this efficiently.
DBMS supports the concurrent use of the same data set by many users
Many people are usually involved in the data collection, maintenance and
processing. These data sets are often considered to be of a high strategic
value to the owner(s). Moreover, for different users of the database,
different views on the data can be defined.
In this way, users will be under the impression that they operate on their
personal database, and not on one shared by many people. They may all be
DBMS includes data backup and recovery functions and the control of data
redundancy
Regular back-ups of the dataset and automatic recovery schemes provide an
insurance against loss of data. Storing a fact multiple times, a phenomenon
known as data redundancy, can lead to situations in which stored facts may
contradict each other, causing reduced usefulness of the data. A well-
designed database takes care to store single facts only once.
The decision to use a DBMS will depend, among other things, on how
much data there is, what type of use will be made of it, and how many users
might be involved. The alternatives of data management can be traditional
approach.
Data is stored in flat files as records. Records consist of various fields
which are delimited by a space, comma, any special character etc. End of
records and end of files will be marked using any predetermined character
set or special characters in order to identify them. Following are some
problems associated with traditional approaches.
Data Security
Data Redundancy
Data Isolation
Relational Data-Model
A data model is a language that allows the definition of:
The structures that will be used to store the base data,
The integrity constraints that the stored data has to obey at all moments
in time, and
The computer programs used to manipulate the data.
In the relational data model, a database is viewed as a collection of
relations, commonly also known as tables. A table or relation is itself a
collection of tuples (or records). An attribute is a named field of a tuple,
with which each tuple associates a value, the tuple attribute value. A key of
a relation comprises one or more attributes. A value for these attributes
uniquely identifies a tuple.
43
Geographic Information Systems Unit 2
The most common way of defining queries in a relational database is through the
SQL language. SQL stands for Structured Query Language.
Examples: SELECT FROM Parcel WHERE AreaSize > 1000 [tuple selection
from the Parcel relation, using the condition AreaSize > 1000. The indicates that we
want to extract all attributes of the input relation]
The FROM-clause identifies the two input relations; the WHERE-clause states the
join condition.
SELECT Pid, Location FROM Parcel[attribute projection from the Parcel relation.
The SELECT-clause indicates that we only want to extract the two attributes Pid and
Location. There is no WHERE-clause in this query].
GIS software provides support for spatial data and thematic or attribute data.
GISs have traditionally stored spatial data and attribute data separately. This
required the GIS to provide a link between the spatial data (represented with
rasters or vectors), and their non-spatial attribute data.
44
Geographic Information Systems Unit 2
GIS and databases interact to enhance individual strengths and minimize their
weaknesses.
GIS DBMS
The strength of GIS lies in its built-in DBMSs have a long tradition in handling
attribute (i.e., administrative, non-spatial,
functions that derive from this, for purposes tabular, thematic) data in a secure way, for
such as storage, analysis, and map production. multiple users at the same time.
GIS packages themselves can store tabular
Perhaps, DBMSs offer much better table
data.
functionality, since they are specifically
However, they do not always provide a full- designed for this purpose. A lot of the data
fledged query language to operate on the in GIS applications is attribute data, so it
tables. made sense to use a DBMS for it.
A GIS has spatial data represented with The DBMS serves as a centralized data
rasters or vectors, and the attribute data stored repository for all users, while each user
in an external DBMS runs her/his own GIS software, obtaining
its data from the DBMS.
With raster representations, each raster cell This identifier is usually just called the
stores a characteristic value. This value can be objectID featureID
used to look up attribute data in an link the spatial object (as represented in
accompanying database table. vectors) with its attribute data.
45
Geographic Information Systems Unit 2
ti-
polygons - are automatically given a unique
identifier by the system
46
Geographic Information Systems Unit 2
Sample Questions
1.
2. Write a short note on GIS architecture and functionality.
3. Explain SDI in detail.
4. What do you mean by spatial data? Explain stages involved in spatial data
handling.
5. How databases are linked in GIS? Explain reasons for using database.
6. What are the possible alternatives for data management in GIS?
7. Explain relational database with an appropriate example.
8. Explain how GIS and DBMS can be linked together.
9. What do you mean by spatial databases? Explain spatial database
functionality in GIS.
47
Geographic Information Systems Unit 3
Unit-III
Spatial Referencing and Positioning
Contents
A spatial reference is a series of parameters that define the coordinate system and
other spatial properties for each dataset in the geodatabase. It is typical that all
datasets for the same area (and in the same geodatabase) use a common spatial
reference definition.
A spatial reference includes the following:
The coordinate system
The coordinate precision with which coordinates are stored (often referred to as the
coordinate resolution)
Processing tolerances (such as the cluster tolerance)
The spatial extent covered by the dataset (often referred to as the spatial domain)
48
Geographic Information Systems Unit 3
surface. The angles often are measured in degrees (or in grads). The following
illustration shows the world as a globe with longitude and latitude values:
In the spherical system, horizontal lines, or east west lines, are lines of equal
latitude, or parallels. Vertical lines, or north south lines, are lines of equal
longitude, or meridians. These lines encompass the globe and form a gridded
network called a graticule.
The line of latitude midway between the poles is called the equator. It defines
the line of zero latitude. The line of zero longitude is called the prime meridian.
For most GCSs, the prime meridian is the longitude that passes through
Greenwich, England. The origin of the graticule (0,0) is defined by where the
equator and prime meridian intersect.
Latitude and longitude values are traditionally measured either in decimal
degrees or in degrees, minutes, and seconds (DMS). Latitude values are
measured relative to the equator and range from 90° at the south pole to +90°
at the north pole. Longitude values are measured relative to the prime meridian.
They range from 180° when traveling west to 180° when traveling east. If the
prime meridian is at Greenwich, then Australia, which is south of the equator
and east of Greenwich, has positive longitude values and negative latitude
values.
It may be helpful to equate longitude values with x and latitude values with y.
Data defined on a geographic coordinate system is displayed as if a degree is a
linear unit of measure. This method is basically the same as the Plate Carrée
projection. A physical location will usually have different coordinate values in
different geographic coordinate systems.
If two datasets are not referenced to the same geographic coordinate system,
you may need to perform a geographic (datum) transformation. This is a well-
defined mathematical method to convert coordinates between two geographic
coordinate systems. As with the coordinate systems, there are several hundred
predefined geographic transformations that you can access. It is very important
to correctly use a geographic transformation if it is required. When neglected,
49
Geographic Information Systems Unit 3
3.1.2.1Coordinate system
3.1.2.2Map projections
50
Geographic Information Systems Unit 3
earth's surface is clear with the graticule drawn on it. Wrap a piece of paper
around the earth. A light at the center of the earth will cast the shadows of the
graticule onto the piece of paper. You can now unwrap the paper and lay it flat.
The shape of the graticule on the flat paper is different from that on the earth.
The map projection has distorted the graticule.
A spheroid cannot be flattened to a plane any more easily than a piece of
orange peel can be flattened it will tear. Representing the earth's surface in
two dimensions causes distortion in the shape, area, distance, or direction of the
data.
Projection parameters
Each map projection has a set of parameters that you must define. The parameters
specify the origin and customize a projection for your area of interest.
Linear parameters
False easting is a linear value applied to the origin of the x-coordinates.
False northing is a linear value applied to the origin of the y-coordinates. False
easting and northing values are usually applied to ensure that all x- and y-
values are positive. You can also use the false easting and northing parameters
to reduce the range of the x- or y- coordinate values.
For example, if you know all y- values are greater than 5,000,000 meters, you could
apply a false northing of 5,000,000. Height defines the point of perspective above
the surface of the sphere or spheroid for the Vertical Near-Side Perspective projection.
Angular parameters
Azimuth defines the centerline of a projection. The rotation angle measures east
from north. It is used with the azimuth cases of the Hotine Oblique Mercator
projection.
Central meridian defines the origin of the x-coordinates.
51
Geographic Information Systems Unit 3
Longitude of origin defines the origin of the x-coordinates. The central meridian
and longitude of origin parameters are synonymous.
Central parallel defines the origin of the y-coordinates.
i. CLASS: The three classes of map projections are cylindrical, conical and
azimuthal. The Earth's reference surface projected on a map wrapped around
the globe as a cylinder produces a cylindrical map projection. Projected on a
map formed into a cone gives a conical map projection. When projected
directly onto the mapping plane it produces an azimuthal (or zenithal or
planar) map projection. The figure below shows the surfaces involved in these
three classes of projections.
ii. Point of secancy: The planar, conical, and cylindrical surfaces in the figure above
are all tangent surfaces; they touch the horizontal reference surface in one point
(plane) or along a closed line (cone and cylinder) only. Another class of projections is
obtained if the surfaces are chosen to be secant to (to intersect with) the horizontal
reference surface; illustrations are in the figure below. Then, the reference surface is
intersected along one closed line (plane) or two closed lines (cone and cylinder).
Secant map surfaces are used to reduce or average scale errors because the line(s) of
intersection are not distorted on the map (section 4.3 scale distortions on a map).
52
Geographic Information Systems Unit 3
iii. aspect: Projections can also be described in terms of the direction of the projection
plane's orientation (whether cylinder, plane or cone) with respect to the globe. This is
called the aspect of a map projection. The three possible aspects are normal,
transverse and oblique. In a normal projection, the main orientation of the projection
surface is parallel to the Earth's axis (as in the figures above for the cylinder and the
cone). A transverse projection has its main orientation perpendicular to the Earth's
axis. Oblique projections are all other, non-parallel and non-perpendicular, cases. The
figure below provides two examples.
53
Geographic Information Systems Unit 3
A particular map projection can have any one of these three properties. No map
projection can be both conformal and equal-area. A projection can only be equidistant
(true to scale) at certain places or in certain directions.
Map and GIS users are mostly confronted in their work with transformations from one
two-dimensional coordinate system to another. This includes the transformation of
polar coordinates delivered by the surveyor into Cartesian map coordinates (section
2.5) or the transformation from one 2D Cartesian (x,y) system of a specific map
projection into another 2D Cartesian (x,y) system of a defined map projection.
54
Geographic Information Systems Unit 3
3.1.2.Satellite-based Positioning
GPS Segments
The Global Positioning System basically consists of three segments: Space Segment,
Control Segment and User Segment.
i. Space Segment: ( Satellite that orbits the earth and radio signals that they emits)
The Space Segment contains 24 satellites, in 12-hour near-circular orbits at altitude of
about 20000 km, with inclination of orbit 55°. The constellation ensures at least 4
satellites in view from any point on the earth at any time for 3-D positioning and
navigation on world-wide basis. The three axis controlled, earth-pointing satellites
continuously transmit navigation and system data comprising predicted satellite
ephemeris, clock error etc.
ii. Control Segment: (the ground station that monitors and maintain the space
segment component)
This has a Master Control Station (MCS), few Monitor Stations (MSs) and an UpLoad
Station (ULS). The MSs are transportable shelters with receivers and computers;
which passively track satellites, accumulating ranging data from navigation signals.
This is transferred to MCS for processing by computer, to provide best estimates of
satellite position, velocity and clock drift relative to system time. The data thus
processed generates refined information of gravity field influencing the satellite
motion, solar pressure parameters, position, clock bias and electronic delay
characteristics of ground stations and other observable system influences. Future
navigation messages are generated from this and loaded into satellite memory once a
day via ULS which has a parabolic antenna, a transmitter and a computer.
55
Geographic Information Systems Unit 3
Principle of positioning:
There are two, general operating modes from which GPS-derived positions can be
obtained absolute and relative (or differential) positioning. Within each of these two
modes, range measurements to the satellites can be performed by tracking either the
phase of the satellite's carrier signal
3.1.2.Methods of Observations:
The different methods of observations with GPS include, absolute positioning, relative
positioning in translocation mode, relative positioning using differential GPS
technique, and kinematic GPS surveying technique.
56
Geographic Information Systems Unit 3
1. Absolute Positioning:
1. A receiver on or above the planet, also equipped with a clock, receives the
message slightly later, and reads its own clock.
2. From the time delay observed between the two clock readings, and knowing
the speed of radio transmission through the medium between (satellite) sender
and receiver, the receiver can compute the distance to the sender, also known
pseudorange
57
Geographic Information Systems Unit 3
signals may cause error that is an order of magnitude larger than all other
error sources combined.
Incorrect clock reading: Even atomic clocks can be off by a small margin, and since
Einstein, we know that travelling clocks are slower than resident clocks, due to a so-
called relativistic effect. If one understands that a clock that is off by 0.000001 sec
Incorrect orbit position: The orbit of a satellite around our planet is easy to describe
mathematically if both bodies are considered point masses, but in real life they are not.
For the same reasons that the Geoid is not a sim-
gravitation field that a satellite experiences in orbit is not simple either. Moreover, it is
disturbed by solar and lunar gravitation, making its flight path slightly erratic and
difficult to forecast exactly.
Both types of error are strictly monitored by the ground control segment, which is
responsible for correcting any errors of this nature, but it does so by applying an
agreed upon tolerance. A control station can obviously compare results of positioning
computations like discussed above with its accurately known position, flagging any
until errors have been corrected, and brought to within the tolerance. This may be
done by uploading a correction on the clock or orbit settings to the satellite.
Thirdly, the medium between sender and receiver may be of influence to the
radio signals. The middle atmospheric layers of strato- and mesosphere are
relatively harmless and of little hindrance to radio waves, but this is not true of
the lower and upper layer. They are, respectively:
The troposhere:
phenomena that we call the weather. It is an obstacle that delays radio waves in
a rather variable way.
The ionosphere: the most outward part of the atmosphere that starts at an
altitude of 90 km, holding many electrically charged atoms, thereby forming a
protection against various forms of radiation from space, including to some
extent radio waves. The degree of ionization shows a distinct night and day
rhythm, and also depends on solar activity.
58
Geographic Information Systems Unit 3
The latter is a more severe source of delay to satellite signals, which obviously means
that pseudoranges are estimated larger than they actually are. When satellites emit
radio signals at two or more frequencies, an estimate can be computed from
differences in delay incurred for signals of different frequency, and this will allow for
the correction of atmospheric delay, leading to a 10 50% improvement of accuracy. If
this is not the case, or if the receiver is capable of receiving just a single frequency, a
model should be applied to forecast the (especially ionospheric) delay, typically taking
into account the time of day and current latitude of the receiver.
Fourth in this list is the error occurring when a radio signal is received via two
or more paths between sender and receiver, some of which typically via a
bounce off of some nearby surface, like a building or rock face. The term
applied to this phenomenon is multi-path; when it occurs the multiple
receptions of the same signal may interfere with each other . Multi-path is a
difficult to avoid error source.
At any point
59
Geographic Information Systems Unit 3
Up until this point, we have assumed that the receiver determines the range of a
satellite by measuring time delay on the received ranging code. There exists a
more advanced range determination technique known as carrier phase
measurement. This typically requires more advanced receiver technology, and
longer observation sessions. Carrier phase measurement can currently only be
used with relative positioning, as absolute positioning using this method is not
yet well developed.
60
Geographic Information Systems Unit 3
1. GPS
61
Geographic Information Systems Unit 3
The navigation message contains the satellite orbit and satellite clock error in-
formation, as well as some general system information. GPS also carries a fifth,
encrypted military signal carrying the M-code. GPS uses WGS84 as its
reference system. It has been refined on several occasions and is now aligned
with the WGS84 and ITRFITRF at the level of a few centimeters worldwide.
GPS has adopted UTC as its time system.
In the civil market, GPS receivers of varying quality are available, their quality
depending on the embedded positioning features: supporting single- or dual-
frequency, supporting only absolute or also relative positioning, performing
code measurements or also carrier phase measurements. Leica and Trimble are
two of the well-known brands in the high-precision, professional surveying do-
GPS manufacturers main; Magellan and Garmin, for instance, operate in the
lower price, higher volume consumer market range, amongst others for
recreational use in outdoor activities. Many of these are single frequency
receivers, doing only code measurements, though some are capable of relative
positioning. This includes the new generation of GPS-enabled mobile phones.
2. GLONASS
GLONASS radio signals are somewhat similar to that of GPS, but differ in the
details. Satellites use different identifier schemes, and their navigation message
use other parameters. They also use different frequencies: GLONASS L1 is at
62
Geographic Information Systems Unit 3
3. Galileo
n
satellite-based positioning system, to become independent of the GPS
monopoly and to support its own economic growth by providing services of
high reliability under civilian control.
In June 2004, the EU and the US agreed to make Galileo and GPS compatible
by adoption of interchangeable satellite signal set-ups. The effect of this
agreement is that the Galileo/GPS tandem satellite system will have so many
satellites in the sky (close to 60) that a receiver can almost always find an
optimal constellation in view. This will be especially useful in situations where
in the past bad signal reception happened: in built-up areas and forests, for
instance. It will also bring the implementation of a Global Navigation Satellite
System (GNSS) closer as positional accuracy and reliability will improve. With
such a system, eventually one expects to implement fully automated air and
road traffic. Automatic aircraft landing, for instance, requires horizontal
accuracy in the order of 4 m, and vertical accuracy below 1 m: these
requirements can currently not be achieved reliably.
63
Geographic Information Systems Unit 3
Spatial data can be obtained from various sources. It can be collected from
scratch, using direct spatial data acquisition techniques, or indirectly, by
making use of existing spatial data collected by others.
Data input is the operation of encoding data for inclusion into a database. The
creation of accurate databases is a very important part of GIS.
Data collection, and the maintenance of databases, remains the most expensive
and time consuming aspect of setting up a major GIS facility. This typically
costs 60-80% of the overall costs of a GIS project.
There are a number of issues which arise when developing a data base for a
planning or management projects. The first issue is should the data be stored in
vector or raster format. Considerations here include:
The nature of the source data e.g. it is already in raster form
The predominant use to which it will be put
The potential losses that may occur in transition
Storage space (increasingly less important)
Requirements for data sharing with other systems/software
64
Geographic Information Systems Unit 3
Any data which is not captured directly from the environment is known as
secondary data.
In contrast to direct methods of data capture described above, spatial data can
also be sourced indirectly. This includes data derived from existing paper maps
Secondary data through scanning, data digitized from a satellite image,
processed data purchased from data capture firms or international agencies, and
so on. This type of data is known as secondary data:
1. Digitizing
2. Scanning
reflected light with a CCD array. The result is an image as a matrix of pixels,
each of which holds an intensity value. Office scanners have a fixed maximum
resolution, expressed as the highest number of pixels they can identify per inch;
Resolution the unit is dots-per-inch (dpi). For manual on-screen digitizing of a
paper map, a resolution of 200 300 dpi is usually sufficient, depending on the
thickness of the thinnest lines. For manual on-screen digitizing of aerial
photographs, higher resolutions are recommended typically, at least 800 dpi.
65
Geographic Information Systems Unit 3
3. Vectorization
The process of distilling points, lines and polygons from a scanned image is
called vectorization.
As scanned lines may be several pixels wide, they are often first thinned to
retain only the centreline. The remaining centreline pixels are converted to
series of (x, y) coordinate pairs, defining a polyline.
Subsequently, OCR features are formed and attributes are attached to them.
This process may be en-tirely automated or performed semi-automatically,
with the assistance of an op-erator. Pattern recognition methods like Optical
Character Recognition (OCR) for text can be used for the automatic
detection of graphic symbols and text.
Vectorization causes errors such as small spikes along lines, rounded corners,
errors in T- and X-junctions, displaced lines or jagged curves. These errors are
corrected in an automatic or interactive post-processing phase.The phases of
the vectorization process are illustrated in Figure
66
Geographic Information Systems Unit 3
4. Metadata
includes:
Data quality information: Positional, attribute and temporal accuracy, lineage, etc.
In essence, metadata answer who, what, when, where, why, and how questions about
all facets of the data made available. Maintaining metadata is an key part in
maintaining data and information quality in GIS. This is because it can serve different
purposes, from description of the data itself through to providing in structions for data
handling. Depending on the type and amount of metadata provided, it could be used to
determine the data sets that exist for a geographic location, evaluate whether a given
data set meets a specified need, or to process and use a data set.
3.2.2.Data Quality:
With the advent of satellite remote sensing, GPS and GIS technology, and the
increasing availability of digital spatial data, resource managers and others who
formerly relied on the surveying and mapping profession to supply high quality map
products are now in a position to produce maps themselves. At the same time, GISs
are being increasingly used for decision support applications, with in- Application
requirements creasing reliance on secondary data sourced through data providers or
via the internet , through geo-web services. The implications of using low-quality data
in important decisions are potentially severe. There is also a danger that uninformed
GIS users introduce errors by incorrectly applying geometric and other
transformations to the spatial data held in their database.
So far we have used the terms error, accuracy and precision without appropriately
defining them.
67
Geographic Information Systems Unit 3
The surveying and mapping profession has a long tradition of determining and
minimizing errors. This applies particularly to land surveying and
photogrammetry, both of which tend to regard positional and height errors as
undesirable. Cartographers also strive to reduce geometric and attribute errors
in their products, and, in addition, define quality in specifically cartographic
terms, for ex-ample quality of linework, layout, and clarity of text.
It must be stressed that all measurements made with surveying and photogram-
metric instruments are subject to error. These include:
68
Geographic Information Systems Unit 3
Location accuracy is normally measured as a root mean square error (RMSE). The
RMSE is similar to, but not to be confused with, the standard deviation of a statistical
sample. The value of the RMSE is normally calculated from a set of check
measurements (coordinate values from an independent source of higher accuracy for
identical points).
Attribute accuracy
We can identify two types of attribute accuracies. These relate to the type of data we
are dealing with:
For nominal or categorical data, the accuracy of labeling (for example the type
of land cover, road surface, etc).
It follows that depending on the data type, assessment of attribute accuracy may range
from a simple check on the labelling of features for example, is a road classified as a
metaled road actually surfaced or not? to complex statistical procedures for
assessing the accuracy of numerical data, such as the percentage of pollutants present
in the soil.
When spatial data are collected in the field, it is relatively easy to check on the
appropriate feature labels. In the case of remotely sensed data, however, considerable
effort may be required to assess the accuracy of the classification procedures. This is
usually done by means of checks at a number of sample points.
3.2.2.4. Temporal accuracy
As noted, the amount of spatial data sets and archived remotely sensed data
has increased enormously over the last decade. These data can provide useful
temporal information such as changes in land ownership and the monitoring of
environmental processes such as deforestation. Analogous to its positional and
attribute components, the quality of spatial data may also be assessed in terms
of its temporal accuracy. For a static feature this refers to the difference in the
values of its coordinates at two different times. This includes not only the
accuracy and precision of time measurements (for example, the date of a
survey), but also the temporal consistency of different data sets. Because the
positional and attribute components of spatial data may change together or
independently, it is also necessary to consider their temporal validity. For
69
Geographic Information Systems Unit 3
example, the boundaries of a land parcel may remain fixed over a period of
many years whereas the ownership attribute may change more frequently.
The field data are then used to construct an error matrix (also known as a
confuse- Error matrix or misclassification matrix) that can be used to evaluate
the accuracy of the classification. For example there are three land types are
identified. For 62 check points that are forest, the classified image identifies
them as forest. However, two forest check points are classified in the image as
agriculture. Vice versa, five agriculture points are classified as forest.
3.2.2.5.Lineage:
Lineage describes the history of a data set. In the case of published maps, some
lineage information may be provided as part of the metadata, in the form of a
note on the data sources and procedures used in the compilation of the data.
Examples include the date and scale of aerial photography, and the date of field
verification. Especially for digital data sets, however, lineage may be defined
more formally as:
that part of the data quality statement that contains information that de-scribes the
source of observations or materials, data acquisition and compilation methods,
conversions, transformations, analyses and derivations that the data has been
All of these aspects affect other aspects of quality, such as positional accuracy.
Clearly, if no lineage information is available, it is not possible to adequately
3.2.2.6.Completeness
Completeness refers to whether there are data lacking in the database compared
to what exists in the real world. Essentially, it is important to be able to assess
what does and what does not belong to a complete dataset as intended by its
70
Geographic Information Systems Unit 3
3.2.2.6.Logical consistency :
compatibility of data with other data in a data set (e.g. in terms of data format),
3.2.3.Data Preparation:
Spatial data preparation aims to make the acquired spatial data fit for use.
Images may require enhancements and corrections of the classification scheme
of the data. Vector data also may require editing, such as the trimming of over-
shoots of lines at intersections, deleting duplicate lines, closing gaps in lines,
and generating polygons. Data may require conversion to either vector format
or raster format to match other data sets which will be used in the analysis.
Additionally, the data preparation process includes associating attribute data
with the spatial features through either manual input or reading digital attribute
files into the GIS/DBMS.
The intended use of the acquired spatial data may require only a subset of the
original data set, as only some of the features are relevant for subsequent anal-
Intended use y axis subsequent map production. In these cases, data and/or
cartographic generalization can performed on the original data set.
Acquired data sets must be checked for quality in terms of the accuracy,
consistency and completeness parameters discussed above. Often, errors can be
identified automatically, after which manual editing methods can be applied to
correct the errors. Alternatively, some software may identify and automatically
correct certain types of errors. Below, we focus on the geometric, topological,
and attribute components of spatial data.
- rd sequence. For
example, crossing lines are split before dangling lines are erased, and nodes are
created at intersections before polygons are generated.
71
Geographic Information Systems Unit 3
With polygon data, one usually starts with many polylines, in an unwieldy for-
mat known as spaghetti data, that are combined in the first step . This results in
fewer polylines with more internal vertices. Then, polygons can be identified
(c).
Sometimes, polylines that should connect to form closed boundaries do not,
and therefore must be connected (either manually or automatically); this step is
not indicated in the figure. In a final step, the elementary topology of the
polygons can be derived (d).
Rasterization or vectorization
Vectorization produces a vector data set from a raster. We have looked at this
in some sense already: namely in the production of a vector set from a scanned
image. Another form of vectorization takes place when we want to identify
features or patterns in remotely sensed imagery. The keywords here are feature
extraction and pattern recognition, which are dealt with in Principles of
Remote Sensing .
If much or all of the subsequent spatial data analysis is to be carried out on
raster data, one may want to convert vector data sets to raster data. This process
is known as rasterization.
It involves assigning point, line and polygon attribute values to raster cells that
overlap with the respective point, line or polygon. To avoid information loss,
the raster resolution should be carefully chosen on the basis of the geometric
resolution.
A cell size which is too large may result in cells that cover parts of multiple
vector features, and then ambiguity arises as to what value to assign to the cell.
If, on the other hand, the cell size is too small, the file size of the raster may
increase significantly.
er
boundaries are only an approximation
72
Geographic Information Systems Unit 3
A GIS project usually involves multiple data sets, so the next step addresses the issue
of how these multiple sets relate to each other. There are four fundamental cases to be
considered in the combination of data from different sources:
Differences in accuracy
Images come at a certain resolution, and paper maps at a certain scale. This typically
results in differences of resolution of acquired data sets, all the more since map
features are sometimes intentionally displaced to improve readability of the map. For
instance, the course of a river will only be approximated roughly on a small-scale
map, and a village on its northern bank should be depicted Scale north of the river,
even if this means it has to be displaced on the map a little bit. The small scale causes
an accuracy error. If we want to combine a digitized version of that map, with a
digitized version of a large-scale map, we must be aware that features may not be
where they seem to be. Analogous examples can be given for images at different
resolutions.
In Figure , the polygons of two digitized maps at different scales are coincide, and
polygon boundaries cross each other. This causes small, artefact polygons in the
overlay known as sliver polygons.
73
Geographic Information Systems Unit 3
If the map scales involved differ significantly, the polygon boundaries of the
large-scale map should probably take priority, but when the differences are
slight, we need interactive techniques to resolve the issues. There can be good
reasons for having data sets at different scales.
A good example is found in mapping organizations; European organizations
maintain a
single source database that contains the base data. This database is essentially
scale-less and contains all data required for even the largest scale map to be
produced. For each map scale that the mapping organization produces, they
derive a separate database from the foundation data. Such a derived database
may be called a cartographic database as the data stored are elements to be
printed on a map, including, for instance, data on where to place name tags, and
what colour to give them. This may mean the organization has one database for
the larger scale ranges and other databases for the smaller scale ranges. They
maintain a ulti-scale data environment.
Differences in representation
We have already talked about the various ways to represent spatial data. Some-times
data is acquired as point samples or observations, other times it is in the form of
polygons with attribute data.
When points need to be translated into raster, we need to perform something
known as point data transformation
.Some advanced GIS applications require the possibility of representing the
same geographic phenomenon in different ways. These are called multi
representation systems.
The production of maps at various scales is an example, but there are
numerous others. The commonality is that phenomena must sometimes be
viewed as points, and at other times as polygons.
For example, a small-scale national road network analysis may represent
villages as point objects, but a nation-wide urban population density study
should regard all municipalities as represented by polygons.
The complexity that this requirement entails is that the GIS or the DBMS
must keep track of links between different representations for the same
phenomenon, and must also provide support for decisions as to which
representations to use in which situation.
The links between various representations for the same object maintained by
the system allows switching between them, and many fancy applications of
their use seem possible.
When individual data sets have been prepared as described above, they some-
74
Geographic Information Systems Unit 3
Edge matching is the process of joining two or more map sheets, for instance,
after they have separately been digitized.
Merging adjacent data sets can be a major problem. Some GIS functions, such
as line smoothing and data clean-up (removing duplicate lines) may have to be
performed. Figure illustrates a typical situation.
Some GISs have merge or edge-matching functions to solve the problem
arising from merging adjacent data. At the map sheet edges, feature
representations have to be matched in order for them to be combined.
Coordinates of the objects along shared borders are adjusted to match those in
the neighboring data sets. Mismatches may still occur, so a visual check, and
interactive editing is likely to be required.
75
Geographic Information Systems Unit 3
76
Geographic Information Systems Unit 3
The aim is to use measurements to obtain a representation of the entire field using
point samples. In this section we outline four techniques to do so:
regression:
In trend surface fitting, the assumption is that the entire study area can be
represented by a formula f (x, y) that for a given location with coordinates (x,
y) will give us the approximated value of the field in that location. The key
objective in trend surface fitting is to derive a formula that best describes the
field. Various classes of formula exist, with the simplest being the one that
describes a flat, but tilted plane:
f (x, y) = c1 · x + c2 · y + c3
If we believe and this judgement must be based on domain expertise that
the field under consideration can be best approximated by a tilted plane, then
the problem of finding the best plane is the problem of determining best values
for the coefficients c1, c2 and c3.
This is where the point measurements earlier obtained become important.
Statistical techniques known as regression techniques can be used to determine
values for these coefficients c that best fit with the measurements.
A plane will be fitted through the measurements that makes the smallest overall
error with respect to the original measurements.
Triangulation:
77
Geographic Information Systems Unit 3
known as a kernel) is defined, and initially placed over the top left raster cell.
Measurement Moving window averaging points falling inside the window
contribute to the averaging computation, those outside the window do not.
This is why moving window averaging is said to be a local interpolation
method. After the cell value is computed and assigned to the cell, the window
is moved one cell to the right, and the computations are performed for that cell.
Successively, all cells of the raster are visited in this way.
10
-2 -
78
Geographic Information Systems Unit 3
(a) 2 0 2 4 6 8 10 12
Kriging
The first step in the kriging procedure is to compare successive pairs of point
measurements to generate a semi-variogram.
In the second step, the semi-variogram is used to calculate the weights used in
interpolation. Although kriging is Semi-variogram a powerful technique, it
should not be applied without a good understanding of geostatistics, including
the principle of spatial autocorrelation
79
Geographic Information Systems Unit 3
Sample Questions:
80
Geographic Information systems Unit 4
Unit-IV
Spatial Data Analysis
Contents
4.1.1.1- Measurement:
Geometric measurement on spatial features includes counting, distance and area size
computations. Measurement are categorized into two types namely
81
Geographic Information systems Unit 4
82
Geographic Information systems Unit 4
The area size of a selected part of the raster (a group of cells) is calculated as
the number of cells multiplied by the cell area size. The distance between two
raster cells is the standard distance function applied to the locations of their
respective mid-points, obviously taking into account the cell resolution where a
raster is used to represent line features as strings of cells through the raster, the
length of a line feature is computed as the sum of distances between
consecutive cells.
When exploring a spatial data set, the first thing one usually wants is to select certain
features, to (temporarily) restrict the exploration. Such selections can be made on
geometric/spatial grounds, or on the basis of attribute data associated with the spatial
features. We discuss both techniques below.
83
Geographic Information systems Unit 4
conditions using logical connectives like AND, OR, NOT. Examples: Area <
400000 AND LandUse = 80, Area < 400000 OR LandUse = 80, NOT
(LandUse = 80)
Spatial selection using topological relationships: It consist of two steps:
1. To select one or more features as the selection objects, and
2. To apply a chosen spatial relationship function to determine the
selected features that have that relationship with the selection objects.
Selecting features that are inside selection objects: This type of query uses
the containment relationship between spatial objects. polygons can contain
polygons, lines or points, and lines can contain lines or points, but no other
containment relationships are possible.
Selecting features that intersect: The intersect operator identifies features that
are not disjoint.
Selecting features adjacent to selection objects Adjacency is themeetrelation-
ship .It expresses that features share boundaries, and therefore it applies only to
line and polygon features.
Selecting features based on their distance One may also want to use the
distance function of the GIS as a tool in selecting features. Such selections can
be searches within a given distance from the selection objects, at a given
distance, or even beyond a given distance.
4.1.1.3- Classification
At times an input data set may have itself been the result of a classification, and in
such a case we called it as reclassification. For example, we may have a soil map
that shows different soil type units and we would like to show the suitability of
units for a specific crop.
In this case, it is better to assign to the soil units an attribute reclassification of
suitability for the crop. Since different soil types may have the same crop
suitability, a classification may merge soil units of different type into the same
category of crop suitability.
84
Geographic Information systems Unit 4
Fig (1.1)
Point-in-polygon: in the vector model, this overlay determines the points lying
inside a specific polygon. In the example, we are looking for all hotels that are
85
Geographic Information systems Unit 4
located in the settlement areas. In the resulting layer, the points have the additional
information whether or not they are in the settlement area. In the raster model, the
points in question are visible through the addition of the two input layers.
Fig (1.2)
Line-in-polygon: the overlay of lines and polygons is more complex. The example
shows the calculation of road sections located in the settlement area. In the vector
model, the topology changes: the original line is cut into shorter segments by the
intersection points. It has to be specified for each segment whether it is inside or
outside the settlement polygon. In the raster model, a simple addition identifies the
interest areas.
Fig (1.3)
86
Geographic Information systems Unit 4
Feature overlays from vector data are created when one vector layer (points, lines,
or polygons) is merged with one or more other vector layers covering the same
area with points, lines, and/or polygons. A resultant new layer is created that
combines the geometry and the attributes of the input layers.
87
Geographic Information systems Unit 4
Raster overlay involves two or more different sets of data that derive from a
common grid. The separate sets of data are usually given numerical values. These
values then are mathematically merged together to create a new set of values for a
single output layer.Below is an example of raster overlay by addition. Two input
rasters added together to create an output raster with the values for each cell
summed.
This approach is often used to rank attribute values by suitability or risk and then
add them, to produce an overall rank for each cell. The various layers can also be
assigned a relative importance to create a weighted ranking (the ranks in each layer
are multiplied by that layer's weight value before being summed with the other
layers).
computation on it.
Interpolation functions predict unknown values using the known values at nearby
locations. This typically occurs for continuous fields, like elevation, when the data
actually stored does not provide the direct answer for the location(s) of interest.
Topographic functions determine characteristics of an area by looking at the
immediate neighborhood as well. Typical examples are slope computations on
digital terrain models (i.e. continuous spatial fields). The slope in a location is
defined as the plane tangent to the topography in that location.
Various computations can be performed, such as:
88
Geographic Information systems Unit 4
Thus, we can say that the principle of buffer zone generation is simple: we
select one or more target locations, and then determine the area around them,
within a certain distances.
Types of buffering:
A) Data Type
Point Buffering: It involves the creation of a circular polygon about the point
of interest. Buffer distance, in this case the radius of the polygon, can either be
same for each point or the user can define it using attribute table or lookup
table. Example: An area covered by a mobile tower.
Polyline Buffering:A polygon is created at a specific width on a side or around
a line or its segment. Example: An area to be affected by construction of canal
89
Geographic Information systems Unit 4
B) Width
Varying: Buffer distance around a feature varies for different segments of the
features. Example: An area along the banks of a river depending on the
intensity of the adjacent land use.
Single: Buffer zone with just one concerned parameter is created. Example:
Area owned by somebody along a border.
Multiple: Creation of concentric zone around the feature of interest. Example:
An area affected by radiation from a nuclear plant. Creation of zone expanding
more on one direction than the other. Example: An area inundated if a dam
breaks
Other:Buffer zones often have dissolved boundaries so that there are no
overlapping areas between the buffer zones. In some cases though, it may also
be useful for boundaries of buffer zones to remain intact, so that each buffer
zone is a separate polygon and so as to identify the overlapping areas.
90
Geographic Information systems Unit 4
2) Thiessen Polygons:
Procedure:
The process goes through following steps:
1. Collects the points from a point layer (vertices if the source is a polyline or
polygon layer)
2. Clean duplicate points
3. Generates Convex Hull
4. Creates a TIN structure
5. Generates perpendicular bisectors for each tin edge.
6. Builds the Thiessen polygons
7. Clips the Thiessen polygons feature class with the convex hull.
Application Area:
Thiessen polygons are used to generate soil maps based on irregular distributed
sample points. The border between two soil types is assumed to be at the half of the
distance between two sample points that exhibit different soil types.
91
Geographic Information systems Unit 4
metric space can be chosen and weighting factors etc. can be included to the
calculation.
diffusion.
Diffusion
computation involves one or more target locations, which are better called
source. They are the locations of the source of whatever spreads. The
computation also involves a local resistance raster, which for each cell provides
a value that indica
cell.
The value in the cell must be normalized: i.e. valid Resistance for a
location(s) and the local resistance raster, the GIS will be able to compute a
new raster that indicates how much minimal total resistance the spread has
witnessed for reaching a raster cell.
given, least-cost path, determined again by local terrain characteristics. The typical
case arises when we want to determine the drainage patterns in a catchment: the
92
Geographic Information systems Unit 4
derived from a limited set of sample values. These may be based on direct
measurement, such as height values for an elevation surface, or temperature
values for a temperature surface; between these measured locations values are
assigned to the surface by interpolation.
Surfaces can be represented using contour or iso-lines, arrays of points, TINs,
and rasters; however, most surface analysis in GIS is done on raster or TIN
data.
Rasters are rectangular arrays of cells (or pixels), each of which stores a value
for the part of the surface it covers. A given cell contains a single value, so the
amount of detail that can be represented for the surface is limited to the size of
the raster cells.
Rasters are the most commonly used surface models in ArcGIS. The simplicity
of the raster data structure makes calculations on rasters (or comparisons
between rasters) faster for rasters than other surface representations. Rasters are
also used to store imagery, scanned maps, and categorical information, such as
land use class, which is often derived from imagery.
Surface analysis involves several kinds of processing, including extracting new
surfaces from existing surfaces, reclassifying surfaces, and combining surfaces.
Certain tools extract or derive information from a surface, a combination of
surfaces, or surfaces and vector data.
Some of these tools are primarily designed for the analysis of raster terrain
surfaces. These include Slope, Aspect, and Hill shade.
Slope tool calculates the maximum rate of change from a cell to its neighbors,
which is typically used to indicate the steepness of terrain.
Aspect toolcalculates the direction in which the plane fitted to the slope faces
for each cell. The aspect of a surface typically affects the amount of sunlight it
receives (as does the slope); in northern latitudes places with a southerly aspect
tends to be warmer and drier than places that have a northerly aspect.
93
Geographic Information systems Unit 4
Hill shade tool calculates the intensity of lighting on a surface given a light
source at a particular location it can model which parts of a surface would be
shadowed by other parts.
94
Geographic Information systems Unit 4
95
Geographic Information systems Unit 4
Optimal path finding techniques are used when a least-cost path between two
nodes in a network must be found. The two nodes are called origin and
destination, respectively. The aim is to find a sequence of connected lines to
traverse from the origin to the destination at the lowest possible cost.
The cost function, it can be defined as the total length of all lines on the path.
The cost function not only length of the lines, but also their capacity, maximum
transmission (travel) rate and other line characteristics, for instance to obtain a
reasonable approximation of travel time. There can even be in which the nodes
visited add to the cost of the path as well. These may be called turning costs.
Problems related to optimal path finding are ordered optimal path finding and
unordered optimal path finding. Both have an extra requirement that a number of
additional nodes needs to be visited along the path. In ordered optimal path
finding, the sequence in which these extra nodes are visited matters; in unordered
optimal path finding it does not.
2. Network partitioning
on the network, known as the trace origin. For a node or line to be conditionally
96
Geographic Information systems Unit 4
connected, it Means that a path exists from the node/line to the trace origin, and
that the connecting path fulfills the conditions set.
Tracing is the computation that the GIS perform to find the paths from the trace
origin that obey the tracing conditions. It is a rather useful function for many
network-related problems.
and GIS are more or less inseparable, as GIS is itself a tool for modeling
the real world.
The solution to a (spatial) problem usually depends on a (large) number of
parameters. Since these parameters are often interrelated, their interaction is
made more precise in an application model.
Many kinds of application models exist, and they can be classified in many
different ways. Here we identify five characteristics of GIS-based application
models:
1. The purpose of the model : It refers to whether the model is descriptive,
prescriptive or predictive in nature. Descriptive models attempt to answer the
97
Geographic Information systems Unit 4
These various errors may affect the outcome of spatial data manipulations. In
addition, further errors may be introduced during the various processing steps.
One of the most commonly applied operations in geographic information
systems is analysis by overlaying two or more spatial data layers each such
layer will contain errors, due to both inherent inaccuracies in the source data
and errors arising from some form of computer processing, for example,
rasterization.
During the process of spatial overlay, all the errors in the individual data layers
contribute to the final error of the output. The amount of error in the output
depends on the type of overlay operation applied. For example, errors in the
results of overlay using the logical operator AND are not the same as those
created using the OR operator. Following are some common sources of error
introduced into GIS analyses.
98
Geographic Information systems Unit 4
Sample Questions
99
Geographic Information Systems Unit 5
Unit-V
Data Visualization
Contents
100
Geographic Information Systems Unit 5
accurately as possible
Topographic maps are used to represents the
detailed information of a location including
the natural and human made phenomenon.
The 3Dimensional feature are represented
using contours lines
They may include forest land, infrastructure,
land use, relief, hydrology, geographic
names and reference grids. The designs are
based on conventions and the set of symbols using visual variables
Eg : use of blue to represent water
Visual Communication
The main function of map is to communicate geographic information. Inform
the map users about the location and nature of geographic phenomena and
spatial pattern
The main goals of cartographers is to produce a map with cartographic tool
which will be able to convey the main purpose of map.
Since GIS has spread so widely in day today life, that even a simple excel sheet
has mapping caliber. Thinking process is accelerated by continued
developments in hardware and software
Dynamic presentation and user interaction is possible by using media like
DVD-ROMs and WWW
102
Geographic Information Systems Unit 5
also has an important impact on cartography. It offers the users interacting with
the map in real time .
Interaction with the map will stimulate t
add more functionality to the map.
Geovisualization
It refers to the set of tools and techniques supporting the analysis of geospatial
data through the use of interactive visualization. Map based scientific
visualization is also known as geovisualization.
It covers both presentation and exploration function of map. Presentation is
described as public visual communication since it is related to wide audiences.
Exploration is described as private visual thinking because its about an
individual manipulating with data. Exploration involves expertise and
knowledge about dealing with data.Exploration means to search for spatial,
temporal, spatio-temporal patterns or relationships or trends.
Visual Communication Process
103
Geographic Information Systems Unit 5
Types of Data
104
Geographic Information Systems Unit 5
Visual variables :
Variation in symbol appearance are grouped together into six categories called
visual variables. Visual variables are used to make one symbol different from
.
105
Geographic Information Systems Unit 5
106
Geographic Information Systems Unit 5
5.6.3 :- Text
Text is used to convey information. Text can be combined with visual variable
of different colors to specify different objects like blue for water bodies. The
positioning of text also matters with respect to the object that it refers. Text
elements in the map body are also known as labels. Interactive editing is also
provided to improve the final map.
Point features should have names to upper right. Line features are mentioned in
blocks and parallel to course of line feature. Area features have name in
feature, size dependent on extent.
107
Geographic Information Systems Unit 5
Interactive labelling: If the placement does not work well the label can be
moved immediately
Dynamic labelling : where the computer takes over the labelling task using
inbuilt algorithms
108
Geographic Information Systems Unit 5
Sample Questions:
109