Geographic Information Concepts: University of South Carolina Arcview 3.0A Short Course Series
Geographic Information Concepts: University of South Carolina Arcview 3.0A Short Course Series
Copyright © 1998
2
Table of Contents
___________________________________________________________________________________________________________
Prepared by the College of Liberal Arts Computer Lab, USC–- February 1997
3
6.0 Terminology 20
___________________________________________________________________________________________________________
Prepared by the College of Liberal Arts Computer Lab, USC–- February 1997
4
1.0 Background Information be organized. Contact Paul Braun (777-0957) with concerns
about a course you would like offered.
1.1 College of Liberal Arts Support
1.2 Relevant Families of Software
The College of Liberal Arts Computing Lab is one of the
central locations of GIS activities on the University of South 1.2.1 Computer Aided Design (CAD)
Carolina campus. The Lab has developed various ways to
support users throughout campus. Examine the College of CAD is a family of software popular for creating a digital
Liberal Arts web page at http://www.cla.sc.edu/gis/uscgis.html. version of blue prints, drawings, etc. The software is popular
among draftsmen, engineers, architects, and landscape
architects. Many CAD packages have been designed to work
1.1.1 Geospatial Data Server best with one of those particular professions.
Many CAD software packages have a strong emphasis on
The Lab has created a data server that describes what spatial three-dimensional capabilities and shading.
information the lab has gathered and has made that information CAD systems are available on all popular platforms (i.e., PC,
accessible to the public. The server includes spatial data about Macintosh, UNIX). CAD systems tend to organize information
the USC campus, the City of Columbia, Richland and by layers and do not utilize topological structures.
Lexington County, the State of South Carolina, the United
States, and the World. See
http://www.cla.sc.edu/gis/dataindex.html A great deal of marketing literature for CAD software products
confuses the differences between CAD and GIS. Keep in mind
that CAD systems have begun to migrate toward GIS
1.1.2 Course Offerings functionality. AutoDesk’s AutoCAD and Intergraph’s
Microstation are leaders in the CAD software industry.
What courses are offered and when is always up for discussion. ArcView can read AutoCAD and Microstation CAD drawings
A set schedule exists for classes through the spring semester. in their native formats.
Other classes can be added based on enrollment and requests.
If there is enough demand for a particular topic, a course could
1.2.2 Image Processing (IP)
___________________________________________________________________________________________________________
Prepared by the College of Liberal Arts Computer Lab, USC–- February 1997
5
___________________________________________________________________________________________________________
Prepared by the College of Liberal Arts Computer Lab, USC–- February 1997
6
2.0 Data Acquisition Techniques three satellites at any one time) to give the GPS receiver
operator a coordinate location. General measurements can be
obtained immediately (+/- 100s of feet). Measurements can be
2.1 Spatial Data collected and post-processed in a computer to obtain very
accurate measurements (+/- inches).
2.1.1 Digitizing
Digitizing is the process of tracing paper maps on top of a 2.1.4 Remote Sensing
digitizing tablet to convert the drawing to a digital format. On- Remote sensing is the collection of information with a remote
screen digitizing is the process of tracing features off of an object (camera, balloon, satellite, etc.). Imagery can be of any
image on the screen. For example, vegetation information can wavelength in the electro-magnetic spectrum (i.e., infrared,
be digitized from a Landsat image while the image is drawn to visible, radar, etc.). Images from remote sensing are in raster
the screen. Digitizing creates vector data which can be (pixel) format.
converted to raster.
___________________________________________________________________________________________________________
Prepared by the College of Liberal Arts Computer Lab, USC–- February 1997
7
unique feature ID and x and y coordinates. It may also have 495612 3765256 GASOLINE 2
relevant attributes about the features you are mapping. For 497403 3764664 GASOLINE 3
example, 499460 3761745 GASOLINE 3
496054 3759584 GASOLINE 2
Comma Delimited 497089 3761839 GASOLINE 2
Xcoord,Ycoord,type,rating 494612 3761199 CRUDE OIL 1
494612,3764140,GASOLINE,1
494671 3760955 CRUDE OIL 2
494671,3763942,GASOLINE,2 495612 3760000 CRUDE OIL 2
495612,3765256,GASOLINE,2 497403 3762000 CRUDE OIL 3
497403,3764664,GASOLINE,3 499460 3761370 CRUDE OIL 3
499460,3761745,GASOLINE,3
496054 3764000 CRUDE OIL 2
496054,3769584,GASOLINE,2 497089 3763000 CRUDE OIL 2
497089,3761839,GASOLINE,2
494612,3761199,CRUDE OIL,1
As mentioned above, these files can be read into a desktop
494671,3760955,CRUDE OIL,2 mapping package. The x and y fields will be used to represent
495612,3760000,CRUDE OIL,2 the information (type and rating) as points. All other
information will be stored as point attributes.
Tab Delimited
This same file could be tab delimited. A tab delimited file
would replace all the commas with tabs. For example,
___________________________________________________________________________________________________________
Prepared by the College of Liberal Arts Computer Lab, USC–- February 1997
8
___________________________________________________________________________________________________________
Prepared by the College of Liberal Arts Computer Lab, USC–- February 1997
9
Coordinate systems are comprised of spheroids, datums, and Projection is the process of representing a three-dimensional
projections and are specified in terms of units (i.e., feet, meters, surface in two-dimensions. Projections are mathematical
yards, etc.). Coordinate systems locate features by x and y expressions that convert data from a geographic location
coordinates. (latitude and longitude) on a sphere or spheroid to a
representative location on a flat surface (a map). This process
3.1.1 Spheroids distorts at least one of these properties: shape, area, distance, or
direction.
A spheroid is a mathematical description of the earth. Over
time these mathematical expressions have changed from The most popular projections are:
describing the earth as a perfect circle to a spheroid (i.e., an
egg shape). For years the US standard has been Clarke 1866 Conic
but with improvements in measurement techniques, the US • A cone is placed over the globe touching along one or two
standard is moving toward GRS80. standard parallels, and information is transposed onto the
cone,
• Of all the conic projections, the Equidistant Conic, Lambert
3.1.2 Datums Conic Conformal, and Albers Equal-Area Conic
projections are the most popular.
A datum is a set of control points whose geometric
relationships are known, either through measurement or
calculation and is used to define a coordinate system. Datums
are based on a particular spheroid. There are two datums used
almost exclusively in the US, the North American Datum of
1927 (NAD27 based on Clarke 1866) and the North American
Datum of 1983 (NAD83 based on GRS80). Converting digital
data based on NAD27 to NAD83 can migrate features slightly.
___________________________________________________________________________________________________________
Prepared by the College of Liberal Arts Computer Lab, USC–- February 1997
10
Cylindrical
• A cylinder is placed over the globe touching alone one or
two standard parallels, and information is transposed onto
the cylinder,
• Of all cylindrical projections, the Mercator projection is the The most common projections used in commercial or publicly
most popular. available mapping data sets are:
Latitude-longitude
Latitude-Longitude is not a two-dimensional coordinate system
but is commonly referred to as one. Lines of latitude run east
west parallel to the equator. Longitude lines run north-south
and converge at the poles. Therefore, the length of one degree
of longitude varies depending upon the latitude at which it is
Planar measured. For example, one degree of longitude at the equator
is 111 kilometers in length, but the length of one degree of
___________________________________________________________________________________________________________
Prepared by the College of Liberal Arts Computer Lab, USC–- February 1997
11
Most of the data available from the College of Liberal Arts GIS
data server is available in UTM, zone 17 with map units of
meters.
___________________________________________________________________________________________________________
Prepared by the College of Liberal Arts Computer Lab, USC–- February 1997
12
___________________________________________________________________________________________________________
Prepared by the College of Liberal Arts Computer Lab, USC–- February 1997
13
2. Accommodates discrete data such as soils as well as 5. Vector data set graphics tend to be of a higher quality.
continuous data such as topography,
3. Processing algorithms are easier than for vector data sets,
and 3.2.5 Raster Data Structure Resolution
4. Compatible with other data collection and manipulation
software such as in remote sensing. Raster data structure resolution refers to pixel size. A SPOT
monochromatic satellite image has a resolution of 10 meters
while a LANDSAT satellite image has a resolution of 30
3.2.3 Vector Data Structure
meters. The smaller the resolution, the more processing time
and larger the file size. ArcView generally defaults to an
Vector data represents features as points, lines, and polygons. optimized grid cell size for grid creation.
A point feature is an x and y coordinate, a line is a string of
consecutive points, and a polygon is a string of consecutive 3.3 Data Conversion
points that closes back upon itself. Vector data sets can have
topology (see section 3.5). Topology means that, in addition to Just like a WordPerfect document can be converted to a
the position of every feature, the software maintains the spatial Microsoft Word document, so too can raster data be converted
relationships of adjacency and connectivity between features to vector and vice versa. For example, a scanned land use map
(i.e., it “knows” where all features are and how the relate to would be a raster data file. It could be converted into a vector
each other). format in order to have discrete polygons which represent
different landuses.
3.2.4 Benefits of Vector Data Structure
3.4 Topology
1. Less storage space is needed because many pixels don’t
have to be stored for a homogeneous area, Topology is a data structure often used in GIS. Topology is the
2. Feature types can be individually retrieved such as roads or stored relationships between map features. When topology has
water features, been created (such as in an ARC/INFO coverage), the file
3. It is easier to associate a variety of descriptive resource data would “know” its position, “know” what is around it,
with a single resource feature, “understand” its environment by virtue of recognizing its
surroundings, and “know” how to get from A to B.
4. Digitized data does not have to be converted, and
___________________________________________________________________________________________________________
Prepared by the College of Liberal Arts Computer Lab, USC–- February 1997
14
Topological relationships are stored in a series of relational centimeter, or a foot) is equal to 24,000 units of that same size
databases. Each database stores information about a feature. on the surface of the earth.
For example, a database would store the following information
about each individual arc: Small scale maps (e.g., 1:100) show small areas and large
amounts of detail. Large scale maps (e.g., 1:1,000,000) show
large areas and small amounts of detail. Feature display is
• Number of the arc,
scale dependent. For example, a building might be shown as a
• Beginning node number, polygon at 1:100 but as a point at 1:100000. Minimum
• Ending node number, mapping units are used to filter out features and determine if a
• Polygon to its left, and feature should be mapped at that particular scale. For example,
at a scale of 1:24,000 a minimum mapping unit for mapping
• Polygon to its right. wetlands might be 2 acres because any wetland smaller than
that would be too difficult to locate on a 1:24,000 scale map.
In this way, each node, line, and polygon could be defined by However, if the mapping was being done at 1:200 the
its neighbors and who it is connected to. minimum mapping unit could be very small. Therefore, the
smaller the scale (larger the area shown) the more details that
3.5 Map Scale will be left out.
Map scale is shown as a ratio of reduction between real world Scale 2 cm2 1 in2
distances and distances on a map. The following three 1:24,000 23.04 ha 91.8 ac
statements show the same scale: 1:62,500 156 ha 623.0 ac
1:100,000 400 ha 1594.0 ac
1 inch = 2,000 feet 1:250,000 2,500 ha 9964.0 ac
1 inch = 24,000 inches 1:500,000 10,000 ha 39856.0 ac
1:24,000
Area Equivalents of a 2 square centimeters and 1 square inch at
The left side of the ratio is the distance on the map and the different scales
right side of the ratio is the distance on the surface of the earth.
When there are no units expressed in the ration, such as in
1:24,000, that means any unit on the map (e.g., 1 inch, a
___________________________________________________________________________________________________________
Prepared by the College of Liberal Arts Computer Lab, USC–- February 1997
15
___________________________________________________________________________________________________________
Prepared by the College of Liberal Arts Computer Lab, USC–- February 1997
16
4.0 Database Issues Fields within a database are the columns of data. An attribute
of ‘acres’ or ‘owner_name’ is one field. A field is sometimes
4.1 Data Types called an item or a column.
___________________________________________________________________________________________________________
Prepared by the College of Liberal Arts Computer Lab, USC–- February 1997
17
___________________________________________________________________________________________________________
Prepared by the College of Liberal Arts Computer Lab, USC–- February 1997
18
5.0 Data Sources The Liberal Arts Computing Lab has a GIS data server that
contains a great deal of information at multiple scales
There are many places to obtain spatial and database data either (http://www.cla.sc.edu/gis/). That data is useable directly
for free or for a fee. Information can be coded into a across the University’s backbone. However, in today’s current
spreadsheet or relational database manager system and pulled computing environment, it is best to peruse the web site to
into a GIS (if it has coordinate representation of x,y or a discover what data exists, and then come talk to either Lynn
character description such as an address, zip code, city name, Shirley or Paul Braun about accessing that information. The
county name), data can be digitized, scanned, purchased from a lab also has a myriad of CD-ROMs that have been purchased
private vendor, or requested from various public agencies. for distribution. Those data sources include the Digital Chart
of the World, US and South Carolina base map features, and
1990 Census data. These CDs can be viewed at any time by
5.1 Create Your Own stopping in room 3E, Gambrell Hall.
___________________________________________________________________________________________________________
Prepared by the College of Liberal Arts Computer Lab, USC–- February 1997
19
http://sun1.cr.usgs.gov/doc/edchome/ndcdb/ndcdb.html
Census Data
1. Tiger roads,
2. Tract, block group, and block spatial data
3. Databases of demographics, income, housing, etc.
___________________________________________________________________________________________________________
Prepared by the College of Liberal Arts Computer Lab, USC–- February 1997
20
Coverages - An ARC/INFO coverage is the basic unit for Shapefiles - A shapefile is a data structure created for ArcView
handling geographic and attribute features. Technically, a by ESRI. It is a non-topologically structured graphic file that
coverage is a directory with a series of files below it which still has attributes tied to its features. Therefore, it will draw
hold all the graphic location, topology, and attribute faster in ArcView because it doesn’t have to maintain all the
information. A coverage is actually a directory with a series of topology files of a coverage. A shapefile can be converted to a
files below it. coverage or a grid and vice versa. A shapefile is actually made
up of a maximum of 5 different files.
DEM -Digital Elevation Model. A topographic surface
arranged in a data file as a set of regularly-spaced x,y,z Theme - A theme is the generic name for any piece of spatial
locations where z represents elevation. It is also a format for data that is brought into a view within ArcView. A theme
USGS digital elevation data sets and an elevation database for could be a coverage, a shapefile, an AutoCAD drawing
elevation data for the National Mapping Division of the U.S. (*.DWG), an image (*.tif, *.lan, etc.), or a grid.
Geological Survey.
Tiger - Topologically Integrated Geographic Encoding and
DLG - Digital Line Graph. DLG files come from the U.S. Referencing data format used by the US Census Bureau to
Geological Survey and include data from base map categories support census programs and surveys. It was used for the 1990
such as transportation, hydrography, contours, and public land census. TIGER files contain street address ranges along lines
survey boundaries. DLG is also the digital format standards and census tract/block boundaries. This descriptive data can be
published by USGS for exchanging cartographic data files and used to associate address information and census/demographic
for which the USGS delivers Digital Line Graph data sets. data with spatial information.
___________________________________________________________________________________________________________
Prepared by the College of Liberal Arts Computer Lab, USC–- February 1997