Describe how real-world features are represented by GIS data
Explain basic differences between the raster and vector data models
Define map scale and explain how source scale affects the use of map data
Recognize the types of shortcomings associated with GIS data
Correctly cite data sets on a map or in a report
Use ArcGIS Pro to explore data and view maps
TerrSet System Overview
2
3 of 38
3
4 of 38
Types of map data
Source: USGS
Discrete objects exist in a defined location/space
Lookout tower (point).
Stream (line).
Snail habitat (polygon).
Continuous data exist everywhere
Elevation (raster).
4
5 of 38
GIS data models
Source: Esri
Source: USGS
Source: Google Earth and TeleAtlas
The vector model is best for discrete data
state boundaries.
streams.
snail habitat.
The raster model is used for imagery and is best for continuous data
elevation.
aerial photography.
5
6 of 38
Vector data model
The vector data model is best for discrete data
Features are stored map objects
Points, lines, polygons.
A feature class is a collection of similar features stored together, like states or rivers
6
7 of 38
Attribute tables
Source: Esri
Features are linked to tables containing information about the spatial objects
The map object and the table data are connected by a unique integer
7
8 of 38
Raster data model
Source: USGS
The raster model breaks map areas into small squares known as cells or pixels
A single numeric value is stored in each cell, such as elevation
8
9 of 38
Map Layers
A layer refers to the visualization of something on a map.
Layers make maps more contextual and help you focus on specific aspects like assets, roads, and points of interest.
9
10 of 38
10
11 of 38
Map scale
1 cm on map = 100,000 cm on ground
Map scale is the ratio of distance on the map to distance on the ground
It is dimensionless and can be expressed in any units: cm or inches or mm…
11
12 of 38
Generalization
Source: Esri
Generalization is used to simplify map features for clear display
Small scale maps present less detailed versions of objects.
Large scale maps present more detailed versions.
Source scale is the scale at which data are captured
It impacts detail and accuracy of the data set.
12
13 of 38
GIS data and scales
Source: Esri
GIS data may be presented at any scale
It is best not to display it at scales too different from the source scale
Larger scale maps have finer sampling distance (resolution)
13
14 of 38
Impact of source scale
Source: Esri
Two different data sets of the same objects may not align, especially when the source scale is different, as for these Oregon counties from two different data sources
14
15 of 38
15
16 of 38
GIS Accuracy and Precision
17 of 38
Some Basic Definitions
Accuracy is the degree to which information on a map or in a digital database matches true or accepted values.
Precision refers to the level of measurement and exactness of description in a GIS database.
Data quality refers to the relative accuracy and precision of a particular GIS database. These facts are often documented in data quality reports.
Error encompasses both the imprecision of data and its inaccuracies.
18 of 38
Data Quality
Data sets are rarely perfect
No absolute quality standard exists
Quality is defined as the fitness of a data set for a particular purpose
The same data set may be unsuitable for one use but adequate for another
A user has ethical and legal responsibility to determine whether a data set is sufficient for its intended use
18
19 of 38
20 of 38
Types of Error
Positional accuracy and precision
Attribute accuracy and precision
Conceptual accuracy and precision
Logical accuracy and precision
The need for accuracy and precision will vary radically depending on the type of information coded and the level of measurement needed for a particular application.
21 of 38
Sources of Inaccuracy and Imprecision
Burrough (1986) divides sources of error into three main categories:
Obvious sources of error.
Age of data
Areal Cover
Map Scale
Density of Observations
Relevance
Format/Accessibility/Cost
Errors resulting from natural variations or from original measurements.
Measurement Error/Biases
Errors arising through processing.
Numerical Errors
22 of 38
22
23 of 38
Beware of False Precision and False Accuracy!
24 of 38
Geometric accuracy
Is it where it says it is?
Depends on the level of error in original source.
Additional errors may be incurred or propagated during processing.
Assessed by comparing the data to another data set with known high accuracy.
Which road is “correct”? What errors might occur in the locations of both?
Source: Google Earth and TeleAtlas
24
25 of 38
Thematic accuracy
How accurate are the attributes?
One might ask questions about, for example, tree crown density.
How is it measured?
How accurate are the measurements?
What are the possible sources of error?
Source: Esri
Source: Google Earth and TeleAtlas
25
26 of 38
Resolution
Resolution is the sampling interval of measurements during data collection
Spatial sampling resolution.
Distance between GPS points along a road.
Size of pixel for elevation or satellite image raster.
Thematic data resolution.
How fine was the measuring scale?
Were the data classified after measurement?
Temporal data resolution.
How frequently were data sampled?
Daily, monthly, every decade?
Source: Google Earth and TeleAtlas
26
27 of 38
Precision
Precision has two meanings in science
It is NOT the same as accuracy!
Meaning One: the number of significant digits in a measurement
Example: a GPS unit reports locations to the nearest meter. The precision is about 1 meter.
Meaning Two: The statistical variability of a repeated measurement
Example: 20 GPS measurements at same spot have a standard deviation of 5 meters. The precision is about ±10 meters.
Source: Google Earth and TeleAtlas
27
28 of 38
Logical consistency
Logical consistency assesses how well a data set represents the real-world relationships?
Are common boundaries identical.
Do roads actually connect?
Do voting district and county boundaries align?
Source: Esri
28
29 of 38
29
30 of 38
Citing GIS data
Formats may vary for different agencies or companies
A good citation enables someone to find the source should they wish to obtain a copy
Citing data is a professional and ethical responsibility
Data set name (Year published) [source type]. Producer name, producer contact information. Resource URL: [Date accessed].
30
31 of 38
Citation examples
Black Hills RIS Vegetation Database (2008) [downloaded file]. Black Hills National Forest, Custer, SD. URL: http://www.fs.usda.gov/main/blackhills/landmanagement/gis [August, 2010].
Esri™ Data and Maps (2012) [DVD]. Esri™, Inc., Redlands, CA.
National Hydrography Dataset (2015) [downloaded file]. United States Geological Survey on the National Map Viewer. URL: http://viewer.nationalmap.gov/viewer/ [July 23, 2015].
USA Topo Maps (2009) [map service]. Esri™ on ArcGIS Online. URL: http://server.arcgisonline. com/ arcgis/services/USA_Topo_Maps/MapServer [January 1, 2012].
31
32 of 38
32
33 of 38
TerrSet
33
34 of 38
34
35 of 38
Summary
A GIS is a spatially enabled database that uses map and attribute data to answer questions about where things are and how they are related.
Maps may represent discrete features, such as a road or a lake, or continuous measurements, such as elevation or temperature. The vector data model is ideal for discrete data, and the raster data model excels at storing continuous data.
Map scale is the ratio of the size of objects in the map to their size on the ground. The source scale of GIS data affects their accuracy and precision and the map scales for which they are suitable.
35
36 of 38
Summary
Every GIS user has a responsibility to ensure that data are suitable for the proposed application and an obligation to cite the sources of data used.
Data quality is measured in terms of geometric accuracy, thematic accuracy, resolution, and precision.
Metadata store information about GIS data layers to help people use them properly.