10.07.2015 Views

3. Integrated analysis of spatial and attribute data - RTC, Regional ...

3. Integrated analysis of spatial and attribute data - RTC, Regional ...

3. Integrated analysis of spatial and attribute data - RTC, Regional ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

International Training Course on “Climate Analysis <strong>and</strong> Applications”10-19 October 2011WMO <strong>RTC</strong>-Turkey FacilitiesAlanya, Antalya, TURKEYFUNDAMENTALS OFGISDr. Arzu ErenerSelcuk University, Department <strong>of</strong> GeomaticsEngineeringemail: ae76@hotmail.com.tr1


Content1. Fundamentals <strong>of</strong> GIS2. What is GIS?<strong>3.</strong> Geo-Spatial Data4. The Nature <strong>of</strong> Geographic Data5. Organizing Geographic Data For Analysis6. The Components <strong>of</strong> GIS7. Spatial Data Models8. GIS Analysis Functions9. Principles <strong>of</strong> GeoStatistical Analysis2


What is GIS?A widely accepted definition is that:“A GIS is a computer based system that provides the following four sets<strong>of</strong> capabilities to h<strong>and</strong>le georeferenced <strong>data</strong>” (Aron<strong>of</strong>f, 1989):inputmanagement (storage <strong>and</strong> retrieval)manipulation <strong>and</strong> <strong>analysis</strong>output3


What is GIS?‣One <strong>of</strong> the best ways to introduce GIS is to consider the generic types <strong>of</strong>questions they have been designed to answer.‣These include questions about location, patterns, trends <strong>and</strong> conditions:Where are particular features found?What geographical patterns exist?Where have changes occured over a given time period?4


What is GIS?‣Where do certain conditions apply?‣What will the <strong>spatial</strong> impacts be if an organisation takes certain action?5


‣A GIS pr<strong>of</strong>essionals were then asked to find appropriate sources <strong>of</strong> <strong>spatial</strong><strong>data</strong> that could be used to represent these criteria.‣A range <strong>of</strong> sources was identified including:7


It is one <strong>of</strong> the most important elements <strong>of</strong> a GISIs it ‘<strong>spatial</strong>’ or ‘geographic’?Spatial Data: Data with an associated <strong>spatial</strong> location (with respect to agiven reference frame)‣medical images are referenced to the human body‣engineering drawings are referenced to a mechanical object‣architectural drawings are referenced to a buildingGeographic <strong>data</strong>/geo-<strong>spatial</strong> <strong>data</strong>: <strong>data</strong> whose underlying referenceframe is the earth surface.NOTE: Spatia <strong>data</strong> <strong>and</strong> Geo-<strong>spatial</strong> <strong>data</strong> <strong>of</strong>ten used interchangeably8


The Nature <strong>of</strong> Geographic DataGeographic <strong>data</strong> are commonly characterized as having four majorcomponents Geographic Position: geographic <strong>data</strong> are recorded in a coordinatesystem Attributes: characteristics <strong>of</strong> geographic features Relationships: the relationships between the geographic features Time: geographic <strong>data</strong> are referenced to a point in time or a period <strong>of</strong>time9


Geographic PositionThe Nature <strong>of</strong> Geographic Data‣A gis requires that a common coordinate system be used for all the <strong>data</strong> settherefore they can overlay one another.10


AttributesThe Nature <strong>of</strong> Geographic Data‣<strong>spatial</strong> <strong>attribute</strong>: the physical dimension or the width <strong>of</strong> a road‣non-<strong>spatial</strong> <strong>attribute</strong>: a class, such as a vegetation type <strong>and</strong> thedescriptive information, such as the name <strong>of</strong> the owner <strong>of</strong> a parcel11


The Nature <strong>of</strong> Geographic DataSpatial RelationshipsThese relationships are generally very numerous, may be complex, <strong>and</strong> are important.• Spatial relationships:‣• Topological relationships(independent on coordinates)for example: next to, inside‣• Metrical relationships(dependent on coordinates):for example: ten kilometres away• Non-<strong>spatial</strong> relationships:for example: Owner <strong>of</strong>, builder <strong>of</strong>.what it is near ?12


TimeThe Nature <strong>of</strong> Geographic DataGeographic <strong>data</strong> are referenced to a point in time or a period <strong>of</strong> timeMLC map <strong>of</strong> 2006 MLC map <strong>of</strong> 200913


Organizing Geographic Data ForAnalysis‣In GIS a <strong>data</strong> layer consists <strong>of</strong> logicallyrelated geographic features <strong>and</strong> their<strong>attribute</strong>s‣the organizing principal <strong>of</strong> <strong>data</strong> layers is beto group similar feature types..‣each layer is registered positionally toother layers through a common coordinatesystem‣in principle, the number <strong>of</strong> layers isunlimited, restrictions being imposed only bystorage space14


There may be three main reasons for segregating geographic information intoseparate layersOrganizing Geographic Data ForAnalysis‣ to simplify the combination <strong>of</strong> features‣ to perform manipulation <strong>and</strong> <strong>analysis</strong> on multiple <strong>data</strong> sets‣ to let a smaller scale <strong>data</strong>base lead to a larger scale <strong>data</strong>base showing moredetail15


The Components <strong>of</strong> GIS‣All <strong>of</strong> these components need to be in balance for the system to be successful.‣No one part can run without the other.16


The Components <strong>of</strong> GIS‣People include a plethora <strong>of</strong> positions including:‣GIS managers,‣<strong>data</strong>base administrators,‣application specialists,‣systems analysts, <strong>and</strong>‣programmers.17


The Components <strong>of</strong> GISMethods include‣how the <strong>data</strong> will be retrieved,‣input into the system,‣stored,‣ managed,‣transformed,‣analyzed, <strong>and</strong> finally presented in a final output.18


The Components <strong>of</strong> GISHardware consists <strong>of</strong>:‣enough power to run the s<strong>of</strong>tware,‣enough memory to store large amounts <strong>of</strong> <strong>data</strong>,‣<strong>and</strong> input <strong>and</strong> output devices such as scanners,‣digitizers,‣GPS <strong>data</strong> loggers,‣media disks, <strong>and</strong> printers. (Carver, 1998)19


The Components <strong>of</strong> GISS<strong>of</strong>twareAll packages must be capable <strong>of</strong>:‣ <strong>data</strong> input,‣ storage,‣ management,‣ transformation,‣ <strong>analysis</strong>, <strong>and</strong> output.Some examples for the s<strong>of</strong>tware are:‣ ArcInfo‣ Intergraph‣ MapInfo‣ Smallworld‣ SICAD20


Data InputThe Components <strong>of</strong> GISThe most time consuming <strong>and</strong> costly aspectProcedure <strong>of</strong> encoding <strong>data</strong>The creation <strong>of</strong> an accurate <strong>and</strong> well documented <strong>data</strong>-base is critical to theoperation <strong>of</strong> GISḊata quality information includes the:‣date <strong>of</strong> collection‣the positional accuracy‣classification accuracy‣complateness‣<strong>and</strong> the method used to collect <strong>and</strong>encode the <strong>data</strong>21


Data InputData QualityErrors in the <strong>data</strong> set can addunpleasant <strong>and</strong>costly hoursto implementing a GIS <strong>and</strong> the results <strong>and</strong> conclusions <strong>of</strong> the GIS <strong>analysis</strong>most likely will be wrong.Several guidelines to look at include:LineagePositional Accuracy Attribute Accuracy Logical ConsistencyCompletenes22


Data QualityLineageThe source material from which the <strong>data</strong> were derivedhistory, the source <strong>of</strong> <strong>data</strong> <strong>and</strong>processing stepsInclude all dates <strong>of</strong> the source material <strong>and</strong> updates <strong>and</strong> changes made to it.(Guptill, 1995)23


Positional AccuracyData Qualityexpected deviance in the geographic location <strong>of</strong> an object from its true ground positionincludes measures <strong>of</strong> thehorizontal <strong>and</strong>vertical accuracyThere are two components <strong>of</strong> positional accuracy:the bias <strong>and</strong>the precision24


Attribute AccuracyData QualityAn <strong>attribute</strong> is a fact about some location, set <strong>of</strong> locations, or features on the surface<strong>of</strong> the earth.The source <strong>of</strong> error usually lies within the collection <strong>of</strong> these facts.CompletenesA check to see if relevant <strong>data</strong> is missing with regards to the features <strong>and</strong> the <strong>attribute</strong>s.Grouped into two categories:Complateness <strong>of</strong> coverageClassification25


SPATIALDATAMODELS26


GIS Spatial Data ModelsThere are two fundamental approaches <strong>of</strong> representing geographic features:‣Raster Data Model <strong>and</strong>‣Vector Data ModelRaster DataThe real worldVector Data27


GIS Spatial Data ModelsRepresentation <strong>of</strong> Raster <strong>and</strong> Vector Data Model28


Comparison <strong>of</strong> Vector <strong>and</strong> Raster Models29


Raster Data ModelIn the raster <strong>data</strong> model, l<strong>and</strong> cover is represented by a regular grid <strong>of</strong> aquareEach cell will have a value corresponding to its l<strong>and</strong> cover type.Raster <strong>data</strong> are good at:‣representing continuous <strong>data</strong> (e.g., slope, elevation, chemicalconcentrations)‣representing multiple feature types (e.g., points, lines, <strong>and</strong> polygons) assingle feature types (cells)‣rapid computations ("map algebra") in which raster layers are treated aselements in mathematical expressions‣<strong>analysis</strong> <strong>of</strong> multi-layer or multivariate <strong>data</strong> (e.g., satellite image processing<strong>and</strong> <strong>analysis</strong>)‣hogging disk space30


Raster Data Model‣The size <strong>of</strong> the file increases rapidly with resolution.‣Since the file size is related to the area <strong>of</strong> coverage, it increases by thesquare <strong>of</strong> the increase in resolution.31


Raster Data ModelRaster file can be achieved by using various methods <strong>of</strong> <strong>data</strong> compression such as:‣ 1. Traditional raster encoding‣ 2. Run_Length raster encoding‣<strong>3.</strong> Quadtree raster encoding32


1. Traditional Raster EncodingIt repeats storing the same value for each cell33


1. Traditional Raster Encoding34


2. Run_Length raster encoding‣In runlength encoding the adjacent cells along a row that have the same valueare treated as a group termed a run.35


<strong>3.</strong> Quadtree raster encoding‣provides a more compact raster representation.‣Here consider that you dont use one size cells instead you use a variable sized gridcell dividing the area.‣A coarse resolution (large cells) is used to encode large homogenious areas.‣A finer resolution is used for areas <strong>of</strong> high <strong>spatial</strong> variability.36


<strong>3.</strong> Quadtree raster encodingRoot : the point from which all other branches expended.Leaf : the point from which there is no further branching.Nodes: All other points in the tree.İllustration <strong>of</strong> the components <strong>of</strong> quadtree37


Vector Data ModelIn the vector <strong>data</strong> model, features on the earthare represented aspointslines / routespolygons / regionsTINs (triangulated irregular networks)Every position in the map space has a uniquecoordinate valueVector <strong>data</strong> are good ataccurately representing true shape <strong>and</strong>sizerepresenting non-continuous <strong>data</strong> (e.g.,rivers, political boundaries, road lines,mountain peaks)creating aesthetically pleasing mapsconserving disk space38


Vector Data ModelSpaghetti Model‣It is the simpliest vector <strong>data</strong> model‣ In this model the paper map is translated line-for-line into a list <strong>of</strong> x,ycoordinates‣ Although all the <strong>spatial</strong> features are recorded, the <strong>spatial</strong> relationshipsbetween these features are not encoded39


Vector Data Model‣The <strong>data</strong> model is really a mapexpressed in cartesiancoordinates.‣The spagethi model is efficientfor digitally reproducing maps.‣However, inefficient for types <strong>of</strong><strong>spatial</strong> <strong>analysis</strong> since any <strong>spatial</strong>relationship must be derived bycomputation.40


Vector Data ModelTopological Model‣Topology is geometry without coordinates‣Topology is the mathematical method used to define <strong>spatial</strong> relationships.‣Mental maps are mostly topologicalNotes: The instruction “Go ahead until you hit the next intersection <strong>and</strong> turn left”is topological41


Topological ModelTopological relationships will not be affected by continuous distortions42


Topological ModelNodes, Arcs, <strong>and</strong> PolygonsTopological <strong>data</strong> structures are <strong>of</strong>ten based on nodes, arcs, <strong>and</strong> polygons‣Arc: a series <strong>of</strong> points that starts <strong>and</strong> end at a node.‣Node: intersection point where two or more arcs meet.‣Polygon: comprised <strong>of</strong> a closed chain <strong>of</strong> arcs that represent the boundries <strong>of</strong>the area.43


Topological ModelOther names for Nodes, Arcs, <strong>and</strong> Polygons Nodes: vertices, 0-cells Arcs: chains, links, edges, 1-cells Polygons: faces, areas, 2-cells44


Topological ModelArc-Node <strong>data</strong> model45


Some Basic Topological RelationshipsSpatial queries can be processed much more quickly using topology tablesthan they can be done by calculation from the coordinate <strong>data</strong>.‣Connectivity describes, if two points are connected by a line‣Enclosure describes, if an entity is enclosed by another‣Adjacency describes, if two polygons share a common border‣Contiguity is the <strong>spatial</strong> relationship <strong>of</strong> adjacencyForexample: biologists might be interested in the habitats that occur next to each other (Contiguity).46


Triangulated Irregular Network (TIN)‣A vector-based topological <strong>data</strong> model‣A TIN represents the terrain surface as a set <strong>of</strong> interconnected triangles‣each triangle's surface are defined by the X,Y,Z coordinates <strong>of</strong> thethree corner points <strong>and</strong> represented by a plane‣ each piece <strong>of</strong> the triangle will fit with its neighboring pieces <strong>and</strong>therefore, the surface will be continuous47


Triangulated Irregular Network (TIN)Delanuay Triangulation:‣ Of the algorithms available, the Delanuay Triangulation is widely used‣ Artificial boundary points are defined to form a perimeter around theedges <strong>of</strong> the <strong>data</strong> set area‣ The triangulation starts with any pair <strong>of</strong> the artificial boundary pointsknown as inital known neighbours (A <strong>and</strong> B in the Figure)48


Triangulated Irregular Network (TIN)Delanuay Triangulation:‣The search for the next neighbour is made by constructing a circle with the baseAB diameter <strong>and</strong> searching to the clockwise to find if any point falls within this circle‣ If no <strong>data</strong> point lies within the circle, the circle is increased in size to perhaps twicethe area <strong>of</strong> the original circle49


GIS ANALYSIS FUNCTIONS50


A Classification <strong>of</strong> GIS Analysis Functions:1. Maintenance <strong>and</strong> <strong>analysis</strong> <strong>of</strong> the <strong>spatial</strong> <strong>data</strong>‣ Format transformations‣ Geometric transformations‣ Transformations between map projections‣ Conflation‣ Edge matching‣ Editing <strong>of</strong> graphic elements‣ Line coordinate thinning2. Maintenance <strong>and</strong> <strong>analysis</strong> <strong>of</strong> <strong>attribute</strong> <strong>data</strong>‣ Attribute editing functions‣ Attribute query functions51


A Classification <strong>of</strong> GIS Analysis Functions:<strong>3.</strong> <strong>Integrated</strong> <strong>analysis</strong> <strong>of</strong> <strong>spatial</strong> <strong>and</strong> <strong>attribute</strong> <strong>data</strong> Retrieval/classification‣ retrieval‣ classification‣ measurement Overlay operations Neighbourhood operations‣ search‣ line in polygon <strong>and</strong> point in polygon‣ topographic functions‣ thiessen polygons‣ interpolation‣ contour generation Connectivity functions‣ contiguity measures‣ proximity‣ network‣ spread‣ seek‣ intervisibility‣ illumination‣ perspective view52


A Classification <strong>of</strong> GIS Analysis Functions:4. Output formatting‣ Map annotation‣ Text labels‣ Texture patterns <strong>and</strong>53


1. Maintenance <strong>and</strong> <strong>analysis</strong> <strong>of</strong> the <strong>spatial</strong> <strong>data</strong>Format Transformation: the transformation <strong>of</strong> files into the <strong>data</strong> structure <strong>and</strong>file formats used internally by the GISGeometric Transformation: Used to assign ground coordinates to a map or<strong>data</strong> layer whitin the GIS or to adjust one <strong>data</strong> layer so that it can becorrrectly overlayed on another <strong>of</strong> the same area. (the procedure is calledregistration)1. Registration by Relative Position2. Registration by Absolute Position54


1. Maintenance <strong>and</strong> <strong>analysis</strong> <strong>of</strong> the <strong>spatial</strong> <strong>data</strong>Geometric Transformation:1. Registration by Relative PositionOne <strong>data</strong> layer, termed as the slave, is registred to a second <strong>data</strong> layer, termedthe master.2. Registration by Absolute PositionEach <strong>data</strong> layer is seperately registered to the same geographic coorinate system(such as UTM coordinates)55


Edge-matching :Maintenance <strong>and</strong> <strong>analysis</strong> <strong>of</strong> the <strong>spatial</strong> <strong>data</strong>‣An editing procedure to adjust the position <strong>of</strong> features extending across adjacentmap sheet boundaries.‣This function ensures that all features that cross adjacent map sheets have thesame edge locations.‣Links are used when matching features in adjacent coverages.56


Maintenance <strong>and</strong> <strong>analysis</strong> <strong>of</strong> the <strong>spatial</strong> <strong>data</strong>Editing Functions :Editing functions are used to add, delete, or manipulate the geographic position <strong>of</strong>features.Sliver or splinter polygons are thin polygons that occur along the borders <strong>of</strong> polygonsfollowing digitizing <strong>and</strong> the topological overlay <strong>of</strong> two or more coverages.57


Maintenance <strong>and</strong> <strong>analysis</strong> <strong>of</strong> the <strong>spatial</strong> <strong>data</strong>Line Coordinate ThinningUsed to reduce the quantity <strong>of</strong> coordinate <strong>data</strong> that must be stored in the GIS whenmore coordinates are entered than needed (Aran<strong>of</strong>f, 1989)58


2. Maintenance <strong>and</strong> <strong>analysis</strong> <strong>of</strong> <strong>attribute</strong> <strong>data</strong>This group <strong>of</strong> functions is used to edit, check, <strong>and</strong> analyze the non-<strong>spatial</strong><strong>attribute</strong> <strong>data</strong>Attribute editing functions - allow <strong>attribute</strong>s to be retrieved, examined <strong>and</strong>changed.Can add new <strong>attribute</strong>s <strong>and</strong> delete old ones.Attribute query functions - retrieve records in the <strong>attribute</strong> <strong>data</strong>baseaccording to a set <strong>of</strong> conditions specified by the operator (Aran<strong>of</strong>f, 1989).Forexample: <strong>attribute</strong>s <strong>of</strong> forest cover<strong>data</strong> stored in two tables can queried together togenerate a report <strong>of</strong> the total area <strong>of</strong> forest more than 30 years old.59


Attribute query functions2. Maintenance <strong>and</strong> <strong>analysis</strong> <strong>of</strong> <strong>attribute</strong> <strong>data</strong>60


<strong>3.</strong> <strong>Integrated</strong> <strong>analysis</strong> <strong>of</strong> <strong>spatial</strong> <strong>and</strong> <strong>attribute</strong> <strong>data</strong>The power <strong>of</strong> GIS lies in its ability to analyse <strong>spatial</strong> <strong>and</strong> <strong>attribute</strong> <strong>data</strong>together.The group <strong>of</strong> functions have been subdivided into four categories:‣Retrieval/classification‣Overlay operations‣Neighbourhood operations‣Connectivity functions61


<strong>3.</strong> <strong>Integrated</strong> <strong>analysis</strong> <strong>of</strong> <strong>spatial</strong> <strong>and</strong> <strong>attribute</strong> <strong>data</strong>Retrieval/classification‣Retrieval operations involved selective search, manipulation <strong>and</strong> output <strong>of</strong> <strong>data</strong>‣Information from <strong>data</strong>base tables can be accessed directly through the map, ornew maps can be created using information in the tabular <strong>data</strong>base.Commonly used retrievalfunctions are:‣ browsing‣query windowgenerationFor example:“select all polygons <strong>of</strong> a certain type <strong>of</strong> soil within 20km diameter radius <strong>of</strong> aspecified location”62


<strong>3.</strong> <strong>Integrated</strong> <strong>analysis</strong> <strong>of</strong> <strong>spatial</strong> <strong>and</strong> <strong>attribute</strong> <strong>data</strong>Classification is the procedure <strong>of</strong> identifying a set <strong>of</strong> features belonging to agroup <strong>and</strong> assigning a name to that group.63


<strong>3.</strong> <strong>Integrated</strong> <strong>analysis</strong> <strong>of</strong> <strong>spatial</strong> <strong>and</strong> <strong>attribute</strong> <strong>data</strong>‣Generalization, called map dissolve, is the process <strong>of</strong> making a classification lessdetailed by combining classes.‣Generalization is <strong>of</strong>ten used to reduce the level <strong>of</strong> classification detail to make anunderlying pattern more apparent64


<strong>3.</strong> <strong>Integrated</strong> <strong>analysis</strong> <strong>of</strong> <strong>spatial</strong> <strong>and</strong> <strong>attribute</strong> <strong>data</strong>•Measurement functions is the calculation <strong>of</strong> distances between points, length <strong>of</strong>lines, perimeters <strong>and</strong> areas <strong>of</strong> polygons <strong>and</strong> the size <strong>of</strong> a group <strong>of</strong> cells with thesame class.Spatial measurementsinclude:‣distances betweenpoints‣• length <strong>of</strong> lines‣• perimeters <strong>and</strong>areas <strong>of</strong> polygons‣• the size <strong>of</strong> a group<strong>of</strong> cells with the sameclass‣• 3-D measurements65


<strong>3.</strong> <strong>Integrated</strong> <strong>analysis</strong> <strong>of</strong> <strong>spatial</strong> <strong>and</strong> <strong>attribute</strong> <strong>data</strong>Overlay Operation‣Overlay <strong>analysis</strong> is a technique <strong>of</strong> deriving new information from two or morelayers <strong>of</strong> <strong>data</strong> covering the same area‣Overlay operations are usually performed more efficiently in raster basedsystems.‣Overlay operations include:‣ Arithmetic overlay‣ Logical overlay66


<strong>3.</strong> <strong>Integrated</strong> <strong>analysis</strong> <strong>of</strong> <strong>spatial</strong> <strong>and</strong> <strong>attribute</strong> <strong>data</strong>Overlay OperationArithmetic operations include:‣• addition‣• subtraction‣• multiplication‣• divisionA logical overlay involves finding those area where a specified set <strong>of</strong>conditions occur (or not ocur ) together67


<strong>3.</strong> <strong>Integrated</strong> <strong>analysis</strong> <strong>of</strong> <strong>spatial</strong> <strong>and</strong> <strong>attribute</strong> <strong>data</strong>Addition68


Subtraction<strong>3.</strong> <strong>Integrated</strong> <strong>analysis</strong> <strong>of</strong> <strong>spatial</strong> <strong>and</strong> <strong>attribute</strong> <strong>data</strong>‣Image subtraction is <strong>of</strong>ten used to identify changes that have occurredbetween images collected on different dates69


<strong>3.</strong> <strong>Integrated</strong> <strong>analysis</strong> <strong>of</strong> <strong>spatial</strong> <strong>and</strong> <strong>attribute</strong> <strong>data</strong>MultiplicationThe precedure is being used to convert the <strong>data</strong> from units <strong>of</strong> inches to units inmilimiters by multiplying each rainguige value by 25.4 mm/inc70


<strong>3.</strong> <strong>Integrated</strong> <strong>analysis</strong> <strong>of</strong> <strong>spatial</strong> <strong>and</strong> <strong>attribute</strong> <strong>data</strong>Logical Overlay OperationsExample: Desirable areas for groving a certain type <strong>of</strong> crop might bedefined as:“ those areas that have an agricultural l<strong>and</strong> <strong>and</strong> have fertile soil “If the l<strong>and</strong> use <strong>and</strong> soil <strong>data</strong> are represented as separate <strong>data</strong> layers then, alogical overlay operation can identify the locations where these conditions occurtogether71


Logical Operations on Raster: LOGICAL AND72


Logical Operations on Raster: LOGICAL OR73


Overlay Analysis‣One <strong>of</strong> the major factors affecting the performance <strong>of</strong> the overlayoperations is the <strong>data</strong> model‣Vector <strong>and</strong> raster models differ significantly in the way overlayingoperations are implemented74


Overlay AnalysisVector overlay generates new: nodes arcs polygons• The vector overlay generally requires significantly more processingtime than raster overlay• The new polygons inherit the <strong>attribute</strong>s from the original input layers75


Vector overlayThe process <strong>of</strong> subdividing polygons is termed clippingClipping makes overlay operations more complex in vector domain than in rasterdomain.76


Raster overlay‣Raster overlay processing involves retrieving <strong>and</strong> comparing thecorresponding pixels from both layers‣The overlay operation is carried out two layers at a time‣There is no need to calculate intersections <strong>of</strong> boundaries or make anymodifications to take the feature boundaries77


Comparison <strong>of</strong> Vector <strong>and</strong> Raster OverlayOperations‣Overlay operations are easier in raster <strong>data</strong>‣ However, sparse <strong>data</strong> sets require as much processing as denselypopulated ones‣ In vector <strong>data</strong>, only the <strong>data</strong> <strong>of</strong> interest are processed‣ In many GISs, a hybrid approach, which takes the advantage <strong>of</strong> thecapabilities <strong>of</strong> both <strong>data</strong> models, is used78


Connectivity (Network) Operations:Many types <strong>of</strong> <strong>analysis</strong> can be carried out on networks for;‣ transportation‣ planning‣ utility management‣ airline scheduling‣navigation etc.For example;‣ finding the shortest path through a network between selected point‣ finding the parts <strong>of</strong> the network which can be reached within a giventravel time from a selected point79


Connectivity (Network) Operations:‣Each connectivity function must include the following:‣1. a specification <strong>of</strong> the ways <strong>spatial</strong> elements (such as roads) areinterconnected‣2. a set <strong>of</strong> rules that specify the allowed movement along theseinterconnections‣<strong>3.</strong> a unit <strong>of</strong> measurement‣ The widely used connectivity operations are:‣ contiguity measures‣proximity <strong>analysis</strong>‣ network operations80


Connectivity (Network) Operations:Contiguity measures evaluate the characteristics <strong>of</strong> <strong>spatial</strong> units that areconnected A contiguous area consists <strong>of</strong> a group <strong>of</strong> <strong>spatial</strong> units that are connectedForexample seach a l<strong>and</strong> unit to be used as a park might be specified as a contiguous l<strong>and</strong> unit <strong>of</strong>forest having minimum area <strong>of</strong> 1000 square km.81


Connectivity (Network) Operations:Proximity‣Proximity is a measure <strong>of</strong> distance between features‣It is most commonly measured in units <strong>of</strong> length but can be measuredin other units, such as travel time or noise level‣Four parameters must be specified to measure proximity:‣1. target location (e.g. a road, a school, a park)‣2. unit <strong>of</strong> measure (e.g. distance in meters)‣<strong>3.</strong> function to calculate proximity (e.g straight line distance,travel time)‣4. area to be analyzed82


Connectivity (Network) Operations:ProximityForexample: The 300 ft buffer zone drawn around the roads definesthe forest area where logging is not permitted.83


Connectivity (Network) Operations:Proximity‣A buffer zone is an area <strong>of</strong> a specified width drawn around one or moregeographic features‣Buffering creates a new area, enclosing the buffered object84


Network FunctionsConnectivity (Network) Operations:‣A network is composed <strong>of</strong> interconnected linear features‣ Networks are commonly used for moving resources from one location toanother‣ Network <strong>analysis</strong> operations are performed on topologically structuredvector <strong>data</strong>‣ A GIS is used to perform thee principal types <strong>of</strong> network <strong>analysis</strong>:‣ prediction <strong>of</strong> network loading‣ route optimization‣ resource allocation85


Network FunctionsConnectivity (Network) Operations:86


Connectivity (Network) Operations:Neighbourhood Operations‣Neighbourhood operations evaluate the characteristics <strong>of</strong> the area surrounding aspecified location‣For example:“Counting the number <strong>of</strong> residential buildings within a 5 km radius <strong>of</strong> a fire station”87


Connectivity (Network) Operations:Neighbourhood Operations‣Every neighbourhood function requires the specification <strong>of</strong> at least three basicparameters:1. target location(s)2. specification <strong>of</strong> the neighbourhood<strong>3.</strong> function to be performed88


OUTPUT FUNCTIONSMap AnnotationTitles, Legends, Scale Bars, <strong>and</strong> North Arrows are the simplest forms <strong>of</strong>depicting information concerning the map.Graphic SymbolsGraphic Symbols areused to portray thevarious entities depictedon the map.89


The principles <strong>of</strong> geostatistical <strong>analysis</strong>90


Geostatistical <strong>analysis</strong>‣Sample points taken at different locations in a l<strong>and</strong>scape <strong>and</strong> creates(interpolates) a continuous surface.‣The sample points are measurements <strong>of</strong> some phenomenon such as:‣oil spill,‣ elevation heights‣Soil <strong>and</strong> rock properties‣Climate measurements‣Etc.‣The Geostatistical Analyst derives a surface using the values from themeasured locations to predict values for each location in the l<strong>and</strong>scape.91


Geostatistical Analyst‣A major challenge facing most GIS modelers is to generate the most accuratepossible surface.‣The deterministic methods (eg. IDW) weights the surrounding measured valuesto derive a prediction for each location.‣the weights are based only on the distance between the measured points‣However, the geostatistical methods that are based on statistical models theweights are based not only on the distance between the measured points <strong>and</strong>the prediction location but also on the overall <strong>spatial</strong> arrangement among themeasured points.‣To use the <strong>spatial</strong> arrangement in the weights, the <strong>spatial</strong> autocorrelationmust be quantified.92


Calculate the empirical semivariogram‣Kriging, like most interpolation techniques,is built on the assumption that” things thatare close to one another are more alikethan those farther away (quantified here as<strong>spatial</strong> autocorrelation).”‣The empirical semivariogram is a meansto explore this relationship.93


Calculate the empirical semivariogram‣The Semivariogram/Covariance Cloud allows you to examine the <strong>spatial</strong>autocorrelation between the measured sample points.‣Since closer locations should be more alike, in the semivariogram the closelocations (far left on the x-axis) should have small semivariogram values (low onthe y-axis)‣As pairs <strong>of</strong> locations become farther apart (moving to the right on the x-axis <strong>of</strong>the semivariogram cloud), they should become more dissimilar94


Underst<strong>and</strong>ing a semivariogram—the range, sill, <strong>and</strong> nugget‣ Sample locations separated by distances closer than the range are <strong>spatial</strong>lyautocorrelated, whereas locations farther apart than the range are not.‣range= The distance where the model first flattens out‣sill = The value that the semivariogram model attains at the range (the valueon the y-axis).Theoretically, at zero separation distance (i.e., lag = 0), the semivariogram valueshould be zero.However, at an infinitesimally small separation distance, the difference betweenmeasurements <strong>of</strong>ten does not tend to zero.This is called the nugget effect.For example, if the semivariogram model intercepts the yaxis at 1.34, then 95thenugget is 1.34.


Modeling a semivariogram‣<strong>spatial</strong> modeling <strong>of</strong> the semivariogram, begin with a graph <strong>of</strong> the empiricalsemivariogram, computed for all pairs <strong>of</strong> locations separated by distance h as:Semivariogram = 0.5 * average [ (value at location i - value at location j) 2 ]The empirical semivariogram is a graph <strong>of</strong> the averaged semivariogram values onthe y-axis <strong>and</strong> distance (or lag) on the x-axis96


Calculate the empirical semivariogramThe configuration <strong>of</strong> the points is displayed in orange on the map theThe <strong>spatial</strong> coordinates each point are given as (X,Y).You will use ordinary kriging to predict a value for location (1, 4) (yellow)which is called the prediction location.97


Calculate the empirical semivariogramThe ordinary kriging model is:‣where s =(X,Y) location; one <strong>of</strong> the sample locations is s = (1,5),<strong>and</strong>‣Z(s) is the value at that location; for example, Z(1,5) = 100.‣The model is based on a constant mean m for the <strong>data</strong> (no trend) <strong>and</strong> r<strong>and</strong>omerrors e(s) with <strong>spatial</strong> dependenceThe predictor is formed as a weighted sum <strong>of</strong> the <strong>data</strong>:where‣Z(si) is the measured value at the ith location, for example, Z(1,5) = 100;‣ג i is an unknown weight for the measured value at the ith location; ‣s 0is the prediction location, for example, (1,4); <strong>and</strong> N = 5 for the fivemeasured values98


Calculating the empirical semivariogram‣In a semivariogram, half the differencesquared between the pairs <strong>of</strong> locations (the y-axis) is plotted relative to the distance thatseparates them (the x-axis).‣The distance between two locations is calculated by using the Euclidean distance:100-105=5 2 =25Theempiricalsemivariance=0.5 timesthe difference squared:0.5 * average[(value atlocation i - value atlocation j) 2 ].99


Binning the empirical semivariogram‣With larger <strong>data</strong>sets (more measured samples) the number <strong>of</strong> pairs <strong>of</strong>locations will increase rapidly <strong>and</strong> will quickly become unmanageable.‣Therefore, you can group the pairs <strong>of</strong> locations, which is referred to asbinning.‣1. form pairs <strong>of</strong> points‣2. group the pairs with common distance <strong>and</strong> direction.Shows all possible pairwise links among all12 locations.pairs are groupedbased on common distances <strong>and</strong>directions100


Calculating the empirical semivariogram‣In this example, a bin is a specified range <strong>of</strong> distances.‣That is, all points that are within 1 to 2 meter apart are grouped into the first bin,within 2 to 3 meters apart are grouped into the second bin..so on101


Calculating the empirical semivariogram(1.414+2)/2 (12.5+0+0)/3=4.167102


Calculating the empirical semivariogramthe goal is to solve the equations for all <strong>of</strong>the ג is (the weights), so the predictor canbe formed by usingThe matrix formula for ordinary kriging is:‣The gamma matrix ק contains the modeled semivariogram valuesbetween all pairs <strong>of</strong> sample locations, where ɣ ijdenotes the modeledsemivariogram values based on the distance between the two samplesidentified as the ith <strong>and</strong> jth locations.‣The vector g contains the modeled semivariogram values between eachmeasured location <strong>and</strong> the prediction location, where ɣ i0denotes themodeled semivariogram values based on the distance between the ithsample location <strong>and</strong> the prediction location.103


Fitting a model‣Now you can plot the averagesemivariance versus average distance <strong>of</strong>the bins onto a graph.‣the empirical semivariogram.‣But the empirical semivariogram values cannot be used directly inthe ק matrix because you might get negative st<strong>and</strong>ard errors for thepredictions;‣instead, you must fit a model to the empirical semivariogram.‣Once the model is fit, you will use the fitted model whendetermining semivariogram values for various distances.104


Fitting a model to the empirical semivariogram‣Geostatistical Analyst provides the following functions to choose from tomodel the empirical semivariogram:‣Circular,‣Spherical,‣Tetraspherical,‣ Pentaspherical,‣Exponential,‣Gaussian,‣Rational Quadratic,‣Hole Effect,‣K-Bessel,‣J-Bessel, <strong>and</strong>‣Stable.‣The selected model influences the prediction <strong>of</strong> the unknown values,particularly when the shape <strong>of</strong> the curve near the origin differs significantly.‣The steeper the curve near the origin, the more influence the closestneighbors will have on the prediction.‣As a result, the output surface will be less smooth.‣Each model is designed to fit different types <strong>of</strong> phenomena more accurately.105


Different types <strong>of</strong> semivariogram modelsThe Spherical modelThe Exponential modelForexample:The Spherical model shows a progressive decrease <strong>of</strong> <strong>spatial</strong> autocorrelation(equivalently, an increase <strong>of</strong> semivariance) until some distance, beyond whichautocorrelation is zero.This model is applied when <strong>spatial</strong> autocorrelation decreases exponentially withincreasing distance, disappearing completely only at an infinite distance106


Fitting a model‣For simplicity, the model that you will fit is a least-squares regression line, <strong>and</strong>you will force it to have a positive slope <strong>and</strong> pass through zero.‣The formula to determine the semivariance at any given distance in thisexample is: Semivariance = Slope * DistanceSlope (1<strong>3.</strong>5) = slope <strong>of</strong> the fitted model.Distance (h) = distance between pairs <strong>of</strong> locations107arise due to unbiasednessconstraints


108


‏;ג In order to solve for109


Creation <strong>of</strong> g vector for the unmeasured locationThe g vector for (1,4) is1<strong>3.</strong>5x2=27110


Make a predictionThe kriging weights for the measured Values used, to calculate a prediction forthe location with the unknown value.kriging weights vector is solved by using ק matrix <strong>and</strong> g vector by:46.757+10.3257+……-1.679=102.6218111


Make a prediction‣the weights decrease with distance but are more refined than a straight distanceweighting‣since they account for the <strong>spatial</strong> arrangement <strong>of</strong> the <strong>data</strong>.112


Kriging variance‣One <strong>of</strong> the strengths <strong>of</strong> using a statistical approach is that it is possible to alsocalculate a statistical measure <strong>of</strong> uncertainty for the prediction.‣To do so, multiply each entry in the ג vector times each entry in the g vector <strong>and</strong>add them together to obtain what is known as the predicted kriging variance.‣The square root <strong>of</strong> the kriging variance is called the kriging st<strong>and</strong>ard error.113


‣The kriging st<strong>and</strong>ard error value is <strong>3.</strong>6386.‣If it is assumed that the errors are normally distributed, 95 percent predictionintervals can be obtained in the following way:‣Kriging Predictor + 1.96*sqrt(kriging variance)Which means:‣If predictions are made again <strong>and</strong> again from the same model, in thelong run 95 percent <strong>of</strong> the time the prediction interval will contain the valueat the prediction location.‣In the example, the prediction interval ranges from:‣ 95.49 to 109.75 (102.62 + 1.96 * <strong>3.</strong>64 )114


LABRATORY WORK‣ You will use the mean annualrainfall <strong>data</strong>set.‣ The stations are in the eastern<strong>and</strong> north eastern part <strong>of</strong> Turkey‣ interpolate the rain values at thelocations where values are notknown1. using the default settings <strong>of</strong> theGeostatistical Analyst.2. By analysing <strong>spatial</strong>autocorrelations<strong>3.</strong> Compare the result models‣Exploring your <strong>data</strong>‣Examining the distribution <strong>of</strong> your <strong>data</strong>‣Histogram‣QQ Plots‣Identifying global trends in your <strong>data</strong>‣Underst<strong>and</strong>ing <strong>spatial</strong> autocorrelation <strong>and</strong> directional influences115


THANK YOU116


117


118


119


120


121

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!