SlideShare a Scribd company logo
1 of 8
Download to read offline
Tehnici moderne de evaluare a calitatii datelor in cadastru
             Modern techniques for evaluation of spatial data quality

                                                            s.l. dr. Droj Gabriela - gaby@oradea.ro
                                                     s.l. drd. ing. Suba Stefan –subast@yahoo.com
                                              s.l. drd. Buda Aurelian - budaaurelian@yahoo.com


        For centuries ago cartographers, geographers, surveyors and geodesists have been
involved in collection, storage, analysis and visualization of spatial data. The collections of
spatial data are subject to random, systematic and gross errors. In the last decade GIS became
the main tool for analyzing spatial data in many real life activities. The quality of the
geographical information systems is dependent on the quality of the stored and queried data. In
comparison with the non-geographic information systems, in GIS applications, the quality of data
has a greater influence on the results. Despite the arguments in favor of explicit treatment of
spatial data quality, usually Cadastre and GIS practice continues to proceed as if data were
perfect. The objectives of this study are to identify different techniques for evaluation of spatial
data quality.

       Keywords: GIS, Spatial Data Quality, cadastre

         1. Introduction
         For centuries ago cartographers, geographers, surveyors and geodesists have been
involved in collection, storage, analysis and visualization of spatial data. The collections of
spatial data are subject to random, systematic and gross errors.
         In the last decade GIS became the main tool for analyzing spatial data in many real life
activities. The quality of the geographical information systems is dependent on the quality of the
stored and queried data. In comparison with the non-geographic information systems, in GIS
applications, the quality of data has a greater influence on the results.
         The five main reasons for the main concerns about spatial data quality issues were
identified as:
    a) There is an increasing availability, exchange and use of spatial data;
    b) There is a growing group of users less aware of spatial data quality;
    c) GIS enables the use of spatial data in all sorts of applications, regardless of the data
        quality;
    d) Current GIS offers hardly any tools for handling spatial quality;
    e) There is an increasing distance between those who use the spatial data (the end users) and
         those who are creating the spatial databases [van Oort].
        Despite the arguments in favor of explicit treatment of spatial data quality, usually
Cadastre and GIS practice continues to proceed as if data were perfect. Data are used uncritically
without consideration of the error contained and this can lead to erroneous results, misleading
information, unwise environmental decisions and increased costs.
        The objectives of this study are to identify different techniques for evaluation of spatial
data quality.
2. The quality of spatial data
        The quality of spatial data is defined by a set of characteristic classified in [Dro09]:
        • Dependability characteristics refer to the time related aspects of quality: availability,
            temporal correctness. The temporal correctness is given by the moment of collection
            of the data and by the level of updating.
        • Integrity characteristics represent the applicability of information, the number of
            collected data and their properties: completeness, consistency, correctness.
        • Accuracy characteristics refer the positional accuracy and attribute accuracy.
        The requirements for the quality of the spatial data are depending on the specificity of the
informational system. For example the quality requirements for the cadastres data are higher than
for the environmental application. On the other hand the quality requirements for cadastre data
are usually imposed by local and regional standards.
        ISO 19113 provides a classification scheme for these errors. They are categorized into
different elements and sub elements depending on the nature of the error. Regarding the ISO
19113 standard the data quality aspects are [ISO]:
• Completeness – presence and absence of features and their attributes and relationships:
        o Commission – excess data presents in a dataset,
        o Omission - data absent from dataset,
• Logical consistency – degree of adherence to logical rules of data structure, attribution and
    relationships:
        o Conceptual consistency - adherence to rules of conceptual schema,
        o Domain consistency – adherence of values to the value domain,
        o Formal consistency – degree of which data is stored in accordance with physical
            structure of dataset,
        o Topological consistency – correctness of explicitly encoded topological characteristics
            of a dataset
• Positional accuracy – accuracy of the position of feature
        o Absolute or external accuracy - closeness of reported coordinate values to values
            accepted as or being true,
        o Relative or internal accuracy – closeness of the relative positions of features in a
            dataset to their respective relative positions accepted as or being true,
        o Gridded data position accuracy - closeness of gridded data position values to values
            accepted as or being true,
• Temporal accuracy – accuracy of temporal attributes and temporal relationships of features
        o Accuracy of a time measurement – correctness of a temporal references of an item
            (reporting of error in time measurement)
        o Temporal consistency – correctness of ordered events or sequences, if reported,
        o Temporal validity – validity of data with respect to time
• Thematic accuracy – accuracy of quantitative attributes and the correctness of non-
    quantitative attributes and of the classifications of features and their relationships
        o Classification correctness – comparison of the classes assigned to features or their
            attributes to a universe of discourse (ground truth or reference dataset),
        o Non-quantitative attribute correctness – correctness of non-quantitative attributes,
        o Quantitative attribute accuracy – accuracy of quantitative attributes.
3. Cadastral issues
        Cadastral survey or cadastral map is a comprehensive register of the metes-and-bounds
real property of a country. A cadastre commonly includes details of the ownership, the precise
location, the dimensions (and area), the cultivations if rural and the value of individual parcels of
land. Cadastral information are important in both aspects the legal and spatial part of the
information, however the legal issues is mostly dominating. Cadastre in European Union does not
reply to a unique model, there are used various types of cadastre inventory systems based on
different patterns and functions. The purpose of European Union is not to create a new cadastral
system but to this variety without reducing efficiency of future common developments.
        European cadastre’s can be categorized as: [Jak09]
        • Positional-accurate cadastre in positional-accurate type cadastres the attribute
            (ownership records) and the location are both up-to-date and accurate. These can be
            found in Austria, the Netherlands and Finland (urban areas).
        • Index map cadastre In index map type cadastres, the actual representation of the
            property units can be found either from legal documents or in the field but the
            ownership records are accurate and up-to-date. Index maps present the topological
            relations between the cadastral units. Examples of these cadastres can be found in
            Sweden and Finland (outside urban areas).
        • Mosaic cadastre In mosaic type cadastres there in no legal requirement to register the
            ownership and therefore the cadastre often covers only part of the country. The
            Cadastre contains information of individual ownerships, but there is no direct
            relationship with neighboring parcels. Combinations of different title maps may be
            compiled or reproduced but the topological relations remain undefined. These
            cadastres exist in Great Britain.
        Regarding the requirements of the new cadastral law (7/1996), the Romanian cadastre will
be in the category of positional accurate cadastre, but nowadays the are several parallel cadastral
systems in use in Romania. The cadastral parcel is an important component not only in national
but also in European spatial data infrastructures. In the context of global economy is necessary to
create effective infrastructures and promote efficiency, therefore the systems of spatial data had
to be interoperable.

        4. Standards of spatial data
        To achieve interoperability of geographic information is necessary to establish a set of
minimum standards and policies. This standards and policies must define at three levels[Dro09]:
    • Institutional level represented by organizational rules like access rules, data protection
        and copyright,
    • Technical level consists in all the hardware, software and communication protocols
        compatibility,
    • Semantic level - data standards, the set of public data and process standards.
        In this paper we will focus on the semantic level. Data standards are semantic definitions
that are structured in a model. They describe the minimum requirements of objects, features or
items that are (will be) collected, automated, or affected by process. Process standards also
referred as service standards describe how the procedures to follow, methodologies to apply,
procedures to present information, or business process rules to be followed to implement other
standards. Process standards are used: (1) to establish a threshold for minimally acceptable data,
(2) to determine the best data for an application, or (3) to promote interoperability and broad
based use of data.
        Current standards in geographic information may be classified as industry standards also
called de-facto standards and official standards, national, regional, or international standards. The
de- facto standards are generated by industry, in the case of spatial data this is the Open GIS
Consortium (Open Geospatial). The official standards can be also divided in international
standards (ISO), regional standards (European standards INSPIRE) and national standards.
        These standards specifies all the methods, tools and services for data management
(including definition and description), acquiring, processing, analyzing, accessing, presenting and
transferring such data in digital/electronic form between different users, systems and locations.

                                                               ISO
                                                       Base standards –
               ISO                                      19100 series
                                               Reference                              amendment

                                           consensus
                                                             Abstract
               OGC             Essential
                                Model                      specification
                                               RFP proces                             amendment

                                                       Implementation
                                                        specification

                                       Development                         revision

                                                       Implementation
               Market
                                                          services
                     Figure 1 - Relationship between ISO and OGC standards [Oos02]

        Being international standards ISO is respected by the entire official standards or de –
facto. The standardization in the field of digital geographic information is done by TC211
committee. Their work aims to establish a structured set of standards known as the ISO 19100
series of geographic information standards. These standards are meant to enable geospatial
datasets to interact between different data models and different applications. The most important
quality-related standards, for spatial data, in the ISO 19100 families are:
        • ISO 19113 Geographic Information – Quality principles
        • ISO 19114 Geographic Information – Quality evaluation procedures
        • ISO 19138 Geographic Information – Data quality measures
        • ISO 19115 Geographic Information – Metadata
        • ISO 19115 Geographic Information – Metadata –Part 2 : Extensions for imagery and
            gridded data
        • ISO 19122 Geographic Information – Qualifications and certification of personnel
        • ISO 19127 Geographic Information – Geodetic codes and parameters
• ISO 19131 Geographic Information – Data product specifications
        • ISO 19139 Geographic Information – Metadata – XML schema implementation
        The must important industrial standards for spatial data specification are the OpenGIS.
OpenGIS ® Standards and Specifications are technical documents that detail interfaces or
encodings. Software developers use these documents to build support for the interfaces or
encodings into their products and services [Ogis].
        There is a close link between the ISO and OpenGIS standards. These relations are
presented in figure 1.[Oos02]:
        Each country, based on ISO 19100 series, OpenGIS Standards and Specifications and
national land management laws, tries to standardize the spatial data in order to establish a
National Spatial Information Data. For European countries this NSDIs must also fulfill the
regulation of INSPIRE.
        INSPIRE (Infrastructure for Spatial Information in Europe) is meant to create a legal
framework in order to establish and operate an Infrastructure for Spatial Information in Europe,
for the purpose of formulating, implementing, monitoring and evaluating Community policies at
all levels and providing public information.

        5. Techniques for evaluation of spatial data quality
        The international ISO standards provides principles not only for the principles how
geospatial data can be described but also offer guidance on assessing the quality of actual data
sets and what should be the quality evaluation procedures. The procedures for evaluation of data
quality according to the ISO 19114 standard it must be done in five steps[Jak09, Joo06]:
           • Identifying the data quality scope: elements and subelements,
           • Identifying the data quality measure
           • Selecting the evaluation method,
           • Determine data quality results,
           • Determine conformance.
Figure 2 - Five steps for quality evaluation according ISO 19114

        The five steps of quality evaluating procedures and the relationship between them are
presented in the upper picture.
        The quality evaluation information shall be reported as metadata. A separate quality
evaluation report is required when metadata result is only pass/fail or when aggregate quality
results are generated.
        The standard specifies the fields to be filled in when reporting on assessment as a quality
evaluation report (Annex I in the standard)[Jak09]:
        • Identification of reporting document
        • Scope observed
        • Measure used (formula, resulting values, result unit, reliability, reliability unit)
        • Confidence in conformance test (confidence value, confidence unit, documents
            explaining the method)
        • Type of quality evaluation method used (direct external etc., inspection strategy
            applied)
        • Description of quality method used (basic assumptions, processing algorithms,
            definition of parameters, parameter values for the specific test, parameter units),
        • Possible aggregation of results (unit for aggregated values, resulting values, statistics
            used for aggregation, computation date, pointer to aggregation report)
        The evaluation of spatial data quality can be done by using two methods: by direct
observation, with visual comparisons of the resulted models, and by using statistical parameters.
        The visual comparisons are very difficult to be done and impossible to be quantified. In
the case of quality evaluation made through visual comparison is required to have images of the
both models, in order to model a real life situation and the generated model.
In order to quantify the uncertainty of the spatial data, it is necessary to use statistical
methods. To decide the quality of the data by statistical analysis, the comparison of the known
data with the resulted data is required. To analyze the pattern of deviation between two sets of
data, conventional ways are to yield statistical expressions of the accuracy, such as the root mean
square error, standard deviation, mean, variance and coefficient of variation [ Dro08].
        The quality requirements for cadastre data are also dependent of the type of inventory
system used. In the following table we present the role of quality in the different type of cadastre.
[Jak06, Jak09]

Cadastre        Type Ownership         Location data              Potential quality issues
                data
Mosaic          Ownership records      Ownership maps             Reliability
                                       (general boundaries)       No other quality information
                                                                  available
Index           Ownership as           Uniform index map          Positional Accuracy
                attribute                                         Topological consistency
                information                                       Process management
Positional-     Ownership as           Accurate location of       Logical consistency
accurate        attribute              boundary points            Security issues
                information                                       Process management

        The ISO 19138 specifies the data quality measures; it defines a set of data quality
measures that can be used when reporting data quality for the sub-elements in ISO 19113. In the
table below we presented the most important quality measures for cadastral issues selected from
ISO 19138[Jak06, Jak09].

Data quality element     Data     quality    Data quality measure     Data quality basic
                         sub-element                                  measure
Completeness             Commission          Number of excess items   Error count
Completeness             Omission            Number of missing items  Error count
Logical consistency      Topological         Number of faulty point- Error count
                         consistency         curve connections
Logical                  Topological         Number of missing point- Error count
consistency              consistency         connections
Positional accuracy      Absolute      or    Covariance matrix
                         external
                         accuracy
Positional accuracy      Absolute      or RMSE          or    standard
                         external         deviation, where the true
                         accuracy         value is not estimated from
                                          the observation but known a
                                          priori
Temporal accuracy        Absolute of time Mean       value  of    date Difference between
                         measurement      attributes                   date in database and
                                                                       date
Temporal accuracy         Temporal            Number of items not in Error count
                          validity            conformance with their
                                              value
Thematic accuracy         Classification      Number       of    incorrectly Error count
                          correctness         classified features
Thematic accuracy         Classification      Misclassification
                          correctness         matrix

       6. Conclusion

        In this paper we analyzed the main issues regarding spatial data quality in general and in
cadastre. We studied a classification of the characteristics which represent the quality of spatial
data and the main elements and subelements of spatial data quality as it is regulated by ISO
standards. Also we present a brief introduction in the guidelines established by different
standardization organization and the relationships between them.
        The main conclusion which is pointed up by the studies is the importance of standards in
data quality assessment.
        The second conclusion consists in the usage of spatial data standardization as a technique
for evaluation of spatial data quality.
        The last conclusion is that, there are two keys to the improvement of data quality – they
are prevention and correction. Error prevention is considered to be far superior to error detection,
since detection is often costly and can never guarantee to be 100% successful, and the best way
of error prevention is to follow all the regulation regarding specification, structure, data sets, and
implementation and evaluation procedures.

       References:

[Dro08]Droj, G - Quantifying the Uncertainty of Spatial Data, ZAC, Cluj, 2008
[Dro09] Droj, G, Suba, S. - Role of Data Quality in GIS Decision Support Tools, Szekesfehervar
       GIS Open 2009
[Jak09]Jakobsson, A. Giversen, J. - Guidelines for Implementing the ISO 19100 Geographic
       Information Quality Standards in National Mapping and Cadastral Agencies,
       www.eurogeographics.org, visited in march 2009
[Jak06]Jakobsson, A. Giversen, J. - On the Future of Topographic Base Information
       Management in Finland and Europe, Dissertation for degree of Doctor in science, 2006,
[Joo06]Joos G. – Data Quality Standards, XXIII FIG Congress , 2006
[Ogis99] OpenGIS® - Simple Features Specification For SQL Revision 1.1, Open GIS
       Consortium 1999
[Ogis05] OpenGIS® - Implementation Specification for Geographic Information - Simple feature
       access - Part 2: SQL option, Open Geospatial Consortium 2005
[Ost02] Østensenstensen, O; Smits P. - ISO/TC211: standardisation of Geographic Information
       and Geo-Informatics, Canada International Geoscience and Remote Sensing International
       Symposium 2002
[van Oort] van Oort, P. -Spatial data quality: from description to application, NCG Nederlandse
       Commissie voor Geodesie Netherlands Geodetic Commission Delft, December 2005.

More Related Content

What's hot

TYBSC IT PGIS Unit II Chapter I Data Management and Processing Systems
TYBSC IT PGIS Unit II Chapter I Data Management and Processing SystemsTYBSC IT PGIS Unit II Chapter I Data Management and Processing Systems
TYBSC IT PGIS Unit II Chapter I Data Management and Processing SystemsArti Parab Academics
 
Spatial Analysis and Geomatics
Spatial Analysis and GeomaticsSpatial Analysis and Geomatics
Spatial Analysis and GeomaticsRich Heimann
 
QUERY AND NETWORK ANALYSIS IN GIS
QUERY AND NETWORK ANALYSIS IN GISQUERY AND NETWORK ANALYSIS IN GIS
QUERY AND NETWORK ANALYSIS IN GISDEVANG KAPADIA
 
TYBSC IT PGIS Unit IV Spacial Data Analysis
TYBSC IT PGIS Unit IV  Spacial Data AnalysisTYBSC IT PGIS Unit IV  Spacial Data Analysis
TYBSC IT PGIS Unit IV Spacial Data AnalysisArti Parab Academics
 
Database gis fundamentals
Database gis fundamentalsDatabase gis fundamentals
Database gis fundamentalsSumant Diwakar
 
4.2 spatial data mining
4.2 spatial data mining4.2 spatial data mining
4.2 spatial data miningKrish_ver2
 
Geographic query and analysis
Geographic query and analysisGeographic query and analysis
Geographic query and analysisMohsin Siddique
 
Exploration in the House 2015: NSW Seamless Geology Project: Progress to date...
Exploration in the House 2015: NSW Seamless Geology Project: Progress to date...Exploration in the House 2015: NSW Seamless Geology Project: Progress to date...
Exploration in the House 2015: NSW Seamless Geology Project: Progress to date...NSW Environment and Planning
 
Spatial analysis and modeling
Spatial analysis and modelingSpatial analysis and modeling
Spatial analysis and modelingTolasa_F
 
Components of Spatial Data Quality in GIS
Components of Spatial Data Quality in GISComponents of Spatial Data Quality in GIS
Components of Spatial Data Quality in GISKaium Chowdhury
 

What's hot (20)

GIS Data Types
GIS Data TypesGIS Data Types
GIS Data Types
 
TYBSC IT PGIS Unit II Chapter I Data Management and Processing Systems
TYBSC IT PGIS Unit II Chapter I Data Management and Processing SystemsTYBSC IT PGIS Unit II Chapter I Data Management and Processing Systems
TYBSC IT PGIS Unit II Chapter I Data Management and Processing Systems
 
Spatial Analysis and Geomatics
Spatial Analysis and GeomaticsSpatial Analysis and Geomatics
Spatial Analysis and Geomatics
 
QUERY AND NETWORK ANALYSIS IN GIS
QUERY AND NETWORK ANALYSIS IN GISQUERY AND NETWORK ANALYSIS IN GIS
QUERY AND NETWORK ANALYSIS IN GIS
 
TYBSC IT PGIS Unit IV Spacial Data Analysis
TYBSC IT PGIS Unit IV  Spacial Data AnalysisTYBSC IT PGIS Unit IV  Spacial Data Analysis
TYBSC IT PGIS Unit IV Spacial Data Analysis
 
215 spatial db
215 spatial db215 spatial db
215 spatial db
 
Spatial databases
Spatial databasesSpatial databases
Spatial databases
 
Spatial Data Model 2
Spatial Data Model 2Spatial Data Model 2
Spatial Data Model 2
 
Types of GIS Data
Types of GIS DataTypes of GIS Data
Types of GIS Data
 
Gis
GisGis
Gis
 
Topology in GIS
Topology in GISTopology in GIS
Topology in GIS
 
Database gis fundamentals
Database gis fundamentalsDatabase gis fundamentals
Database gis fundamentals
 
Optimizing spatial database
Optimizing spatial databaseOptimizing spatial database
Optimizing spatial database
 
4.2 spatial data mining
4.2 spatial data mining4.2 spatial data mining
4.2 spatial data mining
 
Geographic query and analysis
Geographic query and analysisGeographic query and analysis
Geographic query and analysis
 
Exploration in the House 2015: NSW Seamless Geology Project: Progress to date...
Exploration in the House 2015: NSW Seamless Geology Project: Progress to date...Exploration in the House 2015: NSW Seamless Geology Project: Progress to date...
Exploration in the House 2015: NSW Seamless Geology Project: Progress to date...
 
Spatial analysis and modeling
Spatial analysis and modelingSpatial analysis and modeling
Spatial analysis and modeling
 
Gis Concepts 1/5
Gis Concepts 1/5Gis Concepts 1/5
Gis Concepts 1/5
 
TerraWorld
TerraWorldTerraWorld
TerraWorld
 
Components of Spatial Data Quality in GIS
Components of Spatial Data Quality in GISComponents of Spatial Data Quality in GIS
Components of Spatial Data Quality in GIS
 

Viewers also liked

Justifying Modelin Uncertanty Agora
Justifying Modelin Uncertanty AgoraJustifying Modelin Uncertanty Agora
Justifying Modelin Uncertanty AgoraUniversity of Oradea
 
Semantic 3D City Models with CityGML
Semantic 3D City Models with CityGMLSemantic 3D City Models with CityGML
Semantic 3D City Models with CityGMLICGCat
 
Study: The Future of VR, AR and Self-Driving Cars
Study: The Future of VR, AR and Self-Driving CarsStudy: The Future of VR, AR and Self-Driving Cars
Study: The Future of VR, AR and Self-Driving CarsLinkedIn
 

Viewers also liked (6)

Justifying Modelin Uncertanty Agora
Justifying Modelin Uncertanty AgoraJustifying Modelin Uncertanty Agora
Justifying Modelin Uncertanty Agora
 
Improving Dtm Accuracy
Improving Dtm AccuracyImproving Dtm Accuracy
Improving Dtm Accuracy
 
Fuzzy In Remote Classification
Fuzzy In Remote ClassificationFuzzy In Remote Classification
Fuzzy In Remote Classification
 
Dtm Quality Assesment
Dtm Quality AssesmentDtm Quality Assesment
Dtm Quality Assesment
 
Semantic 3D City Models with CityGML
Semantic 3D City Models with CityGMLSemantic 3D City Models with CityGML
Semantic 3D City Models with CityGML
 
Study: The Future of VR, AR and Self-Driving Cars
Study: The Future of VR, AR and Self-Driving CarsStudy: The Future of VR, AR and Self-Driving Cars
Study: The Future of VR, AR and Self-Driving Cars
 

Similar to Tehnici Moderne De Evaluare A Calitatii Datelor In Cadastru

Geographic information system (gis)
Geographic information system (gis)Geographic information system (gis)
Geographic information system (gis)Shailendra Singh
 
A hybrid approach for analysis of dynamic changes in spatial data
A hybrid approach for analysis of dynamic changes in spatial dataA hybrid approach for analysis of dynamic changes in spatial data
A hybrid approach for analysis of dynamic changes in spatial dataijdms
 
PIAS 2013-GIS.pptxfskjczjsbchdbfscnnND dHSA
PIAS 2013-GIS.pptxfskjczjsbchdbfscnnND  dHSAPIAS 2013-GIS.pptxfskjczjsbchdbfscnnND  dHSA
PIAS 2013-GIS.pptxfskjczjsbchdbfscnnND dHSAFloridaTLaoaten
 
Spatial data analysis
Spatial data analysisSpatial data analysis
Spatial data analysisJohan Blomme
 
Fundamentals of Geographical Information System.ppt
Fundamentals of Geographical Information System.pptFundamentals of Geographical Information System.ppt
Fundamentals of Geographical Information System.pptbimalp7
 
Phenomenological Structuring of Geographic Information
Phenomenological Structuring of Geographic InformationPhenomenological Structuring of Geographic Information
Phenomenological Structuring of Geographic InformationRobert (Bob) Williams
 
GEOGRAPHIC INFORMATION SYSTEM.pptx
GEOGRAPHIC INFORMATION SYSTEM.pptxGEOGRAPHIC INFORMATION SYSTEM.pptx
GEOGRAPHIC INFORMATION SYSTEM.pptxFizaNaaz8
 
What is Geography Information Systems (GIS)
What is Geography Information Systems (GIS)What is Geography Information Systems (GIS)
What is Geography Information Systems (GIS)John Lanser
 
Components of gis
Components of gisComponents of gis
Components of gisPramoda Raj
 

Similar to Tehnici Moderne De Evaluare A Calitatii Datelor In Cadastru (20)

Gis
Gis Gis
Gis
 
ch1
ch1ch1
ch1
 
Geographic information system (gis)
Geographic information system (gis)Geographic information system (gis)
Geographic information system (gis)
 
Final ies
Final iesFinal ies
Final ies
 
DCW Data Quality 1992
DCW Data Quality 1992DCW Data Quality 1992
DCW Data Quality 1992
 
A hybrid approach for analysis of dynamic changes in spatial data
A hybrid approach for analysis of dynamic changes in spatial dataA hybrid approach for analysis of dynamic changes in spatial data
A hybrid approach for analysis of dynamic changes in spatial data
 
What is gis
What is gisWhat is gis
What is gis
 
PIAS 2013-GIS.pptxfskjczjsbchdbfscnnND dHSA
PIAS 2013-GIS.pptxfskjczjsbchdbfscnnND  dHSAPIAS 2013-GIS.pptxfskjczjsbchdbfscnnND  dHSA
PIAS 2013-GIS.pptxfskjczjsbchdbfscnnND dHSA
 
Env. mon
Env. monEnv. mon
Env. mon
 
Gis dr rahul
Gis dr rahulGis dr rahul
Gis dr rahul
 
Vector.pdf
Vector.pdfVector.pdf
Vector.pdf
 
Spatial data analysis
Spatial data analysisSpatial data analysis
Spatial data analysis
 
Why The Historic Environment Needs A Spatial Data Infrastructure
Why The Historic Environment Needs A Spatial Data InfrastructureWhy The Historic Environment Needs A Spatial Data Infrastructure
Why The Historic Environment Needs A Spatial Data Infrastructure
 
Fundamentals of Geographical Information System.ppt
Fundamentals of Geographical Information System.pptFundamentals of Geographical Information System.ppt
Fundamentals of Geographical Information System.ppt
 
Geographical Information System
Geographical Information SystemGeographical Information System
Geographical Information System
 
AIC Darwin Phenomenology 1990
AIC Darwin Phenomenology 1990AIC Darwin Phenomenology 1990
AIC Darwin Phenomenology 1990
 
Phenomenological Structuring of Geographic Information
Phenomenological Structuring of Geographic InformationPhenomenological Structuring of Geographic Information
Phenomenological Structuring of Geographic Information
 
GEOGRAPHIC INFORMATION SYSTEM.pptx
GEOGRAPHIC INFORMATION SYSTEM.pptxGEOGRAPHIC INFORMATION SYSTEM.pptx
GEOGRAPHIC INFORMATION SYSTEM.pptx
 
What is Geography Information Systems (GIS)
What is Geography Information Systems (GIS)What is Geography Information Systems (GIS)
What is Geography Information Systems (GIS)
 
Components of gis
Components of gisComponents of gis
Components of gis
 

Tehnici Moderne De Evaluare A Calitatii Datelor In Cadastru

  • 1. Tehnici moderne de evaluare a calitatii datelor in cadastru Modern techniques for evaluation of spatial data quality s.l. dr. Droj Gabriela - gaby@oradea.ro s.l. drd. ing. Suba Stefan –subast@yahoo.com s.l. drd. Buda Aurelian - budaaurelian@yahoo.com For centuries ago cartographers, geographers, surveyors and geodesists have been involved in collection, storage, analysis and visualization of spatial data. The collections of spatial data are subject to random, systematic and gross errors. In the last decade GIS became the main tool for analyzing spatial data in many real life activities. The quality of the geographical information systems is dependent on the quality of the stored and queried data. In comparison with the non-geographic information systems, in GIS applications, the quality of data has a greater influence on the results. Despite the arguments in favor of explicit treatment of spatial data quality, usually Cadastre and GIS practice continues to proceed as if data were perfect. The objectives of this study are to identify different techniques for evaluation of spatial data quality. Keywords: GIS, Spatial Data Quality, cadastre 1. Introduction For centuries ago cartographers, geographers, surveyors and geodesists have been involved in collection, storage, analysis and visualization of spatial data. The collections of spatial data are subject to random, systematic and gross errors. In the last decade GIS became the main tool for analyzing spatial data in many real life activities. The quality of the geographical information systems is dependent on the quality of the stored and queried data. In comparison with the non-geographic information systems, in GIS applications, the quality of data has a greater influence on the results. The five main reasons for the main concerns about spatial data quality issues were identified as: a) There is an increasing availability, exchange and use of spatial data; b) There is a growing group of users less aware of spatial data quality; c) GIS enables the use of spatial data in all sorts of applications, regardless of the data quality; d) Current GIS offers hardly any tools for handling spatial quality; e) There is an increasing distance between those who use the spatial data (the end users) and those who are creating the spatial databases [van Oort]. Despite the arguments in favor of explicit treatment of spatial data quality, usually Cadastre and GIS practice continues to proceed as if data were perfect. Data are used uncritically without consideration of the error contained and this can lead to erroneous results, misleading information, unwise environmental decisions and increased costs. The objectives of this study are to identify different techniques for evaluation of spatial data quality.
  • 2. 2. The quality of spatial data The quality of spatial data is defined by a set of characteristic classified in [Dro09]: • Dependability characteristics refer to the time related aspects of quality: availability, temporal correctness. The temporal correctness is given by the moment of collection of the data and by the level of updating. • Integrity characteristics represent the applicability of information, the number of collected data and their properties: completeness, consistency, correctness. • Accuracy characteristics refer the positional accuracy and attribute accuracy. The requirements for the quality of the spatial data are depending on the specificity of the informational system. For example the quality requirements for the cadastres data are higher than for the environmental application. On the other hand the quality requirements for cadastre data are usually imposed by local and regional standards. ISO 19113 provides a classification scheme for these errors. They are categorized into different elements and sub elements depending on the nature of the error. Regarding the ISO 19113 standard the data quality aspects are [ISO]: • Completeness – presence and absence of features and their attributes and relationships: o Commission – excess data presents in a dataset, o Omission - data absent from dataset, • Logical consistency – degree of adherence to logical rules of data structure, attribution and relationships: o Conceptual consistency - adherence to rules of conceptual schema, o Domain consistency – adherence of values to the value domain, o Formal consistency – degree of which data is stored in accordance with physical structure of dataset, o Topological consistency – correctness of explicitly encoded topological characteristics of a dataset • Positional accuracy – accuracy of the position of feature o Absolute or external accuracy - closeness of reported coordinate values to values accepted as or being true, o Relative or internal accuracy – closeness of the relative positions of features in a dataset to their respective relative positions accepted as or being true, o Gridded data position accuracy - closeness of gridded data position values to values accepted as or being true, • Temporal accuracy – accuracy of temporal attributes and temporal relationships of features o Accuracy of a time measurement – correctness of a temporal references of an item (reporting of error in time measurement) o Temporal consistency – correctness of ordered events or sequences, if reported, o Temporal validity – validity of data with respect to time • Thematic accuracy – accuracy of quantitative attributes and the correctness of non- quantitative attributes and of the classifications of features and their relationships o Classification correctness – comparison of the classes assigned to features or their attributes to a universe of discourse (ground truth or reference dataset), o Non-quantitative attribute correctness – correctness of non-quantitative attributes, o Quantitative attribute accuracy – accuracy of quantitative attributes.
  • 3. 3. Cadastral issues Cadastral survey or cadastral map is a comprehensive register of the metes-and-bounds real property of a country. A cadastre commonly includes details of the ownership, the precise location, the dimensions (and area), the cultivations if rural and the value of individual parcels of land. Cadastral information are important in both aspects the legal and spatial part of the information, however the legal issues is mostly dominating. Cadastre in European Union does not reply to a unique model, there are used various types of cadastre inventory systems based on different patterns and functions. The purpose of European Union is not to create a new cadastral system but to this variety without reducing efficiency of future common developments. European cadastre’s can be categorized as: [Jak09] • Positional-accurate cadastre in positional-accurate type cadastres the attribute (ownership records) and the location are both up-to-date and accurate. These can be found in Austria, the Netherlands and Finland (urban areas). • Index map cadastre In index map type cadastres, the actual representation of the property units can be found either from legal documents or in the field but the ownership records are accurate and up-to-date. Index maps present the topological relations between the cadastral units. Examples of these cadastres can be found in Sweden and Finland (outside urban areas). • Mosaic cadastre In mosaic type cadastres there in no legal requirement to register the ownership and therefore the cadastre often covers only part of the country. The Cadastre contains information of individual ownerships, but there is no direct relationship with neighboring parcels. Combinations of different title maps may be compiled or reproduced but the topological relations remain undefined. These cadastres exist in Great Britain. Regarding the requirements of the new cadastral law (7/1996), the Romanian cadastre will be in the category of positional accurate cadastre, but nowadays the are several parallel cadastral systems in use in Romania. The cadastral parcel is an important component not only in national but also in European spatial data infrastructures. In the context of global economy is necessary to create effective infrastructures and promote efficiency, therefore the systems of spatial data had to be interoperable. 4. Standards of spatial data To achieve interoperability of geographic information is necessary to establish a set of minimum standards and policies. This standards and policies must define at three levels[Dro09]: • Institutional level represented by organizational rules like access rules, data protection and copyright, • Technical level consists in all the hardware, software and communication protocols compatibility, • Semantic level - data standards, the set of public data and process standards. In this paper we will focus on the semantic level. Data standards are semantic definitions that are structured in a model. They describe the minimum requirements of objects, features or items that are (will be) collected, automated, or affected by process. Process standards also referred as service standards describe how the procedures to follow, methodologies to apply, procedures to present information, or business process rules to be followed to implement other
  • 4. standards. Process standards are used: (1) to establish a threshold for minimally acceptable data, (2) to determine the best data for an application, or (3) to promote interoperability and broad based use of data. Current standards in geographic information may be classified as industry standards also called de-facto standards and official standards, national, regional, or international standards. The de- facto standards are generated by industry, in the case of spatial data this is the Open GIS Consortium (Open Geospatial). The official standards can be also divided in international standards (ISO), regional standards (European standards INSPIRE) and national standards. These standards specifies all the methods, tools and services for data management (including definition and description), acquiring, processing, analyzing, accessing, presenting and transferring such data in digital/electronic form between different users, systems and locations. ISO Base standards – ISO 19100 series Reference amendment consensus Abstract OGC Essential Model specification RFP proces amendment Implementation specification Development revision Implementation Market services Figure 1 - Relationship between ISO and OGC standards [Oos02] Being international standards ISO is respected by the entire official standards or de – facto. The standardization in the field of digital geographic information is done by TC211 committee. Their work aims to establish a structured set of standards known as the ISO 19100 series of geographic information standards. These standards are meant to enable geospatial datasets to interact between different data models and different applications. The most important quality-related standards, for spatial data, in the ISO 19100 families are: • ISO 19113 Geographic Information – Quality principles • ISO 19114 Geographic Information – Quality evaluation procedures • ISO 19138 Geographic Information – Data quality measures • ISO 19115 Geographic Information – Metadata • ISO 19115 Geographic Information – Metadata –Part 2 : Extensions for imagery and gridded data • ISO 19122 Geographic Information – Qualifications and certification of personnel • ISO 19127 Geographic Information – Geodetic codes and parameters
  • 5. • ISO 19131 Geographic Information – Data product specifications • ISO 19139 Geographic Information – Metadata – XML schema implementation The must important industrial standards for spatial data specification are the OpenGIS. OpenGIS ® Standards and Specifications are technical documents that detail interfaces or encodings. Software developers use these documents to build support for the interfaces or encodings into their products and services [Ogis]. There is a close link between the ISO and OpenGIS standards. These relations are presented in figure 1.[Oos02]: Each country, based on ISO 19100 series, OpenGIS Standards and Specifications and national land management laws, tries to standardize the spatial data in order to establish a National Spatial Information Data. For European countries this NSDIs must also fulfill the regulation of INSPIRE. INSPIRE (Infrastructure for Spatial Information in Europe) is meant to create a legal framework in order to establish and operate an Infrastructure for Spatial Information in Europe, for the purpose of formulating, implementing, monitoring and evaluating Community policies at all levels and providing public information. 5. Techniques for evaluation of spatial data quality The international ISO standards provides principles not only for the principles how geospatial data can be described but also offer guidance on assessing the quality of actual data sets and what should be the quality evaluation procedures. The procedures for evaluation of data quality according to the ISO 19114 standard it must be done in five steps[Jak09, Joo06]: • Identifying the data quality scope: elements and subelements, • Identifying the data quality measure • Selecting the evaluation method, • Determine data quality results, • Determine conformance.
  • 6. Figure 2 - Five steps for quality evaluation according ISO 19114 The five steps of quality evaluating procedures and the relationship between them are presented in the upper picture. The quality evaluation information shall be reported as metadata. A separate quality evaluation report is required when metadata result is only pass/fail or when aggregate quality results are generated. The standard specifies the fields to be filled in when reporting on assessment as a quality evaluation report (Annex I in the standard)[Jak09]: • Identification of reporting document • Scope observed • Measure used (formula, resulting values, result unit, reliability, reliability unit) • Confidence in conformance test (confidence value, confidence unit, documents explaining the method) • Type of quality evaluation method used (direct external etc., inspection strategy applied) • Description of quality method used (basic assumptions, processing algorithms, definition of parameters, parameter values for the specific test, parameter units), • Possible aggregation of results (unit for aggregated values, resulting values, statistics used for aggregation, computation date, pointer to aggregation report) The evaluation of spatial data quality can be done by using two methods: by direct observation, with visual comparisons of the resulted models, and by using statistical parameters. The visual comparisons are very difficult to be done and impossible to be quantified. In the case of quality evaluation made through visual comparison is required to have images of the both models, in order to model a real life situation and the generated model.
  • 7. In order to quantify the uncertainty of the spatial data, it is necessary to use statistical methods. To decide the quality of the data by statistical analysis, the comparison of the known data with the resulted data is required. To analyze the pattern of deviation between two sets of data, conventional ways are to yield statistical expressions of the accuracy, such as the root mean square error, standard deviation, mean, variance and coefficient of variation [ Dro08]. The quality requirements for cadastre data are also dependent of the type of inventory system used. In the following table we present the role of quality in the different type of cadastre. [Jak06, Jak09] Cadastre Type Ownership Location data Potential quality issues data Mosaic Ownership records Ownership maps Reliability (general boundaries) No other quality information available Index Ownership as Uniform index map Positional Accuracy attribute Topological consistency information Process management Positional- Ownership as Accurate location of Logical consistency accurate attribute boundary points Security issues information Process management The ISO 19138 specifies the data quality measures; it defines a set of data quality measures that can be used when reporting data quality for the sub-elements in ISO 19113. In the table below we presented the most important quality measures for cadastral issues selected from ISO 19138[Jak06, Jak09]. Data quality element Data quality Data quality measure Data quality basic sub-element measure Completeness Commission Number of excess items Error count Completeness Omission Number of missing items Error count Logical consistency Topological Number of faulty point- Error count consistency curve connections Logical Topological Number of missing point- Error count consistency consistency connections Positional accuracy Absolute or Covariance matrix external accuracy Positional accuracy Absolute or RMSE or standard external deviation, where the true accuracy value is not estimated from the observation but known a priori Temporal accuracy Absolute of time Mean value of date Difference between measurement attributes date in database and date
  • 8. Temporal accuracy Temporal Number of items not in Error count validity conformance with their value Thematic accuracy Classification Number of incorrectly Error count correctness classified features Thematic accuracy Classification Misclassification correctness matrix 6. Conclusion In this paper we analyzed the main issues regarding spatial data quality in general and in cadastre. We studied a classification of the characteristics which represent the quality of spatial data and the main elements and subelements of spatial data quality as it is regulated by ISO standards. Also we present a brief introduction in the guidelines established by different standardization organization and the relationships between them. The main conclusion which is pointed up by the studies is the importance of standards in data quality assessment. The second conclusion consists in the usage of spatial data standardization as a technique for evaluation of spatial data quality. The last conclusion is that, there are two keys to the improvement of data quality – they are prevention and correction. Error prevention is considered to be far superior to error detection, since detection is often costly and can never guarantee to be 100% successful, and the best way of error prevention is to follow all the regulation regarding specification, structure, data sets, and implementation and evaluation procedures. References: [Dro08]Droj, G - Quantifying the Uncertainty of Spatial Data, ZAC, Cluj, 2008 [Dro09] Droj, G, Suba, S. - Role of Data Quality in GIS Decision Support Tools, Szekesfehervar GIS Open 2009 [Jak09]Jakobsson, A. Giversen, J. - Guidelines for Implementing the ISO 19100 Geographic Information Quality Standards in National Mapping and Cadastral Agencies, www.eurogeographics.org, visited in march 2009 [Jak06]Jakobsson, A. Giversen, J. - On the Future of Topographic Base Information Management in Finland and Europe, Dissertation for degree of Doctor in science, 2006, [Joo06]Joos G. – Data Quality Standards, XXIII FIG Congress , 2006 [Ogis99] OpenGIS® - Simple Features Specification For SQL Revision 1.1, Open GIS Consortium 1999 [Ogis05] OpenGIS® - Implementation Specification for Geographic Information - Simple feature access - Part 2: SQL option, Open Geospatial Consortium 2005 [Ost02] Østensenstensen, O; Smits P. - ISO/TC211: standardisation of Geographic Information and Geo-Informatics, Canada International Geoscience and Remote Sensing International Symposium 2002 [van Oort] van Oort, P. -Spatial data quality: from description to application, NCG Nederlandse Commissie voor Geodesie Netherlands Geodetic Commission Delft, December 2005.