Assessing Trust in OSM Features Using Edit History

Carsten Keßler a,b and René de Groot a
a Institute for Geoinformatics, University of Münster | b soon: Hunter College, CUNY
http://carsten.io | @carstenkessler
Trust as a Proxy Measure for the
Quality of VGI in the Case of OSM

The Idea
‣ Develop a measure to assess the degree to which a data
consumer can trust the quality of a feature

The Idea
‣ Trust measure is based on a feature’s editing history

The Idea
‣ Trust measure is based on a feature’s editing history
‣ Benefits
‣ Works at feature level
‣ Filter features by quality
‣ Spot problematic features

Does this work?
Can we reliably assess the quality of a feature in
OpenStreetMap based on its editing history?

Does this work?
amenity = university
name = Institute for Geoinformatics
v1

Does this work?
building = yes
v1 v2

Does this work?
building = yes
addr:city = Münster
addr:country = DE
addr:housenumber = 253
addr:street = Weseler Straße
building = yes
wheelchair = limited
v1 v2 v3 …

OSM Heatmap Kudos: Johannes Trame

OSM Provenance Ontology
http://carsten.io/osm/osm-provenance.rdf
prv:Tag
includesEdit
Changeset prv:CreationGuideline
Edit
prv:createdBy
prv:precededBy
prv:usedData
NodeState
WayState
prv:DataCreation User
prv:performedBy
changesGeometry
addsTag
removesTag
changesValueOfKey
rdfs:Literal
prv:DataItem
prv:HumanActor
subClassOfhasTag
FeatureState

Does this work?
‣ Get a first idea whether this is a viable approach
‣ Compare results of
‣ a simple trust measure and
‣ observed feature quality
‣ Is there a correlation between the two?

Study area:
Münster’s
old town

Feature Selection
‣ Re-mapping the whole district was not feasible

Feature Selection
‣ Up to 100 features were manageable

Feature Selection
‣ Selection based on minimum number of versions

Feature Selection
‣ Selection based on minimum number of versions
‣ 74 features with 6+ versions

Trust measure
‣ Positive factors:
‣ Versions
‣ Users
‣ Indirect confirmations =
edits in the direct vicinity
(50m)

Trust measure
‣ Positive factors:
‣ Versions
‣ Users
‣ Indirect confirmations =
edits in the direct vicinity
(50m)
‣ Negative factors:
‣ Tag corrections
‣ Rollbacks

Trust measure (contd.)
‣ Classification for each factor: 5 equal classes
‣ Combined into one classification
‣ Equal weights

Field Survey
‣ Thematic accuracy
4 classes:
1. Main tag wrong
2. Other tags wrong
3. Thematic ambiguities
4. Thematically correct

Field Survey
‣ Thematic accuracy
4 classes:
1. Main tag wrong
2. Other tags wrong
3. Thematic ambiguities
4. Thematically correct
‣ Results:
‣ 6 features (~8%)

Field Survey (contd.)
‣ Topological consistency

‣ Is the feature correctly
positioned relative to the
surrounding features?

‣ Results:
‣ 73 out of 74 features (~99%)

‣ Results:
‣ Information completeness
‣ TF-IDF measure to identify
relevant tags per main tag

‣ Results:
‣ Information completeness
‣ TF-IDF measure to identify
relevant tags per main tag
‣ ~37% tags missing (avg.)

Observed
quality:
combined
results

mean quality class: ~4.2
mean trust class: ~2.8

Do we get the trend right?
‣ Removed outliers
‣ Kendall’s τ: 0.52
‣ Moderate, but significant
positive correlation

Conclusions
‣ Initial study
‣ A feature’s history can determine its trustworthiness

Conclusions
‣ Initial study
‣ Trust values correlate with observed quality

Conclusions
‣ Initial study
‣ Even with a very simple model

Conclusions
‣ Initial study
‣ Even with a very simple model
‣ Outliers cannot be explained yet

Tons of Future Work
‣ Extend and refine the trust model:
Classification, weighting, positive vs negative aspects, …

Tons of Future Work
‣ Social aspects: Who has edited a feature?

Tons of Future Work
‣ Repeat study without spatial focus

Tons of Future Work
‣ How to scale the data collection?

Tons of Future Work
‣ How to scale the data collection?
‣ Learn the trust model from the data

Thankyou!
All data used in this research © OpenStreetMap contributors.
carsten.kessler@uni-muenster.de | http://carsten.io | @carstenkessler
Carsten Keßler | René de Groot

Assessing Trust in OSM Features Using Edit History

Recommended

Recommended

More Related Content

Similar to Assessing Trust in OSM Features Using Edit History

Similar to Assessing Trust in OSM Features Using Edit History (20)

More from Carsten Keßler

More from Carsten Keßler (11)

Recently uploaded

Recently uploaded (20)

Assessing Trust in OSM Features Using Edit History