SlideShare a Scribd company logo
1 of 18
Download to read offline
1 / 20 20 / 03 / 2013
Towards Preservation of semantically
enriched Architectural Knowledge
Stefan Dietze (L3S Research Center, Leibniz University Hanover, DE)
Stefan Dietze, Jakob Beetz, Ujwal Gadiraju, Georgios
Katsimpras, Raoul Wessel, René Berndt
2 / 20 20 / 03 / 201327/09/13
Challenges
 Diversity of data - interoperability: low-level
point clouds & legacy 3D models up to enriched
Building Information Models (BIM), higher-level
semantics and Web data / knowledge
 Diverse stakeholders: architects, building
operators, urban planners, archivists, …
 Building, model and data evolution: document
temporal evolution to prevent information loss
Goals and Challenges (1/2)
Goal
 Methods and tools for sustainable long-term
preservation of architectural knowledge
Stefan Dietze (L3S Research Center)
3 / 20 20 / 03 / 201327/09/13
Challenges
 “Semantic” enrichment of
architectural knowledge: exploiting
Web data and knowledge to enrich
low-level architectural data.
 Inconsistent vocabularies: adopting
state of the art (LD) vocabularies and
schemas towards sustainability
 Long-term readability / renderability
of architectural models: addressing
digital decay (eg due to deprecated
file formats) and model evolution
Architectural Archives Architectural Web Data
Goals and Challenges (2/2)
Stefan Dietze (L3S Research Center)
4 / 20 20 / 03 / 201327/09/13
UBO: Universität Bonn
- Technical Coordinator
- WP4/WP5: change management, shape
recognition
Fraunhofer Austria
- WP2: system specification
& integration
TUE, Department of the Built Environment,
Eindhoven University of Technology
- WP3:,semantics & metadata
CITA, Center for Information Technology
and Architecture Copenhagen
- WP7: data, evaluation, test
Luleå University of Technology
- WP8: dissemination/exploitation
Catenda, SME
- User perspective, market requirements, evaluation
LUH: German National Library of
Science and Technology (TIB) &
L3S Research Center Hannover
-Coordinator
- WP3 Semantic Enrichment
- WP6 leader, long-term preservation
Consortium
Stefan Dietze (L3S Research Center)
5 / 20 20 / 03 / 201327/09/13
Why interlinking & semantic enrichment?
Stefan Dietze (L3S Research Center)
policies
traffichistory
environment
infrastructure
1. research 2. design 3. monitoring (over time)
A very simplistic view on urban planning/architectural lifecycle today
DURAARK approach - exploiting Web data to help architects and urban planners to
answer questions like:
 What‘s the legal, social and environmental context of a structure (sustainability policies etc)?
 How did buildings and their contexts (traffic, surroundings, usage and functionality, popularity, etc)
evolve over time?
 How did an architectural change impact surrounding traffic/environment?
(examples: bridges, airports)
 How did an architectural change impact popularity and attractiveness of a building?
 ….
6 / 20 20 / 03 / 201327/09/13
Architectural Data Preservation
3D Models
Point Clouds
Stefan Dietze (L3S Research Center)
Building Information
Models (BIM)
= structured „Building
Model Metadata“
7 / 20 20 / 03 / 201327/09/13
Architectural Data Preservation
SDA Scope
Stefan Dietze (L3S Research Center)
 Semantic enrichment of low-level architectural models
(gradual process)
 Interlinking of related models/data
(across different abstraction levels, model types, datasets
and repositories…)
 Preservation & temporal analysis: tracking the evolution
of models, buildings and related data
8 / 20 20 / 03 / 2013
Example: GDR’s People’s Palace - static vs evolving data/links
Social & Semantic Web for enrichment
9 / 20 20 / 03 / 201327/09/13
Semantic enrichment – schema/knowledge types
Challenges
 Selection of suitable datasets from
wealth of diverse datasets
 Preservation: dealing with evolution
of distributed datasets (i.e. the
semantics & context of the
structure/models)
10 / 20 20 / 03 / 201327/09/13 Stefan Dietze (L3S Research Center)
Data selection: too few information about too many datasets
 Lack of reliable dataset metadata but wide diversity (eg, DBpedia vs traffic stats London vs … ) :
 Spatial and temporal coverage ?
 Dynamics ? (evolution, frequency of changes…)
 Resource types & topics ? (policy documents vs traffic statistics)
 Currentness, availability, provenance, ….
Enrichment & Preservation
http://datahub.io/dataset/transport-data-gov-uk
329.527.661 triples
metadata
LOD cloud: 300++ datasets
DataHub: 6000++ datasets
11 / 20 20 / 03 / 201327/09/13
<geoLatLong:52/13>
Stefan Dietze (L3S Research Center)
Data preservation: handling evolution of distributed data
 Preservation needs to address evolution of distributed datasets / semantics of links
 In RDF graphs (such as the LOD Cloud), „all“ nodes are connected:
 Which datasets to preserve (only direct links or also more distant neighbours)?
(semantic relatedness, see [ESWC2013])
 Propagation of changes in LOD graph => measuring relevance of changes for specific entities
Enrichment & Preservation
<dbp:Berlin(east)>
<dura:GDR Peoples Palace>
<dbp:Berlin>
12 / 20 20 / 03 / 201327/09/13
<geoLatLong:52/13>
Stefan Dietze (L3S Research Center)
Data preservation: handling evolution of distributed data
 Preservation needs to address evolution of distributed datasets / semantics of links
 In RDF graphs (such as the LOD Cloud), „all“ nodes are connected:
 Which datasets to preserve (only direct links or also more distant neighbours)?
(semantic relatedness, see [ESWC2013])
 Propagation of changes in LOD graph => measuring relevance of changes for specific entities
 Preservation strategies dependent on dataset dynamics
 Simple linking (archiving) for static datasets (eg statistics over past periods in data.gov.uk)
 Recurring link computation and graph archival for dynamic datasets (frequency?)
Enrichment & Preservation
<dbp:Berlin(east)>
<dura:GDR Peoples Palace>
<dbp:Berlin>
Traffic statistics
(1986-1989)
Traffic statistics
(2013-…)
Energy efficiency policies
13 / 20 20 / 03 / 201327/09/13
Approach: dataset profiling
 Enrichment & preservation = intertwined process!
 Dataset selection & cataloging: via DataHub.io
(similar to LOD cloud)
 Dataset profiling: metadata about dataset dynamics, size,
types, topics, evolution, temporal/spatial coverage etc
=> Data observatory (see also [ESWC2013], [ISWC2013])
 Vocabulary curation (expert-based)
Web Data Curation for Building-related Data
DURAARK
Data Observatory
Automated processing to generate:
 Descriptive Dataset Profiles
 Data Interlinking & Correlation
Stefan Dietze (L3S Research Center)
describes
Endpoint
Retrieval & Graph
Extraction
Schema
Extraction and
Mapping
Sample Graph
Extraction
(per dataset)
NER & NED
(per resource)
Interlinking & Co-
Resolution
(cross-dataset)
Profiling
(topics, coverage,
dynamics,…)http://datahub.io/group/linked-building-data
14 / 20 20 / 03 / 201327/09/13
Endpoint
Retrieval & Graph
Extraction
Schema
Extraction and
Mapping
Sample Graph
Extraction
(per dataset)
NER & NED
(per resource)
Interlinking & Co-
Resolution
(cross-dataset)
Profiling
(topics, coverage,
dynamics,…)
Dataset
Catalog/Index
Links/
Cross-references
rdfs:label:„…ECB….“ ?
Dataset metadata (RDF/VoID):
 Schema mappings
(types, properties)
 Entities & categories
 Topic relevance scores
 Availability, currentness
data (tbc)
dbpedia:Finance
dbpedia:Sports
dbpedia:England-Wales-Cricket-Board
dbpedia:European_Central_Bank
Goals:
 RDF catalog of datasets
 Tracking the evolution
of datasets according
to, eg, topics,
dynamics, spatial
coverage, accessability
 Links and coreferences
=> unified view on data
=> Linked Building Data
Graph
 Infrastructure & APIs for
federated queries
Dataset profiling: processing workflow
Towards a Web Data ”Observatory”
Stefan Dietze (L3S Research Center)
dbpedia:Frankfurt
15 / 20 20 / 03 / 201327/09/13
Pipeline
 Demo categories!
Web Data Observatory – ongoing work
Stefan Dietze (L3S Research Center)
http://data.linkededucation.org/linkedup/categories-explorer
16 / 20 20 / 03 / 201327/09/13
Vocabulary Curation & Data Interlinking
Stefan Dietze (L3S Research Center)
 Using dataset profiles for semi-
automated data interlinking:
 Manual alignment of schemas &
vocabularies into unified RDF
graph
 Automated interlinking (and
preservation) techniques
 Preservation metadata (PREMIS RDF?)
 Expert-based curation of building-
related vocabularies
 BuildingSmartDD
(http://www.buildingsmart.org
/standards/ifd)
 OMNIClass, UNIClass
 SFB-NL (http://nl-
sfb.bk.tudelft.nl)
 CROW Library for
infrastructural objects
(http://www.gww-ob.nl/)
 …
17 / 20 20 / 03 / 201327/09/13
Conclusions
Summary
 “Data Observatory” as generic platform and domain-specific instantiation
(profiling building-related dataset aspects in DURAARK)
 Preservation/linking strategies for SDA based on dataset profiles (eg dynamics, relevance)
SDA Scope
Outlook
 Dataset selection: populating DataHub-
group
 Schema and vocabulary curation and
alignment
 Dataset profiling: establishing LDO,
considering range of metadata aspects
 Building SDA: data interlinking & dataset
preservation
DURAARK
Data Observatory
Stefan Dietze (L3S Research Center)
ongoing work
future work
18 / 20 20 / 03 / 201327/09/13
Thank you!
http://purl.org/dietze | @stefandietze
http://www.duraark.eu

More Related Content

What's hot

What's hot (20)

Preserving Computer-Aided Design, Digital Preservation Coalition Report
Preserving Computer-Aided Design, Digital Preservation Coalition ReportPreserving Computer-Aided Design, Digital Preservation Coalition Report
Preserving Computer-Aided Design, Digital Preservation Coalition Report
 
DURAARK Preserving Architectural Knowledge
DURAARK Preserving Architectural KnowledgeDURAARK Preserving Architectural Knowledge
DURAARK Preserving Architectural Knowledge
 
Towards a Linked Data Publishing Methodology
Towards a Linked Data Publishing MethodologyTowards a Linked Data Publishing Methodology
Towards a Linked Data Publishing Methodology
 
BDE SC3.3 Workshop - BDE review: Scope and Opportunities
 BDE SC3.3 Workshop -  BDE review: Scope and Opportunities BDE SC3.3 Workshop -  BDE review: Scope and Opportunities
BDE SC3.3 Workshop - BDE review: Scope and Opportunities
 
3D ICONS IPR experience
3D ICONS IPR experience 3D ICONS IPR experience
3D ICONS IPR experience
 
Using Open Research Data for Public Policy Making: Opportunities of Virtual R...
Using Open Research Data for Public Policy Making: Opportunities of Virtual R...Using Open Research Data for Public Policy Making: Opportunities of Virtual R...
Using Open Research Data for Public Policy Making: Opportunities of Virtual R...
 
Metadata, the CARARE aggregation service and 3D ICONS
Metadata, the CARARE aggregation service and 3D ICONSMetadata, the CARARE aggregation service and 3D ICONS
Metadata, the CARARE aggregation service and 3D ICONS
 
Potential usage of 3D data and IPR issues, presented by Sheena Basset
Potential usage of 3D data and IPR issues, presented by Sheena BassetPotential usage of 3D data and IPR issues, presented by Sheena Basset
Potential usage of 3D data and IPR issues, presented by Sheena Basset
 
Introduction to 3D ICONS
Introduction to 3D ICONSIntroduction to 3D ICONS
Introduction to 3D ICONS
 
The last mile of 3DIcons: making available 3D contents and their metadata thr...
The last mile of 3DIcons: making available 3D contents and their metadata thr...The last mile of 3DIcons: making available 3D contents and their metadata thr...
The last mile of 3DIcons: making available 3D contents and their metadata thr...
 
Metadata for 3D models, Sheena Bassett
Metadata for 3D models, Sheena BassettMetadata for 3D models, Sheena Bassett
Metadata for 3D models, Sheena Bassett
 
Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...
Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...
Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...
 
SC1 Workshop 2 Pilot instantiations
SC1 Workshop 2 Pilot instantiationsSC1 Workshop 2 Pilot instantiations
SC1 Workshop 2 Pilot instantiations
 
Josep Maria Salanova - Introduction to BDE+SC4
Josep Maria Salanova - Introduction to BDE+SC4Josep Maria Salanova - Introduction to BDE+SC4
Josep Maria Salanova - Introduction to BDE+SC4
 
The DURAARK Workbench and PREMIS
The DURAARK Workbench and PREMISThe DURAARK Workbench and PREMIS
The DURAARK Workbench and PREMIS
 
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...
 
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
 
Data management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.euData management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.eu
 
The Next Generation of the Microdata Information System MISSY - An Integrated...
The Next Generation of the Microdata Information System MISSY - An Integrated...The Next Generation of the Microdata Information System MISSY - An Integrated...
The Next Generation of the Microdata Information System MISSY - An Integrated...
 
2016 SDMX Experts meeting, National Accounts business case (validation, data ...
2016 SDMX Experts meeting, National Accounts business case (validation, data ...2016 SDMX Experts meeting, National Accounts business case (validation, data ...
2016 SDMX Experts meeting, National Accounts business case (validation, data ...
 

Viewers also liked

KnowEscape workshop, OKCon 2013
KnowEscape workshop, OKCon 2013KnowEscape workshop, OKCon 2013
KnowEscape workshop, OKCon 2013
Stefan Dietze
 

Viewers also liked (7)

DURAARK at Bibliotheksymposium Wildau
DURAARK at Bibliotheksymposium WildauDURAARK at Bibliotheksymposium Wildau
DURAARK at Bibliotheksymposium Wildau
 
Grapp2014 presentation
Grapp2014 presentationGrapp2014 presentation
Grapp2014 presentation
 
Turning Data into Knowledge (KESW2014 Keynote)
Turning Data into Knowledge (KESW2014 Keynote)Turning Data into Knowledge (KESW2014 Keynote)
Turning Data into Knowledge (KESW2014 Keynote)
 
What's all the data about? - Linking and Profiling of Linked Datasets
What's all the data about? - Linking and Profiling of Linked DatasetsWhat's all the data about? - Linking and Profiling of Linked Datasets
What's all the data about? - Linking and Profiling of Linked Datasets
 
DURAARK at AUdS 2015
DURAARK at AUdS 2015DURAARK at AUdS 2015
DURAARK at AUdS 2015
 
A Scalable Approach for Efficiently Generating Structured Dataset Topic Profiles
A Scalable Approach for Efficiently Generating Structured Dataset Topic ProfilesA Scalable Approach for Efficiently Generating Structured Dataset Topic Profiles
A Scalable Approach for Efficiently Generating Structured Dataset Topic Profiles
 
KnowEscape workshop, OKCon 2013
KnowEscape workshop, OKCon 2013KnowEscape workshop, OKCon 2013
KnowEscape workshop, OKCon 2013
 

Similar to Towards preservation of semantically enriched architectural knowledge

LinkedUp - Linked Data Europe Workshop 2014
LinkedUp - Linked Data Europe Workshop 2014LinkedUp - Linked Data Europe Workshop 2014
LinkedUp - Linked Data Europe Workshop 2014
Stefan Dietze
 
Web Science Synergies: Exploring Web Knowledge through the Semantic Web
Web Science Synergies: Exploring Web Knowledge through the Semantic WebWeb Science Synergies: Exploring Web Knowledge through the Semantic Web
Web Science Synergies: Exploring Web Knowledge through the Semantic Web
Stefan Dietze
 

Similar to Towards preservation of semantically enriched architectural knowledge (20)

Linked Data for Architecture, Engineering and Construction (AEC)
Linked Data for Architecture, Engineering and Construction (AEC)Linked Data for Architecture, Engineering and Construction (AEC)
Linked Data for Architecture, Engineering and Construction (AEC)
 
To architect or engineer? Lessons from DataPool on building RDM repositories
To architect or engineer? Lessons from DataPool on building RDM repositoriesTo architect or engineer? Lessons from DataPool on building RDM repositories
To architect or engineer? Lessons from DataPool on building RDM repositories
 
Retrieval, Crawling and Fusion of Entity-centric Data on the Web
Retrieval, Crawling and Fusion of Entity-centric Data on the WebRetrieval, Crawling and Fusion of Entity-centric Data on the Web
Retrieval, Crawling and Fusion of Entity-centric Data on the Web
 
LinkedUp - Linked Data Europe Workshop 2014
LinkedUp - Linked Data Europe Workshop 2014LinkedUp - Linked Data Europe Workshop 2014
LinkedUp - Linked Data Europe Workshop 2014
 
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
 
UW Libraries Data Services Forum
UW Libraries Data Services ForumUW Libraries Data Services Forum
UW Libraries Data Services Forum
 
Web Science Synergies: Exploring Web Knowledge through the Semantic Web
Web Science Synergies: Exploring Web Knowledge through the Semantic WebWeb Science Synergies: Exploring Web Knowledge through the Semantic Web
Web Science Synergies: Exploring Web Knowledge through the Semantic Web
 
Claremont Report on Database Research: Research Directions (Le Gruenwald)
Claremont Report on Database Research: Research Directions (Le Gruenwald)Claremont Report on Database Research: Research Directions (Le Gruenwald)
Claremont Report on Database Research: Research Directions (Le Gruenwald)
 
MODEL_FOR_SEMANTICALLY_RICH_POINT_CLOUD.pdf
MODEL_FOR_SEMANTICALLY_RICH_POINT_CLOUD.pdfMODEL_FOR_SEMANTICALLY_RICH_POINT_CLOUD.pdf
MODEL_FOR_SEMANTICALLY_RICH_POINT_CLOUD.pdf
 
Developing and applying the CARARE metadata schema for 3D documentation, pres...
Developing and applying the CARARE metadata schema for 3D documentation, pres...Developing and applying the CARARE metadata schema for 3D documentation, pres...
Developing and applying the CARARE metadata schema for 3D documentation, pres...
 
General concepts: DDI
General concepts: DDIGeneral concepts: DDI
General concepts: DDI
 
remotesensing-12-01253.pdf
remotesensing-12-01253.pdfremotesensing-12-01253.pdf
remotesensing-12-01253.pdf
 
Donders Institute - Research Data Management
Donders Institute - Research Data Management Donders Institute - Research Data Management
Donders Institute - Research Data Management
 
Knowledge Graph Introduction
Knowledge Graph IntroductionKnowledge Graph Introduction
Knowledge Graph Introduction
 
From Data to Knowledge - Profiling & Interlinking Web Datasets
From Data to Knowledge - Profiling & Interlinking Web DatasetsFrom Data to Knowledge - Profiling & Interlinking Web Datasets
From Data to Knowledge - Profiling & Interlinking Web Datasets
 
Survey of Object Oriented Database
Survey of Object Oriented DatabaseSurvey of Object Oriented Database
Survey of Object Oriented Database
 
NoSQL Databases - Lecture 12 - Introduction to Databases (1007156ANR)
NoSQL Databases - Lecture 12 - Introduction to Databases (1007156ANR)NoSQL Databases - Lecture 12 - Introduction to Databases (1007156ANR)
NoSQL Databases - Lecture 12 - Introduction to Databases (1007156ANR)
 
Beyond Linked Data - Exploiting Entity-Centric Knowledge on the Web
Beyond Linked Data - Exploiting Entity-Centric Knowledge on the WebBeyond Linked Data - Exploiting Entity-Centric Knowledge on the Web
Beyond Linked Data - Exploiting Entity-Centric Knowledge on the Web
 
9th International Conference on Data Mining and Database Management Systems (...
9th International Conference on Data Mining and Database Management Systems (...9th International Conference on Data Mining and Database Management Systems (...
9th International Conference on Data Mining and Database Management Systems (...
 
Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.
 

More from Stefan Dietze

WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons LearnedWWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
Stefan Dietze
 
Open Data & Education Seminar, ITMO, St Petersburg, March 2014
Open Data & Education Seminar, ITMO, St Petersburg, March 2014Open Data & Education Seminar, ITMO, St Petersburg, March 2014
Open Data & Education Seminar, ITMO, St Petersburg, March 2014
Stefan Dietze
 

More from Stefan Dietze (20)

AI in between online and offline discourse - and what has ChatGPT to do with ...
AI in between online and offline discourse - and what has ChatGPT to do with ...AI in between online and offline discourse - and what has ChatGPT to do with ...
AI in between online and offline discourse - and what has ChatGPT to do with ...
 
An interdisciplinary journey with the SAL spaceship – results and challenges ...
An interdisciplinary journey with the SAL spaceship – results and challenges ...An interdisciplinary journey with the SAL spaceship – results and challenges ...
An interdisciplinary journey with the SAL spaceship – results and challenges ...
 
Research Knowledge Graphs at NFDI4DS & GESIS
Research Knowledge Graphs at NFDI4DS & GESISResearch Knowledge Graphs at NFDI4DS & GESIS
Research Knowledge Graphs at NFDI4DS & GESIS
 
Research Knowledge Graphs at GESIS & NFDI4DataScience
Research Knowledge Graphs at GESIS & NFDI4DataScienceResearch Knowledge Graphs at GESIS & NFDI4DataScience
Research Knowledge Graphs at GESIS & NFDI4DataScience
 
Human-in-the-loop: the Web as Foundation for interdisciplinary Data Science M...
Human-in-the-loop: the Web as Foundation for interdisciplinary Data Science M...Human-in-the-loop: the Web as Foundation for interdisciplinary Data Science M...
Human-in-the-loop: the Web as Foundation for interdisciplinary Data Science M...
 
Human-in-the-Loop: das Web als Grundlage interdisziplinärer Data Science Meth...
Human-in-the-Loop: das Web als Grundlage interdisziplinärer Data Science Meth...Human-in-the-Loop: das Web als Grundlage interdisziplinärer Data Science Meth...
Human-in-the-Loop: das Web als Grundlage interdisziplinärer Data Science Meth...
 
Towards research data knowledge graphs
Towards research data knowledge graphsTowards research data knowledge graphs
Towards research data knowledge graphs
 
Beyond research data infrastructures: exploiting artificial & crowd intellige...
Beyond research data infrastructures: exploiting artificial & crowd intellige...Beyond research data infrastructures: exploiting artificial & crowd intellige...
Beyond research data infrastructures: exploiting artificial & crowd intellige...
 
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...From Web Data to Knowledge: on the Complementarity of Human and Artificial In...
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...
 
Using AI to understand everyday learning on the Web
Using AI to understand everyday learning on the WebUsing AI to understand everyday learning on the Web
Using AI to understand everyday learning on the Web
 
Analysing User Knowledge, Competence and Learning during Online Activities
Analysing User Knowledge, Competence and Learning during Online ActivitiesAnalysing User Knowledge, Competence and Learning during Online Activities
Analysing User Knowledge, Competence and Learning during Online Activities
 
Analysing & Improving Learning Resources Markup on the Web
Analysing & Improving Learning Resources Markup on the WebAnalysing & Improving Learning Resources Markup on the Web
Analysing & Improving Learning Resources Markup on the Web
 
Big Data in Learning Analytics - Analytics for Everyday Learning
Big Data in Learning Analytics - Analytics for Everyday LearningBig Data in Learning Analytics - Analytics for Everyday Learning
Big Data in Learning Analytics - Analytics for Everyday Learning
 
Mining and Understanding Activities and Resources on the Web
Mining and Understanding Activities and Resources on the WebMining and Understanding Activities and Resources on the Web
Mining and Understanding Activities and Resources on the Web
 
Towards embedded Markup of Learning Resources on the Web
Towards embedded Markup of Learning Resources on the WebTowards embedded Markup of Learning Resources on the Web
Towards embedded Markup of Learning Resources on the Web
 
Semantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital LibrariesSemantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital Libraries
 
Dietze linked data-vr-es
Dietze linked data-vr-esDietze linked data-vr-es
Dietze linked data-vr-es
 
WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons LearnedWWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
 
Demo: Profiling & Exploration of Linked Open Data
Demo: Profiling & Exploration of Linked Open DataDemo: Profiling & Exploration of Linked Open Data
Demo: Profiling & Exploration of Linked Open Data
 
Open Data & Education Seminar, ITMO, St Petersburg, March 2014
Open Data & Education Seminar, ITMO, St Petersburg, March 2014Open Data & Education Seminar, ITMO, St Petersburg, March 2014
Open Data & Education Seminar, ITMO, St Petersburg, March 2014
 

Recently uploaded

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 

Towards preservation of semantically enriched architectural knowledge

  • 1. 1 / 20 20 / 03 / 2013 Towards Preservation of semantically enriched Architectural Knowledge Stefan Dietze (L3S Research Center, Leibniz University Hanover, DE) Stefan Dietze, Jakob Beetz, Ujwal Gadiraju, Georgios Katsimpras, Raoul Wessel, René Berndt
  • 2. 2 / 20 20 / 03 / 201327/09/13 Challenges  Diversity of data - interoperability: low-level point clouds & legacy 3D models up to enriched Building Information Models (BIM), higher-level semantics and Web data / knowledge  Diverse stakeholders: architects, building operators, urban planners, archivists, …  Building, model and data evolution: document temporal evolution to prevent information loss Goals and Challenges (1/2) Goal  Methods and tools for sustainable long-term preservation of architectural knowledge Stefan Dietze (L3S Research Center)
  • 3. 3 / 20 20 / 03 / 201327/09/13 Challenges  “Semantic” enrichment of architectural knowledge: exploiting Web data and knowledge to enrich low-level architectural data.  Inconsistent vocabularies: adopting state of the art (LD) vocabularies and schemas towards sustainability  Long-term readability / renderability of architectural models: addressing digital decay (eg due to deprecated file formats) and model evolution Architectural Archives Architectural Web Data Goals and Challenges (2/2) Stefan Dietze (L3S Research Center)
  • 4. 4 / 20 20 / 03 / 201327/09/13 UBO: Universität Bonn - Technical Coordinator - WP4/WP5: change management, shape recognition Fraunhofer Austria - WP2: system specification & integration TUE, Department of the Built Environment, Eindhoven University of Technology - WP3:,semantics & metadata CITA, Center for Information Technology and Architecture Copenhagen - WP7: data, evaluation, test Luleå University of Technology - WP8: dissemination/exploitation Catenda, SME - User perspective, market requirements, evaluation LUH: German National Library of Science and Technology (TIB) & L3S Research Center Hannover -Coordinator - WP3 Semantic Enrichment - WP6 leader, long-term preservation Consortium Stefan Dietze (L3S Research Center)
  • 5. 5 / 20 20 / 03 / 201327/09/13 Why interlinking & semantic enrichment? Stefan Dietze (L3S Research Center) policies traffichistory environment infrastructure 1. research 2. design 3. monitoring (over time) A very simplistic view on urban planning/architectural lifecycle today DURAARK approach - exploiting Web data to help architects and urban planners to answer questions like:  What‘s the legal, social and environmental context of a structure (sustainability policies etc)?  How did buildings and their contexts (traffic, surroundings, usage and functionality, popularity, etc) evolve over time?  How did an architectural change impact surrounding traffic/environment? (examples: bridges, airports)  How did an architectural change impact popularity and attractiveness of a building?  ….
  • 6. 6 / 20 20 / 03 / 201327/09/13 Architectural Data Preservation 3D Models Point Clouds Stefan Dietze (L3S Research Center) Building Information Models (BIM) = structured „Building Model Metadata“
  • 7. 7 / 20 20 / 03 / 201327/09/13 Architectural Data Preservation SDA Scope Stefan Dietze (L3S Research Center)  Semantic enrichment of low-level architectural models (gradual process)  Interlinking of related models/data (across different abstraction levels, model types, datasets and repositories…)  Preservation & temporal analysis: tracking the evolution of models, buildings and related data
  • 8. 8 / 20 20 / 03 / 2013 Example: GDR’s People’s Palace - static vs evolving data/links Social & Semantic Web for enrichment
  • 9. 9 / 20 20 / 03 / 201327/09/13 Semantic enrichment – schema/knowledge types Challenges  Selection of suitable datasets from wealth of diverse datasets  Preservation: dealing with evolution of distributed datasets (i.e. the semantics & context of the structure/models)
  • 10. 10 / 20 20 / 03 / 201327/09/13 Stefan Dietze (L3S Research Center) Data selection: too few information about too many datasets  Lack of reliable dataset metadata but wide diversity (eg, DBpedia vs traffic stats London vs … ) :  Spatial and temporal coverage ?  Dynamics ? (evolution, frequency of changes…)  Resource types & topics ? (policy documents vs traffic statistics)  Currentness, availability, provenance, …. Enrichment & Preservation http://datahub.io/dataset/transport-data-gov-uk 329.527.661 triples metadata LOD cloud: 300++ datasets DataHub: 6000++ datasets
  • 11. 11 / 20 20 / 03 / 201327/09/13 <geoLatLong:52/13> Stefan Dietze (L3S Research Center) Data preservation: handling evolution of distributed data  Preservation needs to address evolution of distributed datasets / semantics of links  In RDF graphs (such as the LOD Cloud), „all“ nodes are connected:  Which datasets to preserve (only direct links or also more distant neighbours)? (semantic relatedness, see [ESWC2013])  Propagation of changes in LOD graph => measuring relevance of changes for specific entities Enrichment & Preservation <dbp:Berlin(east)> <dura:GDR Peoples Palace> <dbp:Berlin>
  • 12. 12 / 20 20 / 03 / 201327/09/13 <geoLatLong:52/13> Stefan Dietze (L3S Research Center) Data preservation: handling evolution of distributed data  Preservation needs to address evolution of distributed datasets / semantics of links  In RDF graphs (such as the LOD Cloud), „all“ nodes are connected:  Which datasets to preserve (only direct links or also more distant neighbours)? (semantic relatedness, see [ESWC2013])  Propagation of changes in LOD graph => measuring relevance of changes for specific entities  Preservation strategies dependent on dataset dynamics  Simple linking (archiving) for static datasets (eg statistics over past periods in data.gov.uk)  Recurring link computation and graph archival for dynamic datasets (frequency?) Enrichment & Preservation <dbp:Berlin(east)> <dura:GDR Peoples Palace> <dbp:Berlin> Traffic statistics (1986-1989) Traffic statistics (2013-…) Energy efficiency policies
  • 13. 13 / 20 20 / 03 / 201327/09/13 Approach: dataset profiling  Enrichment & preservation = intertwined process!  Dataset selection & cataloging: via DataHub.io (similar to LOD cloud)  Dataset profiling: metadata about dataset dynamics, size, types, topics, evolution, temporal/spatial coverage etc => Data observatory (see also [ESWC2013], [ISWC2013])  Vocabulary curation (expert-based) Web Data Curation for Building-related Data DURAARK Data Observatory Automated processing to generate:  Descriptive Dataset Profiles  Data Interlinking & Correlation Stefan Dietze (L3S Research Center) describes Endpoint Retrieval & Graph Extraction Schema Extraction and Mapping Sample Graph Extraction (per dataset) NER & NED (per resource) Interlinking & Co- Resolution (cross-dataset) Profiling (topics, coverage, dynamics,…)http://datahub.io/group/linked-building-data
  • 14. 14 / 20 20 / 03 / 201327/09/13 Endpoint Retrieval & Graph Extraction Schema Extraction and Mapping Sample Graph Extraction (per dataset) NER & NED (per resource) Interlinking & Co- Resolution (cross-dataset) Profiling (topics, coverage, dynamics,…) Dataset Catalog/Index Links/ Cross-references rdfs:label:„…ECB….“ ? Dataset metadata (RDF/VoID):  Schema mappings (types, properties)  Entities & categories  Topic relevance scores  Availability, currentness data (tbc) dbpedia:Finance dbpedia:Sports dbpedia:England-Wales-Cricket-Board dbpedia:European_Central_Bank Goals:  RDF catalog of datasets  Tracking the evolution of datasets according to, eg, topics, dynamics, spatial coverage, accessability  Links and coreferences => unified view on data => Linked Building Data Graph  Infrastructure & APIs for federated queries Dataset profiling: processing workflow Towards a Web Data ”Observatory” Stefan Dietze (L3S Research Center) dbpedia:Frankfurt
  • 15. 15 / 20 20 / 03 / 201327/09/13 Pipeline  Demo categories! Web Data Observatory – ongoing work Stefan Dietze (L3S Research Center) http://data.linkededucation.org/linkedup/categories-explorer
  • 16. 16 / 20 20 / 03 / 201327/09/13 Vocabulary Curation & Data Interlinking Stefan Dietze (L3S Research Center)  Using dataset profiles for semi- automated data interlinking:  Manual alignment of schemas & vocabularies into unified RDF graph  Automated interlinking (and preservation) techniques  Preservation metadata (PREMIS RDF?)  Expert-based curation of building- related vocabularies  BuildingSmartDD (http://www.buildingsmart.org /standards/ifd)  OMNIClass, UNIClass  SFB-NL (http://nl- sfb.bk.tudelft.nl)  CROW Library for infrastructural objects (http://www.gww-ob.nl/)  …
  • 17. 17 / 20 20 / 03 / 201327/09/13 Conclusions Summary  “Data Observatory” as generic platform and domain-specific instantiation (profiling building-related dataset aspects in DURAARK)  Preservation/linking strategies for SDA based on dataset profiles (eg dynamics, relevance) SDA Scope Outlook  Dataset selection: populating DataHub- group  Schema and vocabulary curation and alignment  Dataset profiling: establishing LDO, considering range of metadata aspects  Building SDA: data interlinking & dataset preservation DURAARK Data Observatory Stefan Dietze (L3S Research Center) ongoing work future work
  • 18. 18 / 20 20 / 03 / 201327/09/13 Thank you! http://purl.org/dietze | @stefandietze http://www.duraark.eu