3. • KM deals with…
Knowledge Creation
Knowledge Accumulation
Knowledge Transfer
Knowledge Sharing
Knowledge Utilization
• All the above for better decision making in
organizations
4.
5. • Facts are connected to form information
• Information when contextualized becomes Knowledge
• The underlying structure in our brain is similar to a network or
a graph with nodes connected by edges
• Nodes are data
• Edges are relations
• Nodes + Edges = Information
• A group of Nodes and Edges form Hyper Graph (KNOWLEDGE)
6. • Q: How is data represented in the real world?
• Ans: ROWS AND COLUMNS (classic Relational Algebra)
7. Data Silo
No natural
integration
Re-invention of
wheel
Cumbersome
process
8.
9. Resource Description Framework (RDF)
Subject-Predicate -Object
Jurong belongs to the West Zone
Linked Data Representation Forma
http://data.gov.sg/resource/area/Jurong_West
http://data.gov.sg/ontology/property/has_zone
http://data.gov.sg/resource/zone/West
Subject
Predicate
Object
http://w3.org/2003/01/geo/wgs84_pos#/lat
http://w3.org/2003/01/geo/wgs84_pos#/long12.5555
0.21222
Resource
Vocabulary
& Resource
Resource
11. • Dbpedia (http://dbpedia.org/) is the linked data representation
of Wikipedia
• Wiki page:
http://en.wikipedia.org/wiki/Boon_Lay_(Jurong_West)
• DBpedia page:
http://dbpedia.org/page/Boon_Lay_(Jurong_West)
• Example for Querying Linked Data from Dbpedia using
SPARQL
http://dbpedia.org/sparql
select ?actor, count(?a) where
{
?a <http://dbpedia.org/ontology/starring> ?actor .
?actor <http://dbpedia.org/ontology/birthPlace> <http://dbpedia.org/resource/Singapore> .
}
14. • Corporate and private data is a no-go
• Public data
• Freely available data
• Data of global importance
• Data that doesn’t require much domain knowledge
• Ideal Data: Government Open Data
16. iDA Singapore launched data.gov.sg portal and mGov@SG public services during
June 2011
Data.gov.sg provides 5000+ public data sets from 50 government agencies
Purpose: Building applications, research and for creating applications using the
data
data.gov.sg
17. Why did we do this project?
To prescribe a migrational framework for linked data for
data.gov.sg (DGS) data sets
First hand view of the required migration activities
Issues anticipated at each step
Evaluation & Recommendation on Linked Data tools
To help IDA in understanding the benefits of Linked Data
18. ABC Water Proj (R)
Agency Websites
Singstat
publicationsMINISTRIES
XLS
HTML
PDF
Accountant-General's Department
Accounting and Corporate Regulatory Authority
Agency For Science, Technology & Research
Attorney-General’s Chambers
Building & Construction Authority
Central Narcotics Bureau
Central Provident Fund Board
Civil Aviation Authority of Singapore
Department of Statistics
Economic Development Board
Energy Market Authority
Health Sciences Authority
Housing & Development Board
Immigration & Checkpoints Authority
Infocomm Development Authority of Singapore
Inland Revenue Authority of Singapore
Institute of Technical Education
Intellectual Property Office of Singapore
JTC Corporation
Judiciary, Subordinate Courts
Judiciary, Supreme Court
Land Transport Authority
Majlis Ugama Islam Singapura
Maritime & Port Authority of Singapore
Monetary Authority of Singapore
Nanyang Polytechnic
National Environment Agency
National Heritage Board
National Library Board
National Parks Board
Ngee Ann Polytechnic
People's Association
Public Service Division
Public Transport Council
Public Utilities Board
Republic Polytechnic
Sentosa Development Corporation
Singapore Civil Defence Force
Singapore Customs
Singapore Land Authority
Singapore Police Force
Singapore Polytechnic
Singapore Sports Council
Singapore Workforce Development Agency
Spring Singapore
Temasek Polytechnic
Urban Redevelopment Authority
Ministry of Community Development, Youth & Sports
Ministry of Education
Ministry of Foreign Affairs
Ministry of Health
Ministry of Law –Community Mediation Unit
Ministry of Manpower
Ministry of Transport
Media Development Authority
BFABuildings(C)
GreenBuilding(E)
C- Community
Cul - Culture
E- Environment
Emp- Employment
Edu - Education
H- Health
F- Family
R- Recreation
S- Sports
Breast Screen (H)
Cervical Screen (H)
Healthier Dining (H)
Quit Centers (H)
Infocomm Access (C)
Silver infocomm (C)
Wireless Hotspots (R)
Child care (F)
Disability (F)
Elder care (F)
Family (F)
Family Friendly Estab (F)
Student Care (F)
Comm Mediation Center (C)
After Death Facilities (E)
Funeral Palours (E)
Dengue Cluster (H)
Hawker Center (E)
NEA Offices (E)
Recycling Bins (E)
Waste Disposal Site (E)
Waste Treatment (E)
Heritage sites(Cul)
Monuments(Cul)
Museums(Cul)
Libraries (Cul)
Streets and Places(Cul)
CD Councils (C)
Community Clubs (C)
Constituency offices (C)
Other facilities (C)
Other Pan networks (C)
PA head quarters (C)
Residents Committee(C)
Water Venture (C)
National Parks (R)
Skyrise greenery (E)
Sports clubs (S)
CET Centers(Emp)
WDA Service points(Emp)
Kindergartens (Edu)
Get TokenAddress
SearchAgency Data
SearchStatic Map
Get Layer InfoMashup
Get Related Data
Get Directions
Public Transportation
Reverse Geocode
Map-related APIs from various agencies
Traffic-related APIs from Land Transport Authority
Tourism-related APIs from the Singapore Tourism Board
Environment-related APIs from the National Environment Agency
Library-related data feeds & web services from National Library Board
DGS Eco System
SG DATA
TEXTUAL
SPATIAL
API
THEMES OPERATIONSCATEGORIES
UNSTRUCTURED DATA
STRUCTURED DATA
STRUCTURED DATA
STATUTORY
BOARDS
SG Government Data Eco Syst
19. Proposed Linked Data Migrational
Framework for DGS
Specification Identfication Analysis
Object Modeling
Ontology Modeling
URI Naming
RDF Creation
External Linking
Datasets Publication
Discovery & Exploitation
Re-use Create
S2R D2R A2R
Govt Agencies and IDA
Govt Agencies Domain
Matter Experts
Ontology Modelers
IDA and Web Architects
Developers
Developers and Domain
Experts
Developers
Web Architects
Objectives
Specifications
Project Duration
Dataset Prioritization
Dataset License Setting
Impln Mode Selection
Roadmap
Architecture
Overview
Relational Model
Dataset Overview
Drawing Objects in
Whiteboard
Conceptual View
Conceptual View
Public Vocabularies
Re-use of Existing
Vocabularies
Creation of New
Vocabularies
OWL, RDFS, RDF
Vocabulary files
Resources
Class and Properties
Visualization of URI
mining process
URI Administration
URI Lifecycle
ER Model
Spreadsheets,
DBMS, API
Conversion to RDF triples
using Mapping files
RDF Triples
Government and
external data sets
Linking based on
Similarity Algorithms
Outbound Links
RDF Triples
Ontologies
SPARQL, API
Data Insertion
VOID Modeling
Data Retrieval
API to SPARQL conversion
VOID Triples
JSON data
Actual Data
Existing Apps
Gamification
Crowdsourcing
Catalog Registration
External Reference
New Apps
INPUT
PROCESS
OUTPUT
INPUT
PROCESS
OUTPUT
INPUT
PROCESS
OUTPUT
INPUT
PROCESS
OUTPUT
INPUT
PROCESS
OUTPUT
INPUT
PROCESS
OUTPUT
INPUT
PROCESS
OUTPUT
INPUT
PROCESS
OUTPUT
Resource
Allocation
10
Resource
Allocation
15
Resource
Allocation
15
Resource
Allocation
5
Resource
Allocation
20
Resource
Allocation
5
Resource
Allocation
15
Resource
Allocation
15
21. • Linked Data’s representation of data sets eases
Knowledge Creation and facilitates Knowledge
Transfer
• Data in LD format becomes universally
recognizable and consumable (ex:
http://live.dbpedia.org/page/Lee_Kuan_Yew)
• The prescribed framework outlines the steps to
be executed for data conversion to LD format for
SG Govt Datasets
• Probable issues and best Linked Data software
libraries and tools were also put forth
22. Thank
You
Link to Original CI presentation
http://www.slideshare.net/aravindsraamkumar/proposed-linked-data-migration-
framework-for-singapore-government-datasets
23. Linking Enterprise Data by David Wood (2010)
• http://www.springer.com/gp/book/9781441976642
Beginner book:
Linked Data: Evolving the Web into a Global Data Space
• http://linkeddatabook.com/editions/1.0/