This document proposes an 8-step framework for migrating Singapore government data to a linked data format. The framework includes steps for specification identification, data set analysis, object modeling, ontology modeling, URI naming, RDF creation, external linking, and dataset publication. It evaluates existing linked data implementations and tools to recommend approaches for each step. The goal is to standardize terms across agencies and make government data more conveniently reusable and joinable through linked data principles.
This presentation focuses on providing means for exploring Linked Data. In particular, it gives an overview of current visualization tools and techniques, looking at semantic browsers and applications for presenting the data to the end used. We also describe existing search options, including faceted search, concept-based search and hybrid search, based on a mix of using semantic information and text processing. Finally, we conclude with approaches for Linked Data analysis, describing how available data can be synthesized and processed in order to draw conclusions.
ESWC 2016 Tutorial on RDF Benchmarks
(This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 688227.)
This presentation covers the whole spectrum of Linked Data production and exposure. After a grounding in the Linked Data principles and best practices, with special emphasis on the VoID vocabulary, we cover R2RML, operating on relational databases, Open Refine, operating on spreadsheets, and GATECloud, operating on natural language. Finally we describe the means to increase interlinkage between datasets, especially the use of tools like Silk.
This presentation focuses on providing means for exploring Linked Data. In particular, it gives an overview of current visualization tools and techniques, looking at semantic browsers and applications for presenting the data to the end used. We also describe existing search options, including faceted search, concept-based search and hybrid search, based on a mix of using semantic information and text processing. Finally, we conclude with approaches for Linked Data analysis, describing how available data can be synthesized and processed in order to draw conclusions.
ESWC 2016 Tutorial on RDF Benchmarks
(This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 688227.)
This presentation covers the whole spectrum of Linked Data production and exposure. After a grounding in the Linked Data principles and best practices, with special emphasis on the VoID vocabulary, we cover R2RML, operating on relational databases, Open Refine, operating on spreadsheets, and GATECloud, operating on natural language. Finally we describe the means to increase interlinkage between datasets, especially the use of tools like Silk.
Building a Self-service Data Download Application For AZGeoSafe Software
The presenters will share there experience in designing, developing and deploying the self-service web-application built on FME server, Openlayers and Angular JS. Future enhancements to the application will be discussed as well.
Experimental transformation of ABS data into Data Cube Vocabulary (DCV) form...Alistair Hamilton
Presentation by Al Hamilton and Cody Johnson to Canberra Semantic Web Meetup Group on why producers of official statistics are interested in semantic web community (including Linked Open Data) and outlining experimental work by Cody Johnson on transforming selected Population Census data released by the ABS in SDMX-ML to RDF Data Cube Vocabulary format.
Towards Digital Twin standards following an open source approachFIWARE
Digital Twins are gaining momentum when designing smart solutions in different application domains. However, there is a lack of open standards that warrant interoperability and portability of solutions, avoiding vendor lock-in.
During the presentation, we will review major developments in this area, focused on the adoption of a standard API for accessing Digital Twin Data and Smart Data Models. We will review how a Digital Twin approach enables data integration at different levels: architecting vertical smart solutions, within smart organizations and across organizations. At all levels interfacing with IoT, BigData, AI/ML, Blockchain, or Robotics technologies.
Tim Davies, SMART Infrastructure Facility E-Research Coordinator, presented the SMART Data Management Systement as part of the SMART Seminar Series on Thursday, 5th March 2015.
SmartCities increase citizens’ quality of life and improve the efficiency and quality of the services provided by governing entities and business
“The city must become like the Internet, i.e. enabling creative development and easy deployment of applications which aim to empower the citizen” - THE APPS FOR SMART CITIES MANIFESTO
This view can be achieved by leveraging:
Available infrastructure such as Open Government Data and deployed sensor networks in cities
Citizens’ participation through apps in their smartphones
The IES CITIES project promotes user-centric mobile micro-services that exploit open data and generate user-supplied data
Hypothesis: Users may help on improving, extending and enriching the open data in which micro-services are based
Its platform aims to:
Facilitate the generation of citizen-centric apps that exploit urban data in different domains
Enable user supplied data to complement, enrich and enhance existing datasets about a city
FIWARE Wednesday Webinars - NGSI-LD and Smart Data Models: Standard Access to...FIWARE
NGSI-LD and Smart Data Models: Standard Access to Digital Twin Data - 15 July 2020
Corresponding webinar recording: https://youtu.be/MBx23ypORLk
Understanding the basis of context information management, NGSI-LD and smart Data Models
Chapter: Core
Difficulty: 2
Audience: Any Technical
Speaker: Juanjo Hierro (CTO, FIWARE Foundation), Alberto Abella (Data Modeling Expert and Technical Evangelist, FIWARE Foundation)
Maximizing Your Data’s Potential: DOTs & DPWs EditionSafe Software
Whether you’re dealing with roads, highways, runways, waters, or all of the above – valuable data that drives decision-making is at the centre of it all. But data management across all elements of the transportation industry can feel like an ongoing challenge. You may have software and systems in place, but getting them to accomplish all your data integration needs seamlessly may leave you feeling much to be desired.
That's where FME will save your day (quite literally). During this Department of Transportation (DOT) and Department of Public Works (DPW) edition webinar, we’ll walk you through:
- Managing transportation data
- Connecting to different data silos or web systems
- Processing real-time data streams
Let us help you put data integration & automation at the forefront of your data analytics strategy using the power of FME: the data integration with the best support for spatial data. Register to learn how other DOTs and transportation companies have made this possible, and how you can too.
Building a Self-service Data Download Application For AZGeoSafe Software
The presenters will share there experience in designing, developing and deploying the self-service web-application built on FME server, Openlayers and Angular JS. Future enhancements to the application will be discussed as well.
Experimental transformation of ABS data into Data Cube Vocabulary (DCV) form...Alistair Hamilton
Presentation by Al Hamilton and Cody Johnson to Canberra Semantic Web Meetup Group on why producers of official statistics are interested in semantic web community (including Linked Open Data) and outlining experimental work by Cody Johnson on transforming selected Population Census data released by the ABS in SDMX-ML to RDF Data Cube Vocabulary format.
Towards Digital Twin standards following an open source approachFIWARE
Digital Twins are gaining momentum when designing smart solutions in different application domains. However, there is a lack of open standards that warrant interoperability and portability of solutions, avoiding vendor lock-in.
During the presentation, we will review major developments in this area, focused on the adoption of a standard API for accessing Digital Twin Data and Smart Data Models. We will review how a Digital Twin approach enables data integration at different levels: architecting vertical smart solutions, within smart organizations and across organizations. At all levels interfacing with IoT, BigData, AI/ML, Blockchain, or Robotics technologies.
Tim Davies, SMART Infrastructure Facility E-Research Coordinator, presented the SMART Data Management Systement as part of the SMART Seminar Series on Thursday, 5th March 2015.
SmartCities increase citizens’ quality of life and improve the efficiency and quality of the services provided by governing entities and business
“The city must become like the Internet, i.e. enabling creative development and easy deployment of applications which aim to empower the citizen” - THE APPS FOR SMART CITIES MANIFESTO
This view can be achieved by leveraging:
Available infrastructure such as Open Government Data and deployed sensor networks in cities
Citizens’ participation through apps in their smartphones
The IES CITIES project promotes user-centric mobile micro-services that exploit open data and generate user-supplied data
Hypothesis: Users may help on improving, extending and enriching the open data in which micro-services are based
Its platform aims to:
Facilitate the generation of citizen-centric apps that exploit urban data in different domains
Enable user supplied data to complement, enrich and enhance existing datasets about a city
FIWARE Wednesday Webinars - NGSI-LD and Smart Data Models: Standard Access to...FIWARE
NGSI-LD and Smart Data Models: Standard Access to Digital Twin Data - 15 July 2020
Corresponding webinar recording: https://youtu.be/MBx23ypORLk
Understanding the basis of context information management, NGSI-LD and smart Data Models
Chapter: Core
Difficulty: 2
Audience: Any Technical
Speaker: Juanjo Hierro (CTO, FIWARE Foundation), Alberto Abella (Data Modeling Expert and Technical Evangelist, FIWARE Foundation)
Maximizing Your Data’s Potential: DOTs & DPWs EditionSafe Software
Whether you’re dealing with roads, highways, runways, waters, or all of the above – valuable data that drives decision-making is at the centre of it all. But data management across all elements of the transportation industry can feel like an ongoing challenge. You may have software and systems in place, but getting them to accomplish all your data integration needs seamlessly may leave you feeling much to be desired.
That's where FME will save your day (quite literally). During this Department of Transportation (DOT) and Department of Public Works (DPW) edition webinar, we’ll walk you through:
- Managing transportation data
- Connecting to different data silos or web systems
- Processing real-time data streams
Let us help you put data integration & automation at the forefront of your data analytics strategy using the power of FME: the data integration with the best support for spatial data. Register to learn how other DOTs and transportation companies have made this possible, and how you can too.
A critical discussion on the statement with a BI framework - “Enterprises today have access to large amounts of information from internal as well as external sources. The information
typically comes in either structured or less structured forms. However, enterprises generally do not make the best use of the information they have access to, tending instead to focus on just internal structured data generated by core transactional
systems.”
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptxBrad Spiegel Macon GA
Brad Spiegel Macon GA’s journey exemplifies the profound impact that one individual can have on their community. Through his unwavering dedication to digital inclusion, he’s not only bridging the gap in Macon but also setting an example for others to follow.
This 7-second Brain Wave Ritual Attracts Money To You.!nirahealhty
Discover the power of a simple 7-second brain wave ritual that can attract wealth and abundance into your life. By tapping into specific brain frequencies, this technique helps you manifest financial success effortlessly. Ready to transform your financial future? Try this powerful ritual and start attracting money today!
# Internet Security: Safeguarding Your Digital World
In the contemporary digital age, the internet is a cornerstone of our daily lives. It connects us to vast amounts of information, provides platforms for communication, enables commerce, and offers endless entertainment. However, with these conveniences come significant security challenges. Internet security is essential to protect our digital identities, sensitive data, and overall online experience. This comprehensive guide explores the multifaceted world of internet security, providing insights into its importance, common threats, and effective strategies to safeguard your digital world.
## Understanding Internet Security
Internet security encompasses the measures and protocols used to protect information, devices, and networks from unauthorized access, attacks, and damage. It involves a wide range of practices designed to safeguard data confidentiality, integrity, and availability. Effective internet security is crucial for individuals, businesses, and governments alike, as cyber threats continue to evolve in complexity and scale.
### Key Components of Internet Security
1. **Confidentiality**: Ensuring that information is accessible only to those authorized to access it.
2. **Integrity**: Protecting information from being altered or tampered with by unauthorized parties.
3. **Availability**: Ensuring that authorized users have reliable access to information and resources when needed.
## Common Internet Security Threats
Cyber threats are numerous and constantly evolving. Understanding these threats is the first step in protecting against them. Some of the most common internet security threats include:
### Malware
Malware, or malicious software, is designed to harm, exploit, or otherwise compromise a device, network, or service. Common types of malware include:
- **Viruses**: Programs that attach themselves to legitimate software and replicate, spreading to other programs and files.
- **Worms**: Standalone malware that replicates itself to spread to other computers.
- **Trojan Horses**: Malicious software disguised as legitimate software.
- **Ransomware**: Malware that encrypts a user's files and demands a ransom for the decryption key.
- **Spyware**: Software that secretly monitors and collects user information.
### Phishing
Phishing is a social engineering attack that aims to steal sensitive information such as usernames, passwords, and credit card details. Attackers often masquerade as trusted entities in email or other communication channels, tricking victims into providing their information.
### Man-in-the-Middle (MitM) Attacks
MitM attacks occur when an attacker intercepts and potentially alters communication between two parties without their knowledge. This can lead to the unauthorized acquisition of sensitive information.
### Denial-of-Service (DoS) and Distributed Denial-of-Service (DDoS) Attacks
Multi-cluster Kubernetes Networking- Patterns, Projects and GuidelinesSanjeev Rampal
Talk presented at Kubernetes Community Day, New York, May 2024.
Technical summary of Multi-Cluster Kubernetes Networking architectures with focus on 4 key topics.
1) Key patterns for Multi-cluster architectures
2) Architectural comparison of several OSS/ CNCF projects to address these patterns
3) Evolution trends for the APIs of these projects
4) Some design recommendations & guidelines for adopting/ deploying these solutions.
1.Wireless Communication System_Wireless communication is a broad term that i...JeyaPerumal1
Wireless communication involves the transmission of information over a distance without the help of wires, cables or any other forms of electrical conductors.
Wireless communication is a broad term that incorporates all procedures and forms of connecting and communicating between two or more devices using a wireless signal through wireless communication technologies and devices.
Features of Wireless Communication
The evolution of wireless technology has brought many advancements with its effective features.
The transmitted distance can be anywhere between a few meters (for example, a television's remote control) and thousands of kilometers (for example, radio communication).
Wireless communication can be used for cellular telephony, wireless access to the internet, wireless home networking, and so on.
test test test test testtest test testtest test testtest test testtest test ...
Semantic web design for www.data.gov.sg - Presentation
1.
2. • Basics of Linked Data
• Purpose of this project
• Migrational Framework
• Eight Steps
• Conclusion
3. What is Linked Data?
• Linked Data is an alternative data representation
format.
• Actually, its just a repackaging of Semantic Web
elements
• It is different from relational database concepts
such as tables, rows, columns…
4. RDF
Subject-Predicate -Object
Jurong belongs to the West Zone
Linked Data Representation Format
http://data.gov.sg/resource/area/Jurong_West
http://data.gov.sg/ontology/property/has_zone
http://data.gov.sg/resource/zone/West
Subject
Predicate
Object
http://w3.org/2003/01/geo/wgs84_pos#/lat http://w3.org/2003/01/geo/wgs84_pos#/long
1°20'040.2"N
103°42'24.54"E
Traditional representation - Tables
5. Linked Data Components
• Data talks about itself. Humans and Machines
both understand data - How?
• URIs - lots of them
(http://data.gov.sg/PlanningArea/Kallang)
• RDF - Data model (Jurong Point is a location)
• Ontologies - Enforces a structure to data (Land
Hierarchy) – represented as RDFs
• SPARQL - Does the same job as SQL and a bit
more...
7. Linked Data Cloud (Web of Data)
Linked Data becomes Linked Open Data(LOD) by
publishing it with “appropriate” license
Provides opportunity to link with other useful data
sets
Provides variety of information about the same
resource
8. Linked Data and Government Data - a
natural compatibility
• Why?
• Govt data is used by all
• Govt data needs to be transparent and easily
understandable
• Govt data is mainly factual – a direct fit!
• Standardized representation of Govt data across
the globe can facilitate comparison without
hassles.
• Best way to propagate a useful agenda to the
private arena...
9. Who have implemented Linked Data?
• UK, US, Brazil Governments
• Private Corporations? Yes
– BBC
– Nature
– World Bank
– New York Times
– FAO
– CIA Factbook
?Provide Links?
11. ABC Water Proj (R)
Agency Websites
Singstat
publicationsMINISTRIES
XLS
HTML
PDF
Accountant-General's Department
Accounting and Corporate Regulatory Authority
Agency For Science, Technology & Research
Attorney-General’s Chambers
Building & Construction Authority
Central Narcotics Bureau
Central Provident Fund Board
Civil Aviation Authority of Singapore
Department of Statistics
Economic Development Board
Energy Market Authority
Health Sciences Authority
Housing & Development Board
Immigration & Checkpoints Authority
Infocomm Development Authority of Singapore
Inland Revenue Authority of Singapore
Institute of Technical Education
Intellectual Property Office of Singapore
JTC Corporation
Judiciary, Subordinate Courts
Judiciary, Supreme Court
Land Transport Authority
Majlis Ugama Islam Singapura
Maritime & Port Authority of Singapore
Monetary Authority of Singapore
Nanyang Polytechnic
National Environment Agency
National Heritage Board
National Library Board
National Parks Board
Ngee Ann Polytechnic
People's Association
Public Service Division
Public Transport Council
Public Utilities Board
Republic Polytechnic
Sentosa Development Corporation
Singapore Civil Defence Force
Singapore Customs
Singapore Land Authority
Singapore Police Force
Singapore Polytechnic
Singapore Sports Council
Singapore Workforce Development Agency
Spring Singapore
Temasek Polytechnic
Urban Redevelopment Authority
Ministry of Community Development, Youth & Sports
Ministry of Education
Ministry of Foreign Affairs
Ministry of Health
Ministry of Law –Community Mediation Unit
Ministry of Manpower
Ministry of Transport
Media Development Authority
BFABuildings(C)
GreenBuilding(E)
C- Community
Cul - Culture
E- Environment
Emp- Employment
Edu - Education
H- Health
F- Family
R- Recreation
S- Sports
Breast Screen (H)
Cervical Screen (H)
Healthier Dining (H)
Quit Centers (H)
Infocomm Access (C)
Silver infocomm (C)
Wireless Hotspots (R)
Child care (F)
Disability (F)
Elder care (F)
Family (F)
Family Friendly Estab (F)
Student Care (F)
Comm Mediation Center (C)
After Death Facilities (E)
Funeral Palours (E)
Dengue Cluster (H)
Hawker Center (E)
NEA Offices (E)
Recycling Bins (E)
Waste Disposal Site (E)
Waste Treatment (E)
Heritage sites(Cul)
Monuments(Cul)
Museums(Cul)
Libraries (Cul)
Streets and Places(Cul)
CD Councils (C)
Community Clubs (C)
Constituency offices (C)
Other facilities (C)
Other Pan networks (C)
PA head quarters (C)
Residents Committee(C)
Water Venture (C)
National Parks (R)
Skyrise greenery (E)
Sports clubs (S)
CET Centers(Emp)
WDA Service points(Emp)
Kindergartens (Edu)
Get TokenAddress
SearchAgency Data
SearchStatic Map
Get Layer InfoMashup
Get Related Data
Get Directions
Public Transportation
Reverse Geocode
Map-related APIs from various agencies
Traffic-related APIs from Land Transport Authority
Tourism-related APIs from the Singapore Tourism Board
Environment-related APIs from the National Environment Agency
Library-related data feeds & web services from National Library Board
DGS Eco System
SG DATA
TEXTUAL
SPATIAL
API
THEMES OPERATIONSCATEGORIES
UNSTRUCTURED DATA
STRUCTURED DATA
STRUCTURED DATA
STATUTORY
BOARDS
SG Government Data Eco System
12. Different levels of
granularity
Multiple End points
Meta data only at
data set levels
Data already
cooked !!
Hierarchies not
captured
Vocabulary Conflict
in spatial and
textual data
Few design issues spotted through the Linked Data lens
13. Benefits of using Linked Data for iDA
Singapore
• An opportunity to standardize common terms
across agencies
• Re-use of resources (through URIs) ex:
http://data.gov.sg/zone/central
• Centralized control?
• Single endpoint for all govt data - Linked Data
API
• Very convenient for developers to join data
from different agencies. eg: combining data
from SLA and URA
14. URA Sites for Sales dataset(Urban Planning)
DOS Population and Household Characteristics dataset (Population Demographics)
Age Pyramid of Resident Population
Old Age Support Ratio
Datasets Used for Framework Evaluation
15. Framework Formulation Process
• Work was split into three phases – Analysis, Design
and Evaluation
• Based on study of Linked Data Migration Research
Papers and cookbooks published by the World Wide
Web Consortium(W3C)
• Analysis of Linked Data implementations in UK ,US
and Brazil
• Evaluation of Linked Data tools with Singapore data
sets for recommendation in each step of the
framework
• Contemplating on probable issues that could be
faced during implementation
16. Proposed Linked Data Migrational
Framework for DGS
Specification Identfication Analysis
Object Modeling
Ontology Modeling
URI Naming
RDF Creation
External Linking
Datasets Publication
Discovery & Exploitation
Re-use Create
S2R D2R A2R
Govt Agencies and IDA
Govt Agencies Domain
Matter Experts
Ontology Modelers
IDA and Web Architects
Developers
Developers and Domain
Experts
Developers
Web Architects
Objectives
Specifications
Project Duration
Dataset Prioritization
Dataset License Setting
Impln Mode Selection
Roadmap
Architecture
Overview
Relational Model
Dataset Overview
Drawing Objects in
Whiteboard
Conceptual View
Conceptual View
Public Vocabularies
Re-use of Existing
Vocabularies
Creation of New
Vocabularies
OWL, RDFS, RDF
Vocabulary files
Resources
Class and Properties
Visualization of URI
mining process
URI Administration
URI Lifecycle
ER Model
Spreadsheets,
DBMS, API
Conversion to RDF triples
using Mapping files
RDF Triples
Government and
external data sets
Linking based on
Similarity Algorithms
Outbound Links
RDF Triples
Ontologies
SPARQL, API
Data Insertion
VOID Modeling
Data Retrieval
API to SPARQL conversion
VOID Triples
JSON data
Actual Data
Existing Apps
Gamification
Crowdsourcing
Catalog Registration
External Reference
New Apps
INPUT
PROCESS
OUTPUT
INPUT
PROCESS
OUTPUT
INPUT
PROCESS
OUTPUT
INPUT
PROCESS
OUTPUT
INPUT
PROCESS
OUTPUT
INPUT
PROCESS
OUTPUT
INPUT
PROCESS
OUTPUT
INPUT
PROCESS
OUTPUT
Resource
Allocation
10
Resource
Allocation
15
Resource
Allocation
15
Resource
Allocation
5
Resource
Allocation
20
Resource
Allocation
5
Resource
Allocation
15
Resource
Allocation
15
1
2
3
4
5
6
7
8
17. Step 1 Step 2 Step 3 Step 4 Step 5 Step 6 Step 7 Step 8
Specification Home
1) Design the High Level Architecture
2) Set the “Migration Potential” for data sets
3) Decide the “Perspective” – Vertical vs Horizontal -> Agency vs Application (We recommend
Agency perspective)
Data set
Data set
URL Data Type Agency
Utility
Level
Interlinking
Possibility
Potential
Level
Annual Vehicle Population by Type of
Fuel Use URL
Textual
(PDF) LTA H L M
Administrative Data - Employment
Statistic URL
Textual
(HTML) MOM H M H
18. Step 1 Step 2 Step 3 Step 4 Step 5 Step 6 Step 7 Step 8
Specification Home
4) Setting up of License for data sets
5) Implementation Method – “Linked Data + RDF”
Other options - 1) Just URIs 2) URI for data sets only
Analysis of Data sets
Study of System specifications, design & integration documents (including database) of
the selected data sets
• Understand Metadata, Schema design and Entity Relationship (ER) models
Data Set
Data Set
URL Data Type Agency License
Access
Rights Data Access Modes
Annual Vehicle Population by Type of Fuel
Use URL Textual (PDF) LTA PDDL R
API, SPARQL, RDF
Dump
Administrative Data - Employment
Statistic URL
Textual
(HTML) MOM PDDL R
API, SPARQL, RDF
Dump
19. Step 1 Step 2 Step 3 Step 4 Step 5 Step 6 Step 7 Step 8
Object Modeling
This is modeling without usage context.
*Requires normalization of database model in 3NF form
Issues
Possibility of applying high abstraction and
high granularity to objects
Key Learning
Ease in identifying the use of common
objects across data sets
Facilitates brainstorming of relationships
between objects
Home
20. Step 1 Step 2 Step 3 Step 4 Step 5 Step 6 Step 7 Step 8
Ontology Modeling
Takes the conceptual diagram from Object Modeling as input.
Design Ontologies
1. Identify classes and subclasses
2. Identify hierarchy structure
3. Connect classes through relationship
4. Create rules for inference (optional)
5. Output OWL vocabulary files
Ontology modelling is carried out in two ways:- 1) Using and extending public
ontologies 2) Designing a local ontology from scratch
Home
21. Step 1 Step 2 Step 3 Step 4 Step 5 Step 6 Step 7 Step 8
Ontology Modeling
Date fields, location fields and fields related to
measurements in DGS have scope for
vocabulary re-use
Vocabulary for the identified data sets
(developed using Protege) with screenshots
List of vocabularies required for LOGD
implementation
List of tools used for ontology modeling
OUTPUT?
ALLOCATION PERCENTAGE?
PERSONNEL INVOLVED
Home
22. Step 1 Step 2 Step 3 Step 4 Step 5 Step 6 Step 7 Step 8
URI Naming
ABOX TBOX
http://data.gov.sg/ontology/Ministry/ http://data.gov.sg/ministry/MOH
http://data.gov.sg/ontology/Agency/ http://data.gov.sg/agency/SLA
http://data.gov.sg/ontology/SiteLocation http://data.gov.sg/location/pioneer_road_north
http://data.gov.sg/ontology/Race http://data.gov.sg/race/chinese
Dataset ID URAstaticfile001
Dataset http://data.gov.sg/dataset/ URAstaticfile001/
Class http://data.gov.sg/terms/class/URAstaticfile001/sitesforsale
Property http://data.gov.sg/terms/property/URAstaticfile001/time
Row 1 http://data.gov.sg/dataset/URAstaticfile001/1
Row 1 - A generic column http://data.gov.sg/dataset/URAstaticfile001/1/columnName
Dataset URIs
Home
1) “URI Administration” Mode
Maintained centrally in the DGS platform (resultant URIs will start with http://data.gov.sg/)
-> RECOMMENDED
vs
Maintained by individual agencies (resultant URIs will start with http://ura.gov.sg or
http://sla.gov.sg).
vs
Maintained externally by third party platforms such as Kasabi (resultant URIs will start with
http://data.kasabi.org) – No longer valid as Kasabi service has been shut down
2) Setup of URI Taxonomy
23. Step 1 Step 2 Step 3 Step 4 Step 5 Step 6 Step 7 Step 8
RDF Creation Home
RDF triples are generated by converting data from source format with the necessary transformation
Type Nature Example of Singapore data sets Source format
S2R (Static Files) Static
URA Site for Sales, Singstat’s Population
Household Characteristics XLS, CSV, TXT files and other static files
D2R (RDBMS) Dynamic DGS tables RDBMS
A2R (APIs/Web
Services) Dynamc
OneMap API, myTransport API, NLB web
services
Application Programming Interface (API) and Web
Services(SOAP, REST)
24. Step 1 Step 2 Step 3 Step 4 Step 5 Step 6 Step 7 Step 8
RDF Creation Home
Evaluated 3 tools for each mode of conversion
Google Refine - S2R
RDF Views - D2R
RDF Sponger - A2R
Google Refine Demo for S2R!
ER models from RDBMS are to be converted into corresponding
vocabularies/Ontologies for D2R process using STDTrip methodology
For A2R, External Cartridges (mapping files) are to be created for mapping API
parameters to vocabularies. This can be done in RDF Sponger
“We feel that Linked Data is best suited for data from Static files and not for data that is
real-time and dynamic in nature unless conformity to structure can be trusted”
25. Step 1 Step 2 Step 3 Step 4 Step 5 Step 6 Step 7 Step 8
External Linking
External Linking is connecting with other data sets in the web of data
Data.gov.sg
WorldBank
CIA World
Factbook
DBpedia FAO Geonames
Supreme
Court
Flickr
<http://data.gov.sg/location/bugis> <owl:sameAs> <http://www.dbpedia.org/resource/Bugis>
<http://data.gov.sg/race/malay> <owl:sameAs> <http://www.dbpedia.org/resource/Malay_race>
Issues
•The outbound links made to data sets outside of IDA’s purview can be risky
•Dead links are a vivid possibility during the change of resource URIs or system
downtime
Home
26. Step 1 Step 2 Step 3 Step 4 Step 5 Step 6 Step 7 Step 8
Datasets Publication Home
Triple Store or RDF Store is the data structure used to store Linked Data.
• We used Virtuoso Universal Server’s built-in triple store for evaluation
• It is visualized that the triple store will be centralized at iDA
SPARQL (pronounced as SPARKLE) will be the main output terminal for Linked Data
• SPARQL can be used to SELECT, INSERT , DELETE, UPDATE data
• SPARQL is gateway to any operation on Linked Data. APIs and Applications are
built on top of it
Triple Store and SPARQL Demo!
We had some information about External Linked Data Hosting but we had to remove it
as the major provider Talis has closed its own hosting service Kasabi!
27. Step 1 Step 2 Step 3 Step 4 Step 5 Step 6 Step 7 Step 8
Datasets Publication Home
Linked Data API is the common API endpoint that will be used by developers and public
users to access government data.
- This solves the problem of maintaining multiple end points!
ex: http://gov.tso.co.uk/transport/api/transport/doc/bus-stop-point.xml
28. Step 1 Step 2 Step 3 Step 4 Step 5 Step 6 Step 7 Step 8
Discovery & Exploitation
Key Theme
1) Internal discovery within Singapore for local citizens – idea4apps (link)
2) External discovery for attracting usage of Singapore government data in
international economic & political research and global issues(water scarcity, Carbon
Footprint etc.)
• Entry in CKAN registry ->http://thedatahub.org/tag/registry
Home
Gamification? Promoted by LinkedGov.org
29. Original
data
provided
by URA
Possible because of
the re-use of the
common resource
URI Pasir Ris across
data sets
Similarly, location based
data from OneMap API is
retrieved for Pasir Ris
Interlinked Datasets Post-Migration
30. Other Interesting Use Cases
Definitely not Science Fiction!
Q & A Engine that works on top of government linked data. Inspired by www.trueknowledge.com
31. Sense-Making
Question: Which recent year had a growth rate close to 50% for majority of Singapore
based SME?
Step1: Spot the resources in this query
Dbpedia Spotlight does just that! – Semantic Information Extraction
Which recent year had a growth rate close to 50% for majority of Singapore based SME
Step2: Identify the relationship between the resources
SME is instance of the Organization class Organization class comes under Singapore country
Growth rate is a property of Sales class Year is a class by itself
Majority is subset of Group class
Step3: Use NLP technique – Syntactic Analysis (Stanford Parser) followed by Focus
Extraction for understanding the question
2010 is retuned as the result!
Step 4: Look for RDF triples that meet the criteria
Syntactic Parse tree is generated followed by Access Pattern
32. Key Challenges
• Dense data - lot of additional RDF triples will get created along with the
required RDF triples as a resource belongs to multiple ontologies
Demographics dataset stats:-
Rows:~300 Columns:16 in excel file
Resultant triples count in RDF/Triple store:13711
Reason: Majority of the generated triples are for machine understanding.
• URI administration could be an intense activity as dead URIs can cause
damage to applications eg: what will happen if
http://data.gov.sg/area/jurong doesn't work?
• Changes to structure of static files and RDMBS tables require changes in
RDF mapping files - might be a long process if not properly regulated
• Not readily suitable for real-time data
33. Summary
Four in-person
discussion sessions
with IDA, NIIT and SLA
Analysis of Five
data.gov.sg system
specifications
Evaluation of Four
existing Migration
Frameworks
Prototyping with Six
core Linked Data Tools
Dataset Publication
Virtuoso Universal Server Linked Data API
External Linking
SILK LIMES
RDF Creation
Google Refine RDF Views RDF Sponger
URI Naming
Pubby
Ontology Modeling
Protégé
Object Modeling
Concept Map
34. Summary
• Applicability of the framework to Singapore
Government Data
• Issues identified in existing Data Eco System
• Recommended tools and best practices for each step
• Launchpad for SG Linked Data implementation
Final Thoughts…
• ROI is not a key metric for Linked Data implementation
• Benefits of moving to Linked Data is intangible and may
not be immediately realizable
• Volume of work is huge compared to traditional
systems
35. We are thankful to Prof Chris Khoo
for his supervision and iDA staff Soy Boon Lim
for providing overview of data.gov.sg and also
for furnishing DGS design documents...
36. Why are we doing this project?
To prescribe a Linked Data migrational framework for
data.gov.sg (DGS) data sets
First hand view of the required migration activities
Issues anticipated at each step
Evaluation & Recommendation on Linked Data tools
To help IDA in realizing - What more can be done with existing
data ? A closer look at Government counterparts – UK and US !
In totality, iDA can use this report as a guide for the various
aspects related to Linked Data implementation
37. Basic Thought Process of Linked Data
Publishing
• Select data sets that appear apt for Linked
Data
• Identify the data sources for the data sets
• Find out what type of transformations are
needed
• Publish it!
38. iDA Singapore launched Data.gov.sg portal and mGov@SG public services during June 2011
Data.gov.sg provides 5000+ public data sets from 50 government agencies
Purpose: Building applications, research and for creating applications using the data
Data.Gov.Sg
Editor's Notes
Dbpedia – Places and Events
CIA and World bank- Economic Analysis
Flickr – places
FAO – export and import commodities
Supreme Court – Facts