The document discusses open data and linked data. It begins by distinguishing between open data, which makes government data available in any format, and linked data, which structures data using URIs, HTTP, RDF vocabularies and standards. It then provides examples of how open data and linked data have been used by initiatives like Data.gov and Data.gov.uk to increase government transparency and enable public participation.
A complete introduction to open data in the context of local transportation, including definitions, examples, rationales, implementation challenges and guidelines.
This document discusses establishing an Open Knowledge Foundation (OKF) chapter in Korea. It provides background on OKF and its goals of promoting open data and knowledge. It outlines reasons for starting an OKF Korea, including learning best practices from other countries, making Korean open data more accessible worldwide, and building better communities around open data. Plans are described to collaborate with existing groups doing related work and to launch open data projects and events. The vision is for OKF Korea to help advance the quality, accessibility and use of open data in Korea.
The amount of data in our world today is substantially outsized. Many of the personal and non-personal aspects of our day-to-day activities are aggregated and stored as data by both businesses and governments. The increasing data captured through multimedia, social media, and the Internet are a phenomenon that needs to be properly examined. In this article, we explore this topic and analyse the term data ownership. We aim to raise awareness and trigger a debate for policy makers with regard to data ownership and the need to improve existing data protection, privacy laws, and legislation at both national and international levels.
Haiku Deck allows users to create presentations by providing templates and designs for Haiku, a traditional Japanese poetic form. Users can get started making their own Haiku Deck presentation by accessing templates on SlideShare. The document encourages the reader to create their own Haiku Deck presentation.
A complete introduction to open data in the context of local transportation, including definitions, examples, rationales, implementation challenges and guidelines.
This document discusses establishing an Open Knowledge Foundation (OKF) chapter in Korea. It provides background on OKF and its goals of promoting open data and knowledge. It outlines reasons for starting an OKF Korea, including learning best practices from other countries, making Korean open data more accessible worldwide, and building better communities around open data. Plans are described to collaborate with existing groups doing related work and to launch open data projects and events. The vision is for OKF Korea to help advance the quality, accessibility and use of open data in Korea.
The amount of data in our world today is substantially outsized. Many of the personal and non-personal aspects of our day-to-day activities are aggregated and stored as data by both businesses and governments. The increasing data captured through multimedia, social media, and the Internet are a phenomenon that needs to be properly examined. In this article, we explore this topic and analyse the term data ownership. We aim to raise awareness and trigger a debate for policy makers with regard to data ownership and the need to improve existing data protection, privacy laws, and legislation at both national and international levels.
Haiku Deck allows users to create presentations by providing templates and designs for Haiku, a traditional Japanese poetic form. Users can get started making their own Haiku Deck presentation by accessing templates on SlideShare. The document encourages the reader to create their own Haiku Deck presentation.
This document discusses business process management (BPM) and how it relates to service-oriented architecture (SOA). It provides an overview of BPM, including a comparison to business process management systems (BPMS). When choosing a BPMS, the complexity and cost of BPM should be considered. The document also outlines the vision of BPM in enabling flexible modification of process definitions through graphical tools. It notes that while BPM improves agility within a business unit, cross-organizational processes require distributed standards.
This document outlines John Theodore Goetz's PhD defense presentation on hyperon photoproduction from threshold to 5.4 GeV using the CEBAF Large Acceptance Spectrometer. The presentation has two parts, with part I discussing cascade hyperons, the g12 experiment that collected data on hyperon photoproduction, and g12 kaon data analysis. Part II will present results from g12, including excitation functions of the Ξ hyperon, searches for higher mass Ξ* states and iso-exotic particles, and conclusions.
The document discusses using open data and linked data on the web. It begins by defining open government data and its benefits like transparency and participation. It then explains how the semantic web uses linked data to connect related data across the web. Examples are given of government and other datasets that are available as linked open data. The presentation concludes by proposing future interdisciplinary collaboration to further develop applications using open and linked data.
20+ years experience spread across different Industry domain like BPO, Automobile, Consumer durable & Computer peripherals. Experienced in driving & managing different roles across the domains in Sales, Marketing, Customer Service, Recruitment and Global operations.
Since 2006, the focus has been in ITeS industry with expertise in Talent Transformation & Recruitment, managing & driving global delivery models spread across Asia, EMEA & US, delivery expertise across CRM, Health Care, Banking, Supply chain, Procurement. Managing clients, P&L responsibility, growth of accounts, contract compliance, Site operations, HR bent to understand people & behavior as key skills.
As Recruitment Leader, the key is to manage numbers, build relationship with stake holders, build capacity & capability at the location which can help the organization benefits. Key skills are driving business. Business centric, People oriented and keep the drive on organisation benefits. Someone who clearly understands the ground reality and manage the organization requirements ( 0 to 30000 ft)
Passion - Recruitment & Delivery Operations management, Site operations
This document provides an overview of linked data and semantic web technology. It discusses key concepts like URIs, RDF, SPARQL, and OWL. URIs are used to identify things on the web so they can be referred to and looked up. RDF is a general method for conceptual description or modeling of information using subject-predicate-object triples. SPARQL is a query language for retrieving and manipulating semantic data in RDF format. OWL builds on RDF and RDF Schema to provide additional vocabulary for describing properties and classes of semantic web resources.
Overview of Open Data, Linked Data and Web ScienceHaklae Kim
This document provides an overview of open data, linked data, and web science through conceptual discussions, case studies, and proposed next steps. It begins with definitions of key concepts like open data and the semantic web. Case studies demonstrate current applications of open data through government initiatives and technologies like Google's Knowledge Graph and Apple's Siri. The document concludes by acknowledging challenges with open data strategies and advocating for interdisciplinary collaboration to realize the potential of linked open government data.
The open semantic enterprise enterprise data meets web dataGeorg Guentner
Presentation in workshop at the 2nd B2B Software Days (11.04.2013, Vienna), together with Herbert Beilschmidt (Oracle Austria):
The Open Semantic Enterprise.Enterprise Data meets Web Data.
The technologies of the “Web od Data” have reached a degree of maturity and acceptance allowing the productive use in enterprises for the support of their business processes. Though the focus is currently on the adoption and use of Open (Linked) Data, the underlying principles can also be applied to the closed data sources and proprietary data structures usually available in enterprises.
The workshop outlines the conceptual and architectural approaches to open enterprise data sources and interweave them with the Web of Data. It shows concrete application scenarios of an open source “semantic toolset” that can be integrated with enterprise information and content management systems to open data silos, establish a layer of adaptive integrated views of the enterprise information and support decision processes thus paving the way to an “open semantic enterprise”.
The topical semantic toolset for enterprise content integration includes Apache Stanbol (knowledge extraction), Apache Marmotta (Linked Data Platform), the Linked Media Framework (networked knowledge) und VIE (interactive knowledge).
State-of-the-art big data platforms need to process massive quantities of data in batch and in parallel - filtering, transforming and sorting it before loading it into an enterprise data warehouse. In order to realize an Open Semantic Enterprise, a big data platform has to be optimized for acquiring, organizing, and loading unstructured data. Technological approaches such as NoSQL databases and connectors for Apache Hadoop complement big data solutions for the open world of a semantic enterprise.
This document discusses business process management (BPM) and how it relates to service-oriented architecture (SOA). It provides an overview of BPM, including a comparison to business process management systems (BPMS). When choosing a BPMS, the complexity and cost of BPM should be considered. The document also outlines the vision of BPM in enabling flexible modification of process definitions through graphical tools. It notes that while BPM improves agility within a business unit, cross-organizational processes require distributed standards.
This document outlines John Theodore Goetz's PhD defense presentation on hyperon photoproduction from threshold to 5.4 GeV using the CEBAF Large Acceptance Spectrometer. The presentation has two parts, with part I discussing cascade hyperons, the g12 experiment that collected data on hyperon photoproduction, and g12 kaon data analysis. Part II will present results from g12, including excitation functions of the Ξ hyperon, searches for higher mass Ξ* states and iso-exotic particles, and conclusions.
The document discusses using open data and linked data on the web. It begins by defining open government data and its benefits like transparency and participation. It then explains how the semantic web uses linked data to connect related data across the web. Examples are given of government and other datasets that are available as linked open data. The presentation concludes by proposing future interdisciplinary collaboration to further develop applications using open and linked data.
20+ years experience spread across different Industry domain like BPO, Automobile, Consumer durable & Computer peripherals. Experienced in driving & managing different roles across the domains in Sales, Marketing, Customer Service, Recruitment and Global operations.
Since 2006, the focus has been in ITeS industry with expertise in Talent Transformation & Recruitment, managing & driving global delivery models spread across Asia, EMEA & US, delivery expertise across CRM, Health Care, Banking, Supply chain, Procurement. Managing clients, P&L responsibility, growth of accounts, contract compliance, Site operations, HR bent to understand people & behavior as key skills.
As Recruitment Leader, the key is to manage numbers, build relationship with stake holders, build capacity & capability at the location which can help the organization benefits. Key skills are driving business. Business centric, People oriented and keep the drive on organisation benefits. Someone who clearly understands the ground reality and manage the organization requirements ( 0 to 30000 ft)
Passion - Recruitment & Delivery Operations management, Site operations
This document provides an overview of linked data and semantic web technology. It discusses key concepts like URIs, RDF, SPARQL, and OWL. URIs are used to identify things on the web so they can be referred to and looked up. RDF is a general method for conceptual description or modeling of information using subject-predicate-object triples. SPARQL is a query language for retrieving and manipulating semantic data in RDF format. OWL builds on RDF and RDF Schema to provide additional vocabulary for describing properties and classes of semantic web resources.
Overview of Open Data, Linked Data and Web ScienceHaklae Kim
This document provides an overview of open data, linked data, and web science through conceptual discussions, case studies, and proposed next steps. It begins with definitions of key concepts like open data and the semantic web. Case studies demonstrate current applications of open data through government initiatives and technologies like Google's Knowledge Graph and Apple's Siri. The document concludes by acknowledging challenges with open data strategies and advocating for interdisciplinary collaboration to realize the potential of linked open government data.
The open semantic enterprise enterprise data meets web dataGeorg Guentner
Presentation in workshop at the 2nd B2B Software Days (11.04.2013, Vienna), together with Herbert Beilschmidt (Oracle Austria):
The Open Semantic Enterprise.Enterprise Data meets Web Data.
The technologies of the “Web od Data” have reached a degree of maturity and acceptance allowing the productive use in enterprises for the support of their business processes. Though the focus is currently on the adoption and use of Open (Linked) Data, the underlying principles can also be applied to the closed data sources and proprietary data structures usually available in enterprises.
The workshop outlines the conceptual and architectural approaches to open enterprise data sources and interweave them with the Web of Data. It shows concrete application scenarios of an open source “semantic toolset” that can be integrated with enterprise information and content management systems to open data silos, establish a layer of adaptive integrated views of the enterprise information and support decision processes thus paving the way to an “open semantic enterprise”.
The topical semantic toolset for enterprise content integration includes Apache Stanbol (knowledge extraction), Apache Marmotta (Linked Data Platform), the Linked Media Framework (networked knowledge) und VIE (interactive knowledge).
State-of-the-art big data platforms need to process massive quantities of data in batch and in parallel - filtering, transforming and sorting it before loading it into an enterprise data warehouse. In order to realize an Open Semantic Enterprise, a big data platform has to be optimized for acquiring, organizing, and loading unstructured data. Technological approaches such as NoSQL databases and connectors for Apache Hadoop complement big data solutions for the open world of a semantic enterprise.
This document discusses using linked open data and semantic technologies to support next generation science. It provides background on the increasing availability of open data and opportunities for citizen science contributions. Semantic technologies can help integrate and link diverse scientific data sources. Linked data principles allow disparate datasets to be connected through shared identifiers and relationships. Examples are provided of existing projects that use semantic approaches to enable scientific data discovery, analysis and collaboration across domains like population health, water quality monitoring and climate change. Overall, the document argues that semantic technologies are mature and can help scientists address large, distributed problems by facilitating data integration and knowledge sharing.
1) Big data is becoming broader as more varied data becomes available on the web from sources like open government and e-commerce.
2) Broad data presents challenges that are different than traditional databases as it includes data from many sources that is only partially structured.
3) Semantics and metadata are crucial for integrating and making sense of broad data from multiple sources that may use different terms or collect data in different ways.
This talk introduces Linked Data and Semantic Web by using two examples - population sciences grid and semantAqua - a semantically enabled environmental monitoring. It shows a few tools and the semantic methodology and opens discussion for LOD and team science
Coping with Data Variety in the Big Data Era: The Semantic Computing ApproachAndre Freitas
Big Data is based on the vision of providing users and applications with a more complete picture of the reality supported and mediated by data. This vision comes with the inherent price of data variety, i.e. data which is semantically heterogeneous, poorly structured, complex and with data quality issues. Despite the hype on technologies targeting data volume and velocity, solutions for coping with data variety remain fragmented and with limited adoption. In this talk we will focus on emerging data management approaches, supported by semantic technologies, to cope with data variety. We will provide a broad overview of semantic computing approaches and how they can be applied to data management challenges within organizations today. This talk will allow the audience to have a glimpse into the next-generation, Big Data-driven information systems.
The document discusses several topics related to search engines and online information, including:
1) The PageRank algorithm and its extensions over time to provide more contextually relevant search results.
2) Concerns about privacy and concentration of power as collective intelligence and user data is concentrated within large tech companies.
3) Differences in search results between engines and regions due to factors like censorship and localized information.
Open Data, by definition, provides the chance to re-shape and publish heterogeneous pieces and fragments of information which are open, namely anyone is free to use, reuse, and redistribute it. In order for users to fully benefit this idea, Open Data Systems of tomorrow must provide high quality data, relying on real time and ubiquitous services, along with a deep integration with mobile and smart devices and infrastructures.
In this session, we present a syntheses of Whitehall proposal addressed a this vision: is addressed at building Open Data in a fully-fledged Big Data infrastructure, realized using graph based and NoSQL technologies. This idea is shaped in a cultural heritage scenario, where data in envisaged at valorizing one of the main assets of Italy: cultural heritage.
The document discusses strategies for modeling and publishing open government data as linked data. It outlines a process that includes identifying data, modeling exemplar records, naming resources with URIs, describing resources with vocabularies, converting data to RDF, and publishing and maintaining the data. The key steps are to focus on modeling real-world objects without consideration for specific applications, take an iterative approach, and be forgiving of imperfect initial models. Content management systems and wiki systems are not optimal for structured linked data, so a linked data management system like Callimachus is recommended.
This document provides an overview of foundational research propelled by text analytics. It begins with an outline that discusses text analytics in the big data era, information extraction systems and formalisms, foundational research challenges, and conclusions. It then discusses how text analytics has become important for applications like semantic search, life science mining, e-commerce, CRM/BI, and log analysis. It notes the need for database management systems and general-purpose development and management systems to facilitate value extraction from big data by a wide range of users and skills. Core information extraction tasks like named entity recognition, relation extraction, event extraction, temporal information extraction, and coreference resolution are discussed. Several formalisms for information extraction are presented, including X
Search Solutions 2011: Successful Enterprise Search By DesignMarianne Sweeny
When your colleagues say they want Google, they don’t mean the Google Search Appliance. They mean the Google Search user experience: pervasive, expedient and delivering the information that they need. Successful enterprise search does not start with the application features, is not part of the information architecture, does not come from a controlled vocabulary and does not emerge on its own from the developers. It requires enterprise-specific data mining, enterprise-specific user-centered design and fine tuning to turn “search sucks” into search success within the firewall. This presentation looks at action items, tools and deliverables for Discovery, Planning, Design and Post Launch phases of an enterprise search deployment.
The document proposes an approach called BLOOMS to identify schema level relationships between ontologies in the Linked Open Data cloud by leveraging the structured knowledge in Wikipedia categories to build context trees for ontology classes and identify alignments based on the overlap between the trees. BLOOMS pre-processes ontology classes, maps them to Wikipedia articles, constructs category trees up to 4 levels, prunes irrelevant categories, and determines subclass relationships between ontologies based on the overlap between their respective trees. The approach aims to enhance existing ontology matching techniques for the noisy and loosely structured ontologies commonly found in the LOD cloud.
Linked Open Data Alignment and Enrichment Using Bootstrapping Based TechniquesPrateek Jain
The recent emergence of the “Linked Data” approach for publishing data represents a major step forward in realizing the original vision of a web that can “understand and satisfy the requests of people and machines to use the web content” – i.e. the Semantic Web. This new approach has resulted in the Linked Open Data (LOD) Cloud, which includes more than 70 large datasets contributed by experts belonging to diverse communities such as geography, entertainment, and life sciences. However, the current interlinks between datasets in the LOD Cloud – as we will illustrate – are too shallow to realize much of the benefits promised. If this limitation is left unaddressed, then the LOD Cloud will merely be more data that suffers from the same kinds of problems, which plague the Web of Documents, and hence the vision of the Semantic Web will fall short.
This thesis presents a comprehensive solution to address these issues using a bootstrapping based approach. It showcases using bootstrapping based methods to identify and create richer relationships between LOD datasets. The BLOOMS project (http://wiki.knoesis.org/index.php/BLOOMS) and the PLATO project, both built as part of this research, have provided evidence to the feasibility and the applicability of the solution.
Open Government Data, Linked Data, and the Missing Blocks in Korea Haklae Kim
This presentation discusses open government data and linked data. It provides examples of how open data initiatives from different governments have increased transparency and civic participation. Linked data practices are presented as a way to interconnect disparate datasets using semantic web standards. While Korea has strong e-government infrastructure, the presentation argues more can be done to implement open data and linked data practices. Participatory approaches are advocated to help design open data policies and solutions.
Infinit.e is an open analytics platform built using MongoDB and Hadoop. It allows users to collect, store, enrich, retrieve, analyze and visualize both structured and unstructured documents at scale. The platform utilizes open source technologies like Elasticsearch, MongoDB and Hadoop. MongoDB is used for document storage and development due to its flexible document model and ease of data model changes. Hadoop is used for large-scale analytics due to its proven scalability and availability of machine learning libraries. The presentation concludes with a demonstration of Infinit.e's capabilities.
Bringing Machine Learning and Knowledge Graphs Together
Six Core Aspects of Semantic AI:
- Hybrid Approach
- Data Quality
- Data as a Service
- Structured Data Meets Text
- No Black-box
- Towards Self-optimizing Machines
Sentara Linked Data Workshop - Sept 10, 20123 Round Stones
One day workshop to Sentara Healthcare on using a Linked Data approach for enterprise architecture. Topics include: Open Government Data initiatives, demo of Weather Health Web application; leveraging open data from NIH, NLM, NOAA, EPA, HHS; Callimachus Enterprise, a Linked Data Management System for the enterprise.
Trends in Human-Computer Interaction in Information SeekingRich Miller
The document discusses trends in human-computer interaction for information seeking. It provides 1) a framework for understanding information seeking behavior based on Marchionini's process model, 2) a vision of integrating new technologies into interfaces to enhance access and organization of growing amounts of information, and 3) an overview of significant technologies expected to impact future interfaces, such as natural language, visualization, ubiquitous computing and more. The framework and trends can be used to develop more effective next-generation user interfaces.
Introduction to question answering for linked data & big dataAndre Freitas
This document discusses question answering (QA) systems in the context of big data and heterogeneous data scenarios. It outlines the motivation and challenges for developing natural language interfaces for databases. The document covers the basic concepts and taxonomy of QA systems, including question types, answer types, data sources, and domains. It also discusses the anatomy and components of a typical QA system.
Similar to 전문가토크릴레이 2탄 Open data and linked data (김학래 박사) (20)
The document discusses future devices and user interfaces. It notes that past predictions about technology have often been wrong or limited in vision. It then examines emerging technologies like augmented reality displays, gesture controls, brain interfaces, and ubiquitous sensors. The author argues that devices will become integrated into everyday objects and environments through displays, networks, processors, artificial intelligence and other components. The future will see "things everywhere" rather than distinct devices, with new approaches to user experience and interaction beyond what is possible now.
The document discusses predictions for trends in web and HTML5 for 2013. The top 10 trends are: 1) Increased use of HTML5-enabled devices, 2) HTML5 becoming the standard, 3) Plans to finalize HTML5.1 in 2014, 4) Specialization of web apps, 5) Increased web APIs, 6) Emergence of web operating systems, 7) Expansion of the web into new areas, 8) Responsive web design, 9) Need for policies around the web, and 10) Rethinking the web ecosystem. The document provides details on each trend and discusses related topics like HTML5 standardization.
10. Let’s
Think
“This
led
to
changes
in
the
cons6tu6on
and
the
establishment
of
a
more
open
government”
–
WikiLeaks
WikiLeaks
Data.gov
11. Quick Summary
Open Data vs Linked Data
Open Data starts with making available the data that you already have, in whatever format.
• Equal access for all
Open Data • Licensing, legal issues
• Transparency
• Changing the way government works
• URIs
Linked Data • HTTPs
• RDF vocabularies
• Standards
3
12. Today
This Presentation .....
Open Data and The Semantic Web
Introduction What We Will Do
Open Government Linked Data
Data
4
13. Let’s Start
Web in Evolution
“a steady progression from a document-centric Web to one that is data-centric, including the mediation of semantics”
(Source: Mike, 2007)
5
14. Question
What is the Semantic Web for?
Standards
Inference
Search
Intelligence
6
15. Case Studies
Google’s Semantic Search
People should be able to ask questions and we should understand their meaning, or they should be able to
talk about things at a conceptual level. ... A lot of people will turn to things like the semantic Web as a possible
answer to that.“ - Google Vice President of Search Products User Experience Marissa Mayer
an initiative launched on 2 June 2011 by Bing, Google and Yahoo!
to create and support a common set of schemas for
structured
data markup on web pages.
Freebase is an open, Creative Commons licensed repository
of structured data of almost 22 million entities. An entity is a
single person, place, or thing connected by a graph.
The Knowledge Graph is a collection of information sources that
help discern a user’s specified intent with each individual query.
The graph is actually an encyclopedia with structured http://schema.org/docs/full.html
information obtained from the web. (currently, 200 million
entities)
7
16. Case Studies
Apple’s Siri
Ask Siri how Apple recorded the best quarter in history for a tech company, and her answer should be: Me.
Siri (Speech Interpretation and Recognition Interface) is Knowledge Navigator (1987)
an intelligent personal assistant and knowledge a concept described by former Apple Computer CEO John
navigator which works as an application for Apple's iOS. Sculley in his 1987 book, Odyssey.
A Brief History
- In December 2007 Siri, Inc. was formed by Dag Kittlaus
(CEO), Adam Cheyer (VP Engineering), and Tom Gruber
(CTO/VP Design).
- Siri Inc. went after funding and by November 2009 it had
secured $15.5 million investment, resulted in the creation of
the first Siri application, which debuted on the iPhone 3GS in
February 2010.
- Siri acquired by Apple; iPhone becomes the Virtual Personal
Assistant
(Source: http://www.youtube.com/watch?v=QRH8eimU_20)
8
17. Case Studies
Active Ontology
A processing formalism where distinct processing elements are arranged according to ontology notions;
an execution environment.
Basic concepts
* Ontology : A data structure
- Formal representation for domain knowledge
- Classes, attributes, relations
* Active Ontology : A processing environment
- Processing elements arranged according to ontology
notions
- Communication channels
P movie
P genre P actor P rating
rule set
rule
rule
rule
condition
condition
condition
action
action
action
(Baur et al., 2007)
9
18. Today
This Presentation .....
Open Data and The Semantic Web
Introduction What We Will Do
Open Government Linked Data
Data
10
19. Let’s Start
Big Data
“data that becomes large enough that it cannot be processed using conventional methods”
“Big Data is like Sex in High School–Lots of people are talking about it, but few are having it.”
-Eric Hansen, SiteSpect founder and CEO
11
22. Definition
What is Open (Government) Data?
“Open”
material (data) is open if it can be freely used,
reused and redistributed by anyone
“Government data”
data and information produced
or
commissioned by government or
government controlled entities.
Source: Open Knowledge Foundation, 2010
14
23. • Transparency
• Participation
• Collaboration
“My administration is committed to creating an unprecedented
level of openness in Government.” – Barack Obama
“Memorandum for the Heads of Executive Departments and Agencies – Transparency and Open Government” Jan 2009
24. Data.gov
Launched
on
May
21,
2009,
Data.gov
allows
ci;zens
to
par;cipate
by
leveraging
federal
data
sets
to
build
applica;ons,
conduct
analysis,
and
perform
research.
• The
first
phase
of
Data.gov
features
downloadable
federal
data
sets
organized
by
category
and
federal
organiza6on.
• Data
sets
are
available
for
download
in
XML,
CSV,
and
shape
file
formats.
16
25. Data.gov.uk
Prime
Minister,
David
Cameron,
writes
to
all
government
departments,
31
May
2010:
instruc;ng
them
to
free
up
more
datasets
as
part
of
Transparency
Agenda
Establishment
of
the
Public
Sector
Transparency
Board
chaired
by
Francis
Maude,
Minister
for
the
Cabinet
Office
The
Board
will
be
responsible
for
seRng
open
data
standards
across
the
public
sector,
publishing
further
datasets
on
the
basis
of
public
demand
17
29. Open Data Strategies
Open data instruments
“The application of the four types of instruments by the five countries is depicted – the larger
the circle the more instruments are applied” – Huijboom Van den Broek, 2011.
Education and training Voluntary approaches
US
AU ES UK DK
DK
UK
ES AU US
ES
DK
US ES
DK AU
AU
UK
UK US
Economic instruments Legislation and control
21
30. Critical factors
Drivers and barries of open data policy implementation
1 Strategies and experience in front runner countries Closed government culture
2 Political leadership Privacy legislation
3 Regional initiatives Limited quality of data
4 Citizen initiatives Limited user-friendliness/information overload
5 Market initiatives Lack of standardization of open data policy
6 Emerging technologies Security threats
7 European legislation Existing charging models
8 Thought leaders Uncertain economic impact
9 Possibility of monitoring government Digital divide
10 Budgets cuts Network overload
Source:
Huijboom
and
Van
den
Broek,
2011
22
31. Open Data Portals
CKAN – Open Source Data Portal
Makes it easy to publish, share, and find dataset. Integrated data storage, processing,
viewing and visualization
23
32. Today
This Presentation .....
Open Data and The Semantic Web
Introduction What We Will Do
Open Government Linked Data
Data
24
33. Let’s Start
The Web as a Global Data Platform
.. a system of interlinked hypertext documents accessed via the Internet
25
35. All data including documents, services, people ...
DATA DATA links
The Semantic Web is not about links between web pages.
27
36. Overview
Linked Data The Semantic Web
“The Semantic Web isn't just about putting data on the web. It is about making links, so that a person or
machine can explore the web of data. With linked data, when you have some of it, you can find other,
related, data” - TBL.
5 Stars Open linked data
★ Make your stuff available on the Web
★★ Make it avaiable as structured data
★★★ Use open, standard formats (instead of
excel)
★★★★
Use a open data format – URLs,
★★★★★ descriptions
Link your data to other people’s data
28
37. Overview
Growth of Interlinks
… Linked Data provides the means to reach the goal of the Semantic Web – “the
emergence of a Web of Data”
2007-05-01
2007-10-08
2007-11-10
2008-02-28
2008-03-31
2008-09-18
2009-03-05
2009-03-27
2009-07-14
2010-09-22
29
38. Structured Wikipedia Multimedia Content
DBpedia BBC
Commercial Product Government Data
Best Buy UK Gov
October, 2011 30
295 interlinked datasets, approximately 31 billions triples
41. Today
This Presentation .....
Open Data and The Semantic Web
Introduction What We Will Do
Open Government Linked Data
Data
33
42. Conceptual Architecture
Roadmap of linked open government data
“the combination of machine power and human power and deliver higher-quality data to a wide
range of data consumers via visualization, mashups, and more.”
34
(Ding et al., 2012)
43. How to Start
“We won’t get there tomorrow, but maybe the day after” –
Rufus Pollock
Low-hanging fruit, Less conversational data and quick wins.
Fireout Rebuild
Expand, with
more…..
Data
Services
Efficiency
Costs saving
Transparency
Participation
Inclusion
35
44. References
- Charles Baur, Adam Cheyer, Didier Guzzoni, Active, a platform for building intelligent software
- Noor Huijboom and Tijs Van den Broek, Open Data: an international comparison of strategies, European journal of ePractices,
March/April 2011
- Li Ding, Vassilios Peristeras, and Michael Hausenblas, Linked Open Government Data, IEEE Intelligent Systems, May/June 2012
- Page 1: http://www.w3.org/DesignIssues/diagrams/websci/Marius%20Watz%20-%20Web%20Science%20artwork.png
- Page 4: http://www.go-gulf.com/60seconds.jpg
- Page 9: http://cloud.frontpagemag.com/wp-content/uploads/2012/03/obama11.jpg
- Page 27: http://www.patentlyapple.com/.a/6a0120a5580826970c0168e5ccdd81970c-800wi
- Page 29: http://programminggeeks.com/wp-content/uploads/2010/05/Programming-Geeks-Web-Science.jpg
- Page 29: http://3.bp.blogspot.com/-C0Kyck90Djo/T4KZTg3k1XI/AAAAAAAAAsE/RUp165S0FCQ/s1600/Commitment.jpeg
Page 2 Case Studies
- http://www.guardian.co.uk/commentisfree/2012/aug/03/london-2012-olympics-open-data
- http://www.bbc.co.uk/news/uk-19050139
- http://london2012.nytimes.com/results
- http://www.guardian.co.uk/sport/interactive/2012/jul/23/could-you-be-a-medallist
- http://www.guardian.co.uk/sport/datablog/2012/aug/13/olympics-2012-data-journalism
- http://www.guardian.co.uk/sport/datablog/interactive/2012/jul/26/london-2012-price-olympic-games-visualised
36
45. For more information
contact Haklae Kim via
haklae.kim@gmail.com
Twitter: haklaekim
Or see more activities
at:
http://blogweb.co.kr
http://thedatahub.kr
http://getthedata.kr