Linked Data Cookbook for US Government Agencies by Bernadette Hyland, 3 Round Stones, Inc. and W3C Government Linked Data co-chair.
Presented at Semantic Technology Conference Dec 2011, Washington DC
This is a presentation Zen style talk (ala Garr Reynolds) on the importance of publishing high quality (“5 star”)
Linked Data and why this is central to fulfilling the promise of Open Government in the 21st Century. I blogged the full story on http://3roundstones.com/2011/10/17/a-new-era-of-transparency/
Presentation on what's happening with Government Linked Data presented by Bernadette Hyland. Presentation delivered on 3-Nov-2011 at NASA Goddard to CENDI Federal STI Managers Group.
This is a presentation Zen style talk (ala Garr Reynolds) on the importance of publishing high quality (“5 star”)
Linked Data and why this is central to fulfilling the promise of Open Government in the 21st Century. I blogged the full story on http://3roundstones.com/2011/10/17/a-new-era-of-transparency/
Presentation on what's happening with Government Linked Data presented by Bernadette Hyland. Presentation delivered on 3-Nov-2011 at NASA Goddard to CENDI Federal STI Managers Group.
Role of Linked Data for Scholarly Publishers3 Round Stones
Society of Scholarly Publishing Conference 2012 talk on "Making Semantics Work". Bernadette Hyland describes what publishers need to be paying attention to with respect to data reuse and sharing. She describes goals, approaches and platforms for the internal and external publishing of data as Linked Data for more efficient and effective integration, reuse and distribution.
Presentation delivered as part of the NISO "Back From the Endangered List: Using Authority Data to Enhance the Semantic Web" Webinar on February 9th 2011.
Open Source and Open Data in the Age of the CloudTim O'Reilly
Another of my "State of the Internet Operating System" talks, this one given at the MySQL User Conference on April 14, 2010. A bit more of a focus on open data.
My keynote at the Velocity Conference 2010. Why web operations and web performance optimization matter, and will matter more as technology evolves. Video of this talk is available at http://bit.ly/93J7d1
Drupalcon keynote: Open Source and Open Data in the age of the cloudTim O'Reilly
My keynote at Drupalcon SF on April 20, 2009. Similar to my talk at OSBC, MySQL and Greenplum, but with a bit of a drupal twist. Ending riff on DIY inspired by Isaiah Saxon's comments on my MySQL keynote.
Open Source Project OpenJustitia of the Federal Supreme Court of SwitzerlandMatthias Stürmer
European Commission Workshop
„European Public Administrations and Open Source Sofware:
The Power of Communities”
Open Source World Conference
January 12, 2012 in Granada, Spain
Bernadette Hyland speaks at Startup Queensland Visiting Entrepreneurs Program...Bernadette Hyland-Wood
Continuing with the Queensland Government’s and Brisbane Marketing’s fantastic program of bringing international entrepreneurs to Queensland to tell their stories and to mentor local founders, ilab will be hosting US entrepreneur Bernadette Hyland on Thursday Aug 6, 2015.
Bernadette has a fascinating CV – Software Engineer, Startup Founder, Open Data guru, Web innovator and W3C influencer, IoT, public health data analytics, Crowdsourcing, STEM education and is a major supporter women startup founders.
Semantic Content Management framework with wiki interface for creating data-driven Web applications. This is an Open Source project based on International Data Exchange standards (W3C) and Web technologies. Learn more about Callimachus at http://callimachusproject.org.
Update on the progress of two Linked Data projects, including one from US EPA and another from a Virginia based regional healthcare company using anonymized EMR and Linked Data for personalized healthcare.
Talk delivered at YOW! Developer Conferences in Melbourne, Brisbane and Sydney Australia on 1-9 December 2016.
Abstract: Governments collect a lot of data. Data on air quality, toxic chemicals, laws and regulations, public health, and the census are intended to be widely distributed. Some data is not for public consumption. This talk focuses on open government data — the information that is meant to be made available for benefit of policy makers, researchers, scientists, industry, community organisers, journalists and members of civil society.
We’ll cover the evolution of Linked Data, which is now being used by Google, Apple, IBM Watson, federal governments worldwide, non-profits including CSIRO and OpenPHACTS, and thousands of others worldwide.
Next we’ll delve into the evolution of the U.S. Environmental Protection Agency’s Open Data service that we implemented using Linked Data and an Open Source Data Platform. Highlights include how we connected to hundreds of billions of open data facts in the world’s largest, open chemical molecules database PubChem and DBpedia.
WHO SHOULD ATTEND
Data scientists, software engineers, data analysts, DBAs, technical leaders and anyone interested in utilising linked data and open government data.
TSO Semantic Discoverability - at UK Gov Linked Data - by Richard Goodwin TSO...TSO
Semantic discoverability – removing the barriers for organisations to adopt Linked Data through automating the mark-up of unstructured and semi-structured text.
To see DES in action visit http://openup.tso.co.uk/data-enrichment-service
Richard Goodwin presented at The UK Government Linked Data Working Group event on 5 November.
The event was for anyone interested in being a part of the UK Government Linked Data Working Group. The event enabled participants to share experiences and to help support and forward the work that is already happening across government in the creation and publication of Linked Data.
Presentation at the ESRI Health and Human Services Conference, October 2015, by GeoHealth US Corp. GeoHealth.us is an interactive web service that allows users to map their local environment to health impacts.
3 Round Stones Briefing to U.S. EPA's Chief Data Scientist on Open DataBernadette Hyland-Wood
The following is technical brief to U.S. EPA's Chief Data Scientist on open data information architecture, the use of Linked Data and the EPA Linked Data Management Service. The briefing was held in February 2016 and was educational in nature.
The following brief details the use of linked data to connect various high quality data sets produced by the U.S. Environmental Protection Agency. Linked data is an open standards way to publish and consume data. Using a linked data approach and the REST API, developers, scientists, and the public can more easily find, access and re-use authoritative data published by the EPA.
Role of Linked Data for Scholarly Publishers3 Round Stones
Society of Scholarly Publishing Conference 2012 talk on "Making Semantics Work". Bernadette Hyland describes what publishers need to be paying attention to with respect to data reuse and sharing. She describes goals, approaches and platforms for the internal and external publishing of data as Linked Data for more efficient and effective integration, reuse and distribution.
Presentation delivered as part of the NISO "Back From the Endangered List: Using Authority Data to Enhance the Semantic Web" Webinar on February 9th 2011.
Open Source and Open Data in the Age of the CloudTim O'Reilly
Another of my "State of the Internet Operating System" talks, this one given at the MySQL User Conference on April 14, 2010. A bit more of a focus on open data.
My keynote at the Velocity Conference 2010. Why web operations and web performance optimization matter, and will matter more as technology evolves. Video of this talk is available at http://bit.ly/93J7d1
Drupalcon keynote: Open Source and Open Data in the age of the cloudTim O'Reilly
My keynote at Drupalcon SF on April 20, 2009. Similar to my talk at OSBC, MySQL and Greenplum, but with a bit of a drupal twist. Ending riff on DIY inspired by Isaiah Saxon's comments on my MySQL keynote.
Open Source Project OpenJustitia of the Federal Supreme Court of SwitzerlandMatthias Stürmer
European Commission Workshop
„European Public Administrations and Open Source Sofware:
The Power of Communities”
Open Source World Conference
January 12, 2012 in Granada, Spain
Bernadette Hyland speaks at Startup Queensland Visiting Entrepreneurs Program...Bernadette Hyland-Wood
Continuing with the Queensland Government’s and Brisbane Marketing’s fantastic program of bringing international entrepreneurs to Queensland to tell their stories and to mentor local founders, ilab will be hosting US entrepreneur Bernadette Hyland on Thursday Aug 6, 2015.
Bernadette has a fascinating CV – Software Engineer, Startup Founder, Open Data guru, Web innovator and W3C influencer, IoT, public health data analytics, Crowdsourcing, STEM education and is a major supporter women startup founders.
Semantic Content Management framework with wiki interface for creating data-driven Web applications. This is an Open Source project based on International Data Exchange standards (W3C) and Web technologies. Learn more about Callimachus at http://callimachusproject.org.
Update on the progress of two Linked Data projects, including one from US EPA and another from a Virginia based regional healthcare company using anonymized EMR and Linked Data for personalized healthcare.
Talk delivered at YOW! Developer Conferences in Melbourne, Brisbane and Sydney Australia on 1-9 December 2016.
Abstract: Governments collect a lot of data. Data on air quality, toxic chemicals, laws and regulations, public health, and the census are intended to be widely distributed. Some data is not for public consumption. This talk focuses on open government data — the information that is meant to be made available for benefit of policy makers, researchers, scientists, industry, community organisers, journalists and members of civil society.
We’ll cover the evolution of Linked Data, which is now being used by Google, Apple, IBM Watson, federal governments worldwide, non-profits including CSIRO and OpenPHACTS, and thousands of others worldwide.
Next we’ll delve into the evolution of the U.S. Environmental Protection Agency’s Open Data service that we implemented using Linked Data and an Open Source Data Platform. Highlights include how we connected to hundreds of billions of open data facts in the world’s largest, open chemical molecules database PubChem and DBpedia.
WHO SHOULD ATTEND
Data scientists, software engineers, data analysts, DBAs, technical leaders and anyone interested in utilising linked data and open government data.
TSO Semantic Discoverability - at UK Gov Linked Data - by Richard Goodwin TSO...TSO
Semantic discoverability – removing the barriers for organisations to adopt Linked Data through automating the mark-up of unstructured and semi-structured text.
To see DES in action visit http://openup.tso.co.uk/data-enrichment-service
Richard Goodwin presented at The UK Government Linked Data Working Group event on 5 November.
The event was for anyone interested in being a part of the UK Government Linked Data Working Group. The event enabled participants to share experiences and to help support and forward the work that is already happening across government in the creation and publication of Linked Data.
Presentation at the ESRI Health and Human Services Conference, October 2015, by GeoHealth US Corp. GeoHealth.us is an interactive web service that allows users to map their local environment to health impacts.
3 Round Stones Briefing to U.S. EPA's Chief Data Scientist on Open DataBernadette Hyland-Wood
The following is technical brief to U.S. EPA's Chief Data Scientist on open data information architecture, the use of Linked Data and the EPA Linked Data Management Service. The briefing was held in February 2016 and was educational in nature.
The following brief details the use of linked data to connect various high quality data sets produced by the U.S. Environmental Protection Agency. Linked data is an open standards way to publish and consume data. Using a linked data approach and the REST API, developers, scientists, and the public can more easily find, access and re-use authoritative data published by the EPA.
Linked Data Approach for Integration of Human Health & Environmental Data3 Round Stones
Best practices and platforms for access and reuse of scientific data and models. We explore a Linked Data approach for data integration, modeling and interoperability.
Delivered by Bernadette Hyland at EPA & Society of Toxicology Scientific Workshop titled: "Building for Better Decisions: Multi-scale Integration of Human Health and Environmental Data..
Delivered 8-May-2012 at EPA Research Triangle Park, NC USA.
Sentara Linked Data Workshop - Sept 10, 20123 Round Stones
One day workshop to Sentara Healthcare on using a Linked Data approach for enterprise architecture. Topics include: Open Government Data initiatives, demo of Weather Health Web application; leveraging open data from NIH, NLM, NOAA, EPA, HHS; Callimachus Enterprise, a Linked Data Management System for the enterprise.
Don Hagen presented at the Special Libraries Association meeting on June 15, 2011 as part of a panel on New Forms of Scholarly Communications in the Sciences. His talk was entitled "NTIS Focus on Science and Data: Open and Sustainable Models for Science Information Discovery"
Final version of the general presentation that the RDA Secretary General presented about a dozen times at various conferences and workshops around Europe in the last two months.
This presentation was provided by Johan Bollen of Los Alamos National Laboratory, Research Library and Dianne L. Carty of the Massachusetts Board of Library Commissioners during the NISO event "Performance Measures: Putting Data to Use," held on November 14, 2008.
How I Learned to Stop Worrying and Love Linked DataDomino Data Lab
In this presentation, Jon Loyens will share:
-Best practices for sharing context and knowledge about your data projects
-How linked data can augment your existing data science workflow and toolchain to accelerate your work
-How a social network can unlock power of Linked Data and data collaboration
-How Linked Data can help you easily combine private and Open Data for fun and profit
US EPA OSWER Linked Data Workshop 1-Feb-20133 Round Stones
Overview of US EPA's Linked Data Service to launch in early 2013. Open data published using the Linked Data model increases search engines' ability to find and display high value data sets. Linked Data enables policy makers, analysts and developers to more readily access and re-use data.
presented at the 2011 SemTech
Open government data and related services/applications are quickly growing on the Web. Although most agree that the government data has great potential in solving real world problems, there are still many challenges that must be addressed. This talk will describe several representative domain applications and provide concrete examples of evolving technical challenges remaining. We will show solution paths that have proven useful and make recommendations on the corresponding Semantic Web best practices.
• Scalability. How can we handle(e.g. search and cleanse) the 3,000+ raw/tool datasets, and the additional 300,000+ geo datasets from data.gov?
• Interoperability. Multi-scale open government data came from city governments, state governments, and national governments. How can one compare the GDP of the US and China, and later link to state-level financial data? Open government data covers many domains. How can one associate open government data with domain knowledge to build a cancer prevention application?
• Provenance and quality. How should provenance be leveraged to facilitate high-quality data management interactions (e.g. reuse, mash-up and feedback) between the government and the public?
A discussion of the role of taxonomies and other controlled vocabularies in the managing of large amounts of data for researchers, focusing in particular on searchability and data visualization. Presented by Marjorie M.K. Hlava, president of Access Innovations, Inc., for the SLA Military Libraries Division 2013 Workshop, December 12, 2013.
This talk highlights the rich history and diversity within software engineering and related STEM fields. Bernadette Hyland-Wood, a serial tech entrepreneur with Australia and U.S. experience addressed an audience of high school year 11 and 12 students on STEM futures as part of International Women's week 2018. This talk was organised by ChangeMakeHer ambassadors, helping to create the next generation of female changemakers to lead and change the world. More on ChangeMakeHer Australia https://www.changemakeher.com/about-us
Empowering a healthier future: through the intersection of people, technology and science with a panel of bio-informatists and data experts. Brisbane Australia 27-Feb 2018
Software engineering specifically is about designing, writing, testing, implementing and maintaining software. In 2017 and beyond, it is about much more. Software doesn’t affect any one group of people; rather, software plays a massive role in our lives from the moment we wake up, travel to work, school or wherever we spend significant time during our lives. This talk delivered in November 2017 to high school students in Australia, aims to introduce teenagers to the wide range of opportunities in software engineering and information technology-related majors at university and careers upon graduation. #STEM #sofwareengineering #robotics #AI #GirlsCanCode
Presented by serial tech entrepreneur Bernadette Hyland to an audience of tech and design managers on building an inclusive, collaborative workplace. Bernadette Hyland began her career in Silicon Valley when 37% of computer science graduates were women. During the next two decades, the number of female engineers dropped to a low of 12% despite more women in the workplace. What happened? This talk highlights several remarkable female programming pioneers from the U.S. and Australia. This talk aims to engage the audience in a discussion on the value of diverse collaborations, the role of women and how we may be self-reflective to improve participation and collaboration in the workplace, and reduce discrimination and harassment.
A talk delivered by software engineer and serial tech founder, Ms. Bernadette Hyland to year 9-12 students in Brisbane Australia. The information session was for girls to highlight software engineering and what students can do now to prepare for a productive and satisfying career that leverages science, technology, engineering and math.
Linked Data is an evolving set of techniques for publishing and consuming data on the Web. Learn how Linked Data can turn the Web into a distributed database and how you can participate. In this session, Bernadette Hyland takes the mystery out of Linked Data by summarizing seven steps to prepare your data sets as Linked Data and announce it so others will use it.
Maruthi Prithivirajan, Head of ASEAN & IN Solution Architecture, Neo4j
Get an inside look at the latest Neo4j innovations that enable relationship-driven intelligence at scale. Learn more about the newest cloud integrations and product enhancements that make Neo4j an essential choice for developers building apps with interconnected data and generative AI.
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfPaige Cruz
Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack.
While the dev and ops silo continues to crumble….many organizations still relegate monitoring & observability as the purview of ops, infra and SRE teams. This is a mistake - achieving a highly observable system requires collaboration up and down the stack.
I, a former op, would like to extend an invitation to all application developers to join the observability party will share these foundational concepts to build on:
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!SOFTTECHHUB
As the digital landscape continually evolves, operating systems play a critical role in shaping user experiences and productivity. The launch of Nitrux Linux 3.5.0 marks a significant milestone, offering a robust alternative to traditional systems such as Windows 11. This article delves into the essence of Nitrux Linux 3.5.0, exploring its unique features, advantages, and how it stands as a compelling choice for both casual users and tech enthusiasts.
Dr. Sean Tan, Head of Data Science, Changi Airport Group
Discover how Changi Airport Group (CAG) leverages graph technologies and generative AI to revolutionize their search capabilities. This session delves into the unique search needs of CAG’s diverse passengers and customers, showcasing how graph data structures enhance the accuracy and relevance of AI-generated search results, mitigating the risk of “hallucinations” and improving the overall customer journey.
Communications Mining Series - Zero to Hero - Session 1DianaGray10
This session provides introduction to UiPath Communication Mining, importance and platform overview. You will acquire a good understand of the phases in Communication Mining as we go over the platform with you. Topics covered:
• Communication Mining Overview
• Why is it important?
• How can it help today’s business and the benefits
• Phases in Communication Mining
• Demo on Platform overview
• Q/A
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofsAlex Pruden
This paper presents Reef, a system for generating publicly verifiable succinct non-interactive zero-knowledge proofs that a committed document matches or does not match a regular expression. We describe applications such as proving the strength of passwords, the provenance of email despite redactions, the validity of oblivious DNS queries, and the existence of mutations in DNA. Reef supports the Perl Compatible Regular Expression syntax, including wildcards, alternation, ranges, capture groups, Kleene star, negations, and lookarounds. Reef introduces a new type of automata, Skipping Alternating Finite Automata (SAFA), that skips irrelevant parts of a document when producing proofs without undermining soundness, and instantiates SAFA with a lookup argument. Our experimental evaluation confirms that Reef can generate proofs for documents with 32M characters; the proofs are small and cheap to verify (under a second).
Paper: https://eprint.iacr.org/2023/1886
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...SOFTTECHHUB
The choice of an operating system plays a pivotal role in shaping our computing experience. For decades, Microsoft's Windows has dominated the market, offering a familiar and widely adopted platform for personal and professional use. However, as technological advancements continue to push the boundaries of innovation, alternative operating systems have emerged, challenging the status quo and offering users a fresh perspective on computing.
One such alternative that has garnered significant attention and acclaim is Nitrux Linux 3.5.0, a sleek, powerful, and user-friendly Linux distribution that promises to redefine the way we interact with our devices. With its focus on performance, security, and customization, Nitrux Linux presents a compelling case for those seeking to break free from the constraints of proprietary software and embrace the freedom and flexibility of open-source computing.
Climate Impact of Software Testing at Nordic Testing DaysKari Kakkonen
My slides at Nordic Testing Days 6.6.2024
Climate impact / sustainability of software testing discussed on the talk. ICT and testing must carry their part of global responsibility to help with the climat warming. We can minimize the carbon footprint but we can also have a carbon handprint, a positive impact on the climate. Quality characteristics can be added with sustainability, and then measured continuously. Test environments can be used less, and in smaller scale and on demand. Test techniques can be used in optimizing or minimizing number of tests. Test automation can be used to speed up testing.
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Sudheer Mechineni, Head of Application Frameworks, Standard Chartered Bank
Discover how Standard Chartered Bank harnessed the power of Neo4j to transform complex data access challenges into a dynamic, scalable graph database solution. This keynote will cover their journey from initial adoption to deploying a fully automated, enterprise-grade causal cluster, highlighting key strategies for modelling organisational changes and ensuring robust disaster recovery. Learn how these innovations have not only enhanced Standard Chartered Bank’s data infrastructure but also positioned them as pioneers in the banking sector’s adoption of graph technology.
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIVladimir Iglovikov, Ph.D.
Presented by Vladimir Iglovikov:
- https://www.linkedin.com/in/iglovikov/
- https://x.com/viglovikov
- https://www.instagram.com/ternaus/
This presentation delves into the journey of Albumentations.ai, a highly successful open-source library for data augmentation.
Created out of a necessity for superior performance in Kaggle competitions, Albumentations has grown to become a widely used tool among data scientists and machine learning practitioners.
This case study covers various aspects, including:
People: The contributors and community that have supported Albumentations.
Metrics: The success indicators such as downloads, daily active users, GitHub stars, and financial contributions.
Challenges: The hurdles in monetizing open-source projects and measuring user engagement.
Development Practices: Best practices for creating, maintaining, and scaling open-source libraries, including code hygiene, CI/CD, and fast iteration.
Community Building: Strategies for making adoption easy, iterating quickly, and fostering a vibrant, engaged community.
Marketing: Both online and offline marketing tactics, focusing on real, impactful interactions and collaborations.
Mental Health: Maintaining balance and not feeling pressured by user demands.
Key insights include the importance of automation, making the adoption process seamless, and leveraging offline interactions for marketing. The presentation also emphasizes the need for continuous small improvements and building a friendly, inclusive community that contributes to the project's growth.
Vladimir Iglovikov brings his extensive experience as a Kaggle Grandmaster, ex-Staff ML Engineer at Lyft, sharing valuable lessons and practical advice for anyone looking to enhance the adoption of their open-source projects.
Explore more about Albumentations and join the community at:
GitHub: https://github.com/albumentations-team/albumentations
Website: https://albumentations.ai/
LinkedIn: https://www.linkedin.com/company/100504475
Twitter: https://x.com/albumentations
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Linked Data Cookbook for Government Agencies, SemTech East, Washington DC 1-Dec-2011
1. A Linked Data Cookbook
for Government Agencies
Semantic Technology Conference, Washington DC
01-Dec-2011 8:30AM
Bernadette Hyland
CEO, 3 Round Stones &
co-chair W3C Government Linked Data Working Group
bhyland@3roundstones.com
Twitter @BernHyland
Monday, November 28, 11
2. • Linked Data is about
publishing and
consuming data using
international data
standards
• Based on 20 year old
idea
• Goal is to solve
organizational issues
related to data silos,
requirements for faster
data integration and an
environment of reduced
IT budgets
Monday, November 28, 11
3. Linking Government Data
• 42 contributors
• ...from 8 countries
• 10 chapters
• Publication date:
November 2011
3
Monday, November 28, 11
4. Agenda
• Why publishing Linked Open Data matters
• What governments are doing today
• How government use of Open Standards &
Open Source Software saves lives and money
• Social contract as a government publisher
• Next steps
Monday, November 28, 11
5. Two sides of the
Open Government Coin
Short and long term public interests
Increasing transparency
Helping with informed civic engagement
#2 Data sharing for informed research, policy &
regulation
My talk today focuses on #2
Monday, November 28, 11
6. Why should we Care?
• Reducing data silos has long been discussed ...
• Linked Data, based on international data exchange
standards avoids vendor lock in
• Reduces the need to create & maintain data silos
• Encourages private and public partnerships
• Sows the seeds for economic growth from the top down
and bottom up
Monday, November 28, 11
7. ACCEPTABLE ROI FOR IT
4% 17%
13%
16%
6 months
49% 12 months
18 months
24 months
More than 24 months
Monday, November 28, 11
19. Where is Open Source deployed?
International Standards and Open Source are the reason
• The Web has become the most extensible, robust
information network ever created
• US Dept of Defense is big customer of commercially
support Open Source software
• US Army cites Open Source is saving lives and hundreds of
millions of dollars.
• 100k instances deployed in missile defense systems &
armored personnel carriers
Monday, November 28, 11
20. In 3 brief years ...
• Starting in 2008, a few heads of state directed open
government data to be published on the Web ...
• Three months ago (September 2011), Presidents
Obama (USA) and Rousseff (Brazil) endorsed the
Open Government Partnership, along with
7 other nations
• Each launched their government’s National Plans
during the meeting of the UN General Assembly
Monday, November 28, 11
21. World changing phenomenon
• Using Linked Data approach, we can begin to
address data silos and interoperability using
data exchange standards
• We can combine information sources
• The W3C has defined standards that enable
interoperability and allow us to freely move
data
Monday, November 28, 11
23. What is next?
• We’re already seeing signs of things to
come.
• Structured data on the Web is becoming
mainstream.
Monday, November 28, 11
24. Government Linked
Data Working Group
• Started June 2011; runs to May 2013
• Chartered to provide standards & develop standards
track documents to help all governments share
their data as high quality (“5 star”) Linked
Data
• 39 participants from 25 organizations
• 50% in non-US locations
Monday, November 28, 11
26. Deliverables
Community Directory
Best Practices for Publishing Linked Data
• Procurement, vocabulary selection, URI construction,
versioning, stability, legacy data issues
• Cookbook for Linked Open Data
Standard Vocabularies
• Metadata, Statistical “Cube” Data, People,
Organizational structures
Monday, November 28, 11
27. Beta: http://dir.w3.org
email support@3roundstones.com for login to
add your organization’s details
Monday, November 28, 11
37. Preparation
1. Leverage what exists
• Request a copy of the logical and physical model of the
database(s)
• Obtain data extracts (i.e., databases and/or spreadsheets)
or create data in a way that can be replicated.
Monday, November 28, 11
38. Model the data
2. Model data without context to allow for
reuse and easier merging of data sets
• Traditional DBAs organize data for specified
Web services or applications.
• With LD, application logic does not drive the
data schema, concepts, etc.
Monday, November 28, 11
39. Model the data
3.Look for real world objects of interest (e.g.,
people, places, things, locations, etc.) and
model them.
• Investigate how others are already modeling
similar or related data.
• Look for duplication and normalize the data
• Use common sense to decide whether or
not to make link
Monday, November 28, 11
40. Model the data ...
4. Connect data from different sources and
authoritative vocabularies (see list of popular
vocabularies below).
•Use URIs as names for your
objects
Monday, November 28, 11
41. Model the data ...
•Put aside immediate needs of any
application
•Don’t think about how an application will
use your data
•Do think about time and how the data will
change over time.
Monday, November 28, 11
42. Convert, Publish & Maintain
5.Write a script or process to convert the
data set repeatedly
6.Publish to the Web and announce it! (more
details shortly)
7.Maintenance strategy (more details in the
social contract at the end)
Monday, November 28, 11
43. Take the plunge ... Be forgiving
• Simplistic data models can still be useful
• Better to make progress with something
rather than do nothing because we cannot
be comprehensive and complete
Monday, November 28, 11
44. Take an iterative approach
1. Review of modeling decisions
2. Review vocabularies chosen and developed
3. Modify/update data conversion scripts
4. Do a maintenance walk-through with real use cases
5. Show how to explore data with SPARQL and
visualizations
6. Discuss a persistent identifier strategy (think PURLs)
Monday, November 28, 11
46. Linked Data Management System
Callimachus (kəlĭm'əkəs) is a framework for data-driven
applications based on Linked Data principles.
Callimachus allows Web authors to quickly and easily create
semantically-enabled Web applications.
Monday, November 28, 11
47. Web 2.0 developers can create data driven application
with templates in hours
Triples up & down (no mySQL under the covers)
Wiki editing of content
Access control
Collaboration via Web
Change tracking (history)
Page/form Templates
Monday, November 28, 11
58. Join the Community
Callimachus has benefited from 2+ years of corporate support
We’re using it for real world Web applications in environmental
protection, finance and publishing
Open Source project
Visit callimachusproject.org
Monday, November 28, 11
59. What we covered today
• Why government authorities are publishing information as
Linked Open Data
• The process for converting data into RDF
• Using Open Standards and Open Source to publish
Open Data
• Note: Commercial support & products are
critical for government publishing & consumption of Open
Data
• Announcing agency Open Data & your social contract
Monday, November 28, 11
60. Further Reading
http://linkeddatabook.com/editions/1.0/
http://3roundstones.com/linking-enterprise-data/
http://3roundstones.com/linking-government-data/
http://www.linkeddatadeveloper.com/
Monday, November 28, 11
61. Recommended talk
Thursday, 1-Dec 2011 @ 9:30
by Michael Pendleton &
David G. Smith, US EPA
LINKED GOVERNMENT
DATA:
ENVIRONMENTAL
PROTECTION PERSPECTIVES
Monday, November 28, 11