Presented by Tony Mathys at a Current Issues and Applications of the Geospatial Technologies Lecture, Department of Geography and Environment, Aberdeen University, 24 February 2012
Implementing Open Access: Effective Management of Your Research DataMartin Hamilton
The slides from my session with the DCC's Martin Donnelly at the Understanding ModernGov "Implementing Open Access" event in June 2014. Our talk is all about the support available from Jisc and the DCC to help you manage your research data, and potential future initiatives that might help institutions to handle the move to "open science".
Open Data in a Big Data World: easy to say, but hard to do?LEARN Project
Presentation at 3rd LEARN workshop on Research Data Management, “Make research data management policies work”
Helsinki, 28 June 2016, by Sarah Callaghan, STFC Rutherford Appleton Laboratory
Presented by Tony Mathys at a Current Issues and Applications of the Geospatial Technologies Lecture, Department of Geography and Environment, Aberdeen University, 24 February 2012
Implementing Open Access: Effective Management of Your Research DataMartin Hamilton
The slides from my session with the DCC's Martin Donnelly at the Understanding ModernGov "Implementing Open Access" event in June 2014. Our talk is all about the support available from Jisc and the DCC to help you manage your research data, and potential future initiatives that might help institutions to handle the move to "open science".
Open Data in a Big Data World: easy to say, but hard to do?LEARN Project
Presentation at 3rd LEARN workshop on Research Data Management, “Make research data management policies work”
Helsinki, 28 June 2016, by Sarah Callaghan, STFC Rutherford Appleton Laboratory
Cal Poly - Data Management for ResearchersCarly Strasser
October 17, 2013 @ 1 Robert E. Kennedy Library, Data Studio, California Polytechnic State University.
Researchers rarely learn about good data management practices. Instead we develop our own systems that are often unintelligible to others. In this talk, Strasser, PhD, will focus on the common mistakes that scientists make and how to avoid them. She will provide best practices for data management, which will facilitate data sharing and reuse, and introduce tools you can use.
Cal Poly - Data Management and the DMPToolCarly Strasser
October 17, 2013 @ Robert E. Kennedy Library, Data Studio, California Polytechnic State University.
Many funders now require researchers to submit a Data Management Plan alongside their project proposals. The DMPTool is a free, online wizard that helps you create a data management plan specific to your project, and provides you with links and resources for ensuring your plan is successful.
Collaboration to Curation: The High Rise Project meets Edinburgh DataShare University of Edinburgh
Slides describing the evolution of the Edinburgh DataShare repository and The High-Rise Project and the (potential) collaborative mechanisms that will enable the digital content to be ingested and preserved in the Edinburgh DataShare DSpace repository environment
Linking EUDAT services to the EGI Fed-Cloud - EUDAT Summer School (Hans van P...EUDAT
The main goal of the EGI-EUDAT collaboration is to harmonise the two eInfrastructures, including technical interoperability, authentication, authorisation and identity management, policy and operations. As main objective, this work is to provide end-users with a seamless access to an integrated infrastructure offering both EGI and EUDAT services and then, pairing data and high-throughput computing resources together. Selected user communities are able to bring requirements and help assign the right priorities to each of them. In this way, the integration activity has been driven by the end users from the start. The use case permits a user of either e-infrastructure to instantiate a VM on the EGI Cloud Federation for the execution of a computational job consuming data preserved onto EUDAT resources. The results of such analysis can be staged back to EUDAT storages, and if needed, allocated with Persistent identifiers (PIDs) for future use. To implement all the steps of this use case the following integration activities between the two infrastructures has to be fulfilled: (1) harmonisation between the authentication and authorisation model, (2) definition and implementation of the interfaces between the involved EGI and EUDAT services.
Visit: https://www.eudat.eu/eudat-summer-school
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Jisc
The analysis of government data, data held by business, the web, social science survey data will support new research directions and findings. Big Data is one of David Willetts’ 8 great technologies, and in order to secure the UK’s competitive advantage new investments have been made by the Economic Social Science Research Council ( ESRC) in Big Data, for example the Business Datasafe and Understanding Populations investments. In this session the benefits of the use of Big Data in social science , and the ESRCs Big Data strategy will be explained by Professor David De Roure.of the Oxford e-Research Centre and advisor to the ESRC.
Overview of data management policies and data management plans, including the DMPTool. For Ecological Society of America 2013 Meeting in Minneapolis, MN 5 August 2013.
We looked at the data. Here’s a breakdown of some key statistics about the nation’s incoming presidents’ addresses, how long they spoke, how well, and more.
Cal Poly - Data Management for ResearchersCarly Strasser
October 17, 2013 @ 1 Robert E. Kennedy Library, Data Studio, California Polytechnic State University.
Researchers rarely learn about good data management practices. Instead we develop our own systems that are often unintelligible to others. In this talk, Strasser, PhD, will focus on the common mistakes that scientists make and how to avoid them. She will provide best practices for data management, which will facilitate data sharing and reuse, and introduce tools you can use.
Cal Poly - Data Management and the DMPToolCarly Strasser
October 17, 2013 @ Robert E. Kennedy Library, Data Studio, California Polytechnic State University.
Many funders now require researchers to submit a Data Management Plan alongside their project proposals. The DMPTool is a free, online wizard that helps you create a data management plan specific to your project, and provides you with links and resources for ensuring your plan is successful.
Collaboration to Curation: The High Rise Project meets Edinburgh DataShare University of Edinburgh
Slides describing the evolution of the Edinburgh DataShare repository and The High-Rise Project and the (potential) collaborative mechanisms that will enable the digital content to be ingested and preserved in the Edinburgh DataShare DSpace repository environment
Linking EUDAT services to the EGI Fed-Cloud - EUDAT Summer School (Hans van P...EUDAT
The main goal of the EGI-EUDAT collaboration is to harmonise the two eInfrastructures, including technical interoperability, authentication, authorisation and identity management, policy and operations. As main objective, this work is to provide end-users with a seamless access to an integrated infrastructure offering both EGI and EUDAT services and then, pairing data and high-throughput computing resources together. Selected user communities are able to bring requirements and help assign the right priorities to each of them. In this way, the integration activity has been driven by the end users from the start. The use case permits a user of either e-infrastructure to instantiate a VM on the EGI Cloud Federation for the execution of a computational job consuming data preserved onto EUDAT resources. The results of such analysis can be staged back to EUDAT storages, and if needed, allocated with Persistent identifiers (PIDs) for future use. To implement all the steps of this use case the following integration activities between the two infrastructures has to be fulfilled: (1) harmonisation between the authentication and authorisation model, (2) definition and implementation of the interfaces between the involved EGI and EUDAT services.
Visit: https://www.eudat.eu/eudat-summer-school
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Jisc
The analysis of government data, data held by business, the web, social science survey data will support new research directions and findings. Big Data is one of David Willetts’ 8 great technologies, and in order to secure the UK’s competitive advantage new investments have been made by the Economic Social Science Research Council ( ESRC) in Big Data, for example the Business Datasafe and Understanding Populations investments. In this session the benefits of the use of Big Data in social science , and the ESRCs Big Data strategy will be explained by Professor David De Roure.of the Oxford e-Research Centre and advisor to the ESRC.
Overview of data management policies and data management plans, including the DMPTool. For Ecological Society of America 2013 Meeting in Minneapolis, MN 5 August 2013.
We looked at the data. Here’s a breakdown of some key statistics about the nation’s incoming presidents’ addresses, how long they spoke, how well, and more.
My books- Hacking Digital Learning Strategies http://hackingdls.com & Learning to Go https://gum.co/learn2go
Resources at http://shellyterrell.com/emoji
Artificial intelligence (AI) is everywhere, promising self-driving cars, medical breakthroughs, and new ways of working. But how do you separate hype from reality? How can your company apply AI to solve real business problems?
Here’s what AI learnings your business should keep in mind for 2017.
Study: The Future of VR, AR and Self-Driving CarsLinkedIn
We asked LinkedIn members worldwide about their levels of interest in the latest wave of technology: whether they’re using wearables, and whether they intend to buy self-driving cars and VR headsets as they become available. We asked them too about their attitudes to technology and to the growing role of Artificial Intelligence (AI) in the devices that they use. The answers were fascinating – and in many cases, surprising.
This SlideShare explores the full results of this study, including detailed market-by-market breakdowns of intention levels for each technology – and how attitudes change with age, location and seniority level. If you’re marketing a tech brand – or planning to use VR and wearables to reach a professional audience – then these are insights you won’t want to miss.
Presentation given at the Consorcio Madrono conference on Data Management Plans in Horizon 2020 http://www.consorciomadrono.es/info/web/blogs/formacion/217.php
Data accessibility and the role of informatics in predicting the biosphereAlex Hardisty
The variety, distinctiveness and complexity of life – biodiversity in other words and by implication the ecosystems in which it is situated – is our life support system. It is absolutely essential and more important than almost everything else but it is typically taken for granted. Today’s big societal challenges – food and water security, coping with environmental change and aspects of human health – are beyond the abilities of any one individual or research group to solve. Solving them depends not only on collaboration to deliver the appropriate scientific evidence but increasingly on vast amounts of data from multiple sources (environmental, taxonomic, genomic and ecological) gathered by manual observation and automated sensors, digitisation, remote sensing, and genetic sequencing. In April 2012 we called the biodiversity and ecosystems research communities to arms to formulate a consensus view on establishing an infrastructure to improve the accessibility of the ever-increasing volumes of biological data. We published the whitepaper: “A decadal view of biodiversity informatics: challenges and priorities” that has since been viewed more than 24,000 times. We envisage a shared and maintained multi-purpose network of computationally-based processing services sitting on top of an open data domain. By open data domain we mean data that is accessible i.e., published, registered and linked. BioVeL, pro-iBiosphere, ViBRANT and other FP7 funded projects have all explored aspects of this vision.
Processing Open Data using Terradue Cloud Platformterradue
We gave this talk at the "Open Data Projects cluster meeting" organised by the European Commission on 07-08th September 2015, in Brussels.
It was part of the Session IV: Sustainability and business strategies.
LIBER Webinar: Turning FAIR Data Into RealityLIBER Europe
These slides relate to a LIBER Webinar given on 23 April 2018. Turning FAIR Data Into Reality — Progress and Plans from the European Commission FAIR Data Expert Group.
In this webinar, Simon Hodson, Executive Director of CODATA and Chair of the FAIR Data Expert Group, and Sarah Jones, Associate Director at the Digital Curation Centre and Rapporteur, reported on the Group’s progress.
This module supported the training on Linked Open Data delivered to the EU Institutions on 30 November 2015 in Brussels. https://joinup.ec.europa.eu/community/ods/news/ods-onsite-training-european-commission
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Geoffrey Fox
Keynote at Sixth International Workshop on Cloud Data Management CloudDB 2014 Chicago March 31 2014.
Abstract: We introduce the NIST collection of 51 use cases and describe their scope over industry, government and research areas. We look at their structure from several points of view or facets covering problem architecture, analytics kernels, micro-system usage such as flops/bytes, application class (GIS, expectation maximization) and very importantly data source.
We then propose that in many cases it is wise to combine the well known commodity best practice (often Apache) Big Data Stack (with ~120 software subsystems) with high performance computing technologies.
We describe this and give early results based on clustering running with different paradigms.
We identify key layers where HPC Apache integration is particularly important: File systems, Cluster resource management, File and object data management, Inter process and thread communication, Analytics libraries, Workflow and Monitoring.
See
[1] A Tale of Two Data-Intensive Paradigms: Applications, Abstractions, and Architectures, Shantenu Jha, Judy Qiu, Andre Luckow, Pradeep Mantha and Geoffrey Fox, accepted in IEEE BigData 2014, available at: http://arxiv.org/abs/1403.1528
[2] High Performance High Functionality Big Data Software Stack, G Fox, J Qiu and S Jha, in Big Data and Extreme-scale Computing (BDEC), 2014. Fukuoka, Japan. http://grids.ucs.indiana.edu/ptliupages/publications/HPCandApacheBigDataFinal.pdf
In recent years governments and research institutions have emphasized the need for open data as a fundamental component of open science. But we need much more than the data themselves for them to be reusable and useful. We need descriptive and machine-readable metadata, of course, but we also need the software and the algorithms necessary to fully understand the data. We need the standards and protocols that allow us to easily read and analyze the data with the tools of our choice. We need to be able to trust the source and derivation of the data. In short, we need an interoperable data infrastructure, but it must be a flexible infrastructure able to work across myriad cultures, scales, and technologies. This talk will present a concept of infrastructure as a body of human, organisational, and machine relationships built around data. It will illustrate how a new organization, the Research Data Alliance, is working to build those relationships to enable functional data sharing and reuse.
FAIR data: what it means, how we achieve it, and the role of RDASarah Jones
Presentation on FAIR data, the FAIR Data Action Plan developed by the European Commission Expert Group and the role of the Research Data Alliance on implementing FAIR. The presentation was given at the RDAFinland workshop held on 6th June - https://www.csc.fi/web/training/-/rda_and_fair_supporting_finnish_researchers
ODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open DataMartin Kaltenböck
Vortrag im Rahmen des Data Pioneers Workshop am 10.10.2016 am BMVIT zum Thema Open Innovation und Open Data (Open Innovation mittels Open Data) seitens Elmar Kiesling (TU Wien) und Martin Kaltenböck (SWC) für den ODI (Open Data Institute) Node Vienna.
Similar to SemWeb 4 Gov – opportunities and challenges (20)
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfPeter Spielvogel
Building better applications for business users with SAP Fiori.
• What is SAP Fiori and why it matters to you
• How a better user experience drives measurable business benefits
• How to get started with SAP Fiori today
• How SAP Fiori elements accelerates application development
• How SAP Build Code includes SAP Fiori tools and other generative artificial intelligence capabilities
• How SAP Fiori paves the way for using AI in SAP apps
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
Sudheer Mechineni, Head of Application Frameworks, Standard Chartered Bank
Discover how Standard Chartered Bank harnessed the power of Neo4j to transform complex data access challenges into a dynamic, scalable graph database solution. This keynote will cover their journey from initial adoption to deploying a fully automated, enterprise-grade causal cluster, highlighting key strategies for modelling organisational changes and ensuring robust disaster recovery. Learn how these innovations have not only enhanced Standard Chartered Bank’s data infrastructure but also positioned them as pioneers in the banking sector’s adoption of graph technology.
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfPaige Cruz
Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack.
While the dev and ops silo continues to crumble….many organizations still relegate monitoring & observability as the purview of ops, infra and SRE teams. This is a mistake - achieving a highly observable system requires collaboration up and down the stack.
I, a former op, would like to extend an invitation to all application developers to join the observability party will share these foundational concepts to build on:
Communications Mining Series - Zero to Hero - Session 1DianaGray10
This session provides introduction to UiPath Communication Mining, importance and platform overview. You will acquire a good understand of the phases in Communication Mining as we go over the platform with you. Topics covered:
• Communication Mining Overview
• Why is it important?
• How can it help today’s business and the benefits
• Phases in Communication Mining
• Demo on Platform overview
• Q/A
Dr. Sean Tan, Head of Data Science, Changi Airport Group
Discover how Changi Airport Group (CAG) leverages graph technologies and generative AI to revolutionize their search capabilities. This session delves into the unique search needs of CAG’s diverse passengers and customers, showcasing how graph data structures enhance the accuracy and relevance of AI-generated search results, mitigating the risk of “hallucinations” and improving the overall customer journey.
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024Neo4j
Neha Bajwa, Vice President of Product Marketing, Neo4j
Join us as we explore breakthrough innovations enabled by interconnected data and AI. Discover firsthand how organizations use relationships in data to uncover contextual insights and solve our most pressing challenges – from optimizing supply chains, detecting fraud, and improving customer experiences to accelerating drug discoveries.
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.
Pushing the limits of ePRTC: 100ns holdover for 100 daysAdtran
At WSTS 2024, Alon Stern explored the topic of parametric holdover and explained how recent research findings can be implemented in real-world PNT networks to achieve 100 nanoseconds of accuracy for up to 100 days.
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
UiPath Test Automation using UiPath Test Suite series, part 4DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 4. In this session, we will cover Test Manager overview along with SAP heatmap.
The UiPath Test Manager overview with SAP heatmap webinar offers a concise yet comprehensive exploration of the role of a Test Manager within SAP environments, coupled with the utilization of heatmaps for effective testing strategies.
Participants will gain insights into the responsibilities, challenges, and best practices associated with test management in SAP projects. Additionally, the webinar delves into the significance of heatmaps as a visual aid for identifying testing priorities, areas of risk, and resource allocation within SAP landscapes. Through this session, attendees can expect to enhance their understanding of test management principles while learning practical approaches to optimize testing processes in SAP environments using heatmap visualization techniques
What will you get from this session?
1. Insights into SAP testing best practices
2. Heatmap utilization for testing
3. Optimization of testing processes
4. Demo
Topics covered:
Execution from the test manager
Orchestrator execution result
Defect reporting
SAP heatmap example with demo
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
20240605 QFM017 Machine Intelligence Reading List May 2024
SemWeb 4 Gov – opportunities and challenges
1. SemWeb 4 Gov – opportunities and challenges
Dr. Andrew Woolf
Acting Assistant Director (Climate & Water IT Services), Bureau of Meteorology
2. Acknowledgements…
• Josh Bobruk, Robert Boczek, Karl Braganza, Sarah
Callaghan, Shirley Crompton, Armin Haller, Colin Harpham,
Mike Jackson, Bryan Lawrence, Laurent Lefort, Brian
Matthews, Tim Osborn, Clinton Rakich, Will Rogers, Arif
Shaon, Jeremy Tandy, Kerry Taylor, Blair Trewin
3. Outline
• Drivers
• Use case exemplars
• SemWeb as enabler
• A few projects
• Next steps
• Key lessons
4. Outline
• Drivers
• Use case exemplars
• SemWeb as enabler
• A few projects
• Next steps
• Key lessons
10. United States
My Administration will take appropriate action … to
disclose information rapidly in forms that the public
can readily find and use. Executive departments and
agencies should harness new technologies to put
information about their operations and decisions online
and readily available to the public.
Memorandum on Transparency and Open Government,
Federal Register Vol. 74, No. 15
11. United Kingdom
Our plans include:
•Radically opening up data and public information,
releasing thousands of public data sets – including
Ordnance Survey mapping data, real-time railway
timetables, data underpinning NHS choices, and more
detailed departmental spending data – and making
them free for re-use.
Putting the Frontline First: smarter government
Cabinet Office, 7 December 2009
12. Europe
…open access is reaching the tipping point, with around
50% of scientific papers published in 2011 now available for
free.
…open access will be mandatory for all scientific publications
produced with funding from Horizon 2020, the EU's Research &
Innovation funding programme for 2014-2020.
…Under Horizon 2020 … the Commission will also start a
pilot on open access to data collected during publicly funded
research…
European Commission - IP/13/786, Brussels, 21 August 2013
13. United Nations
We also call for a data revolution for sustainable
development, with a new international initiative to
improve the quality of statistics and information
available to citizens.
A NEW GLOBAL PARTNERSHIP: ERADICATE POVERTY AND
TRANSFORM ECONOMIES THROUGH SUSTAINABLE DEVELOPMENT
The Report of the High-Level Panel of Eminent Persons on the Post2015 Development Agenda
United Nations, 30 May 2013
14. Australia
Principle 1: Open access to information - a default position
Principle 2: Engaging the community
Principle 3: Effective information governance
Principle 4: Robust information asset management
Principle 5: Discoverable and useable information
Principle 6: Clear reuse rights
Principle 7: Appropriate charging for access
Principle 8: Transparent enquiry and complaints processes
Principles on open public sector information
OAIC, May 2011
15. Australia
We will [establish] policies to:
•accelerate Government 2.0 efforts to engage online,
make agencies transparent and provide expanded
access to useful public sector data;
…The next wave of opportunities to improve the quality
and effectiveness of government services are likely to be
driven by access to (appropriately anonymized) public
sector data sets and ‘big data’.
The Hon Andrew Robb AO MP, The Hon Malcolm Turnbull MP
Liberal Party of Australia, 2 Sep 2013
16. Government Open Data
“We commit to pro-actively provide
high-value information, including
raw data, in a timely manner, in
formats that the public can easily
locate, understand and use, and in
formats that facilitate reuse.”
17. Outline
• Drivers
• Use case exemplars
• SemWeb as enabler
• A few projects
• Next steps
• Key lessons
19. Independent reviews
“We … conclude that there is independent verification… of the results and
conclusions of the Climate Research Unit at the University of East Anglia. ...
We … consider that climate scientists should take steps to make
available all the data used to generate their published work, including
raw data”
House of Commons Science and Technology Committee (31 Mar 2010)
CRU should make available sufficient information, concurrent with
any publications, to enable others to replicate their results. …
It would benefit the global climate research community if a
standardised way of defining station metadata and station data
could be agreed, preferably through a standards body, or perhaps the
WMO.
The Independent Climate Change E-mails Review, Sir Muir Russell
(7 Jul 2010)
25. National Plan for Environmental
Information
“an environmental information system
to support the delivery and discovery
of priority environmental information”
http://www.bom.gov.au/environment
/
26. Outline
• Drivers
• Use case exemplars
• SemWeb as enabler
• A few projects
• Next steps
• Key lessons
31. Linked Data
Linked Data Principles
1. Use URL (Web
addresses) as identifiers
for objects
2. Publish them on the
Web
3. Use Semantic Web
standards to model the
data
4. Link objects in your
dataset to objects in other
datasets
‘5 stars’ maturity model
32. Outline
• Drivers
• Use case exemplars
• SemWeb as enabler
• A few projects
• Next steps
• Key lessons
33. Geospatial Transformation with
OGSA-DAI (GeoTOD) [2010]
•Project aims:
– Exploit a high-profile
outcome from the >£200M
UK government-funded eScience program
– Implement the UK Cabinet
Office guidelines on ‘URI
Sets for Location’
– Enable dynamic
transformation of existing
large spatial datasets
http://data.gov.uk/sites/default/files/
Designing_URI_Sets_for_Location-V1.0_10.pdf
34. Designing Location URIs
‘Spatial T
hing’ http:/ea.gov.uk/
/
id
/ Y/ atercourse/ hames
H W
T
‘303 See other:’
‘web document’
http://ea.gov.uk/doc/HY/Watercourse/Thames
rdfs:seeAlso http://ea.gov.uk/so/ Y/ atercourse/
H W
ea-UK
rivers/
e7w1
rdfs:seeAlso http://geotod/so/ Y/ atercourse/
H W
stfc-strategi/
4a97
ov:similarTo http://ceh.nerc/so/ Y/ atercourse/
H W
nerc-hydrodb/
thames-001
‘Spatial Object’
http://geotod/so/ Y/ atercourse/
H W
stfc-strategi/
4a97.rdf
http://geotod/so/ Y/ atercourse/
H W
stfc-strategi/
4a97.html
http://geotod/so/ Y/ atercourse/
H W
stfc-strategi/
4a97.kml
http://geotod/so/ Y/ atercourse/
H W
stfc-strategi/
4a97.gml
‘ content negotiation’
36. OGSA-DAI SO store
orkflow
L
inked SO W
Store
OGSA-DAI
service
GM query
L
GeoServer
W
S
W resources
eb
SP
ARQL query
+
output formatting
SQL
RDB
Generate
RDB
-specific SQL
using mapping file
GeotodD2RQ
Relational resources
D2RQ
M
apping
F
ile
37. GeoTOD
• Challenges:
– Pragmatic interpretation of URI guidelines
– Mapping UML geospatial conceptual models to RDF
• GeoTOD demo:
– http://tiger.dl.ac.uk:8080/geotodls
• UML-to-RDF schema generator:
– http://tiger.dl.ac.uk:8080/rdfsgenerator
38. Advanced Climate Research Infrastructure
for Data (ACRID) [2010]
https://www.uea.ac.uk/mac/
comm/media/press/2010/july/
climatedataproject
•Project aims:
–
–
–
–
Address Climategate concerns re publishing climate data
Enable seamless link from research publication to data
Include dataset provenance information
Verify linked-data principles for this problem
40. DOI and OAI-ORE
•CrossRef DOIs (e.g.
http://dx.doi.org/10.1126/science.11
57784) are linked-data-enabled (as
of April 2011) with conneg:
– RDF/XML, TTL, ATOM
•Open Archives Initiative Object
Reuse and Exchange (OAI-ORE)
– description and exchange of
aggregations of Web
resources
– http://www.openarchives.org
/ore/
41. ACRID – dataset workflows /
provenance
•Various choices
– Open Provenance Model
– Provenance Markup Language
– ISO 19156 (Observations and
Measurements)
– Climate Science Modelling
Language (CSML)
•Adopted CSML
– Observation measures a
Property of a Feature-of-interest
using a Procedure and
generating a Result
42. Australian Climate Observations
Reference Network – Surface Air
Temperature (ACORN-SAT) [2012]
•High-quality daily surface temperature
(min/max) timeseries’
•112 stations
•Over 100 years of records
•Homogenised for
– Site relocations
– Instrument replacement
– Local changes
43. ACORN-SAT
• Project aims:
– Establish the first Australian Government linked data
under data.gov.au
– Trial linked-data for large time-series observation dataset
– Gain experience in applying linked data to information
sharing in support of the National Plan for Environmental
Information
52. Outline
• Drivers
• Use case exemplars
• SemWeb as enabler
• A few projects
• Next steps
• Key lessons
53. Australian Government Linked Data
Working Group (AGLDWG)
•Terms of reference
– Develop technical guidelines
and best practice on the use of
‘linked-data’ by AG agencies
– Inform the development of
data.gov.au as a platform for
publishing Commonwealth PSI
– Promote the benefits and
encourage adoption of ‘linkeddata’ for publishing
Commonwealth PSI
– Where appropriate, undertake
specific activities and coordinate
projects in pursuit of these
objectives
AGLDWG meeting with Sir Tim Berners-Lee,
31 Jan 2013 (Canberra)
54. LD-enabling CKAN
• CKAN used by a number of Government open data
platforms (incl. UK, AU, US)
• Could it be made more LD-friendly?
– Add registry functionality (e.g. for simple term
dictionaries)
– Support namespace-forwarding (e.g. for proxying manyto-many agency-to-subdomain mappings)
• UK Gov LD WG has a prototype already
– https://github.com/der/ukl-registry-poc/wiki
– http://www.slideshare.net/der42/ukgovld-registrywebinarv3
55. Outline
• Drivers
• Use case exemplars
• SemWeb as enabler
• A few projects
• Next steps
• Key lessons
56. 1. Believe in SemWeb 4 Gov!
• Domain/agency-neutral data publishing mechanism
• Encourages information points-of-truth
• Assists ‘naturally’ with cross-agency data integration
• BUT:
– Need to demonstrate value (pilots, prototypes, etc.)
– Agencies will have security concerns
– Deployment behind government firewalls is difficult
57. 2. Address agency needs
• Simple dictionary publishing would be a good start!
– http://test.wmocodes.info/
• Create robust guidance on simple things
– GeoRSS
– schema.org
– URI Rules
– Vocabulary management
– Effective use of CKAN
58. 3. Be pragmatic
• At this stage of the adoption curve, more important to get
something up than establishing complete semantics
• Agency people mostly won’t like sitting through multi-day
ontology workshops!
59. 4. Establish enabling infrastructure
• May be difficult to deploy own triplestores
• Encourage cloud solutions (public, government, research)
e.g. NCI in Australia
• Build SemWeb into collaborations around data.gov(.xxx),
e.g. AGLDWG, Cross Jurisdictional Open Government Data
Working Group
60. 5. ‘Geo’ as killer app for LD
• Much government data is spatially-enabled
• Huge value proposition in technology enabling linkage by
location of: health, education, statistics, transport,
environmental data, etc
• Note geospatial semantics standards work
– ISO 19150 (Geographic information – Ontology)
– OGC GeoSPARQL
61. 6. Skill-up the ICT contractor pool
• Government uses contractors
• Need to build up a SemWeb ‘cottage industry’
– critical mass issue
• Research partners are essential, but also need industry
partners
62. 7. Engage with Gov
• Chat to your local friendly Gov IT geeks
• Be aware of stuff already happening
– e.g. in environmental information sharing: WaterML,
GeoSciML, GWML, INSPIRE
63. Thank you…
Acknowledgements (again) to collaborators: Josh Bobruk,
Robert Boczek, Karl Braganza, Sarah Callaghan, Shirley
Crompton, Armin Haller, Colin Harpham, Mike Jackson,
Bryan Lawrence, Laurent Lefort, Brian Matthews, Tim
Osborn, Clinton Rakich, Will Rogers, Arif Shaon, Jeremy
Tandy, Kerry Taylor, Blair Trewin
Dr Andrew Woolf
a.woolf@bom.gov.au