by Albert Meroño, presented at the 3rd PRELIDA Consolidation and Dissemination Workshop, Riva, Italy, October, 17, 2014. More information about the workshop at: prelida.eu
Approaches to representing and delivering geospatial data in the semantic Web...Paul Box
To address many real world challenges, multi-disciplinary data is required. In the earth sciences, EO together with feature based geospatial representations of spatial objects, such as administrative reporting geographies, topographic features and in-situ sensor assets, typically need to be accessed and integrated to support analysis. Currently, coordinates are the primary linga franca for operating across both types of data. However, human users refer to spatial objects using spatial identifiers such as place names or well know codes such as the postcodes, Local Government Area (LGA) codes or hydrological catchment codes. These identifiers provide a critical data integration mechanism as they are used to reference and access observations and measurement data about spatial objects, such as time series water levels for a catchment or demographic data for a Local Government Area (LGA) held in numerous systems.
With the emergence of the semantic Web, current geometry and coordinate oriented approaches to geospatial data need to change. More non-GIS users need to access reliable identifiers for places (spatial identifiers), rather than their geometries. Consequently, there is a need to develop approaches that enable us to manage and share the identity of spatial object e.g. Lake Arthur and link spatial identities to available geometric representations for use when required.
Emerging approaches to representing and using geospatial data in the semantic Web have some implications for earth observation data. These include the potential use of spatial identifier sets as vocabularies for the spatial dimension of EO data cubes, enabling users to reliably query EO data using these identifiers. If identifiers sets are applied across data cubes, consistent identifier based queries are possible across distributed information sources.
This presentation will highlight some emerging approaches to delivering geospatial data into the semantic web, together with implications for improved integration of EO and feature based representations of spatial objects.
Open Data of the past
Open History Map (https://www.openhistorymap.org) is a non-profit based in Bologna, Italy, aiming at the creation of an open map of the past, reconciling and interconnecting data from Digital Humanities, Public History and Digital Archaeology sources. The presentation will cover the philosophy, the technology and the libraries used in this endeavor, explaining how users can reuse, and integrate the data from the platform into their applications or visualizations.
A Talk on the Graph Database with tutorials
Introduction to the Graph databases and Cypher Query Language
Comparison of the SQL and the Cypher implementations
PPT on competition conducted by Avito on Demand Prediction of an AD on Kaggle. The model is trained on Tensorflow (DNNRegrssor) and SKLearn (Linear Regreesion). Project made by Codrega under Eckovation.
Presentation given at the CBS (Central Bureau of Statistics) by CEDAR members on 06-11-2014 for the Studiemiddag "Digitalisering historische CBS-collectie" (digitisation of the CBS historical collection). All things on converting Excel spreadsheets to RDF Data Cube, harmonisation, and using Linked Data for standardizing statistical data on the Web.
LSD Dimensions: Use and Reuse of Linked Statistical Data as RDF Data CubeAlbert Meroño-Peñuela
Governments, public agencies and institutions, and companies produce a great amount of statistical data every year. Much of these data are released as Open Data and published on the Web, although usually as documents, not as Linked Data. In this talk I'll introduce RDF Data Cube (QB), a W3C standard for publishing multidimensional data, such as statistics, on the Web in such a way that they can be linked to other datasets and concepts. However, QB is pretty open towards how users should model dimensions and codes (variables and values in QB jargon), which hampers reusability of existing ones. To this end, I'll show you LSD Dimensions, a web based application that monitors the usage of dimensions and codes over five hundred public SPARQL endpoints.
Albert Merono-Penuela: Understanding Change in Versioned Web-Knowledge Organi...COST Action TD1210
Albert Merono-Penuela (DANS, VU Amsterdam) "Understanding Change in Versioned Web-Knowledge Organisation Systems (KOS), Presentation at the KnoweScape workshop "Evolution and variation of classification systems" March 4-5, 2015 Amsterdam
ePOM - Intro to Ocean Data Science - Raster and Vector Data FormatsGiuseppe Masetti
E-learning Python for Ocean Mapping (ePOM) project.
Complementary slides to the Raster and Vector Data Formats module (part of the Introduction to Ocean Data Science training).
More details at https://www.hydroffice.org/epom
Dec'2013 webinar from the EUCLID project on managing large volumes of Linked Data
webinar recording at https://vimeo.com/84126769 and https://vimeo.com/84126770
more info on EUCLID: http://euclid-project.eu/
Approaches to representing and delivering geospatial data in the semantic Web...Paul Box
To address many real world challenges, multi-disciplinary data is required. In the earth sciences, EO together with feature based geospatial representations of spatial objects, such as administrative reporting geographies, topographic features and in-situ sensor assets, typically need to be accessed and integrated to support analysis. Currently, coordinates are the primary linga franca for operating across both types of data. However, human users refer to spatial objects using spatial identifiers such as place names or well know codes such as the postcodes, Local Government Area (LGA) codes or hydrological catchment codes. These identifiers provide a critical data integration mechanism as they are used to reference and access observations and measurement data about spatial objects, such as time series water levels for a catchment or demographic data for a Local Government Area (LGA) held in numerous systems.
With the emergence of the semantic Web, current geometry and coordinate oriented approaches to geospatial data need to change. More non-GIS users need to access reliable identifiers for places (spatial identifiers), rather than their geometries. Consequently, there is a need to develop approaches that enable us to manage and share the identity of spatial object e.g. Lake Arthur and link spatial identities to available geometric representations for use when required.
Emerging approaches to representing and using geospatial data in the semantic Web have some implications for earth observation data. These include the potential use of spatial identifier sets as vocabularies for the spatial dimension of EO data cubes, enabling users to reliably query EO data using these identifiers. If identifiers sets are applied across data cubes, consistent identifier based queries are possible across distributed information sources.
This presentation will highlight some emerging approaches to delivering geospatial data into the semantic web, together with implications for improved integration of EO and feature based representations of spatial objects.
Open Data of the past
Open History Map (https://www.openhistorymap.org) is a non-profit based in Bologna, Italy, aiming at the creation of an open map of the past, reconciling and interconnecting data from Digital Humanities, Public History and Digital Archaeology sources. The presentation will cover the philosophy, the technology and the libraries used in this endeavor, explaining how users can reuse, and integrate the data from the platform into their applications or visualizations.
A Talk on the Graph Database with tutorials
Introduction to the Graph databases and Cypher Query Language
Comparison of the SQL and the Cypher implementations
PPT on competition conducted by Avito on Demand Prediction of an AD on Kaggle. The model is trained on Tensorflow (DNNRegrssor) and SKLearn (Linear Regreesion). Project made by Codrega under Eckovation.
Presentation given at the CBS (Central Bureau of Statistics) by CEDAR members on 06-11-2014 for the Studiemiddag "Digitalisering historische CBS-collectie" (digitisation of the CBS historical collection). All things on converting Excel spreadsheets to RDF Data Cube, harmonisation, and using Linked Data for standardizing statistical data on the Web.
LSD Dimensions: Use and Reuse of Linked Statistical Data as RDF Data CubeAlbert Meroño-Peñuela
Governments, public agencies and institutions, and companies produce a great amount of statistical data every year. Much of these data are released as Open Data and published on the Web, although usually as documents, not as Linked Data. In this talk I'll introduce RDF Data Cube (QB), a W3C standard for publishing multidimensional data, such as statistics, on the Web in such a way that they can be linked to other datasets and concepts. However, QB is pretty open towards how users should model dimensions and codes (variables and values in QB jargon), which hampers reusability of existing ones. To this end, I'll show you LSD Dimensions, a web based application that monitors the usage of dimensions and codes over five hundred public SPARQL endpoints.
Albert Merono-Penuela: Understanding Change in Versioned Web-Knowledge Organi...COST Action TD1210
Albert Merono-Penuela (DANS, VU Amsterdam) "Understanding Change in Versioned Web-Knowledge Organisation Systems (KOS), Presentation at the KnoweScape workshop "Evolution and variation of classification systems" March 4-5, 2015 Amsterdam
ePOM - Intro to Ocean Data Science - Raster and Vector Data FormatsGiuseppe Masetti
E-learning Python for Ocean Mapping (ePOM) project.
Complementary slides to the Raster and Vector Data Formats module (part of the Introduction to Ocean Data Science training).
More details at https://www.hydroffice.org/epom
Dec'2013 webinar from the EUCLID project on managing large volumes of Linked Data
webinar recording at https://vimeo.com/84126769 and https://vimeo.com/84126770
more info on EUCLID: http://euclid-project.eu/
The Statistics of Stairway to Heaven: A Semantic Story About Digital HumanitiesAlbert Meroño-Peñuela
Key fields in the Humanities such as History and Musicology are central in the major transformation carried by the Digital Humanities (DH). A fundamental question in DH is how humanities datasets can be represented digitally, in such a way machines can process them, understand their meaning, facilitate their inquiry, and exchange them on the Web. In this talk, I will motivate that humanities scholars and computer scientists interact further, by surveying our current work using Semantic Web technology to represent DH objects in Quantitative History and Symbolic Music. Importantly, I will also argue that the technical knowledge gap between the Semantic Web community and many of its application domains, DH among them, is currently too wide, and thus these domains face issues on accessing and consuming semantically-enabled humanities data. To address these, I will demo our current work on automatic Linked Data API construction (heavily inspired by work done at KMi), historical statistics preprocessing and publishing, and music linkage on the Web.
Maintaining scholarly standards in the digital age: Publishing historical gaz...Humphrey Southall
This presentation: (1( Discusses why providing detailed attributions of individual contributions is essential to large scale sharing of historical research data; (2) Provides a short introduction to Open Linked Data; (3) Introduces the PastPlace Gazetteer API (Applications Programming Interface), explaining components of the RDF it generates using the example of Oxford, UK; (4) Notes that most open data projects use the Creative Commons -- Must Ackowledge license (CC-BY) while not actually acknowledging contributors within their RDF, then shows how we do it; (5) Introduces the separate PastPlace Datafeed API, which implements the W3C Datacube Vocabulary.
Citing and understanding spatial references for eResearch: Spatial Identifie...Paul Box
Geography represents a critical unifying dimension enabling disparate information to be linked to geospatial representations of spatial objects. Well-structured, semantically explicit and reliable geospatial information is required in the Web of Data by tools and applications beyond those used by traditional geospatial practitioners. However, providing reliable identifiers for spatial objects is problematic for a number of reasons. Firstly, spatial objects are typically referenced using codes or names which are ambiguous. Use of names is particularly problematic since a place may have many names and the same name may refer to many places. Codes are reliable only in the context of the system in which they are defined and as there is no standardized way of declaring this context, these codes cannot be reliable used to reference spatial objects across systems. Secondly, data systems may have multiple representations of the same spatial object each with a different geometric representation and identifying code. Finally spatial objects change over time, requiring new versions of data with changes to name, code, geometry and other attributes that characterize the spatial object.
Despite this complexity, there is a need to be able reliably identify, cite and obtain information about spatial objects. It may be necessary to cite either a concept or a specific versions of a representation. Such representations may record different geometries (e.g. point and polygon) at different scales available in different formats, each of which may be valid for a particular use case.
The Spatial Identifier Reference Framework (SIRF) described in this presentation, addresses the spatial identifier challenges articulated above.
SIRF allows spatial objects to be discovered and interrogated using web addresses (URIs) and can show where identifiers are in use in the Web of Data. It can be used as underlying infrastructure to join related information using common locations, without the uncertainties associated with spatial matching.
Slide show for the webinar on "Spatial Data Science with R" organized for the GeoDevelopers.org community. The video of the webinar and all the related materials including source code and sample data can be downloaded from this link: http://amsantac.co/blog/en/2016/08/07/spatial-data-science-r.html
In this webinar I talked about Data Science in the context of its application to spatial data and explained how we can use the R language for the analysis of geographic information within the different stages of a data science workflow, from the import and processing of spatial data to visualization and publication of results.
Similar to CEDAR & PRELIDA Preservation of Linked Socio-Historical Data (20)
Preserving linked data: sustainability and organizational infrastructurePRELIDA Project
by Mariella Guercio (Sapienza Università di Roma), presented at the 3rd PRELIDA Consolidation and Dissemination Workshop, Riva, Italy, October, 17, 2014. More information about the workshop at: prelida.eu
Organizational and Economic Issues in Linked Data PreservationPRELIDA Project
by Jose Maria Garcia (UIBK/STI Innsbruck), presented at the 3rd PRELIDA Consolidation and Dissemination Workshop, Riva, Italy, October, 17, 2014. More information about the workshop at: prelida.eu
CEDAR: From Fragment to Fabric - Dutch Census Data in a Web of Global Cultura...PRELIDA Project
by Ashkan Ashkpour, Albert Meroño-Peñuela, Christophe Gueret (http://cedar-project.nl/), presented at the 3rd PRELIDA Consolidation and Dissemination Workshop, Riva, Italy, October, 17, 2014. More information about the workshop at: prelida.eu
by Sławek Staworko, (joint work with Peter Buneman), University of Edinburgh, presented at the 3rd PRELIDA Consolidation and Dissemination Workshop, Riva, Italy, October, 17, 2014. More information about the workshop at: prelida.eu
by Mark Williams (Department of Film and Media Studies
Dartmouth College), presented at the 3rd PRELIDA Consolidation and Dissemination Workshop, Riva, Italy, October, 17, 2014. More information about the workshop at: prelida.eu
HIBERLINK: Reference Rot and Linked Data: Threat and RemedyPRELIDA Project
Peter Burnhill (EDINA, University of Edinburgh), presented at the 3rd PRELIDA Consolidation and Dissemination Workshop, Riva, Italy, October, 17, 2014. More information about the workshop at: prelida.eu
DIACHRON Preservation: Evolution Management for PreservationPRELIDA Project
by Giorgos Flouris (FORTH), presented at the 3rd PRELIDA Consolidation and Dissemination Workshop, Riva, Italy, October, 17, 2014. More information about the workshop at: prelida.eu
by Yannis Stavrakas (“Athena” Research Center
), presented at the 3rd PRELIDA Consolidation and Dissemination Workshop, Riva, Italy, October, 17, 2014. More information about the workshop at: prelida.eu
by Sotiris Batsakis & Grigoris Antoniou, presented at the 3rd PRELIDA Consolidation and Dissemination Workshop, Riva, Italy, October, 17, 2014. More information about the workshop at: prelida.eu
D.3.1: State of the Art - Linked Data and Digital PreservationPRELIDA Project
by D. Giaretta (APARSEN), presented at the 3rd PRELIDA Consolidation and Dissemination Workshop, Riva, Italy, October, 17, 2014. More information about the workshop at: prelida.eu
Introduction to PRELIDA Consolidation and Dissemination WorkshopPRELIDA Project
by Carlo Meghini (ISTI CNR, Pisa), presented at the 3rd PRELIDA Consolidation and Dissemination Workshop, Riva, Italy, October, 17, 2014. More information about the workshop at: prelida.eu
D3.1 State of the art assessment on Linked Data and Digital PreservationPRELIDA Project
The presentation was given by René van Horik from Data Archiving & Networked Services, The Netherlands, at the PRELIDA Midterm Workshop in Catania, April 2014.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 3. In this session, we will cover desktop automation along with UI automation.
Topics covered:
UI automation Introduction,
UI automation Sample
Desktop automation flow
Pradeep Chinnala, Senior Consultant Automation Developer @WonderBotz and UiPath MVP
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
"Impact of front-end architecture on development cost", Viktor TurskyiFwdays
I have heard many times that architecture is not important for the front-end. Also, many times I have seen how developers implement features on the front-end just following the standard rules for a framework and think that this is enough to successfully launch the project, and then the project fails. How to prevent this and what approach to choose? I have launched dozens of complex projects and during the talk we will analyze which approaches have worked for me and which have not.
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
Keynote at DIGIT West Expo, Glasgow on 29 May 2024.
Cheryl Hung, ochery.com
Sr Director, Infrastructure Ecosystem, Arm.
The key trends across hardware, cloud and open-source; exploring how these areas are likely to mature and develop over the short and long-term, and then considering how organisations can position themselves to adapt and thrive.
Let's dive deeper into the world of ODC! Ricardo Alves (OutSystems) will join us to tell all about the new Data Fabric. After that, Sezen de Bruijn (OutSystems) will get into the details on how to best design a sturdy architecture within ODC.
Connector Corner: Automate dynamic content and events by pushing a buttonDianaGray10
Here is something new! In our next Connector Corner webinar, we will demonstrate how you can use a single workflow to:
Create a campaign using Mailchimp with merge tags/fields
Send an interactive Slack channel message (using buttons)
Have the message received by managers and peers along with a test email for review
But there’s more:
In a second workflow supporting the same use case, you’ll see:
Your campaign sent to target colleagues for approval
If the “Approve” button is clicked, a Jira/Zendesk ticket is created for the marketing design team
But—if the “Reject” button is pushed, colleagues will be alerted via Slack message
Join us to learn more about this new, human-in-the-loop capability, brought to you by Integration Service connectors.
And...
Speakers:
Akshay Agnihotri, Product Manager
Charlie Greenberg, Host
Connector Corner: Automate dynamic content and events by pushing a button
CEDAR & PRELIDA Preservation of Linked Socio-Historical Data
1. CEDAR & PRELIDA
Preservation of Linked Socio-
Historical Data
Albert Meroño-Peñuela
@albertmeronyo
PRELIDA consolidation workshop @ ISWC, 17-10-2014
9. • Web publishable
• Machine processable
• Dynamic schema
• Easily link with other
datasets
10. Why with semantic technology?
• Web publishable, human & machine readable
• Finer granularity level (cell level)
• Statistical comparability by leveraging
semantic descriptions
• Provenance
• Harmonization through linkage to other
datasets (the 5th star)
11. RDF Data Cube
“There are many situations where it would be useful to
be able to publish multi-dimensional data, such as
statistics, on the web in such a way that they can be
linked to related data sets and concepts.”
12.
13.
14. RDF Data Cube vocabulary (QB)
• SDMX compatible
• Defines cubes as a set of observations that consist of
dimensions, measures and attributes
• Dimensions: time period, region, sex (qb:DimensionProperty)
• Measure: population life expectancy (qb:MeasureProperty)
• Attribute: unit of measure = years, metadata status =
measured (qb:AttributeProperty)
Observation: “the measured life expectancy of males in
Newport in the period 2004-2006 is 76.7 years”
19. Classification Systems and
Concept Schemes
• Some missing harmonized dimensions!
• Encode all variables and their values using concept
schemes
• Some already exist
– Which ones? How many of them?
– Where?
– By whom?
– Are they used at all? Can I reuse them?
• Some need to be created
– Manual and expert knowledge based
– Can we do it automatically? Or assist the process?
23. Existing LSD dimensions
• P1: Discoverability? How to discover
dimensions created by others?
• P2: Reusability? How often are dimensions
reused? Can we reuse dimensions created by
others?
• P3: Relevance? What’s the size of LSD?
29. Existing LSD dimensions
• P1: Discoverability? How to discover
dimensions created by others? LSD
Dimensions
• P2: Reusability? How often are dimensions
reused? Can we reuse dimensions created by
others? Logarithmic law / probably yes
• P3: Relevance? What’s the size of LSD? ~7.9%
of the LOD cloud
30. Creating new LSD Dimensions
• CEDAR needs concept schemes for
– Historical religious denominations (i.e. religions in
the NL in 18th-20th c.)
– Historical occupations (id.)
– Historical building types (id.)
41. Preserving CEDAR
• DANS-EASY as backend (http://easy.dans.knaw.nl/)
• Archived objects: Turtle snapshots
– 20Go uncompressed, 200Mo compressed (per
snapshot)
– Versioning (stats on current release)
• Users still need to
– SPARQL the data => bring up the endpoint on demand
– Run analytics on the data => outsource statistical
analysis
42. Thank you
Questions, suggestions, comments most
welcome
@albertmeronyo
http://www.cedar-project.nl
http://krr.cs.vu.nl/
http://easy.dans.knaw.nl/
http://lsd-dimensions.org/
43. Me in 6 tweets
http://www.albertmeronyo.org
• Background: Computer Science, Web hacker, AI & Law
• PhD candidate at the VU University Amsterdam, DANS,
and eHumanities group (KNAW)
• Topic: Semantic Web for the Humanities
• CEDAR project (2012-2015): harmonized historical
Dutch censuses in the Semantic Web
• Problem: statistical data publishing, concept drift and
dynamics of meaning
• Last paper: What is Linked Historical Data? (EKAW
2014)
Editor's Notes
Good afternoon everybody. I’m Albert Meroño. It’s a great pleasure to be here today, thanks to the organisers for the invitation…
Today I’m gonna talk a bit about preservation of linked socio-historical data.
And the work that we’ve been doing at the CEDAR project to publish socio-historical data on the SW. And we study the pros and cons of using semantic technologies to enhance the research methodologies of historians and social scientists.
The interesting thing about preservation and CEDAR is a double angle:
What we do is to re-publish PRESERVED data (from the 18th c.)
At the same time we think on how to PRESERVE that re-publication (preserve the Linked Data)
These things are in the archive
The things in the archive change.
Availability of new technology forces us opening the archive, taking the data out of it, doing something to it, store the new version.
2 problems: layout interpretation, and semantic alignment
We like 5 star datasets. Historians also like 5 star datasets. HOWEVER, they still want their non-standard formats for data diving. Data diving guides their research and suggests new research questions.
This is super cool. NOW, how do we connect with the archive to produce it?....
From the ARCHIVE to RDF Data Cube TURTLE
Work on progress on
Interesting – they explain change explicitly, linking together metadata from different periods of time and map shapes.
To what extent can we build these classifications automatically?
………………… BUT ALL DONE?
Archiving the serialization of such semantic-statistic relationships?