This presentation was provided by Tim McGeary of Duke University during the NISO virtual conference, Open Data Projects, held on Wednesday, June 13, 2018.
An update on the latest BioSharing work; including work with ELIXIR and NIH BD2K, also our survey to assess user needs (530 replies) and the work on the recommender tool
This presentation was provided by Chris Erdmann of Library Carpentries and by Judy Ruttenberg of ARL during the NISO virtual conference, Open Data Projects, held on Wednesday, June 13, 2018.
This presentation was provided by Scott Ziegler of Louisiana State University during the NISO Virtual Conference, Open Data Projects, held on Wednesday, June 13, 2018.
This presentation was provided by Tim McGeary of Duke University during the NISO virtual conference, Open Data Projects, held on Wednesday, June 13, 2018.
An update on the latest BioSharing work; including work with ELIXIR and NIH BD2K, also our survey to assess user needs (530 replies) and the work on the recommender tool
This presentation was provided by Chris Erdmann of Library Carpentries and by Judy Ruttenberg of ARL during the NISO virtual conference, Open Data Projects, held on Wednesday, June 13, 2018.
This presentation was provided by Scott Ziegler of Louisiana State University during the NISO Virtual Conference, Open Data Projects, held on Wednesday, June 13, 2018.
Should We Expect a Bang or a Whimper? Will Linked Data Revolutionize Scholar Authoring and Workflow Tools?
Jeff Baer, Senior Director of Product Management, Research Development Services, Proquest
Presentation given at the British Library Turing workshop on Software Citation, considering what lessons could be learned from the world of data citation
Lesson 8 in a set of 10 created by DataONE on Best Practices for Data Management. The full module can be downloaded from the DataONE.org website at: http://www.dataone.org/educaiton-modules. Released under a CC0 license, attribution and citation requested.
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...dkNET
The NIDDK Information Network (dkNET; http://dknet.org) is a open community resource for basic and clinical investigators in metabolic, digestive and kidney disease. dkNET’s portal facilitates access to a collection of diverse research resources (i.e. the multitude of data, software tools, materials, services, projects and organizations available to researchers in the public domain) that advance the mission of the National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK). This webinar was presented by dkNET principle investigator Dr. Jeffrey Grethe.
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinaidatascienceiqss
The DataTags framework makes it easy for data producers to deposit, data publishers to store and distribute, and data users to access and use datasets containing confidential information, in a standardized and responsible way. The talk will first introduce the concepts and tools behind DataTags, and then focus on the user-facing component of the system - Tagging Server (available today at datatags.org). We will conclude by describing how future versions of Dataverse will use DataTags to automatically handle sensitive datasets, that can only be shared under some restrictions.
The Nuclear Receptor Signaling Atlas (NURSA) is partnering with dkNET (NIDDK Information Network) to host a dataset challenge, and we invite you to join! Everyone is talking about Big Data. How can we ensure that the impact of individual scientists working on a myriad of small and focused studies that discover and probe new phenomena - is not lost in the Big Data world. In fact, there is more than one way to generate big data and we would like your help in creating and expanding “big data” for NIDDK! In this 30-minute webinar, dkNET team will give a presentation about the overview of challenge task, how to use dkNET to find research resources, and top tips!
Lesson 2 in a set of 10 created by DataONE on Best Practices fo Data Management. The full module can be downloaded from the DataONE.org website at: http://www.dataone.org/educaiton-modules. Released under a CC0 license, attribution and citation requested.
In this webinar, we gave a general introduction of the dkNET portal and showed how dkNET can be used to address a variety of use cases, including:
1) Find funding sources for your research of interest
2) Determine what study section have reviewed this type of research
3) Help with new NIH guidelines for rigor and reproducibility
morning session talk at the second Keystone Training School "Keyword search in Big Linked Data" held in Santiago de Compostela.
https://eventos.citius.usc.es/keystone.school/
Exploration, visualization and querying of linked open data sourcesLaura Po
afternoon hands-on session talk at the second Keystone Training School "Keyword search in Big Linked Data" held in Santiago de Compostela.
https://eventos.citius.usc.es/keystone.school/
Link resolver failures, erroneous URLs, EZproxy
configuration errors and inaccurate metadata in e-resource
records are commonplace problems reported by users in
pursuit of e-resource access. This presentation describes
the categorisation and analysis of data generated from the
troubleshooting process over the period of an academic
year. The process is designed to be pre-emptive, seeking to
anticipate e-resource problems that users may encounter,
and productive, providing insight to inform user instruction
and trigger mechanisms to create enhanced electronic
access for users.
Geraldine O Beirn, Queen’s University Belfast
Talk delivered at YOW! Developer Conferences in Melbourne, Brisbane and Sydney Australia on 1-9 December 2016.
Abstract: Governments collect a lot of data. Data on air quality, toxic chemicals, laws and regulations, public health, and the census are intended to be widely distributed. Some data is not for public consumption. This talk focuses on open government data — the information that is meant to be made available for benefit of policy makers, researchers, scientists, industry, community organisers, journalists and members of civil society.
We’ll cover the evolution of Linked Data, which is now being used by Google, Apple, IBM Watson, federal governments worldwide, non-profits including CSIRO and OpenPHACTS, and thousands of others worldwide.
Next we’ll delve into the evolution of the U.S. Environmental Protection Agency’s Open Data service that we implemented using Linked Data and an Open Source Data Platform. Highlights include how we connected to hundreds of billions of open data facts in the world’s largest, open chemical molecules database PubChem and DBpedia.
WHO SHOULD ATTEND
Data scientists, software engineers, data analysts, DBAs, technical leaders and anyone interested in utilising linked data and open government data.
The webinar will be based on LODE-BD Recommendations - Linked Open Data (LOD)-enabled bibliographical data- which aims at providing bibliographic data providers of open repositories with a set of recommendations that will support the selection of appropriate encoding strategies for producing meaningful Linked Open Data (LOD)-enabled bibliographical data (LODE-BD).
In search of lost knowledge: joining the dots with Linked Datajonblower
These slides are from my seminar to the University of Reading Department of Meteorology, November 2013. They contain a (hopefully not very technical) introduction to the concepts of Linked Data and how we are applying them in the CHARMe project (http://www.charme.org.uk). In CHARMe we are using Open Annotation to connect users of climate data with community-generated "commentary information" that helps them to understand a dataset's strengths and weaknesses.
The slide notes contain some helpful context, so you might like to download the PPT file!
The slides are licensed as "Creative Commons Attribution 3.0", meaning that you can do what you like with these slides provided that you credit the University of Reading for their creation. See http://creativecommons.org/licenses/by/3.0/.
Should We Expect a Bang or a Whimper? Will Linked Data Revolutionize Scholar Authoring and Workflow Tools?
Jeff Baer, Senior Director of Product Management, Research Development Services, Proquest
Presentation given at the British Library Turing workshop on Software Citation, considering what lessons could be learned from the world of data citation
Lesson 8 in a set of 10 created by DataONE on Best Practices for Data Management. The full module can be downloaded from the DataONE.org website at: http://www.dataone.org/educaiton-modules. Released under a CC0 license, attribution and citation requested.
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...dkNET
The NIDDK Information Network (dkNET; http://dknet.org) is a open community resource for basic and clinical investigators in metabolic, digestive and kidney disease. dkNET’s portal facilitates access to a collection of diverse research resources (i.e. the multitude of data, software tools, materials, services, projects and organizations available to researchers in the public domain) that advance the mission of the National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK). This webinar was presented by dkNET principle investigator Dr. Jeffrey Grethe.
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinaidatascienceiqss
The DataTags framework makes it easy for data producers to deposit, data publishers to store and distribute, and data users to access and use datasets containing confidential information, in a standardized and responsible way. The talk will first introduce the concepts and tools behind DataTags, and then focus on the user-facing component of the system - Tagging Server (available today at datatags.org). We will conclude by describing how future versions of Dataverse will use DataTags to automatically handle sensitive datasets, that can only be shared under some restrictions.
The Nuclear Receptor Signaling Atlas (NURSA) is partnering with dkNET (NIDDK Information Network) to host a dataset challenge, and we invite you to join! Everyone is talking about Big Data. How can we ensure that the impact of individual scientists working on a myriad of small and focused studies that discover and probe new phenomena - is not lost in the Big Data world. In fact, there is more than one way to generate big data and we would like your help in creating and expanding “big data” for NIDDK! In this 30-minute webinar, dkNET team will give a presentation about the overview of challenge task, how to use dkNET to find research resources, and top tips!
Lesson 2 in a set of 10 created by DataONE on Best Practices fo Data Management. The full module can be downloaded from the DataONE.org website at: http://www.dataone.org/educaiton-modules. Released under a CC0 license, attribution and citation requested.
In this webinar, we gave a general introduction of the dkNET portal and showed how dkNET can be used to address a variety of use cases, including:
1) Find funding sources for your research of interest
2) Determine what study section have reviewed this type of research
3) Help with new NIH guidelines for rigor and reproducibility
morning session talk at the second Keystone Training School "Keyword search in Big Linked Data" held in Santiago de Compostela.
https://eventos.citius.usc.es/keystone.school/
Exploration, visualization and querying of linked open data sourcesLaura Po
afternoon hands-on session talk at the second Keystone Training School "Keyword search in Big Linked Data" held in Santiago de Compostela.
https://eventos.citius.usc.es/keystone.school/
Link resolver failures, erroneous URLs, EZproxy
configuration errors and inaccurate metadata in e-resource
records are commonplace problems reported by users in
pursuit of e-resource access. This presentation describes
the categorisation and analysis of data generated from the
troubleshooting process over the period of an academic
year. The process is designed to be pre-emptive, seeking to
anticipate e-resource problems that users may encounter,
and productive, providing insight to inform user instruction
and trigger mechanisms to create enhanced electronic
access for users.
Geraldine O Beirn, Queen’s University Belfast
Talk delivered at YOW! Developer Conferences in Melbourne, Brisbane and Sydney Australia on 1-9 December 2016.
Abstract: Governments collect a lot of data. Data on air quality, toxic chemicals, laws and regulations, public health, and the census are intended to be widely distributed. Some data is not for public consumption. This talk focuses on open government data — the information that is meant to be made available for benefit of policy makers, researchers, scientists, industry, community organisers, journalists and members of civil society.
We’ll cover the evolution of Linked Data, which is now being used by Google, Apple, IBM Watson, federal governments worldwide, non-profits including CSIRO and OpenPHACTS, and thousands of others worldwide.
Next we’ll delve into the evolution of the U.S. Environmental Protection Agency’s Open Data service that we implemented using Linked Data and an Open Source Data Platform. Highlights include how we connected to hundreds of billions of open data facts in the world’s largest, open chemical molecules database PubChem and DBpedia.
WHO SHOULD ATTEND
Data scientists, software engineers, data analysts, DBAs, technical leaders and anyone interested in utilising linked data and open government data.
The webinar will be based on LODE-BD Recommendations - Linked Open Data (LOD)-enabled bibliographical data- which aims at providing bibliographic data providers of open repositories with a set of recommendations that will support the selection of appropriate encoding strategies for producing meaningful Linked Open Data (LOD)-enabled bibliographical data (LODE-BD).
In search of lost knowledge: joining the dots with Linked Datajonblower
These slides are from my seminar to the University of Reading Department of Meteorology, November 2013. They contain a (hopefully not very technical) introduction to the concepts of Linked Data and how we are applying them in the CHARMe project (http://www.charme.org.uk). In CHARMe we are using Open Annotation to connect users of climate data with community-generated "commentary information" that helps them to understand a dataset's strengths and weaknesses.
The slide notes contain some helpful context, so you might like to download the PPT file!
The slides are licensed as "Creative Commons Attribution 3.0", meaning that you can do what you like with these slides provided that you credit the University of Reading for their creation. See http://creativecommons.org/licenses/by/3.0/.
Slides from our tutorial on Linked Data generation in the energy domain, presented at the Sustainable Places 2014 conference on October 2nd in Nice, France
Engaging Information Professionals in the Process of Authoritative Interlinki...Lucy McKenna
Through the use of Linked Data (LD), Libraries, Archives and Museums (LAMs) have the potential to expose their collections to a larger audience and to allow for more efficient user searches. Despite this, relatively few LAMs have invested in LD projects and the majority of these display limited interlinking across datasets and institutions. A survey was conducted to understand Information Professionals' (IPs') position with regards to LD, with a particular focus on the interlinking problem. The survey was completed by 185 librarians, archivists, metadata cataloguers and researchers. Results indicated that, when interlinking, IPs find the process of ontology and property selection to be particularly challenging, and LD tooling to be technologically complex and unsuitable for their needs.
Our research is focused on developing an authoritative interlinking framework for LAMs with a view to increasing IP engagement in the linking process. Our framework will provide a set of standards to facilitate IPs in the selection of link types, specifically when linking local resources to authorities. The framework will include guidelines for authority, ontology and property selection, and for adding provenance data. A user-interface will be developed which will direct IPs through the resource interlinking process as per our framework. Although there are existing tools in this domain, our framework differs in that it will be designed with the needs and expertise of IPs in mind. This will be achieved by involving IPs in the design and evaluation of the framework. A mock-up of the interface has already been tested and adjustments have been made based on results. We are currently working on developing a minimal viable product so as to allow for further testing of the framework. We will present our updated framework, interface, and proposed interlinking solutions.
Crediting informatics and data folks in life science teamsCarole Goble
Science Europe LEGS Committee: Career Pathways in Multidisciplinary Research: How to Assess the Contributions of Single Authors in Large Teams, 1-2 Dec 2015, Brussels
The People Behind Research Software crediting from the informatics, technical point of view
Keynote presentation delivered at ELAG 2013 in Gent, Belgium, on May 29 2013. Discusses Research Objects and the relationship to work my team has been involved in during the past couple of years: OAI-ORE, Open Annotation, Memento.
Establishing the Connection: Creating a Linked Data Version of the BNBnw13
Presentation for Talis Linked Data in Libraries event July 14 2011
Describes some of the choices made and lessons learned in migrating from traditional bibliographic metadata to linked open data.
Structured data and metadata evaluation methodology for organizations looking...Emily Kolvitz
The current state of findability on the web for many organizations is incipient. Search Engine Optimization (SEO) techniques change frequently and remain much a mystery to many companies. The one variable in the equation of web findability that remains a staple is good quality metadata under the hood of the website.
This research methodology will allow for :
An assessment of findability maturity on the web from an image-centric viewpoint
Help improve findability on the web by establishing a baseline for where your organization is at in terms of structured data content and visualize gaps or areas for improvement from a search engine neutral perspective
Slides from Friday 3rd August - Data in the Scholarly Communications Life Cycle Course which is part of the FORCE11 Scholarly Communications Institute.
Presenter - Natasha Simons
Data Citation Implementation Guidelines By Tim Clarkdatascienceiqss
This talk presents a set of detailed technical recommendations for operationalizing the Joint Declaration of Data Citation Principles (JDDCP) - the most widely agreed set of principle-based recommendations for direct scholarly data citation.
We will provide initial recommendations on identifier schemes, identifier resolution behavior, required metadata elements, and best practices for realizing programmatic machine actionability of cited data.
We hope that these recommendations along with the new NISO JATS document schema revision, developed in parallel, will help accelerate the wide adoption of data citation in scholarly literature. We believe their adoption will enable open data transparency for validation, reuse and extension of scientific results; and will significantly counteract the problem of false positives in the literature.
Linked Data Love: research representation, discovery, and assessment
#ALAAC15
The explosion of linked data platforms and data stores over the last five years has been profound – both in terms of quantity of data as well as its potential impact. Research information systems such as VIVO (www.vivoweb.org) play a significant role in enabling this work. VIVO is an open source, Semantic Web-based application that provides an integrated, searchable view of the scholarly activities of an organization. The uniform semantic structure of VIVO-ISF data enables a new class of tools to advance science. This presentation will provide a brief introduction and update to VIVO and present ways that this semantically-rich data can enable visualizations, reporting and assessment, next-generation collaboration and team building, and enhanced multi-site search. Libraries are uniquely positioned to facilitate the open representation of research information and its subsequent use to spur collaboration, discovery, and assessment. The talk will conclude with a description of ways librarians are engaged in this work – including visioning, metadata and ontology creation, policy creation, data curation and management, technical, and engagement activities.
Kristi Holmes, PhD
Director, Galter Health Sciences Library
Director of Evaluation, NUCATS
Associate Professor, Preventive Medicine-Health and Biomedical Informatics
Northwestern University Feinberg School of Medicine
Similar to Fox-Keynote-Now and Now of Data Publishing-nfdp13 (20)
Closing address by John Wood on the role of the Research Data Alliance given at the Now and Future of Data Publishing Symposium, 22 May 2013, Oxford, UK
Presentation by Eefke Smit asking whether publishers should scrap supplementary materials given as a 'provocation' in the final panel session at the Now and Future of Data Publishing Symposium, 22 May 2013, Oxford, UK
Michener-institutional and subject-specific data repositories-nfdp13DataDryad
Presentation by Bill Michener asking whether Institutional and Subject-Specific Data Repositories can Co-Exist given as a 'provocation' in the final panel session at the Now and Future of Data Publishing Symposium, 22 May 2013, Oxford, UK
Presentation by Brian Hole on the role of data journals in incentivising data publication and open scholarship given as a 'provocation' in the final panel session at the Now and Future of Data Publishing Symposium, 22 May 2013, Oxford, UK
Presentation by David Shotton on Force11 and the Amsterdam Manifesto on data citation and then introducing the final panel session at the Now and Future of Data Publishing Symposium, 22 May 2013, Oxford, UK
Coles partnerships quality and trust-nfdp13DataDryad
Presentation by Simon Coles on issues of partnerships, quality and trust in data publishing given at the Now and Future of Data Publishing Symposium, 22 May 2013, Oxford, UK
Irving-TeraData: data and science driven big industry-nfdp13DataDryad
Presentation by Duncan Irving on TeraData's approach to data management and data publishing in science driven big industry given at the Now and Future of Data Publishing Symposium, 22 May 2013, Oxford, UK
Image used by Ross Mounce to illustrate his talk on incentives and researchers' reluctance to publish data given at the Now and Future of Data Publishing Symposium, 22 May 2013, Oxford, UK
Pfeiffenberger-Data Policies and Sustainability-NFDP13DataDryad
Presentation by Hans Pfeiffenberger on challenges presented by data availability policies and issues of sustainability given at the Now and Future of Data Publishing Symposium, 22 May 2013, Oxford, UK
Presentation by Liz Lyon of DCC on data publishing challenges for HEIs and for research libraries given at the Now and Future of Data Publishing Symposium, 22 May 2013, Oxford, UK
Presentation by Rodrigo Costas on research into data metrics and data sharing given at the Now and Future of Data Publishing Symposium, 22 May 2013, Oxford, UK
Presentation by Tom Mowlam on Ubiquity Press's initiatives in semantic publishing given at the Now and Future of Data Publishing Symposium, 22 May 2013, Oxford, UK
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13DataDryad
Presentation by Elena Zudilova-Seinstra on Elsevier's work on data and the article of the future and open data given at the Now and Future of Data Publishing Symposium, 22 May 2013, Oxford, UK
Presentation by Ruth Wilson on Nature Publishing Group's Scientific Data journal given at the Now and Future of Data Publishing Symposium, 22 May 2013, Oxford, UK
Presentation by Bernd Pulverer on EMBO's 'Source Data' and the next generation of open access given at the Now and Future of Data Publishing Symposium, 22 May 2013, Oxford, UK
Lawrence-f1000-publishing with data-nfdp13DataDryad
Presentation by Rebecca Lawrence on F1000's initiatives for publishing with data given at the Now and Future of Data Publishing Symposium, 22 May 2013, Oxford, UK
Karunkara-Keynote-msf and open data-nfdp2013DataDryad
Keynote given by Unni Karunkara on Médecins Sans Frontières and open data given to the Now and Future of Data Publishing Symposium, 22 May 2013, Oxford, UK
The French Revolution, which began in 1789, was a period of radical social and political upheaval in France. It marked the decline of absolute monarchies, the rise of secular and democratic republics, and the eventual rise of Napoleon Bonaparte. This revolutionary period is crucial in understanding the transition from feudalism to modernity in Europe.
For more information, visit-www.vavaclasses.com
How to Split Bills in the Odoo 17 POS ModuleCeline George
Bills have a main role in point of sale procedure. It will help to track sales, handling payments and giving receipts to customers. Bill splitting also has an important role in POS. For example, If some friends come together for dinner and if they want to divide the bill then it is possible by POS bill splitting. This slide will show how to split bills in odoo 17 POS.
How to Create Map Views in the Odoo 17 ERPCeline George
The map views are useful for providing a geographical representation of data. They allow users to visualize and analyze the data in a more intuitive manner.
The Indian economy is classified into different sectors to simplify the analysis and understanding of economic activities. For Class 10, it's essential to grasp the sectors of the Indian economy, understand their characteristics, and recognize their importance. This guide will provide detailed notes on the Sectors of the Indian Economy Class 10, using specific long-tail keywords to enhance comprehension.
For more information, visit-www.vavaclasses.com
This is a presentation by Dada Robert in a Your Skill Boost masterclass organised by the Excellence Foundation for South Sudan (EFSS) on Saturday, the 25th and Sunday, the 26th of May 2024.
He discussed the concept of quality improvement, emphasizing its applicability to various aspects of life, including personal, project, and program improvements. He defined quality as doing the right thing at the right time in the right way to achieve the best possible results and discussed the concept of the "gap" between what we know and what we do, and how this gap represents the areas we need to improve. He explained the scientific approach to quality improvement, which involves systematic performance analysis, testing and learning, and implementing change ideas. He also highlighted the importance of client focus and a team approach to quality improvement.
We all have good and bad thoughts from time to time and situation to situation. We are bombarded daily with spiraling thoughts(both negative and positive) creating all-consuming feel , making us difficult to manage with associated suffering. Good thoughts are like our Mob Signal (Positive thought) amidst noise(negative thought) in the atmosphere. Negative thoughts like noise outweigh positive thoughts. These thoughts often create unwanted confusion, trouble, stress and frustration in our mind as well as chaos in our physical world. Negative thoughts are also known as “distorted thinking”.
Model Attribute Check Company Auto PropertyCeline George
In Odoo, the multi-company feature allows you to manage multiple companies within a single Odoo database instance. Each company can have its own configurations while still sharing common resources such as products, customers, and suppliers.
Palestine last event orientationfvgnh .pptxRaedMohamed3
An EFL lesson about the current events in Palestine. It is intended to be for intermediate students who wish to increase their listening skills through a short lesson in power point.
The Roman Empire A Historical Colossus.pdfkaushalkr1407
The Roman Empire, a vast and enduring power, stands as one of history's most remarkable civilizations, leaving an indelible imprint on the world. It emerged from the Roman Republic, transitioning into an imperial powerhouse under the leadership of Augustus Caesar in 27 BCE. This transformation marked the beginning of an era defined by unprecedented territorial expansion, architectural marvels, and profound cultural influence.
The empire's roots lie in the city of Rome, founded, according to legend, by Romulus in 753 BCE. Over centuries, Rome evolved from a small settlement to a formidable republic, characterized by a complex political system with elected officials and checks on power. However, internal strife, class conflicts, and military ambitions paved the way for the end of the Republic. Julius Caesar’s dictatorship and subsequent assassination in 44 BCE created a power vacuum, leading to a civil war. Octavian, later Augustus, emerged victorious, heralding the Roman Empire’s birth.
Under Augustus, the empire experienced the Pax Romana, a 200-year period of relative peace and stability. Augustus reformed the military, established efficient administrative systems, and initiated grand construction projects. The empire's borders expanded, encompassing territories from Britain to Egypt and from Spain to the Euphrates. Roman legions, renowned for their discipline and engineering prowess, secured and maintained these vast territories, building roads, fortifications, and cities that facilitated control and integration.
The Roman Empire’s society was hierarchical, with a rigid class system. At the top were the patricians, wealthy elites who held significant political power. Below them were the plebeians, free citizens with limited political influence, and the vast numbers of slaves who formed the backbone of the economy. The family unit was central, governed by the paterfamilias, the male head who held absolute authority.
Culturally, the Romans were eclectic, absorbing and adapting elements from the civilizations they encountered, particularly the Greeks. Roman art, literature, and philosophy reflected this synthesis, creating a rich cultural tapestry. Latin, the Roman language, became the lingua franca of the Western world, influencing numerous modern languages.
Roman architecture and engineering achievements were monumental. They perfected the arch, vault, and dome, constructing enduring structures like the Colosseum, Pantheon, and aqueducts. These engineering marvels not only showcased Roman ingenuity but also served practical purposes, from public entertainment to water supply.
5. International Council for Science – Strategic Coordinating
Committee on Information and Data - recommendation
http://eloquentscience.com/wp-
content/uploads/2011/04/open_access.jpg
http://www.icsu.org/publications/reports-and-reviews/strategic-coordinating-committee-on-information-and-data-report
OECD guidelines
= data access and
sharing policies
http://bernews.com/wp-content/uploads/2011/02/oecd-logo.jpg
6. ICSU SCCID recommendation
•Engage actively
– publishers of all kinds together
– library community
– scientific researchers
•To
– Document and promote community
best practice in the handling of
supplemental material, publication of
data and appropriate data citation.
http://www.leebullen.com/Fini
shed%20Pics/Scientists.jpg
?
7. Goal?
• Data as a first class object
• As a subject of
conversation (v. discourse)
• Metaphors to achieve this
abound and indicate a
particular stakeholder
perspective (worldview,
bias, edict, etc…)
8. It seems we are not quite there yet
• We*
are having
conversations (like the
one today) about data+x
(x=citation, publication,
integration, integrity,
ownership, trust, …)
• *
= ./ ../ // and / (unixtm
)
12. Metaphor!
12
Data Information Knowledge
Producers Consumers
Context
Presentation
Organization
Integration
Conversation
Creation
Gathering
Experience
• Ecosystem
• A framework
for talking
about data,
and …
13. Data perspective under some metaphors
13
Producers Consumers
Quality Control
Fitness for Purpose Fitness for Use
Quality Assessment
Trustee Trustor
14. For others: Is this separation good or not?
14
Producers Consumers
Quality Control
Fitness for Purpose Fitness for Use
Quality Assessment
Trustee Trustor
Publisher “Reader”
This may be us, or others
16. Global Change Information System (GCIS)
16
Vision:
A unified web based source of
authoritative, accessible, usable, and
timely information about climate and
global change for use by scientists,
decision makers, and the public.
17. Prototype Use Case
Name Discover and visit data center website of dataset used to generate report figure.
Goal The NCA Report reader sees a figure and wants to know where the data came from.
Summary A reader of the NCA is browsing the content via the website. He/she sees a figure and wants to know where the data came from. A reference
to the publication in which the figure originated appears in the figure caption. Selecting the link to the source publication displays a page of
information about the publication including, if available, the publication DOI. The page also includes references to the datasets cited in the
publication. Following each of dataset reference links presents a page of information about the dataset, including links back to the agency/data
center webpage describing the dataset in more detail and making the actual data available for order or download.
Actors Primary Actor - reader of the NCA
Preconditions Reader is viewing the NCA online report
Post Conditions Reader visits the data center dataset website
Normal Flow 1) System is presenting the NCA report to the reader in a web site. Presentation includes report figure with caption that includes reference to
source publication.
2) Reader selects publication reference in figure caption
3) System displays information about publication, including DOI (if available).
4) Publication information includes publication dataset citations.
5) Reader selects a dataset cited by the publication.
6) System displays information about dataset including links to agency / data center webpages where more information and (potentially) data
download links are available.
7) Reader selects the data center link and is redirected to data center dataset webpage.
Discover and visit data center website of dataset used to generate report figure.
19. Non-specialist Use Case
Name Find Latest Datasets by Keyword
Goal Search for datasets associated with the keyword “snow”, list search results by recentness of publication.
Summary User story:
I want to look for information concerning “snow.” I don’t know if it is a CLEAN word or a GCMD word or don’t even know what GCMD
or CLEAN is. How would I do it, and what would I see on my monitor during the process?
Assumptions The reader is not assumed to have knowledge regarding the GCMD Keywords (or other) vocabulary.
Actors Primary Actor - reader of the NCA
Preconditions TBD
Post Conditions Reader is presented with a list of datasets associated with the keyword “snow” sorted by dataset publication date.
Normal Flow TBD
Notes We are looking into two user interface options for dataset selection by keyword
1)As a free-text search where the user inputs “snow”.
2)Present the user a faceted browse interface with a vocabulary faceted which presents the user with terms from a structured vocabulary. The
user can manually select the term(s) which match or contain “snow”.
We intend to implement prototypes of both.
Search for datasets with the keyword “snow”, ….
25. From my Research Data Alliance talk; #5
• Please all SNAP your fingers (1, 2, 3,
NOW)
• <snap> the culture around data has to
change, as well as how we think about
paradigms (metaphors)
26. Call to discussion
• Multiple metaphors, many considerations
• An ecosystem approach allows multiple solutions in a
complex socio-technical system – transactions among
providers and consumers
– Significant opportunities for under-served data generators to get
their data ‘out there’ perhaps publication (still a metaphor!)
• Data Review !== Peer Review and more role disconnects
• <discuss>
• Please read our Data Science Journal essay and respond!
• Thanks for your attention - pfox@cs.rpi.edu , http://tw.rpi.edu
28. Pros/Cons - Data Centres (‘big iron’)
• Volume
• Streamlined
• Automation
• Auditable
• Reprocessing capability
• Central authority
• Funded
• Over-reliance on automation
• Weak documentation
• Use is assumed
• Roles ill-defined, reputation?
• Does not handle heterogeneity
• Preservation ?
• Overly focused on generation
• …
29. Pros/Cons - Publishers
• Simple
• Tested
• Disseminated
• Shifted burden
• Imprimatur
• De-facto preservation
• Citable
• Based on science norms
• Locked
• Static/
• Not machine
accessible
• Cost?
• Not scalable
• Cannot verify use
30. Pros/Cons - Release (software)
• Many stages (alpha, beta,
release candidate, release)
• Versioned
• Documented and change
notified
• Intends to couple user
feedback to developers
• Packaged
• Licensing well thought out
• …
• Provenance implicit
• Preservation poorly dealt with
• Quality may be difficult to
determine
• Attribution not part of the mind-
set
• Derivative or embedded use
not always well defined
• …
31. Pros/Cons - Linked data
• Scales
• Built on web
• Simple model design
• Tested
• Disseminated
• Machine processable
• No central authority
• Heterogeneous
• Use not assumed
• Flexible evolution
• Supports encapsulation
• Poor versioning
• Poor auditing
• No imprimatur
• No preservation/ stewardship
• Not human friendly
• Heterogeneous vocab.
• Changes data model
• Unknown evolution
• …
32.
33. 33
.. Data has Lots of Audiences
From “Why EPO?”, a NASA internal
report on science education, 2005
More Strategic
Less Strategic
Science too!
Editor's Notes
http://brianwhitworth.com/STS/WordleCover.png Conversations about data. 1 st class v discourse
http://gbeaubouef.files.wordpress.com/2012/01/slide1.jpg http://eksouth.weebly.com/uploads/6/5/2/3/6523461/8933221.jpg Or lecture you, or cheer lead.
ICSU should establish a forum for the exploration and eventual agreement in relation to science of all the terms used under the broad umbrella of Open Access .
ICSU should engage actively with publishers of all kinds together with the library community and with scientific researchers to document and promote community best practice in the handling of supplemental material, publication of data and appropriate data citation. http://www.einstruction.com/files/default/files/publishers.jpg
There are lots of different kinds of audiences interested in data. While we are talking about using data in the classroom today, several other audiences of are importance to Virtual observatories. In particular, on the more strategic end are groups that, while smaller, have great impact on the public ’s and the government’s perception of the value of the data and its providers. In this category, I would place both science policy specialists and the media. Policy specialists and decision makers have a tremendous impact on budgets, but also feel, at least at some level, beholden to the tax payers. They want to see the impact that data has on people’s lives. They are also looking for information that will help them made an informed decision. In addition, the media plays a critical role, providing about 85% of the science content to the general public. A third group that is worth considering is the educated general public (the science-attentive public). They take science very seriously and can be a vocal advocate for a scinetific resource -- look at the Hubble scenario as an example.