Curating the Scholarly Record: Data Management and Research LibrariesKeith Webster
Presentation at the National Data Service Conference "New Frontiers in Data Discovery: Collaboration with Research Libraries.", Pittsburgh, 20 October 2016
Summit on Olive Project software emulation and curation serviceKeith Webster
Opening remarks by Keith Webster to a summit held at Carnegie Mellon University on the Olive Project. The technology underpinning Olive was developed by Mahadev Satyanarayanan to emulate executable content, allowing for its execution in contemporary software environments. Webster positions Olive's potential as part of a suite of digital preservation services operated by research libraries, seeking to preserve all forms of digital scholarship.
Immersive informatics - research data management at Pitt iSchool and Carnegie...Keith Webster
A joint presentation by Liz Lyon and Keith Webster on providing education for librarians engaged in research data management. This was delivered at Library Research Seminar VI, at the University of Illinois Urbana Champaign in September 2014. The presentation looks at a class delivered by Lyon at the University of Pittsburgh's iSchool in 2014, and the related needs for immersive training opportunities amongst experienced practicing librarians, using Carnegie Mellon University's library, led by Webster, as a case study.
The changing landscape of scholarly communication: presentation to the NFAIS ...Keith Webster
Presentation on the changing relationships between research libraries, publishers, researchers and technology, and the impact of government policy on scholarly publishing and open access.
It appears highly probable that immediate open access publishing
will become the default mode for scholarly publishing – for the
biosciences first, other sectors later. ‘Immediate’ open access
means unfettered publication as soon as a scholarly work is
ready, with no embargo period. The costs of making a scholarly
artefact available can be reduced without sacrificing quality. This
interactive session will sketch the argument for these claims and
will present several value-added services that publishers could
develop to thrive in an open access world.
Putting Research Data into Context: A Scholarly Approach to Curating Data for...OCLC
This was one of three presentations for the panel Putting Research Data into Context: Scholarly, Professional, and Educational Approaches to Curating Data for Reuse at the 77th Annual Meeting of the Association of Information Science and Technology (ASIS&T).
Learning Commons & Subject Librarians @ the University of Massachusetts-Amherstyouthelectronix
On Saturday March 15, 2008 at the Simmons College GSLIS West Campus in South Hadley, MA Anne C. Moore,
Associate Director for User Services at the University of Massachusetts at Amherst discussed the phenomenon of the Learning Commons at UMass and the changing role of the specialist librarian.
Curating the Scholarly Record: Data Management and Research LibrariesKeith Webster
Presentation at the National Data Service Conference "New Frontiers in Data Discovery: Collaboration with Research Libraries.", Pittsburgh, 20 October 2016
Summit on Olive Project software emulation and curation serviceKeith Webster
Opening remarks by Keith Webster to a summit held at Carnegie Mellon University on the Olive Project. The technology underpinning Olive was developed by Mahadev Satyanarayanan to emulate executable content, allowing for its execution in contemporary software environments. Webster positions Olive's potential as part of a suite of digital preservation services operated by research libraries, seeking to preserve all forms of digital scholarship.
Immersive informatics - research data management at Pitt iSchool and Carnegie...Keith Webster
A joint presentation by Liz Lyon and Keith Webster on providing education for librarians engaged in research data management. This was delivered at Library Research Seminar VI, at the University of Illinois Urbana Champaign in September 2014. The presentation looks at a class delivered by Lyon at the University of Pittsburgh's iSchool in 2014, and the related needs for immersive training opportunities amongst experienced practicing librarians, using Carnegie Mellon University's library, led by Webster, as a case study.
The changing landscape of scholarly communication: presentation to the NFAIS ...Keith Webster
Presentation on the changing relationships between research libraries, publishers, researchers and technology, and the impact of government policy on scholarly publishing and open access.
It appears highly probable that immediate open access publishing
will become the default mode for scholarly publishing – for the
biosciences first, other sectors later. ‘Immediate’ open access
means unfettered publication as soon as a scholarly work is
ready, with no embargo period. The costs of making a scholarly
artefact available can be reduced without sacrificing quality. This
interactive session will sketch the argument for these claims and
will present several value-added services that publishers could
develop to thrive in an open access world.
Putting Research Data into Context: A Scholarly Approach to Curating Data for...OCLC
This was one of three presentations for the panel Putting Research Data into Context: Scholarly, Professional, and Educational Approaches to Curating Data for Reuse at the 77th Annual Meeting of the Association of Information Science and Technology (ASIS&T).
Learning Commons & Subject Librarians @ the University of Massachusetts-Amherstyouthelectronix
On Saturday March 15, 2008 at the Simmons College GSLIS West Campus in South Hadley, MA Anne C. Moore,
Associate Director for User Services at the University of Massachusetts at Amherst discussed the phenomenon of the Learning Commons at UMass and the changing role of the specialist librarian.
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...Jisc
Universities and researchers need to be able to manage research data effectively to fulfil research funders requirements and ultimately to contribute to research excellence. UK universities are comparatively well advanced in what is a global challenge, but none the less there needs to be further advances in university policy, technical and support services. This session will share best practice in research data management and information about key tools that can help to develop university solutions; and it will also inform participants about the latest Jisc initiatives to help build university research data services and shared services.
Sitations are the way that researchers communicate how
their work builds on and relates to the work of others and
they can be used to trace how a discovery spreads and is
used by researchers in different disciplines and countries.
Creating a truly comprehensive map of scholarship,
however, relies on having a curated machine-readable
database of citation information, where the provenance of
every citation is clear and reusable. The Initiative for Open
Citations (I4OC), a campaign launched on 6 April 2017,
sought to make publisher members of Crossref aware that
they could open up the citation metadata they already give
to Crossref simply by asking them. With the support of
major publishers and the endorsement of funders and other
organisations, more than 50% of citation data in Crossref
is now freely available, up from less than 1% before the
campaign. This provides the foundation of a well-structured,
open database of literally millions of datapoints that anyone
can query, mine, consume and explore. The presenter will
discuss the aims of the campaign, the new innovative
services that are already using the data, what more still
needs to be done and how you can support the initiative.
Catriona J MacCallum, Hindawi
Incentives for sharing research data – Veerle Van den Eynden, UK Data Service
Incentives to innovate – Joe Marshall, NCUB
Incentives in university collaboration - Tim Lance, NYSERNET
Giving researchers credit for their data – Neil Jefferies, The Bodleian Digital Library Systems and Services (BDLSS)
Jisc and CNI conference, 6 July 2016
Open access, universities as publishers - Jisc Digital Festival 2015Jisc
This session focussed on areas where universities are (re)discovering roles, especially in the area of book publishing. Participants will be provided with evidence to help them consider this role for universities as publishers and its implications for them.
The vision for ‘the Research Paper of the Future’ promises
to make scholarship more discoverable, transparent,
inspectable, reusable and sustainable. Yet new forms
of scientific output also challenge authors, librarians,
publishers and service providers to register, validate,
disseminate and preserve them as elements of the scholarly
record. What constitutes authorship in a collaborative
process of GitHub pull requests and commits? When to
capture, reference and preserve dynamic data sets that
change over time? How to package and render complex
executable collections for review and delivery? This session
considers key challenges in operationalising the Research
Paper of the Future from the perspectives of a publisher,
a library administrator and a scientist/developer of a
collaborative authoring platform.
This presentation was provided by Kenning Arlitsch of Montana State University during a NISO Webinar entitled "Supporting Research on Your Campus", held on May 4, 2016
NISO Two Part Webinar:
Is Granularity the Next Discovery Frontier?
Part 1: Supporting Direct Access to Increasingly Granular Chunks of Content
Working with Metadata Challenges to Support Granular Levels of Access and Descriptions
Myung-Ja (MJ) Han, Metadata Librarian University of Illinois at Urbana-Champaign Urbana, Illinois
Granular Discovery: User Experience Challenges and Opportunities
Tito Sierra, Director of Product Management, EBSCO Information Services
From Unstructured Content to Granular Insights
Daniel Mayer, Vice President of Product & Marketing, TEMIS
Open access (OA) to research publications brings with it significant benefits for UK institutions, researchers and research funders.
After several years of concerted effort to implement OA following the Finch report in 2012, we have learned, and continue to learn, a great deal about what works well, and what works less well. In this workshop we’ll present examples of good practice to support implementation from our nine pathfinder projects.
The Challenges of Making Data Travel, by Sabina LeonelliLEARN Project
1st LEARN Workshop. Embedding Research Data as part of the research cycle. 29 Jan 2016. Presentation by Sabina Leonelli, Exeter Centre for the Study of Life Sciences (Egenis) & Department of Sociology, Philosophy and Anthropology, University of Exeter
The Future is a Moving Goal Post: Change Management in Academic LibrariesIFLAAcademicandResea
IFLA ARL Webinar Series | Held online on August 1, 2019
This presentation focuses on Change Management in Academic Libraries, presented by Gulcin Cribb, University Librarian, Singapore Management University.
Keynote talk to LEARN (LERU/H2020 project) for research data management. Emphasizes that problems are cultural not technical. Promotes modern approaches such as Git / continuousIntegration, announces DAT. Asserts that the Right to Read in the Right to Mine. Calls for widespread development of contentmining (TDM)
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...Carole Goble
https://datascience.nih.gov/news/march-data-sharing-and-reuse-seminar 11 March 2022
Starting in 2023, the US National Institutes of Health (NIH) will require institutes and researchers receiving funding to include a Data Management Plan (DMP) in their grant applications, including the making their data publicly available. Similar mandates are already in place in Europe, for example a DMP is mandatory in Horizon Europe projects involving data.
Policy is one thing - practice is quite another. How do we provide the necessary information, guidance and advice for our bioscientists, researchers, data stewards and project managers? There are numerous repositories and standards. Which is best? What are the challenges at each step of the data lifecycle? How should different types of data? What tools are available? Research Data Management advice is often too general to be useful and specific information is fragmented and hard to find.
ELIXIR, the pan-national European Research Infrastructure for Life Science data, aims to enable research projects to operate “FAIR data first”. ELIXIR supports researchers across their whole RDM lifecycle, navigating the complexity of a data ecosystem that bridges from local cyberinfrastructures to pan-national archives and across bio-domains.
The ELIXIR RDMkit (https://rdmkit.elixir-europe.org (link is external)) is a toolkit built by the biosciences community, for the biosciences community to provide the RDM information they need. It is a framework for advice and best practice for RDM and acts as a hub of RDM information, with links to tool registries, training materials, standards, and databases, and to services that offer deeper knowledge for DMP planning and FAIR-ification practices.
Launched in March 2021, over 120 contributors have provided nearly 100 pages of content and links to more than 300 tools. Content covers the data lifecycle and specialized domains in biology, national considerations and examples of “tool assemblies” developed to support RDM. It has been accessed by over 123 countries, and the top of the access list is … the United States.
The RDMkit is already a recommended resource of the European Commission. The platform, editorial, and contributor methods helped build a specialized sister toolkit for infectious diseases as part of the recently launched BY-COVID project. The toolkit’s platform is the simplest we could manage - built on plain GitHub - and the whole development and contribution approach tailored to be as lightweight and sustainable as possible.
In this talk, Carole and Frederik will present the RDMkit; aims and context, content, community management, how folks can contribute, and our future plans and potential prospects for trans-Atlantic cooperation.
Data policy must be partnered with data practice. Our researchers need to be the best informed in order to meet these new data management and data sharing mandates.
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...Jisc
Universities and researchers need to be able to manage research data effectively to fulfil research funders requirements and ultimately to contribute to research excellence. UK universities are comparatively well advanced in what is a global challenge, but none the less there needs to be further advances in university policy, technical and support services. This session will share best practice in research data management and information about key tools that can help to develop university solutions; and it will also inform participants about the latest Jisc initiatives to help build university research data services and shared services.
Sitations are the way that researchers communicate how
their work builds on and relates to the work of others and
they can be used to trace how a discovery spreads and is
used by researchers in different disciplines and countries.
Creating a truly comprehensive map of scholarship,
however, relies on having a curated machine-readable
database of citation information, where the provenance of
every citation is clear and reusable. The Initiative for Open
Citations (I4OC), a campaign launched on 6 April 2017,
sought to make publisher members of Crossref aware that
they could open up the citation metadata they already give
to Crossref simply by asking them. With the support of
major publishers and the endorsement of funders and other
organisations, more than 50% of citation data in Crossref
is now freely available, up from less than 1% before the
campaign. This provides the foundation of a well-structured,
open database of literally millions of datapoints that anyone
can query, mine, consume and explore. The presenter will
discuss the aims of the campaign, the new innovative
services that are already using the data, what more still
needs to be done and how you can support the initiative.
Catriona J MacCallum, Hindawi
Incentives for sharing research data – Veerle Van den Eynden, UK Data Service
Incentives to innovate – Joe Marshall, NCUB
Incentives in university collaboration - Tim Lance, NYSERNET
Giving researchers credit for their data – Neil Jefferies, The Bodleian Digital Library Systems and Services (BDLSS)
Jisc and CNI conference, 6 July 2016
Open access, universities as publishers - Jisc Digital Festival 2015Jisc
This session focussed on areas where universities are (re)discovering roles, especially in the area of book publishing. Participants will be provided with evidence to help them consider this role for universities as publishers and its implications for them.
The vision for ‘the Research Paper of the Future’ promises
to make scholarship more discoverable, transparent,
inspectable, reusable and sustainable. Yet new forms
of scientific output also challenge authors, librarians,
publishers and service providers to register, validate,
disseminate and preserve them as elements of the scholarly
record. What constitutes authorship in a collaborative
process of GitHub pull requests and commits? When to
capture, reference and preserve dynamic data sets that
change over time? How to package and render complex
executable collections for review and delivery? This session
considers key challenges in operationalising the Research
Paper of the Future from the perspectives of a publisher,
a library administrator and a scientist/developer of a
collaborative authoring platform.
This presentation was provided by Kenning Arlitsch of Montana State University during a NISO Webinar entitled "Supporting Research on Your Campus", held on May 4, 2016
NISO Two Part Webinar:
Is Granularity the Next Discovery Frontier?
Part 1: Supporting Direct Access to Increasingly Granular Chunks of Content
Working with Metadata Challenges to Support Granular Levels of Access and Descriptions
Myung-Ja (MJ) Han, Metadata Librarian University of Illinois at Urbana-Champaign Urbana, Illinois
Granular Discovery: User Experience Challenges and Opportunities
Tito Sierra, Director of Product Management, EBSCO Information Services
From Unstructured Content to Granular Insights
Daniel Mayer, Vice President of Product & Marketing, TEMIS
Open access (OA) to research publications brings with it significant benefits for UK institutions, researchers and research funders.
After several years of concerted effort to implement OA following the Finch report in 2012, we have learned, and continue to learn, a great deal about what works well, and what works less well. In this workshop we’ll present examples of good practice to support implementation from our nine pathfinder projects.
The Challenges of Making Data Travel, by Sabina LeonelliLEARN Project
1st LEARN Workshop. Embedding Research Data as part of the research cycle. 29 Jan 2016. Presentation by Sabina Leonelli, Exeter Centre for the Study of Life Sciences (Egenis) & Department of Sociology, Philosophy and Anthropology, University of Exeter
The Future is a Moving Goal Post: Change Management in Academic LibrariesIFLAAcademicandResea
IFLA ARL Webinar Series | Held online on August 1, 2019
This presentation focuses on Change Management in Academic Libraries, presented by Gulcin Cribb, University Librarian, Singapore Management University.
Keynote talk to LEARN (LERU/H2020 project) for research data management. Emphasizes that problems are cultural not technical. Promotes modern approaches such as Git / continuousIntegration, announces DAT. Asserts that the Right to Read in the Right to Mine. Calls for widespread development of contentmining (TDM)
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...Carole Goble
https://datascience.nih.gov/news/march-data-sharing-and-reuse-seminar 11 March 2022
Starting in 2023, the US National Institutes of Health (NIH) will require institutes and researchers receiving funding to include a Data Management Plan (DMP) in their grant applications, including the making their data publicly available. Similar mandates are already in place in Europe, for example a DMP is mandatory in Horizon Europe projects involving data.
Policy is one thing - practice is quite another. How do we provide the necessary information, guidance and advice for our bioscientists, researchers, data stewards and project managers? There are numerous repositories and standards. Which is best? What are the challenges at each step of the data lifecycle? How should different types of data? What tools are available? Research Data Management advice is often too general to be useful and specific information is fragmented and hard to find.
ELIXIR, the pan-national European Research Infrastructure for Life Science data, aims to enable research projects to operate “FAIR data first”. ELIXIR supports researchers across their whole RDM lifecycle, navigating the complexity of a data ecosystem that bridges from local cyberinfrastructures to pan-national archives and across bio-domains.
The ELIXIR RDMkit (https://rdmkit.elixir-europe.org (link is external)) is a toolkit built by the biosciences community, for the biosciences community to provide the RDM information they need. It is a framework for advice and best practice for RDM and acts as a hub of RDM information, with links to tool registries, training materials, standards, and databases, and to services that offer deeper knowledge for DMP planning and FAIR-ification practices.
Launched in March 2021, over 120 contributors have provided nearly 100 pages of content and links to more than 300 tools. Content covers the data lifecycle and specialized domains in biology, national considerations and examples of “tool assemblies” developed to support RDM. It has been accessed by over 123 countries, and the top of the access list is … the United States.
The RDMkit is already a recommended resource of the European Commission. The platform, editorial, and contributor methods helped build a specialized sister toolkit for infectious diseases as part of the recently launched BY-COVID project. The toolkit’s platform is the simplest we could manage - built on plain GitHub - and the whole development and contribution approach tailored to be as lightweight and sustainable as possible.
In this talk, Carole and Frederik will present the RDMkit; aims and context, content, community management, how folks can contribute, and our future plans and potential prospects for trans-Atlantic cooperation.
Data policy must be partnered with data practice. Our researchers need to be the best informed in order to meet these new data management and data sharing mandates.
The Evolving Role of the Library in Institutional and Faculty AssessmentState Of Innovation
A Discussion of Research Metrics - June 2016
Kim Powel, Life Sciences Informationist Emory University
Holly Miller, Associate Dean Scholarly Content and Faculty Engagement, Florida International University
Joey Figueroa, Solutions Specialist Thomson Reuters
Supporting Libraries in Leading the Way in Research Data ManagementMarieke Guy
Marieke Guy, Institutional Support Officer, Digital Curation Centre, UKOLN, University of Bath, UK presents on Supporting Libraries in Leading the Way in Research Data Management at Online Information, London 20th -21st November 2012
Software curation as a digital preservation serviceKeith Webster
Presentation to the Coalition for Networked Information Spring Conference, Seattle, April 2015 by Keith Webster of Carnegie Mellon University and Euan Cochrane of Yale. Describes need for software curation services, and offers two examples, one from each of our universities, of library engagement.
Pros and Cons of Open Data: A Global South PerspectiveMichelle Willmers
Presentation by ROER4D Curation & Dissemination Manager Michelle Willmers on open data practice in the Global South to the Committee of Plenipotentiary Representatives of the International Committee for Scientific and Technical Information (ICSTI).
OpenAIRE-COAR conference 2014: Re-imagining the role of institutional reposit...OpenAIRE
Presentation at the OpenAIRE-COAR Conference: "Open Access Movement to Reality: Putting the Pieces Together", Athens - May 21-22, 2014.
Re-imagining the role of institutional repositories in open scholarship, by Leslie Chan - Senior Lecturer in the Department of Social Sciences at the University of Toronto Scarborough.
Looking to the Future: What’s the Mindset for a Successful Information Organization? by Keith
Webster, Dean of the Libraries, Carnegie Mellon for the October 16, 2013 NISO Virtual Conference: Revolution or Evolution: The Organizational Impact of Electronic Content.
Opening Keynote: From where we are to where we want to be: The future of resource discovery from a UK perspective
Neil Grindley, Head of Resource Discovery, Jisc
Re-imagining the role of Institutional Repository in Open ScholarshipLeslie Chan
Keynote at the OpenAIRE and COAR Joint Conference Open Access: Movement to Reality
Putting the Pieces Together. Acropolis Museum, Athens, Greece, May 21-13, 2014
Revitalizing the Library in the University Knowledge CommunityKaren S Calhoun
Covers some important studies on the future of the academic research library at Pitt and elsewhere. Discusses collaborative processes to build a new vision of library services and immerse the library more fully in research, teaching and learning at the university.
Remapping the Global and Local in Knowledge Production: Roles of Open AccessLeslie Chan
It is generally acknowledged that researchers and institutions in the Global South suffer from knowledge isolation because of poor infrastructure and lack of access to key resources, including the current literature. The remedy is therefore capacity building and the transfer of not only knowledge, but also the institutional framework of knowledge creation from the North to the South. In this context, Open Access to the scholarly literature is seen as a means of bridging the global knowledge gap.
In this presentation, I argue that a key contributor to the continual knowledge divide and the invisibility of knowledge from the Global South is the persistence and dominance of Northern frameworks of research evaluation and quality metrics, coupled with outmoded national and international innovation policies based on exclusion and competitiveness. These narrow measures have tended to skew international research agenda and undermine locally relevant research.
A great opportunity that Open Access provides is the means to develop alternative metrics of research uptake and impact that are more inclusive of knowledge from the South, particularly those with development outcomes. In particular, it is important to re-conceptualize and re-design the metrics of research impact to reflect new scholarly practices and the diverse means of engagement enabled by OA and the new wave of social media tools. At the same time, appropriate policies need to be developed to reward open scholarship and to encourage research sharing — issues of particular importance for ending knowledge isolation. Examples of the new kinds of “invisible college” enabled by networking tools and OA will be presented, and particular attention will be paid to innovations emanating from the periphery.
Day 3: Introduction to Information LiteracyBuffy Hamilton
Objectives: 1. To explore and evaluate traditional and uthoritative database information sources. 2. To explore and utilize strategies to effectively use traditional and emerging search engines for information. 3. To explore and evaluate how emerging Web 2.0 tools can be used as sources of information. 4. To explore the merits and drawbacks to collaboratively created open sources of information such as Wikipedia.
Globlal Perspective on Open Research: A Bird's Eye ViewLeslie Chan
Presentation at the University of Cape Town, Aug. 5, 2011. This talk was part of the OpenUCT initiative and the Scholarly Communication in Africa Programme. It provides an overview of the changing research landscape and the particular importance of open access and other forms of open collaboration for solving some of the pressing problems of development research. The presentation argues for the importance of policy development in support of research collaboration and the development of enriched metrics for evaluating the development impact of research.
Digital Academic Content and the Future of Libraries: International Cooperati...UBC Library
International Library Cooperation Symposium presentation May 14, 2010 in Tokyo, Japan.
Presentation by Ingrid Parent, President elect of IFLA, and University Librarian at the University of British Columbia
As Europe's leading economic powerhouse and the fourth-largest hashtag#economy globally, Germany stands at the forefront of innovation and industrial might. Renowned for its precision engineering and high-tech sectors, Germany's economic structure is heavily supported by a robust service industry, accounting for approximately 68% of its GDP. This economic clout and strategic geopolitical stance position Germany as a focal point in the global cyber threat landscape.
In the face of escalating global tensions, particularly those emanating from geopolitical disputes with nations like hashtag#Russia and hashtag#China, hashtag#Germany has witnessed a significant uptick in targeted cyber operations. Our analysis indicates a marked increase in hashtag#cyberattack sophistication aimed at critical infrastructure and key industrial sectors. These attacks range from ransomware campaigns to hashtag#AdvancedPersistentThreats (hashtag#APTs), threatening national security and business integrity.
🔑 Key findings include:
🔍 Increased frequency and complexity of cyber threats.
🔍 Escalation of state-sponsored and criminally motivated cyber operations.
🔍 Active dark web exchanges of malicious tools and tactics.
Our comprehensive report delves into these challenges, using a blend of open-source and proprietary data collection techniques. By monitoring activity on critical networks and analyzing attack patterns, our team provides a detailed overview of the threats facing German entities.
This report aims to equip stakeholders across public and private sectors with the knowledge to enhance their defensive strategies, reduce exposure to cyber risks, and reinforce Germany's resilience against cyber threats.
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
Opendatabay - Open Data Marketplace.pptxOpendatabay
Opendatabay.com unlocks the power of data for everyone. Open Data Marketplace fosters a collaborative hub for data enthusiasts to explore, share, and contribute to a vast collection of datasets.
First ever open hub for data enthusiasts to collaborate and innovate. A platform to explore, share, and contribute to a vast collection of datasets. Through robust quality control and innovative technologies like blockchain verification, opendatabay ensures the authenticity and reliability of datasets, empowering users to make data-driven decisions with confidence. Leverage cutting-edge AI technologies to enhance the data exploration, analysis, and discovery experience.
From intelligent search and recommendations to automated data productisation and quotation, Opendatabay AI-driven features streamline the data workflow. Finding the data you need shouldn't be a complex. Opendatabay simplifies the data acquisition process with an intuitive interface and robust search tools. Effortlessly explore, discover, and access the data you need, allowing you to focus on extracting valuable insights. Opendatabay breaks new ground with a dedicated, AI-generated, synthetic datasets.
Leverage these privacy-preserving datasets for training and testing AI models without compromising sensitive information. Opendatabay prioritizes transparency by providing detailed metadata, provenance information, and usage guidelines for each dataset, ensuring users have a comprehensive understanding of the data they're working with. By leveraging a powerful combination of distributed ledger technology and rigorous third-party audits Opendatabay ensures the authenticity and reliability of every dataset. Security is at the core of Opendatabay. Marketplace implements stringent security measures, including encryption, access controls, and regular vulnerability assessments, to safeguard your data and protect your privacy.
1. Library Data Management Services:
A Strategic Framework for Development and
Implementation
NN/LM MAR Research Data Management Symposium
28 April 2014
Keith Webster
Dean of University Libraries
2. Our
professional
future
The genealogy of the
contemporary research
library
The risk of invisibility
The emergence of open
science
Our core professional
skills
An overview of data
management
The policy context
Data management
activities
The
UQ
experience
DM service philosophy
Partnerships
Skills development
The Australian model
Emerging services at CMU
Pu6ng
it
into
prac8ce
The
data
management
impera8ve
16. Where do library clients go?
Specific e-resource
General search engine
Library catalogue
Library building
1
18
38
47
13
28
21
37
2003 2012
Search engine
Wikipedia
SNS
Email
Online database
Virtual reference
Library website 0
0
1
1
2
7
83
Where do student start a search? Where do academics begin research?
Perceptions of libraries 2010,
OCLC
Faculty study 2012: key insights for
libraries and publishers, Ithaka
19. What is happening in the world
is bypassing university libraries
Peter Murray-Rust
The scientist’s view
JISC Libraries of the future debate, April 2009
20. “…contact
with
librarians
and
informa8on
professionals
is
rare”
“…researchers
are
generally
confident
in
their
[self-‐
taught]
abili8es..,
librarians
see
them
as..rela8vely
unsophis8cated”
“…librarians
see
it
as
a
problem
that
they
are
not
reaching
all
researchers
with
formal
training,
whereas
most
researchers
don’t
think
they
need
it”
21. • The
part
that
academic
librarians
should
play
remains
unclear
• Raise
awareness
of
eResearch
amongst
library
staff
• Provide
advice
on
data
management
to
eResearchers
• Data
cura8on
is
vast,
complex
and
requires
subject
input
22. • “The
bad
news
is
that
I’m
not
sure
they
understand
what
goes
on
in
the
library
other
than
taking
out
books.”
Benton
Founda8on,
1996
• “User
percep8ons
nega8vely
affect
the
ability
of
librarians
to
meet
informa8on
needs
simply
because
a
profession
cannot
serve
those
who
do
not
understand
its
purpose
and
exper8se.”
Durrance,
1988
23. The worst thing about
the stereotype is that it
impacts on the psyche of
librarians who really
begin to believe that they
don't deserve the kingpin
role
US Congress, 2001
25. • It is likely that the way that researchers publish, assess
impact, communicate, and collaborate will change more
within the next 20 years than it did in the past 200 years.
http://book.openingscience.org/
26. • Driven by end-users!
• Interdisciplinary knowledge!
• Collaborative across sectors!
• Transitory research teams!
• Accountability (social and
economic) to range of
stakeholders!
• Quality control (academic merit,
cost effectiveness, economic and
social relevance)
(Gibbons [et al], 1994)
• Driven by academic discipline!
• Knowledge framed by
disciplinary norms!
• Deeply institutionalised!
• Accountability to peers!
• Scientist is expert!
• Quality control by peer
review and contribution to
discipline
Mode 1 Mode 2
Modes of knowledge production
27. Funding structures and requirements
• External funding!
• Diverse source of funding!
• Government!
• Not-for-profit!
• Industry!
• Economic outcomes!
• increase wealth creation & prosperity!
• improve nation’s health, environment & quality of life!
• Innovation!
• Improved competitiveness!
• “Commercialisation” of research!
• Less “curiosity-driven” activity
28. • Fund the best research to
meet the needs of the
country!
• Develop leaders and
researchers who can meet
national and global priorities!
• Foster public engagement
with research!
• Funding international
collaboration
Aims of research funders
29.
30.
31. Open access,
open data,
open science
!
!
Increasingly, the “private” nature of academic science
is being displaced by a culture of openness - ideas,
approaches and observations are shared at the
earliest opportunity with colleagues - and sometimes
the world at large.!
!
Whilst the ‘version of record’ approach to journal
article creation retains validity, this is increasingly
seen as a compliance matter - required to meet
career objectives and funder/government
requirements!
32. !
!
Traditional enquiry-driven research
has been supplanted by reflexive
research, driven by the increasingly
necessary flow of external research
funding into universities. Largely, this
comes from government agencies, but
charities (such as the Wellcome Trust)
and industry are also powerful
sponsors of high-quality activity.!
!
This state has led to the notion of the
triple-helix of research - academe,
industry and government.!
!
In turn, these inter-relationships have
spawned a major industry around
assessing and evaluating the impact of
research. Initially, the aim was to
drive up standards; this is now shifting
to a culture of openness, and a desire
to foster public engagement.!
!
!
35. About 35 percent of scientists are using things like blogs to consume and
produce content.There is an explosion of online tools and platforms
available to scientists, ranging from Web 2.0 tools modified or created for
the scientific world to Web sites that are doing amazing things with video,
lab notebooks, and social networking.!
!
The next generation of PIs is already establishing new behaviors.They feel
comfortable blogging, using social media tools, and using wikis to advance
their research. It will take the big institutions to support open-access
journals, for example.And it will take technological innovation in the form
of software that is purpose-built for this unique community and its set of
challenges.!
!
We’re talking about something as fundamental and important as
modernizing the architecture of science.
Adam Bly
http://seedmagazine.com/content/article/science_2.0_pioneers/
36. There are a billion people connected to the, the Web. At least one of them has a smarter
idea about what to do with your data than you do.
James Boyle
44. How
do
we
add
value?
• Bri8sh
Library
adds
£419m
of
value
to
the
economy
each
year
hp://www.bl.uk/aboutus/
stratpolprog/increasingvalue/
bri8shlibrary_economicevalua8on.pdf
45. Making
a
difference
Adverse event avoided Percent
Hospital admission 11.5
Hospital acquired infection 8.2
Surgery 21.2
Additional tests/procedures 49.0
Additional out-patient visits 26.4
Patient mortality 19.2
Marshall (1994) The impact of information services on decision making
51. Current priorities in
academic libraries
1. Continue and complete migration from print to
electronic and realign service operations
2. Retire legacy collections
3. Continue to repurpose library as primary
learning space
4. Reposition library expertise and resources to be
more closely embedded in research and teaching
enterprise outside library
5. Extend focus of collection development from
external purchase to local curation
Lewis (2007);Webster (2010, 2012)
53. Our
professional
future
The genealogy of the
contemporary research
library
The risk of invisibility
The emergence of open
science
Our core professional
skills
An overview of data
management
The policy context
Data management
activities
The
UQ
experience
DM service philosophy
Partnerships
Skills development
The Australian model
Emerging services at CMU
Pu6ng
it
into
prac8ce
The
data
management
impera8ve
55. Why Data Management Services?
"The Board believes that timely attention
to digital research data sharing and
management is fundamental to supporting
U.S. science and engineering in the twenty-
first century.
...strong and sustainable data sharing and
management policies [are] a critical
national need."
Digital Research Data Sharing and Management
December 2011
Task Force on Data Policies
Committee on Strategy and Budget
National Science Board
56. More
data
will
be
created
in
the
next
five
years
than
has
been
collected
in
the
whole
of
human
history.
Properly
managed,
this
data
will
form
a
major
resource
for
Australian
researchers.
58. Research
collaboration is
associated with high
academic and wider
impact
International
collaboration is
associated with high
academic impact
Data can be shared
easily across borders
59. Sharing
data?
• Create
opportuni8es
–For
re-‐analysis
and
re-‐use
–To
facilitate
collabora8on
• Solve
problems
–Waste
of
money,
people
and
effort
–Loss
of
irretrievable
data
–Inability
to
verify
research
• Issues
and
challenges
–Pa8ent
confiden8ality
–IP
and
discovery
protec8on
• Promote
cura8on
rather
than
sharing?
62. • The rapid development in computing
technology and the Internet have opened up
new applications for the basic sources of
research — the base material of research data
— which has given a major impetus to
scientific work in recent years.
• Access to research data increases the returns
from public investment in this area; reinforces
open scientific inquiry; encourages diversity of
studies and opinion; promotes new areas of
work and enables the exploration of topics
not envisioned by the initial investigators.
• The value of data lies in their use. Full and
open access to scientific data should be
adopted as the international norm for the
exchange of scientific data derived from
publicly funded research.
63. • Builds upon work
in Fort
Lauderdale
biological data
sharing principles
http://www.nature.com/nature/journal/v461/n7261/pdf/461168a.pdf
66. Key points
• Publicly funded research data are a public good,
produced in the public interest, which should be
made openly available with as few restrictions as
possible in a timely and responsible manner that
does not harm intellectual property.
• To ensure that the research process is not
damaged by inappropriate release of data, research
organisation policies and practices should ensure
that these are considered at all stages in the
research process.
74. 74
Ins8tu8ons
are
to
retain
research
data,
provide
secure
data
storage,
iden8fy
ownership,
and
ensure
security
and
confiden8ality
of
research
data
Researchers
are
to
retain
research
data
and
primary
materials,
manage
storage
of
research
data
and
primary
materials,
maintain
confiden8ality
of
research
data
and
primary
materials.
75. Australian requirements
1.Intellectual property
2.Data management, including:
◦ Storage
◦ RetentionDisposal
◦ Access, publication, description
3. Conflict of interest — do all parties have the same
understanding about the use of the data?
3.Collaboration and contractual agreements
4.Ethics and privacy Compliance
79. “The Holdren Memo”
To achieve the Administration’s
commitment to increase access to
federally funded published
research and digital scientific data,
Federal agencies investing in
research and development must have
clear and coordinated policies for
increasing such access.
Memo on Increasing Access to the Results of
Federally Funded Scientific Research
White House Office of Science and Technology
Policy
February 22, 2012
83. What do we mean by RDM?
Data Retention
Policy
Repository Data Policy
Data
Visualization
Data
Management
Planning
File
Formatting
Metadata
Discovery
Grant Writing
Registry
Intellectual
Property
Issues
86. Data Services Program
Data
Management
Planning
Data &
Institutional
Repositories
Operational
DMP &
Compliance
Checklist
Check
Up
Visits
Compliance
Assessment
Data
Management
Training
Data
Consult.
&
Staging
3+ Years
87. Our
professional
future
The genealogy of the
contemporary research
library
The risk of invisibility
The emergence of open
science
Our core professional
skills
An overview of data
management
The policy context
Data management
activities
The
UQ
experience
DM service philosophy
Partnerships
Skills development
The Australian model
Emerging services at CMU
Pu6ng
it
into
prac8ce
The
data
management
impera8ve
88. Pu6ng
it
into
place
Data management service philosophy
89. What might our service
offer?
• Teaching or doing?
• Compliance or support?
• Storage or registering?
• Policy advice vs policy development
• Institution-wide or in response to requests?
• Advising on data re-use (sources, analysis etc)
93. Likely partners
• Office of Research
• Ethics/privacy/legal experts
• Computing specialists
• High performance computing
94.
95. Other sources of help
• National data services
• Data archives
• Research funding agencies
• Other libraries
• Growing number of books and reports
• Specialist advice
99. Collec8ons
grid
high low
lowhigh
stewardship
uniquenessBooks
Journals
Newspapers
Gov. docs
CD, DVD
Maps
Scores
Special
collections
Rare books
Local/Historical
newspapers
Local history materials
Archives & Manuscripts,
Theses & dissertations
Research, learning and
administrative
materials,
•ePrints/tech reports
•Learning objects
•Courseware
•E-portfolios
•Research data
•Institutional records
•Reports, newsletters, etc
Freely-accessible web
resources
Open source software
Newsgroup archives
hp://www.slideshare.net/lisld/collec8ons-‐grid
100. Librarians’
competencies
profile
for
RDM
Key
roles
• Providing
access
to
data
–Iden8fica8on
of
data
sets;
discovery
and
analy8c
tools;
advice
on
informa8cs
• Advocacy
and
support
for
managing
data
–Policy
development;
ar8cula8ng
benefits;
promo8ng
data
sharing
and
reuse;
educa8on
and
training;
data
audits
• Managing
data
collec8ons
–Preparing
for
data
deposit;
appraisal;
selec8on;
inges8on;
cura8on;
preserva8on;
storage
and
backup
Based on ARL draft distributed at CNI conference, St Louis, April 2014
101. Librarians’
competencies
profile
for
RDM
Core
competencies
• Providing
access
to
data
–Data
centres
and
repositories;
organiza8on
and
structure
of
data;
licensing
and
IP;
manipula8on
and
analysis
• Advocacy
and
support
for
managing
data
–Research
funder
mandates;
DMP;
research
workflows;
disciplinary
norms;
journal
requirements;
data
audit
and
assessment
tools
• Managing
data
collec8ons
–Metadata;
discovery
tools
and
indexing;
database
design;
data
linking;
forensic
procedures
in
data
cura8on
102. LIS2975
@
Pi
iSchool
• The
Data
Landscape
• Universi8es
and
Data
• Data
Requirements
and
Capability
• RDM
Roadmaps,
Strategy
and
Planning
• Data
Management
Plans
• Disciplinary
Data
1
• Legal
and
Ethical
Data
Issues
• Disciplinary
Data
2
• Data
Centres
• Data
Advocacy,
Skills
and
Training
• Data
Sustainability
and
Cost
109. Early
progress
• Lead
ins8tu8on
in
APSR
• Development
of
eSpace
• ANDS
at
UQ
• Seminars
and
workshops
from
2007
onwards
• Partnering
with
eScience
and
HPC
ins8tutes
• Strong
involvement
across
all
disciplines
110.
111. Service model
• Data management interview and planning
• Consultancy
• Legal advice
• Pointers to other resources - eg for storage
• Data description and publication
• Long-term preservation
• Feeds to Research Data Australia
117. CMU
Faculty
Senate
• WHEREAS
• Researchers
in
all
disciplines
are
faced
with
a
range
of
data
management
needs
as
research
becomes
more
collabora8ve,
data-‐intensive,
and
computa8onal,
• And
the
Office
of
Science
and
Technology
Policy
direc8ve
issued
February
22,
2013,
requires
federal
agencies
that
fund
research
to
mandate
public
access
and
re-‐use
rights
to
peer-‐reviewed
publica8ons
and
digital
data
arising
from
that
funding,
• And
the
federal
Open
Data
Policy
issued
May
9,
2013,
s8pulates
the
requirements
for
sharing
and
enabling
re-‐use
of
digital
data,
• And
data
sharing
and
re-‐use
increase
the
accountability,
verifica8on,
impact,
and
return
on
investment
in
research,
• And
technical
exper8se
and
support
services
are
required
to
meet
researcher
needs,
funding
impera8ves,
and
public
policy
goals,
• And
an
ins8tu8onal
commitment
to
effec8ve
data
management
is
required
for
faculty
to
par8cipate
118. THEREFORE
BE
IT
RESOLVED
THAT
CARNEGIE
MELLON
UNIVERSITY
• Charge
the
University
Libraries,
Office
of
Sponsored
Programs,
Office
of
Research
Integrity
and
Compliance,
and
Compu8ng
Services
to
collaborate
and
provide
the
community
with
core
services
and
tools
for
managing
data
throughout
the
data
life
cycle.
• Promote
these
services
and
tools
and
encourage
faculty
to
use
them
to
manage
and
share
their
data.
• Study
means
by
which
faculty
can
par8cipate
effec8vely.
• Establish
incen8ves
and
community
norms
for
effec8ve
data
management
and
sharing.
• Provide
ongoing
financial
support
to
the
units
providing
services
and
tools,
including
support
for
the
infrastructure,
personnel,
educa8on
and
training
needed
to
sustain
long-‐term
data
management
and
cura8on.
• Develop
a
research
data
management
policy,
establishing
the
University’s
commitment
to
long
term
data
management,
and
aligned
with
federal
agency
requirements
and
open
data
ini8a8ves.
This
policy
and
progress
towards
its
implementa8on
will
be
posted
on
relevant
web
pages.
119. A.
Research
Data
must
be
created,
maintained,
protected,
and
shared
in
accordance
with
contractual,
legisla8ve,
regulatory,
ethical
and
other
relevant
requirements.
B.
Where
permied,
management
and
sharing
of
Research
Data
should
be
supported
through
the
alloca8on
of
the
funding
that
supported
the
research.
C.
Rights
assigned
to
Research
Data
should
not
unnecessarily
restrict
its
management,
sharing,
or
reuse.
D.
A
Data
Management
Plan
(DMP)
should
be
documented
for
all
research
projects
that
will
produce
Research
Data,
with
excep8ons
noted.
E.
Following
comple8on
of
a
research
project,
the
Research
Data
to
be
shared
should
be
deposited
in
one
or
more
Trusted
Data
Repositories
for
access
and
preserva8on.
F.
Research
Data
shared
by
University
Researchers
should
be
registered
with
the
University
Libraries,
regardless
of
whether
access
to
the
Data
is
hosted
by
the
University
or
a
third
party.
G.
Shared
Research
Data
should
be
made
available
for
access
and
reuse
in
a
8mely
manner,
in
compliance
with
funding
or
other
requirements.
H.
Shared
Research
Data
should
be
curated
and
preserved
in
sufficient
detail
for
the
full
Period
of
Reten8on,
in
conformance
with
this
Policy
or
with
legisla8ve,
regulatory,
or
contractual
obliga8ons.
I.
Shared
Research
Data
produced
or
used
during
research
should
be
cited
in
all
research
outputs
following
accepted
or
emerging
data
cita8on
prac8ces.
119
124. Our
professional
future
The genealogy of the
contemporary research
library
The risk of invisibility
The emergence of open
science
Our core professional
skills
An overview of data
management
The policy context
Data management
activities
The
UQ
experience
DM service philosophy
Partnerships
Skills development
The Australian model
Emerging services at CMU
Pu6ng
it
into
prac8ce
The
data
management
impera8ve