The Johns Hopkins University Data Management Services conducted an investigation into its first year of operation to better understand how to develop services from a social science perspective. They found that (1) data curation needs were emerging as a "social movement" in response to digital data production, (2) policies from funders like the NSF were helping to formalize data sharing requirements, and (3) academic libraries were starting to expand services to support data management and curation. Based on these trends and investigating university cultures, the JHU Data Management Services developed consulting, archiving, and planning support services tailored to researchers' data practices.
Data Curation Models JHU Barbara Pralle RDAP12ASIS&T
Data Curation Services Models: John Hopkins University
Barbara Pralle
Curation Service Models panel
Presentation at Research Data Access & Preservation Summit
22 March 2012
Data Curation Models JHU Barbara Pralle RDAP12ASIS&T
Data Curation Services Models: John Hopkins University
Barbara Pralle
Curation Service Models panel
Presentation at Research Data Access & Preservation Summit
22 March 2012
These are the slides for Robert H. McDonald for the Future Trends Panel Presentation at the the Inter-institutional Approaches to Supporting Scholarly Communication Symposium held on August 16, 2012 at the Georgia Institute of Technology.
ESI Supplemental 1 E-research Support SlidesDuraSpace
E-Research Support at
Johns Hopkins University & Purdue University
Supplemental Webinar
Wednesday, October 17, 2012
Presented by Sayeed Choudhurry & James Mullins
DuraSpace is OPEN presented by:
Debra Hanken Kurtz, CEO Jonathan Markow, CSO at the
11th Annual International Conference on Open Repositories 2016, Dublin
This presentation introduced participants to the DC 101 course and was given at the Digital Curation and Preservation Outreach and Capacity Building Workshop in Belfast on September 14-15 2009.
http://www.dcc.ac.uk/events/workshops/digital-curation-and-preservation-outreach-and-capacity-building-workshop
Going Full Circle: Research Data Management @ University of PretoriaJohann van Wyk
Presentation delivered at the eResearch Africa Conference, held 23-27 November 2014, at the University of Cape Town, Cape Town, South Africa. Various approaches to Research Data Management at Higher Education Institutions focus on an aspect or two of the research data cycle. At the University of Pretoria the approach has been to support researchers throughout the research process covering the whole research data cycle. The idea is to facilitate/capture the research data throughout the research cycle. This will give context to the data and will add provenance to the data. The University of Pretoria uses the UK Data Archive’s research data cycle model, to align its Research Data Management project-development. This model identifies the stages of a research data cycle as: creating data, processing data, analysing data, preserving data, giving access to data, and reusing data. This paper will give a short overview of the chronological development of research data management at the University of Pretoria. The overview will also highlight findings of two surveys done at the University, one in 2009 and one in 2013. This will be followed by a discussion of a number of pilot projects at the University, and how the needs of researchers involved in these projects are being addressed in a number of the stages of the research data cycle. The discussion will also give a short overview of how the University plans to support those stages not currently being addressed. The second part of the presentation will focus on the projects and technology (software and hardware) used. The University of Pretoria has adopted an Enterprise Content Management (ECM) approach to manage its Research Data. ECM is not a singular platform or system but rather a set of strategies, tools and methodologies that interoperate with each other to create a comprehensive management tool. These sets create an all-encompassing process addressing document, web, records and digital asset management. At the University of Pretoria we address all these processes with different software suites and tools to create a complete management system. Each process presented its own technical challenges. These had to be addressed, while keeping in mind the end objective of supporting researchers throughout the whole research process and data life cycle. Various platforms and standards have been adopted to meet the University of Pretoria’s criteria. To date three processes have been addressed namely, the capturing of data during the research process, the dissemination of data and the preservation of data.
This presentation was delivered at the Elsevier Library Connect Seminar on 6 October 2014 in Johannesburg, 7 October 2014 in Durban and 9 October 2014 in Cape Town and gives an overview of the potential role that librarians can play in research data management
I shall provide a summary of JISC work in the area of ‘Big Data’. My primary focus will be on how to manage the huge amount of research data produced in UK Universities. I shall cover the history of JISC interventions to improve research data management and look at next steps. I shall touch on some other areas of work like ‘Digging into Data’ and web archiving which also deal with ‘big data’.
RDAP14: Maryann Martone, Keynote, The Neuroscience Information FrameworkASIS&T
Research Data Access and Preservation Summit, 2014
San Diego, CA
March 26-28, 2014
Maryann Martone, Principal Investigator, Neuroscience Information Framework, University of California, San Diego
arXiv Sustainability Initiative RDAP12
Oya Rieger, Cornell University Library, arXiv
Digital Libraries Sustainability panel
Presentation at Research Data Access & Preservation Summit
22 March 2012
These are the slides for Robert H. McDonald for the Future Trends Panel Presentation at the the Inter-institutional Approaches to Supporting Scholarly Communication Symposium held on August 16, 2012 at the Georgia Institute of Technology.
ESI Supplemental 1 E-research Support SlidesDuraSpace
E-Research Support at
Johns Hopkins University & Purdue University
Supplemental Webinar
Wednesday, October 17, 2012
Presented by Sayeed Choudhurry & James Mullins
DuraSpace is OPEN presented by:
Debra Hanken Kurtz, CEO Jonathan Markow, CSO at the
11th Annual International Conference on Open Repositories 2016, Dublin
This presentation introduced participants to the DC 101 course and was given at the Digital Curation and Preservation Outreach and Capacity Building Workshop in Belfast on September 14-15 2009.
http://www.dcc.ac.uk/events/workshops/digital-curation-and-preservation-outreach-and-capacity-building-workshop
Going Full Circle: Research Data Management @ University of PretoriaJohann van Wyk
Presentation delivered at the eResearch Africa Conference, held 23-27 November 2014, at the University of Cape Town, Cape Town, South Africa. Various approaches to Research Data Management at Higher Education Institutions focus on an aspect or two of the research data cycle. At the University of Pretoria the approach has been to support researchers throughout the research process covering the whole research data cycle. The idea is to facilitate/capture the research data throughout the research cycle. This will give context to the data and will add provenance to the data. The University of Pretoria uses the UK Data Archive’s research data cycle model, to align its Research Data Management project-development. This model identifies the stages of a research data cycle as: creating data, processing data, analysing data, preserving data, giving access to data, and reusing data. This paper will give a short overview of the chronological development of research data management at the University of Pretoria. The overview will also highlight findings of two surveys done at the University, one in 2009 and one in 2013. This will be followed by a discussion of a number of pilot projects at the University, and how the needs of researchers involved in these projects are being addressed in a number of the stages of the research data cycle. The discussion will also give a short overview of how the University plans to support those stages not currently being addressed. The second part of the presentation will focus on the projects and technology (software and hardware) used. The University of Pretoria has adopted an Enterprise Content Management (ECM) approach to manage its Research Data. ECM is not a singular platform or system but rather a set of strategies, tools and methodologies that interoperate with each other to create a comprehensive management tool. These sets create an all-encompassing process addressing document, web, records and digital asset management. At the University of Pretoria we address all these processes with different software suites and tools to create a complete management system. Each process presented its own technical challenges. These had to be addressed, while keeping in mind the end objective of supporting researchers throughout the whole research process and data life cycle. Various platforms and standards have been adopted to meet the University of Pretoria’s criteria. To date three processes have been addressed namely, the capturing of data during the research process, the dissemination of data and the preservation of data.
This presentation was delivered at the Elsevier Library Connect Seminar on 6 October 2014 in Johannesburg, 7 October 2014 in Durban and 9 October 2014 in Cape Town and gives an overview of the potential role that librarians can play in research data management
I shall provide a summary of JISC work in the area of ‘Big Data’. My primary focus will be on how to manage the huge amount of research data produced in UK Universities. I shall cover the history of JISC interventions to improve research data management and look at next steps. I shall touch on some other areas of work like ‘Digging into Data’ and web archiving which also deal with ‘big data’.
RDAP14: Maryann Martone, Keynote, The Neuroscience Information FrameworkASIS&T
Research Data Access and Preservation Summit, 2014
San Diego, CA
March 26-28, 2014
Maryann Martone, Principal Investigator, Neuroscience Information Framework, University of California, San Diego
arXiv Sustainability Initiative RDAP12
Oya Rieger, Cornell University Library, arXiv
Digital Libraries Sustainability panel
Presentation at Research Data Access & Preservation Summit
22 March 2012
Curriculum Development at the Tetherless World Constellation - Peter Fox - RD...ASIS&T
Curriculum development at the Tetherless World Constellation – the days after the “Day One” initiative
Peter Fox (RPI and WHOI)
Tetherless World Constellation
Training Data Management Practitioners panel
Presentation at Research Data Access & Preservation Summit 23 March 2012
RDAP 16: Sustainability of data infrastructure: The history of science scienc...ASIS&T
Research Data Access and Preservation Summit, 2016
Atlanta, GA
May 4-7, 2016
Part of Panel 2, Sustainability
Presenter:
Kristin Eschenfelder, University of Wisconsin-Madison
Panel Leads:
Kristin Briney, University of Wisconsin-Milwaukee & Erica Johns, Cornell University
RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)ASIS&T
Research Data Access and Preservation Summit, 2016
Atlanta, GA
May 4-7, 2016
Part of Panel 2, Sustainability
Presenter:
Margaret Henderson, Virginia Commonwealth University
Panel Leads:
Kristin Briney, University of Wisconsin-Milwaukee & Erica Johns, Cornell University
Supporting Libraries in Leading the Way in Research Data ManagementMarieke Guy
Marieke Guy, Institutional Support Officer, Digital Curation Centre, UKOLN, University of Bath, UK presents on Supporting Libraries in Leading the Way in Research Data Management at Online Information, London 20th -21st November 2012
Carmen O'Dell and Barbara Sen JIBS-RLUK event July 2012sherif user group
RDM Rose by Carmen O'Dell and Barbara Sen, (University of Sheffield). Presentation at Demystifying Research Data: don’t be scared be prepared: A joint JIBS/RLUK event, Tuesday 17th July 17th July 2012, Brunei Gallery at SOAS (School of Oriental and African Studies), London.
DataCite and Campus Data Services
Paul Bracke, Associate Dean for Digital Programs and Information Services, Purdue University
Research libraries are increasingly interested in developing data services for their campuses. There are many perspectives, however, on how to develop services that are responsive to the many needs of scientists; sensitive to the concerns of scientists who are not always accustomed to sharing their data; and that are attractive to campus administrators. This presentation will discuss the development of campus-based data services programs, the centrality of data citation to these efforts, and the ways in which engagement with DataCite can enhance local programs.
This presentation was provided by Maria Praetzellis of California Digital Library, during the NISO hot topic virtual conference "Effective Data Management," which was held on September 29, 2021.
Meeting the NSF DMP Requirement June 13, 2012IUPUI
June 13 version of the IUPUI workshop Meeting the NSF Data Management Plan Requirement: What you need to know. This workshop is co-sponsored by the Office of the Vice Chancellor for Research and the University Library.
Libraries and Research Data Management – What Works? Lessons Learned from the...LIBER Europe
This presentation by Dr Birgit Schmidt was given at the Scholarly Communication and Research Infrastructures Steering Committee Workshop. The workshop title was Libraries and Research Data Management – What Works?
Presentation from a University of York Library workshop on research data management. The workshop provides an introduction to research data management, covering best practice for the successful organisation, storage, documentation, archiving, and sharing of research data.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
Generating a custom Ruby SDK for your web service or Rails API using Smithyg2nightmarescribd
Have you ever wanted a Ruby client API to communicate with your web service? Smithy is a protocol-agnostic language for defining services and SDKs. Smithy Ruby is an implementation of Smithy that generates a Ruby SDK using a Smithy model. In this talk, we will explore Smithy and Smithy Ruby to learn how to generate custom feature-rich SDKs that can communicate with any web service, such as a Rails JSON API.
Connector Corner: Automate dynamic content and events by pushing a buttonDianaGray10
Here is something new! In our next Connector Corner webinar, we will demonstrate how you can use a single workflow to:
Create a campaign using Mailchimp with merge tags/fields
Send an interactive Slack channel message (using buttons)
Have the message received by managers and peers along with a test email for review
But there’s more:
In a second workflow supporting the same use case, you’ll see:
Your campaign sent to target colleagues for approval
If the “Approve” button is clicked, a Jira/Zendesk ticket is created for the marketing design team
But—if the “Reject” button is pushed, colleagues will be alerted via Slack message
Join us to learn more about this new, human-in-the-loop capability, brought to you by Integration Service connectors.
And...
Speakers:
Akshay Agnihotri, Product Manager
Charlie Greenberg, Host
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
As AI technology is pushing into IT I was wondering myself, as an “infrastructure container kubernetes guy”, how get this fancy AI technology get managed from an infrastructure operational view? Is it possible to apply our lovely cloud native principals as well? What benefit’s both technologies could bring to each other?
Let me take this questions and provide you a short journey through existing deployment models and use cases for AI software. On practical examples, we discuss what cloud/on-premise strategy we may need for applying it to our own infrastructure to get it to work from an enterprise perspective. I want to give an overview about infrastructure requirements and technologies, what could be beneficial or limiting your AI use cases in an enterprise environment. An interactive Demo will give you some insides, what approaches I got already working for real.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
UiPath Test Automation using UiPath Test Suite series, part 4DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 4. In this session, we will cover Test Manager overview along with SAP heatmap.
The UiPath Test Manager overview with SAP heatmap webinar offers a concise yet comprehensive exploration of the role of a Test Manager within SAP environments, coupled with the utilization of heatmaps for effective testing strategies.
Participants will gain insights into the responsibilities, challenges, and best practices associated with test management in SAP projects. Additionally, the webinar delves into the significance of heatmaps as a visual aid for identifying testing priorities, areas of risk, and resource allocation within SAP landscapes. Through this session, attendees can expect to enhance their understanding of test management principles while learning practical approaches to optimize testing processes in SAP environments using heatmap visualization techniques
What will you get from this session?
1. Insights into SAP testing best practices
2. Heatmap utilization for testing
3. Optimization of testing processes
4. Demo
Topics covered:
Execution from the test manager
Orchestrator execution result
Defect reporting
SAP heatmap example with demo
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
UiPath Test Automation using UiPath Test Suite series, part 4
Iassist 2012 dms public version
1. Johns Hopkins University Data
Management Services:
Reviewing Our First Year
David Fearon
Betsy Gunia
IASSIST June 7, 2012
1
2. Our service development:
a social science analogy
Developing new services for data management support required
understanding social factors, not just technical.
• Services emerging from data curation
“social movement”
• Investigating
– University culture for outreach
– Data practices for consulting & curation support
– Trends in public funding, data sharing & curation
2
3. Data curation “social movement”
Emergence Recognizing the need for curation & access
• eScience: digital data production outpaces sharing &
preservation
Coalescence Policy & requirements
• Jan 2011: NSF Data Management Plan requirement
Institutionalization Building Services & Infrastructure
• 2009: NSF DataNet funds Data Conservancy at JHU
3
4. Data Management Services at
Academic Libraries
• Academic libraries in early stages of expanding services to
support data curation & management
Data Management Planning Support
Assist in data
Provide tools Consult with
archiving &
for planning researchers
sharing
JHU Data Management Services
4
5. JHU Data Management Services
Preparing Data Archiving Research Data
Management Plans
Assist in implementing plan,
Assistance with data
Services management plans
preserve & share research
through JHU Data Archive
Funded by JHU schools
Cost using our service
2% of grant’s total direct cost
JHU Data Archive features (in development):
Data Conservancy architecture Accepts data from all disciplines
Feature-level discovery Interoperable data access
Persistent identifiers Supports long-term preservation
.org
5
6. Investigating JHU Cultures to introduce
Data Management Services
Outreach Goals Approach
Building awareness Finding appropriate
of services communication channels
Research
Projects Dept.
Deans
Admin Admins
Library
Targeted
Colleagues Researchers workshops
6
7. Investigating JHU Cultures to introduce
Data Management Services
Outreach Goals Approach
Building awareness Finding appropriate
of services communication channels
Demonstrating Consulting and
Relevance of customized support
services
Do DMPs
Investigating data practices archive
matter to Can’t I copy Why
NSF? a template? my data?
7
9. Consulting on data management plans
Contact from
Final Plans Principal Info on PI
Investigator (PI)
Standards
JHU DMS
Background
Comments on
Plan Research
Data
Repositories
PI Drafts Data
In-Person Questionnaire
Management
Consultation
Plan
9
10. In-Person Consultation Tool:
Questionnaire (sample questions)
Section I: Data Products and Standards
What naming conventions/schema will be used for your
data, if any?
Section II: Data Storing and Long-Term Preservation
During the project, how (i.e. media) and where (i.e.
location(s)) will data be stored and who is responsible
for it?
Section III: Data Sharing
Are there any data with privacy concerns to sharing
(e.g., human subjects)? If so, what policies need to be
adhered to and how will policies be enforced? 10
11. Social vs. Physical Science:
Observations from data plans
• Mixture of digital and physical material:
Data Type/ • digital (documents, audio recordings, photos, database)
Qty • non digital (newspaper, interview notes, artifacts)
• Datasets on the scale of MBs and GBs, not TBs
• Use of human subjects does not exempt one from sharing
Personal data
Identifiers oGraduate student received grant but was asked to modify
data management plan and find something to share
• More familiar with term “codebook” than “metadata”
Metadata
• Metadata is more often created manually by the PI
11
12. Archiving of Data:
An Evolving Process
Map how data flows through research
project
Evaluate existing data and identify
missing information
Decide what and when to archive
12
13. Investigating trends in data curation
Funders new requirements & policy
•Other funders adopting data management plan
requirements
Data curation archiving, repositories, standards
• Expanding our expertise and knowledgebase
Data sharing Collaboration, data-intensive science
• E.g., academic credit for sharing data collections?
13
14. Thank you
David Fearon
Betsy Gunia
datamangement@jhu.edu
JHU Data Management Services
The Sheridan Libraries
http://dmp.data.jhu.edu/
(410) 516-0713
14
Editor's Notes
Joint effort between us and scientistNot perfectly linear- can archive at any point, but this is our ideal processInterview-hardly anyone thinks their data is useful to others