A presentation given by Julie Allinson at the UK Archivematica group meeting on 6th November 2015 in Leeds. It describes work underway in the "Filling the Digital Preservation Gap" project using Archivematica to preserve research data
Digital Heritage 2015: International TAG CLOUD Project Workshop
Presentation by Holly Wright, Archaeology Data Service, United Kingdom\
Granada, Spain
29 September 2015
Presentation to the PREMIS Implementation Fair at iPRES 2016, about how PREMIS in METS metadata is implemented in the Archivematica digital preservation system.
Big data for bay area big data developer19scottmiller
Global Big Data Conference is offering 3 day extensive Developer conference on Big Data. This is a fast paced, vendor agnostic, technical overview of the Big Data landscape.
Over the past year the University of New Mexico (UNM) Libraries instituted a new digital preservation initiative that was literally built from the ground up. Initially conceived as a means to preserve the libraries' digital collections, the project involved developing program structure, improving tools and working with vendors. As the project developed, the digital preservation needs of a broader community than originally planned became vividly apparent, and it evolved into a much larger endeavor that includes preservation of research data, university archives and digital cultural heritage collections from partner institutions around the state. The presenters will discuss their experiences implementing digital preservation at UNM, and talk about how the initiative is starting to encompass the preservation needs of partner organizations.
Slides accompanying a brief talk given as part of the Archivematica User Group meeting at #SAA2016, the Society of American Archivists 2016 conference in Atlanta, GA. The user group meeting was held on August 3rd Room 309/310 in the Hilton Atlanta.
These slides offer Archivematica users a brief update on the features included in the current 1.5 release and what's on the roadmap for future releases, as well as discussion of related events and resources such as the first ArchivematiCamp in August, screencasts, and more.
Digital Heritage 2015: International TAG CLOUD Project Workshop
Presentation by Holly Wright, Archaeology Data Service, United Kingdom\
Granada, Spain
29 September 2015
Presentation to the PREMIS Implementation Fair at iPRES 2016, about how PREMIS in METS metadata is implemented in the Archivematica digital preservation system.
Big data for bay area big data developer19scottmiller
Global Big Data Conference is offering 3 day extensive Developer conference on Big Data. This is a fast paced, vendor agnostic, technical overview of the Big Data landscape.
Over the past year the University of New Mexico (UNM) Libraries instituted a new digital preservation initiative that was literally built from the ground up. Initially conceived as a means to preserve the libraries' digital collections, the project involved developing program structure, improving tools and working with vendors. As the project developed, the digital preservation needs of a broader community than originally planned became vividly apparent, and it evolved into a much larger endeavor that includes preservation of research data, university archives and digital cultural heritage collections from partner institutions around the state. The presenters will discuss their experiences implementing digital preservation at UNM, and talk about how the initiative is starting to encompass the preservation needs of partner organizations.
Slides accompanying a brief talk given as part of the Archivematica User Group meeting at #SAA2016, the Society of American Archivists 2016 conference in Atlanta, GA. The user group meeting was held on August 3rd Room 309/310 in the Hilton Atlanta.
These slides offer Archivematica users a brief update on the features included in the current 1.5 release and what's on the roadmap for future releases, as well as discussion of related events and resources such as the first ArchivematiCamp in August, screencasts, and more.
Extending DSpace 7: DSpace-CRIS and DSpace-GLAM for empowered repositories an...4Science
Presentation given at OR2019 in Hamburg, Germany
In recent years there has been an increasing need to position institutional repositories in a broader context that enhances research opportunities and facilitates the discovery of resources. This presentation is about DSpace-CRIS and DSpace-GLAM, in their new version compatible with DSpace 7, with renewed features built with the updated technology stack of DSpace 7: Angular and REST API, their characteristics and novelties, and how their adoption can empower the role of repositories within academic, research, and cultural heritage institutions. The migration process for both DSpace-CRIS/GLAM and DSpace users that want to enhance their repository with the additional features and capabilities provided by version 7 will be presented. DSpace-CRIS and GLAM are continuously being aligned with DSpace versions and support is provided through the same community channels. Finally, the future roadmap of the project will be discussed, in the same way as in the last ten years when ideas and features blossomed in DSpace-CRIS were later adopted by the standard DSpace distribution. The community is numerous and growing and the exchange of experiences is beneficial for all organizations.
In April 2014, the Bentley Historical Library received a $355,000 grant from the Mellon Foundation to integrate ArchivesSpace, Archivematica and DSpace into an end-to-end digital archives workflow. This presentation will identify key project goals and outcomes and demonstrate features and functionality of Archivematica’s new “Appraisal and Arrangement” tab.
These slides accompany a 1.5 hour webinar sponsored by the Western New York Library Resources Council, presented by Dan Gillean of Artefactual Systems on February 15th, 2017.
The session was intended to introduce participants to some of the key standards, services, and tools available to support digital preservation planning and activities. Part 1 focused on DP101, and how to begin tackling digital preservation in your institution. Part 2 introduced the Archivematica project's history, philosophy, and aims, while Part 3 was a live demonstration of Archivematica in action.
Thank you to WNYLRC for sponsoring this event!
Provenance as a building block for an open science infrastructureAndreas Schreiber
International Symposium on Grids & Clouds 2018 (ISGC 2018)
Taipei, Taiwan
March 23, 2018
http://indico4.twgrid.org/indico/event/4/session/17/contribution/46
This presentation was provided by Scott Anderson of Millersville University during a NISO Webinar, Engineering Access Under the Hood, Part Two, held on November 15, 2017
DSpace-CRIS slides presented at ORCID's Better Together webinar on 19.09.2019, full slide deck with ORCID introduction at https://doi.org/10.23640/07243.9884033.v2.
Video Recording available at https://vimeo.com/361523018
2010 EGITF Amsterdam - Gap between GRID and HumanitiesDirk Roorda
How useful/relevant is GRID and High Performance Computing in its current form for the Humanities, especially within the European Infrastructure projects CLARIN, DARIAH and CESSDA? We need virtual use cases!
DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...Andrea Bollini
The idea of DSpace-CRIS has its origin in 2009 when the Hong Kong University decided to extend the information exposed in their DSpace IR adding information (people/projects) coming from other systems already in use (mainly) for administrative purpose: a CRIS.
One year ago, November 2012, DSpace-CRIS was released as an open source solution to enrich DSpace (1.8.2). After highlighting the important steps made by the DSpace Community in 2013, that will bring to the final release of DSpace 4.0 in December, Cineca focused its presentation on what DSpace-CRIS is today.
The most important announcement was that DSpace-CRIS is now compatible and compliant with the CERIF standard and that an export feature in CERIF XML will be available in the DSpace-CRIS 4.0 version. Indeed the key components of the CERIF data model are supported natively: UUID, timestamped relation, semantic characterization.
In addition to that, the dynamic, flexible and not hardcoded approach of DSpace-CRIS data model makes it very easy to create new entities (besides the few predefined ones) and configure instances compliant with CERIF.
There are several advantages that DSpace-CRIS brings to Institutional Repositories and to the DSpace community overall:
- CRIS entities as authority for Item metadata values;
- DSpace Items can be linked and displayed in the detail page of any CRIS entities;
- Ability to display selected publications (or any other related entities) in the researcher profile;
- It is possible to create lists of selected publications (or any other related entities);
- CRIS entity detailed page visit;
- Global & Top related CERIF Entity views & downloads referencing the CRIS entity (projects for researchers, researchers for OrgUnits, etc.);
- Global & Top item views & downloads referencing the CRIS entity;
- email and RSS alerts;
- Article level metrics for PubMed (extensible):
- Cited-by count in the item page
- Number of articles for researcher
- Total citations for researcher (only items in local DSpace database will be counted)
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...4Science
Presented by Susanna Mornati at the 2019 DSpace North American User Group Meeting September 23 & 24, 2019 at the University of Minnesota in Minneapolis.
Abstract: DSpace-CRIS is a free open-source platform based on DSpace for Research Data and Information Management, adopted by a wide international community of universities and research centers: DSpace-CRIS Home. It complies with recommendations, open standards and technologies such as the OAI-PMH, SignPosting, and ResourceSync (recommended by the COAR Next Generation Repositories WG), it features complete ORCID integration, compliance with the CERIF model, the IIIF framework, and with the OpenAIRE Guidelines for Literature Repositories, Data Archives, CRIS Managers, to improve findability, accessibility, interoperability, and reuse of digital assets for research and cultural heritage. DSpace-CRIS collects and disseminates information about researchers' profiles, organizations, publications, patents, grants, awards, and all entities that populate the research domain and their relationships, besides storing and exposing full-text publications, datasets, and other relevant digital objects, providing persistent identifiers and long-term preservation capabilities. DSpace-RDM exposes datasets to visual exploration and M2M streaming for analysis thanks to the integration with CKAN. DSpace-GLAM enhances the fruition of the cultural heritage through the (crowd-funded) IIIF image viewer, providing remote fruition of cultural heritage and offering a great user experience. These flavors of DSpace allow to expose and share open data, open information, and open digital objects in a collaborative, interoperable, and sustainable way. The use cases of a variety of institutions in different countries and continents will be shared to show the use of this powerful technology.
"Filling the Digital Preservation Gap" with ArchivematicaJenny Mitcham
A webinar given by Jenny Mitcham and Simon Wilson to Digital Preservation Coalition members on 25th November 2015. It describes work underway in the "Filling the Digital Preservation Gap" project using Archivematica to preserve research data
"Filling the digital preservation gap" with ArchivematicaJenny Mitcham
A presentation given by Jenny Mitcham at the iPRES conference on 6th November 2015 in Chapel Hill, North Carolina. It describes work underway in the "Filling the Digital Preservation Gap" project using Archivematica to preserve research data
Extending DSpace 7: DSpace-CRIS and DSpace-GLAM for empowered repositories an...4Science
Presentation given at OR2019 in Hamburg, Germany
In recent years there has been an increasing need to position institutional repositories in a broader context that enhances research opportunities and facilitates the discovery of resources. This presentation is about DSpace-CRIS and DSpace-GLAM, in their new version compatible with DSpace 7, with renewed features built with the updated technology stack of DSpace 7: Angular and REST API, their characteristics and novelties, and how their adoption can empower the role of repositories within academic, research, and cultural heritage institutions. The migration process for both DSpace-CRIS/GLAM and DSpace users that want to enhance their repository with the additional features and capabilities provided by version 7 will be presented. DSpace-CRIS and GLAM are continuously being aligned with DSpace versions and support is provided through the same community channels. Finally, the future roadmap of the project will be discussed, in the same way as in the last ten years when ideas and features blossomed in DSpace-CRIS were later adopted by the standard DSpace distribution. The community is numerous and growing and the exchange of experiences is beneficial for all organizations.
In April 2014, the Bentley Historical Library received a $355,000 grant from the Mellon Foundation to integrate ArchivesSpace, Archivematica and DSpace into an end-to-end digital archives workflow. This presentation will identify key project goals and outcomes and demonstrate features and functionality of Archivematica’s new “Appraisal and Arrangement” tab.
These slides accompany a 1.5 hour webinar sponsored by the Western New York Library Resources Council, presented by Dan Gillean of Artefactual Systems on February 15th, 2017.
The session was intended to introduce participants to some of the key standards, services, and tools available to support digital preservation planning and activities. Part 1 focused on DP101, and how to begin tackling digital preservation in your institution. Part 2 introduced the Archivematica project's history, philosophy, and aims, while Part 3 was a live demonstration of Archivematica in action.
Thank you to WNYLRC for sponsoring this event!
Provenance as a building block for an open science infrastructureAndreas Schreiber
International Symposium on Grids & Clouds 2018 (ISGC 2018)
Taipei, Taiwan
March 23, 2018
http://indico4.twgrid.org/indico/event/4/session/17/contribution/46
This presentation was provided by Scott Anderson of Millersville University during a NISO Webinar, Engineering Access Under the Hood, Part Two, held on November 15, 2017
DSpace-CRIS slides presented at ORCID's Better Together webinar on 19.09.2019, full slide deck with ORCID introduction at https://doi.org/10.23640/07243.9884033.v2.
Video Recording available at https://vimeo.com/361523018
2010 EGITF Amsterdam - Gap between GRID and HumanitiesDirk Roorda
How useful/relevant is GRID and High Performance Computing in its current form for the Humanities, especially within the European Infrastructure projects CLARIN, DARIAH and CESSDA? We need virtual use cases!
DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...Andrea Bollini
The idea of DSpace-CRIS has its origin in 2009 when the Hong Kong University decided to extend the information exposed in their DSpace IR adding information (people/projects) coming from other systems already in use (mainly) for administrative purpose: a CRIS.
One year ago, November 2012, DSpace-CRIS was released as an open source solution to enrich DSpace (1.8.2). After highlighting the important steps made by the DSpace Community in 2013, that will bring to the final release of DSpace 4.0 in December, Cineca focused its presentation on what DSpace-CRIS is today.
The most important announcement was that DSpace-CRIS is now compatible and compliant with the CERIF standard and that an export feature in CERIF XML will be available in the DSpace-CRIS 4.0 version. Indeed the key components of the CERIF data model are supported natively: UUID, timestamped relation, semantic characterization.
In addition to that, the dynamic, flexible and not hardcoded approach of DSpace-CRIS data model makes it very easy to create new entities (besides the few predefined ones) and configure instances compliant with CERIF.
There are several advantages that DSpace-CRIS brings to Institutional Repositories and to the DSpace community overall:
- CRIS entities as authority for Item metadata values;
- DSpace Items can be linked and displayed in the detail page of any CRIS entities;
- Ability to display selected publications (or any other related entities) in the researcher profile;
- It is possible to create lists of selected publications (or any other related entities);
- CRIS entity detailed page visit;
- Global & Top related CERIF Entity views & downloads referencing the CRIS entity (projects for researchers, researchers for OrgUnits, etc.);
- Global & Top item views & downloads referencing the CRIS entity;
- email and RSS alerts;
- Article level metrics for PubMed (extensible):
- Cited-by count in the item page
- Number of articles for researcher
- Total citations for researcher (only items in local DSpace database will be counted)
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...4Science
Presented by Susanna Mornati at the 2019 DSpace North American User Group Meeting September 23 & 24, 2019 at the University of Minnesota in Minneapolis.
Abstract: DSpace-CRIS is a free open-source platform based on DSpace for Research Data and Information Management, adopted by a wide international community of universities and research centers: DSpace-CRIS Home. It complies with recommendations, open standards and technologies such as the OAI-PMH, SignPosting, and ResourceSync (recommended by the COAR Next Generation Repositories WG), it features complete ORCID integration, compliance with the CERIF model, the IIIF framework, and with the OpenAIRE Guidelines for Literature Repositories, Data Archives, CRIS Managers, to improve findability, accessibility, interoperability, and reuse of digital assets for research and cultural heritage. DSpace-CRIS collects and disseminates information about researchers' profiles, organizations, publications, patents, grants, awards, and all entities that populate the research domain and their relationships, besides storing and exposing full-text publications, datasets, and other relevant digital objects, providing persistent identifiers and long-term preservation capabilities. DSpace-RDM exposes datasets to visual exploration and M2M streaming for analysis thanks to the integration with CKAN. DSpace-GLAM enhances the fruition of the cultural heritage through the (crowd-funded) IIIF image viewer, providing remote fruition of cultural heritage and offering a great user experience. These flavors of DSpace allow to expose and share open data, open information, and open digital objects in a collaborative, interoperable, and sustainable way. The use cases of a variety of institutions in different countries and continents will be shared to show the use of this powerful technology.
"Filling the Digital Preservation Gap" with ArchivematicaJenny Mitcham
A webinar given by Jenny Mitcham and Simon Wilson to Digital Preservation Coalition members on 25th November 2015. It describes work underway in the "Filling the Digital Preservation Gap" project using Archivematica to preserve research data
"Filling the digital preservation gap" with ArchivematicaJenny Mitcham
A presentation given by Jenny Mitcham at the iPRES conference on 6th November 2015 in Chapel Hill, North Carolina. It describes work underway in the "Filling the Digital Preservation Gap" project using Archivematica to preserve research data
A collaborative approach to "filling the digital preservation gap" for Resear...Jenny Mitcham
A presentation given by Jenny Mitcham at the Northern Collaboration Conference on 10th September 2015 at Leeds. It describes work underway in the "Filling the Digital Preservation Gap" project using Archivematica to preserve research data
A collaborative approach to "filling the digital preservation gap" for Resear...Jenny Mitcham
A presentation given by Chris Awre, Jenny Mitcham and Sarah Romkey at RDMF14 (the DCC's Research Data Management Forum) on 9th November 2015 in York. It describes work underway in the "Filling the Digital Preservation Gap" project using Archivematica to preserve research data
Jisc Shared Service requirements presentation - 18th November 2015Jenny Mitcham
A presentation by Chris Awre and Jenny Mitcham about our requirements gathering exercise for the "Filling the Digital Preservation Gap" project to inform the requirements of Jisc's proposed shared service for RDM. The presentation was delivered on the 18th November 2015 at Jisc's shared services workshop at Aston University
“Filling the digital preservation gap”an update from the Jisc Research Data ...Jenny Mitcham
Presentation given to the Hydra Preservation Interest Group by Jenny Mitcham on the Jisc Research Data Spring project "Filling the Digital Preservation Gap"
Implementing figshare, research data networkJisc RDM
Implementing figshare and engaging researchers,
Research data network, September 2016, Georgina Parsons, Cranfield University and Megan Hardeman, figshare.
Jiří Pospíšil
Palacký University in Olomouc,
Czech Republic
Education, Research & Development
6th International Conference
4–8 September 2015
Elenite Holiday Village, Bulgaria
www.sciencebg.net
Dan Roger Sträng
Østfold University College
Norway
Education, Research & Development
5th International Conference
4–8 September 2014
Elenite Holiday Village, Bulgaria
www.sciencebg.net
Jenny Mitcham from the University of York and Chris Awre from the University of Hull share lessons learned from their project to explore the potential of the digital preservation solution Archivematica to help manage research data that academics within the University produce. The project 'Filling the Digital Preservation Gap' has been carried out with funding from Jisc as part of their Research Data Spring program and was a collaboration of the University of York and the University of Hull. The project did not only explore Archivematica as a possible solution but also how it could integrate with the repositories and other systems for the management of research data.
The Series is jointly sponsored by ANDS and CAUL.
Research Data (and Software) Management at Imperial: (Everything you need to ...Sarah Anna Stewart
A presentation on research data management tools, workflows and best practices at Imperial College London with a focus on software management. Presented at the 2017 session of the HPC Summer School (Dept. of Computing).
This presentation was provided by
Priscilla Caplan of The Florida Center for Library Automation and Jeremy York of The University of Michigan Library, during the NISO Webinar "What It Takes To Make It Last: E-Resources Preservation" held on February 10, 2011.
This presentation will provide an overview of issues in digital preservation. Presentation was delivered during the joint DPE/Planets/CAPAR/nestor training event, ‘The Preservation challenge: basic concepts and practical applications’ (Barcelona, March 2009)
UBC Library's Digital Preservation StrategyUBC Library
Presented by Bronwen Sprout & Sarah Romkey, UBC Library.
In early 2011, UBC Library began work on creating a digital preservation strategy in collaboration with Vancouver-based Artefactual Systems. Based on the results of a number of pilot projects, the strategy developed for UBC Library consists of using the open-source Archivematica digital preservation system to provide preservation functionality for the Library’s digitized and born-digital holdings. In addition, the strategy identifies the software requirements, existing and new system components, staffing and business processes that can be implemented to establish operational digital preservation systems and processes. They will discuss the strategy generally and cover three areas of implementation in greater detail: UBC Library’s Rare Books and Special Collections, cIRcle, a DSpace-based institutional repository, and CONTENTdm, UBC Library’s access system for digitized objects.
Open source glam tools for building sustainable cultural heritage and digital...LIBIS
Op donderdag 16 december 2016 gaf Roxanne Wyns van LIBIS een gastles voor het vak Online Publishing in de MA Cultural Studies / Ma Digital Humanities aan de KU Leuven met als thema ‘Open source GLAM tools for building sustainable Cultural Heritage and Digital Humanities infrastructures’.
Beschrijving:
This session focuses on a number of Cultural Heritage and Humanities infrastructure projects in which gallery, library, archival and museum (GLAM) tools have been used in combination with other open source and proprietary systems to provide sustainable and innovative environments for the management and research of diverse cultural heritage collections. The session will introduce software’s such as CollectiveAccess, Omeka, and the IIIF Mirador high-resolution viewer. Attention will also be payed to the opportunities and challenges of open source projects and best practices in standards, data interoperability and safe data storage to achieve good data management.
Ross King, Project Director of SCAPE, gave a short presentation of the EU funded project SCAPE, including descriptions of tools for planning and monitoring digital preservation, scalable computation and repositories, SCAPE Testbeds and where to learn more.
The presentation was given at the workshop ‘Preservation at Scale’ http://bit.ly/17ppAln in connection with the iPres2013 conference in Lissabon, Portugal, in September 2013.
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...datascienceiqss
Scholars Portal, a program of the Ontario Council of University Libraries (OCUL), provides the technical infrastructure to store, preserve, and provide access to shared digital library collections in Ontario - including hosting a local instance of Dataverse since 2011. As part of a national project known as Portage (a project of the Canadian Association of Research Libraries), Scholars Portal is partnering with Artefactual Systems, Dataverse, the University of British Columbia, the University of Alberta, and others, to integrate Dataverse with preservation software Archivematica. When completed, this project will facilitate the long-term preservation of research data according to the Open Archival Information System (OAIS) Reference Model.
A brief overview of the development and current workflows for Research Data Management at Imperial College London, presented to colleagues at the University of Copenhagen and Roskilde University in Denmark.
In this session, we’ll focus exclusively on OpenStack Swift, OpenStack’s object store capability. We’ll review the architecture, use cases, deployment strategies and common obstacles as we “open up the covers” on this exciting element of the OpenStack architecture.
Show drafts
volume_up
Empowering the Data Analytics Ecosystem: A Laser Focus on Value
The data analytics ecosystem thrives when every component functions at its peak, unlocking the true potential of data. Here's a laser focus on key areas for an empowered ecosystem:
1. Democratize Access, Not Data:
Granular Access Controls: Provide users with self-service tools tailored to their specific needs, preventing data overload and misuse.
Data Catalogs: Implement robust data catalogs for easy discovery and understanding of available data sources.
2. Foster Collaboration with Clear Roles:
Data Mesh Architecture: Break down data silos by creating a distributed data ownership model with clear ownership and responsibilities.
Collaborative Workspaces: Utilize interactive platforms where data scientists, analysts, and domain experts can work seamlessly together.
3. Leverage Advanced Analytics Strategically:
AI-powered Automation: Automate repetitive tasks like data cleaning and feature engineering, freeing up data talent for higher-level analysis.
Right-Tool Selection: Strategically choose the most effective advanced analytics techniques (e.g., AI, ML) based on specific business problems.
4. Prioritize Data Quality with Automation:
Automated Data Validation: Implement automated data quality checks to identify and rectify errors at the source, minimizing downstream issues.
Data Lineage Tracking: Track the flow of data throughout the ecosystem, ensuring transparency and facilitating root cause analysis for errors.
5. Cultivate a Data-Driven Mindset:
Metrics-Driven Performance Management: Align KPIs and performance metrics with data-driven insights to ensure actionable decision making.
Data Storytelling Workshops: Equip stakeholders with the skills to translate complex data findings into compelling narratives that drive action.
Benefits of a Precise Ecosystem:
Sharpened Focus: Precise access and clear roles ensure everyone works with the most relevant data, maximizing efficiency.
Actionable Insights: Strategic analytics and automated quality checks lead to more reliable and actionable data insights.
Continuous Improvement: Data-driven performance management fosters a culture of learning and continuous improvement.
Sustainable Growth: Empowered by data, organizations can make informed decisions to drive sustainable growth and innovation.
By focusing on these precise actions, organizations can create an empowered data analytics ecosystem that delivers real value by driving data-driven decisions and maximizing the return on their data investment.
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
Explore our comprehensive data analysis project presentation on predicting product ad campaign performance. Learn how data-driven insights can optimize your marketing strategies and enhance campaign effectiveness. Perfect for professionals and students looking to understand the power of data analysis in advertising. for more details visit: https://bostoninstituteofanalytics.org/data-science-and-artificial-intelligence/
Project update: A collaborative approach to "filling the digital preservation gap" for Research Data Management
1. Project update:
A collaborative approach to
“filling the digital preservation
gap” for Research Data
Management
Julie Allinson
Technology Development Manager
Library & Archives
University of York
6 November 2015
2. Filling the digital preservation gap:
Project aim
“…to investigate
Archivematica and explore
how it might be used to
provide digital preservation
functionality within a wider
infrastructure for Research
Data Management.”
3. This is a collaboration
University of Hull:
• Chris Awre – Head of Information Services, Library and
Learning Innovation
• Richard Green – Independent Consultant
• Simon Wilson – University Archivist
University of York:
• Julie Allinson – Technology Development Manager
• Jen Mitcham – Digital Archivist
Artefactual Systems
Jisc
4. Project structure
• Phase 1 – explore: testing, research,
thinking -produce a report (3 months)
• Phase 2 – develop: make
Archivematica better for RDM, plan
implementation (4 months)
• Phase 3 – implement: set up proof of
concepts at York and Hull (6 months)
5. Phase 1: Read all about it!
http://digital-archiving.blogspot.co.uk/
6. Why do we need digital preservation
for research data?
• There is a digital preservation gap in current
RDM infrastructures
• We can’t ignore digital preservation – moving
targets for data retention mean we need to take
this seriously
• Funder requirements around retention
7. University of York RDM questionnaire 2013
• Which data management issues have you come
across in your research over the last five years?
– “Inability to read files in old software formats on old
media or because of expired software licences”
– 24% of 181 researchers who answered this question
admitted this had been a problem for them
Why do we need digital preservation
for research data?
8. Why Archivematica?
“The goal of the Archivematica project is to
give archivists and librarians with limited
technical and financial capacity the tools,
methodology and confidence to begin
preserving digital information today.”
9. Why Archivematica?
• Standards-based
• Open Source
• Flexible and customisable
• Compatible with hundreds of file formats
• Advanced search and storage management
• Integrated with third-party systems
From https://ww.archivematica.org/en/
10. Archivematica for RDM?
• Flexible - can support different institutional needs and
workflows
• Automates many digital preservation tasks
• Can be integrated with other systems
• Good for those with limited resources
• Enhancements driven by and for the digital preservation
community
11. Archivematica for RDM?
It gives institutions greater confidence
that they will be able to continue to
provide access to usable copies of
research data over time
13. Deliverable One
✓Research Data needs to be kept,
but we don’t know if anyone will ever want it
and it might be *massive*
The Solution: enable the DIP to be generated ‘on
request’ and not as part of the initial ingest
14. Deliverable Two
✓We want to be able to grab the DIP, and
metadata about it for pulling into our
repository
The Solution: a library to help with parsing and
creating METS files
https://github.com/artefactual-labs/mets-reader-writer
15. Deliverable Three
✓We want to be able to report on what we
have
The Solution: a search API to answer basic
questions about the number of files in storage,
their formats, date of ingest, etc.*
* we’re working with DMAOnline @lancaster
16. Deliverable Four
✓With large datasets, the current checksum
mechanism in Archivematica could be a
bottleneck
The Solution: support for multiple checksum
algorithms
17. Deliverable Five
✓What about all those file formats that
Archivematica can’t identify?
The Solution: mechanism for running file
identification with multiple tools and a report of
unidentified formats, working with PRONOM to
improve their coveraage
18. Deliverable Six
✓We want to make it easier for Institutions to
adopt archivematica
The Solution: documentation and screencasts
for Archivematica automation tools, eg.
https://wiki.archivematica.org/Getting_started#Installation
19. All of these new features will become
part of the core Archivematica code in
2016
20. Phase 3
• The plan is to run a third phase of the project to:
✓implement prototype RDM workflows with preservation
using the new Archivematica features at York and Hull
✓use the search API to populate DMAOnline with stats
about datasets
✓do more community outreach
• We will be pitching to Jisc in December for phase
three #fingerscrossed
21. How do York plan to use Archivematica?
Pure RDMonitor Archivematica
AIP
AIP Store
PURE
Web Services
Archivematica
REST API
DIPRepository
Data
Catalogue
Key:
human to human
machine to machine
human to machine
22.
23. Where to find out more
http://www.york.ac.uk/borthwick/
24. The Bigger Picture
•Jisc are looking at building shared services for
RDM
• Our project is inputting into the specification
and discussion
• One area we’d be interested to find out more
about is the appetite for ‘above campus’
options - discussion planned for later.
25. How could you use Archivematica?
• Host it in-house and link it to an existing
repository/access system (for example DSpace,
CONTENTdm, Fedora/Hydra ...or a CRIS)
• Host it in-house and use as a standalone system
(you would need to have a storage system in place and establish a
way of facilitating access to the data)
• Sign up for a hosted instance of Archivematica
with archivesDIRECT (combines Archivematica with
DuraCloud storage)
• Sign up for a hosted instance of Archivematica
with Arkivum (combines Archivematica with Arkivum storage)