Academic libraries have a big data problem: Where can we put big licensed and open datasets so that researchers can easily access and analyze them? How do we broker access to the data our researchers need without prohibitively expensive investments in infrastructure, staffing, and updates? Why isn’t there a sustainable, affordable, and standardized library solution for large datasets? Meet CADRE, the open-source Collaborative Archive & Data Research Environment developed in collaboration with nine university libraries, eight non-profit and industry partners, and the IMLS. CADRE is a cloud-based platform solution for making licensed, big data sets & open and non-consumptive data sets accessible with appropriate security, stewardship, and storage in place. The CADRE model offers a new perspective on using shared tools to enable inexpensive, reliable, hands-off access to big data resources.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
Computational Research for Everyone: A New Model for Shared Big Data Infrastructure in Academic Libraries
1. COMPUTATIONAL
RESEARCH FOR
EVERYONE
A N E W M O D E L F O R S H A R E D
B I G D A T A I N F R A S T R U C T U R E
I N A C A D E M I C L I B R A R I E S
J A M I E W I T T E N B E R G
I N D I A N A U N I V E R S I T Y L I B R A R I E S
@ J A M I E V I V A
3. THE
DILEMMA
It is cost-prohibitive for most
individual libraries to develop
and implement infrastructure
to provide access to licensed
big data sets and large or
unwieldy open data sets
4. THE
DILEMMA
Many researchers who could
benefit from text and data
mining library-acquired &
large open resources are only
be able to do so via a
graphical user interface
6. PROJECT PARTNERS
Indiana University Libraries
IU Network Science Institute Big Ten Academic Alliance
This project was made possible in part by the Institute of Museum and Library Services LG-70-18-0202.
7. THE
SOLUTION
CADRE is a cloud-based
platform that provides secure
access to library-licensed
datasets and open, non-
consumptive datasets
8. THE
DATASETS
Web of Science
Leading commercial dataset with 63
million documents & 1.2 billion
citations
Microsoft Academic Graph
Open bibliometric dataset with 208
million documents & 1.4 billion
citations
9. THE
SOLUTION
By sharing the cost of this
solution across a large
number of academic libraries,
we are able to provide a
superior solution at a lower
cost to members, as well as a
free service tier for non-
members
10. THE
SOLUTION
CADRE will feature a
graphical user interface;
custom computational
resources; and a space to
share and store queries,
algorithms, derived data,
results, workflows, and
visualizations.
13. CONTACT US
These slides are licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
@CADRE_Project
cadre@iu.edu
www.cadre.iu.edu