Advancing the fourth paradigm of research: Assimilating repositories into active research phases
Title Here Advancing the fourth paradigm of research: Assimilating repositories into active research phases Tyler Walters Dean, University Libraries, Virginia TechSPARC Conference, Kansas City, March 12, 2012 Title Here, Optional or Unit Identifier
The Rise of Virtual Environments Repositories are being woven into “virtual ecosystems,” they are holistic and support communities of practice • Early stages / deposit: raw/early phase data, notes, etc. • Annotating, sharing within research groups, commenting, etc. • Research proposal writing, project planning, etc. • Tools: • Discovery, analysis, visualization, and text/data/image mining are being used in concert with repositories • Virtual communities and their communication tools • e.g., social media and community networking capabilities2
Which projects are highlighted? TARDIS Purdue University Research Repository | PURR3
e-Research Version 0.9 Protein Crystallography Research Data and Metadata Workﬂow 3/6/2011 Computer Researchers Computer Cluster Research Admin RM4 Repository SynchrotronMX1 / MX2 Metaman MyTardis MyTardisBeamline Monash Raw Data Metadata Protein Crystallography The Australian Repositories Extraction for Diffraction ImageS Research Data Management Platform (for raw, processed, reﬁned, and published data) Australian ORCA Research Data Commons From Capture Metadata Harvester to Publication MyTardis MyTardis LEGEND Institutional Data & Metadata Research Institutional MyTardis Metadata Data Research Future?: Registry Data Proposed Registry Institutional Virtual lab Research Data system Registry
Early Stage/Deposit • Move curation upstream in the data/information life cycle • Automatically capture metadata, defined by the data producers • Provide facilities for annotation and mark-up of data5
Early Stage: The Active Curation ModelActive Curation Social MediaWorkflows Data Review Rating Commenting Metadata
Tools and Toolkits • A Critical Intersection in the ‘Virtual Ecosystem’ is: Tools + Repositories • Developing toolkits for discovery, analysis, visualization, and text/data/image mining… all are being used with repositories • Leveraging existing tools (open source and proprietary) • Incorporating custom, discipline-specific tools7
Tools & Toolkits Functionality: • By data type • Search • Visualization • Subsetting • Analysis • Services8
Working Group Support • File share • Wikis • To do lists • Blogs • Calendars • Forums • Project notes • Commenting • Tagging • Proposal writing11
How do IRs and “papers” fit in? IRs are being leveraged in these new developments• Services over an Network of Data Producers active content layer that is backed by/harvested into a Web User Interface federated archive infrastructure based Active Content Repository Services Provided on institutional Content Curation Archival data Other Mining Decisions services resources generation Virtual Archives Institutional Repositories(slide from SEAD) Data RPI UIUC UM IU ICPSR Conservancy User Network
Linked Data and Repositories• Tag and annotate data• Overlay it with reference data• Organize it in domain terminology• Link it to people, papers, projects, conversations…(slide from SEAD)
Thank you… Tyler Walters email@example.com tywalters1 = Skype / Twitter Acknowledgements for slides and conversations: • Robert McDonald (Indiana), SEAD • William Michener (New Mexico), DataONE • Antohny Bietz and Steve Androulakis (Monash), TARDIS • Michael Witt (Purdue), PURR • Suzie Allard (Tennessee), ORNL DAAC • Sayeed Chourdhury (Johns Hopkins), Data Conservancy14
A particular slide catching your eye?
Clipping is a handy way to collect important slides you want to go back to later.