Your SlideShare is downloading. ×
SEAD: A system to support social and active data curation
Upcoming SlideShare
Loading in...5

Thanks for flagging this SlideShare!

Oops! An error has occurred.

Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

SEAD: A system to support social and active data curation


Published on

Poster presented at the Data2012: Coming together around data - a PI meeting for Datanet/Interop

Poster presented at the Data2012: Coming together around data - a PI meeting for Datanet/Interop

Published in: Technology, Education
  • Be the first to comment

  • Be the first to like this

No Downloads
Total Views
On Slideshare
From Embeds
Number of Embeds
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

No notes for slide


  • 1. Jude Yew Ann Zimmerman SEAD: A system to support active and social data curation Magaret Hedstrom SE AD Praveen Kumar Robert McDonald in sustainability science James Myers Beth PlaleSustainable Environment — Actionable Data University of Michigan University of Illinois at Urbana-Champaign Indiana University Rensselaer Polytechnic Institute 1. Problem/Domain 4. SEAD Strategy Networked data Active curation Social curation + - - - - Tag and annotate data Overlay it with reference data Organize it in domain terminology Link it to people, papers, projects, conversations Long-term archive solution - Leverage social media for discovery of data, - Move data curation upstream in the data interest & expertise life-cycle - Support annotation of data by users of data- Sustainability science is a data-intensive area that focuses on - Record metadata at ingest - Record conversations and comments surrounding the complex interactions between nature and human activities. the data- Sustainability research requires access to data from the physical - Make connections between data & researchers - Take advantage of existing infrastructures (Institutional and social sciences. through social networking Repositories, ICPSR) for long-term preservation- But data are di cult to nd, obtain and use because di erent disciplines collect, describe and store their data in di erent ways. 5. SEAD Use Cases2. Data Challenges in Sustainability Science i) Able to ingest a variety of data types ii) Support data discovery iii) Add value to existing data The long tail of scienti c research data: - Small and derived data sets - Heterogeneous data - Multiple sources of data + = - Short-lived data with long-term value - Value of data grows when combined & integrated - Users can store, manage and share heterogeneous data types (e.g. images, - Users of data can provide additional metadata and - Provide links between data, people & publications geo-spatial images, sensor data etc.) annotations iv) Create new data v) Community curation of data 3. SEAD GoalsSEAD will address the needs of sustainability researchers to searchfor, aggregate, and maintain valuable data for the long term.To do this, the project seeks to build a prototype that: - The community identi es and curates data of value - These valued data will be moved to existing institutional repositories- Applies existing tools and services to sustainability research - Combine data from multiple sources and contribute derived for long-term storage- Integrates these services into a generalizable “Active and Social data back to SEAD Curation” infrastructure- Enables researchers to collaborate and share data during active - The SEAD team will work closely with the community of sustainability scientists to evolve these use cases. projects - In the rst two years of the project, SEAD will collaborate with scientists studying the Upper Great Lakes and Upper Mississippi River Basin.- Packages and migrates data valued by the users to a - Through this collaboration, SEAD will prototype a system that helps researchers manage their data and motivates them to share data and information about their data with others. federated repository for long-term preservation SEAD is funded by the National Science Foundation under cooperative agreement #OCI0940824.