Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012

  • 496 views
Uploaded on

SEAD presentation by Margaret Hedstrom at the DataNet Interop Meeting in Indianapolis January 26, 2012.

SEAD presentation by Margaret Hedstrom at the DataNet Interop Meeting in Indianapolis January 26, 2012.

More in: Technology , Business
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
    Be the first to like this
No Downloads

Views

Total Views
496
On Slideshare
0
From Embeds
0
Number of Embeds
1

Actions

Shares
Downloads
2
Comments
0
Likes
0

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide
  • We will build usable and useful tools that scientists can take advantage of as they collect, generate and organize data in their active projects. This Active Curation approach will be designed with a great deal of user input to make sure that the tools are light-weight, easy to learn, easy to use, and more effective than the painstaking, hand-crafted approach that many sustainability scientists use today. The Active Curation approach will make data management easier for data producers and lower the curation costs to SEAD.Another part of our strategy is to deploy a variety of social networking and social-media inspired tools to engage the community of data producers and users. These include tools for annotation, rating and commentary on data sets, visualizations of publication and citation networks that map the invisible college of sustainability science researchers, and social networking tools that help build network effects.  We have designed our program with multiple mechanisms to encourage participation in SEAD and adoption of its approach. These include domain engagement workshops to surface needs and requirements, ensure usability of tools, and enlisting key leaders in sustainability as early adopters and promoters of SEAD. These strategies along with support for centralized curation services, education, outreach and training will create a model for sustainable access and preservation of heterogeneous data for sustainability science and other small science disciplines in the long tail.

Transcript

  • 1. SEAD Sustainable Environment – Actionable DataMargaret Hedstrom, PI (Michigan)Praveen Kumar, co-PI (Illinois)Jim Myers, co-PI (RPI)Beth Plale, co-PI (Indiana)Ann Zimmerman, co-PI (Michigan)
  • 2. SEAD’s Goals• Provide data services that address the needs of researchers in sustainability science• Integrate these services into an generalizable “Active and Social Curation” infrastructure suited to data in the “long tail”• Develop capabilities to package and migrate the most valuable datasets to a federated repository infrastructure for long-term preservation
  • 3. Sustainability Science ScienceCooperation Technology Policy Economics Poverty & Justice 3
  • 4. Data challenges• Small and derived data sets• Heterogeneous data• Multiple sources of data• Short-lived data with long-term value• Value of data grows when combined & integrated
  • 5. SEAD’s Strategy• Leverage social media for discovery of data, interest, and expertise• Move data curation upstream in the data life cycle• Involve domain scientists in setting priorities for evolution of data and services• Take advantage of existing infrastructures (Institutional Repositories, ICPSR) for long-term preservation
  • 6. Active Curation ModelActive Curation Social MediaWorkflows Data Review Rating Commenting Metadata
  • 7. SEAD: Leveraging Existing Resources• Cyberinfrastructure – IU Data Capacitor/HPC Capabilities – UIUC/NCSA HPC Capabilities – Rensselaer CCNI Capabilities• Repositories – UM Deep Blue – IU ScholarWorks – ICPSR Repository – UIUC IDEALS
  • 8. SEAD 18 Month Prototype Targets for Cyberinfrastructure• Domain Engagement – Requirements derived from researchers – Use Cases• Active and Social Content Curation – Pilot Active Content Repository, VIVO deployments – Exemplar services for Data Ingest, Discovery, Re- use, Curation• CI for Long-term Access – Data model, protocol design/development – Pilot Federated Repository infrastructure
  • 9. SEAD TEAMUniversity of Michigan: Margaret Hedstrom (UM PI), Ann Zimmerman (Co-PI and Project Manager), George Alter, Bryan Beecher, Charles Severance,Karen Woollams, Jude Yew.Indiana University: Beth Plale (IU PI), Katy Borner, Robert H. McDonald,Kavitha Chandrasekar, Robert Ping, Stacy Kowalczyk, Robert Light.University of Illinois: Praveen Kumar (UIUC PI), Rob Kooper, Luigi Marini,Terry McLaren, Zaman Aktaruzzaman.Rensselaer Polytechnic Institute: Jim Myers (RPI PI), Ram Prasanna GovindKrishnan, Lindsay Todd, Adam Wilson.
  • 10. AcknowledgmentsSEAD is funded by the National ScienceFoundation under cooperative agreement#OCI0940824 http://sead-data.net