Not just collaboration in workflows, but collaborating with sharing workflows Over 400 taverna workflows publicly available. Combine different formalisms in one system? E.g. a dataflow Kahn network and a central- clock based calculus Kepler logo
In particular a platform for research into curation practices As in the panel today Expert – Is library like Suppliers and crowd are the web side Automated is Expert curators : bioinformaticians who understand the services and workflows whose job it is to annotate and set up the curation pipelines, for services and workflows that are not of their own making. Self-curation: Some registries are closed – the myGrid registry is only curated by experts from the myGrid project itself. Others encourage service developers to self-curate, emphasising the use of plug-ins to service development environments such as Eclipse; examples include BioMoby’s jMoby plugin and SAWSDL4J, Lumina and Radiant toolkits for SAWSDL and WSMO Studio (21). Workflow repositories such as myExperiment rely on self-curation by the workflow developers and community curation by their users. Challenges include (a) the enforcement of controlled vocabularies by self-curators, particularly if the vocabularies are also managed by the developers as they can quickly become unruly and (b) incentivising people to contribute their services and workflows for the good of the community. Community Curators : The trend is to follow in the footsteps of popular Web 2.0 social computing sites and encourage community curation through user feedback, blogging, e-tracking, recommendations and folksonomy based tagging. Community approach to services development and use being tried by Seekda and BioMoby and for workflows by myExperiment. Community and self-curation requires built-in incentive models for people to contribute such as credit and attribution, but can be made to work for example iCapture successfully pioneered community curation of ontologies (Wilkinson PSB). Automated Curators : Automated scavengers and crawlers identify candidates for submission and extract as much metadata as possible. Functional metadata is hard to auto-curate, requiring: specialist metadata extraction tools ; software plug-ins that incidentally gather metadata from services as they are used in applications; or smart reasoning over seeded service descriptions and workflows . Operational and usage metadata is ripe for automation, generated from monitoring services, application diagnostics, customer reports and Social Network Analysis. Workflow analytics is the term used for processing workflow collections to identify, for example, service co-use patterns and service popularity. Automated curation needs excellent infrastructure.
myExperiment – Defining the Social Virtual Research Environment David De Roure, Carole Goble, Jiten Bhagat, Don Cruickshank, Antoon Goderis, Danius Michaelides and David Newman
scientists Graduate Students Undergraduate Students experimentation Data, Metadata Provenance Workflows Ontologies Digital Libraries The social process of Science 2.0 Local Web Repositories Virtual Learning Environment Technical Reports Reprints Peer-Reviewed Journal & Conference Papers Preprints & Metadata Certified Experimental Results & Analyses
HTML For Developers mySQL Search Engine reviews ratings groups friendships tags Enactor files workflows ` RDF Store SPARQL endpoint Managed REST API facebook iGoogle android XML API config profiles packs credits
Workflow Monitoring Event Logging Social Metadata Annotation Service Search User Registration Distributed Data Query Job Execution Naming and Identity Anonymisation Text Mining Research Object Management Assembling e-Laboratories Example Core Services
An e-Lab is a set of components and resources
An open system, not a software monolith
Utility of components transcends their immediate application
We envisage an ecosystem of cooperating e-Laboratories
Content Capture and Curation Workflows and Services Experts Social by User Community refine validate refine validate Self by Service Providers seed seed refine validate seed Automated refine validate seed Reuse and Symbiosis
It should facilitate the management and sharing of Research Objects – these are the digital commodities that are used and reused by researchers, ranging from data and methods to scholarly publications.
It should support the social model : producers of research objects should have incentives to make them available; consumers need to be able to discover and reuse them; all will benefit from self- and community-curation.
It should provide an open, extensible environment to permit ease of integration with other software, tools and services, and benefit from participative contribution of software.
It should provide a platform to action research , for example to deliver research objects to remote services and software. It should be straightforward to create customised, task specific tools and environments.