Cwa Sustainability May8 Final


Published on

Professor Carole Goble's slides, presented at the inaugural meeting of the Concept Web Alliance, New York Hall of Science, May 8, 2009

Published in: Technology, Education
1 Like
  • Be the first to comment

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide
  • Cwa Sustainability May8 Final

    1. 1. Concept Web Alliance Sustainability & Governance Professor Carole Goble University of Manchester, UK [email_address] Concept Web Alliance, New York Hall of Science, 8th May 2009
    2. 2. What is an Alliance? <ul><li>An Alliance is a group of organizations working together to solve problems that cannot be solved alone. </li></ul><ul><li>Consensus </li></ul><ul><li>Interoperability </li></ul><ul><li>Pooling </li></ul>
    3. 3. What is CWA? <ul><li>A Forum : to unite stakeholders to share complex Life Science data in a new way, through triples. </li></ul><ul><li>A Facilitator : to promote the development of triple content services and promote the development of triple professional services. </li></ul><ul><li>A Facility : A “warehouse”, distributor and agent for triples on behalf of contributors and users. Develop our own services. </li></ul>
    4. 4. What is CWA. A Forum. <ul><li>For commons-based sharing : </li></ul><ul><li>Bringing together commercial and public stakeholders in life science data and scholarly publication </li></ul><ul><li>The definition and adoption of a common, interoperable model of rich annotated triples and the mechanisms to generate and use them </li></ul><ul><li>The specification of best practice, policies and use cases </li></ul><ul><li>The specification of compliance obligations to the model and the rights to and governance of the content. </li></ul>
    5. 5. What is CWA. A Forum. <ul><li>For commons-based sharing : </li></ul><ul><li>The promotion of widespread adoption of the model and the triples described by it. </li></ul><ul><li>The identification and promotion of the services needed to enable widespread adoption </li></ul><ul><li>Discuss legal and access and reuse issues </li></ul><ul><li>Ensuring the adoption of current technologies, standards and practices </li></ul>
    6. 6. What is CWA. A Facilitator. <ul><li>Work with established entities to identify, develop, promote and oversee: </li></ul><ul><li>Triple Content services </li></ul><ul><ul><li>Base: identity, mapping, provenance tracking… </li></ul></ul><ul><ul><li>Tools: capture, browsing, distribution, access… </li></ul></ul><ul><ul><li>Added value: reasoning… </li></ul></ul><ul><li>Triple Professional services </li></ul><ul><ul><li>Directory, Content aggregation, service aggregation, quality control, guarantees and certification, governance of content suitability and compliance to model, stewardship, content access negotiation </li></ul></ul>
    7. 7. What is CWA: A Facility on behalf of the community.... <ul><li>A “ warehouse ” for deposited triples. </li></ul><ul><li>A distributor for deposited triples. </li></ul><ul><ul><li>Infrastructure framework </li></ul></ul><ul><ul><li>Governance and legal framework </li></ul></ul><ul><ul><li>Financial and sustainability framework </li></ul></ul><ul><li>An agent for triple contributors and consumers. </li></ul><ul><ul><li>Infrastructure framework </li></ul></ul><ul><ul><li>Governance and legal framework </li></ul></ul><ul><ul><li>Incentive models for contribution </li></ul></ul><ul><ul><li>Negotiate to access data services </li></ul></ul><ul><ul><li>Financial and sustainability framework </li></ul></ul>
    8. 8. Linked Open Data
    9. 9. Linked Open Data + + +
    10. 10. Start Small. Incremental Value. One thing well. Triples. Triples. Triples. Jam today. More Jam Tomorrow.
    11. 11. Roadmap and timetable: Year 1 - Proposal <ul><li>August 2009 </li></ul><ul><li>Set up, incorporate the CWA, define governance </li></ul><ul><li>Get started on Working Groups (see next) </li></ul><ul><li>January 2010 </li></ul><ul><li>Define the triple model, exchange format and core services </li></ul><ul><li>Define the technical framework for contribution, distribution and access to a trustworthy triple pool </li></ul><ul><li>Define the operational and governance framework </li></ul><ul><li>Build a partnership with small number of key triple suppliers, triple consumers, tool providers and users </li></ul>
    12. 12. Roadmap and timetable: Year 1 - Proposal <ul><li>April 2010 </li></ul><ul><li>Pilot demonstrate CWA in operation and issue a grand challenge </li></ul><ul><li>1 st CWA Conference </li></ul><ul><li>Secure funding and resource commitments </li></ul><ul><li>May 2010 </li></ul><ul><li>Membership with key organisations </li></ul><ul><li>BioIT Alliance, Pistoia Alliance, PRISM, ELIXIR, SAGE, CODATA, Shared Names… </li></ul><ul><li>W3C, LarKC, CrossRef, Creative Commons…. </li></ul><ul><li>User groups in libraries, e.g. CNI </li></ul>
    13. 13. Organisational Structure - Proposal <ul><li>Executive Committee / Secretariat </li></ul><ul><li>Governing Board (Magnet Group) </li></ul><ul><li>Operations Core team </li></ul><ul><li>Operations Working Groups </li></ul><ul><ul><li>O1. Technical O2. Services </li></ul></ul><ul><ul><li>O3. Content O4. Operation practice </li></ul></ul><ul><ul><li>Championed by Core CWA team </li></ul></ul><ul><li>Members: charter, regular, sponsor </li></ul><ul><li>Invited advisors and specialists </li></ul>
    14. 14. Operations Working Groups <ul><li>O1. Technical – identity, provenance, federation, discrimination, triple boundaries… </li></ul><ul><li>O2. Services - distribution, directory, warehouse, citation tracking, multi-linguality… </li></ul><ul><li>O3. Content – capture, quality, packaging (walled gardens vs jungles), mixed licensing on triples, negotiation to content… </li></ul><ul><li>O4. Operation practice – incentive models, micro-attribution, distribution, legal, claims arbitration, penalty models, policy, triple licenses… </li></ul><ul><li>Are these the right groups? </li></ul><ul><li>Would you commit to joining in? </li></ul>
    15. 15. User oriented <ul><li>Take up depends on user need </li></ul><ul><li>No user ghettos! Users blended into Working Groups </li></ul><ul><li>User-driven demonstrator </li></ul><ul><li>No jargon - Triples? What triples? </li></ul><ul><li>Invisibility – incorporated in normal tools and normal work practice </li></ul><ul><li>User groups: pick out early adopters and triple-ready users </li></ul><ul><li>Clients of researchers and scholars: Libraries, data centres, publishers </li></ul>
    16. 16. For a Facility <ul><li>‘ Guarantor’ of interoperability (quality control) </li></ul><ul><li>Spam protection </li></ul><ul><li>Aggregator (many-to-one-to-many) </li></ul><ul><li>Partnering with inferrers, disambiguators, removers of redundancy (adding value) </li></ul><ul><li>(Re-)seller </li></ul><ul><li>Commission/royalty arrangements with sales agents and triple proprietors </li></ul><ul><li>Revenue stream. </li></ul><ul><li>Long term persistence </li></ul><ul><li>Management company? </li></ul>
    17. 17. Governance of Alliance <ul><li>Purpose and scope </li></ul><ul><li>Administrative and management structures </li></ul><ul><li>Powers and Procedures </li></ul><ul><li>Relationship to other organisations </li></ul><ul><li>Conduct, obligations and benefits </li></ul><ul><li>Membership </li></ul><ul><ul><li>Charter Members </li></ul></ul><ul><ul><ul><li>major contribution, benefits, governing votes </li></ul></ul></ul><ul><ul><li>Regular Members </li></ul></ul><ul><ul><ul><li>Institutional (academic), Personal, Corporate </li></ul></ul></ul><ul><ul><li>Sponsors </li></ul></ul>
    18. 18. Why Sustainability for CWA? <ul><li>Alliance operation </li></ul><ul><ul><li>Organisation </li></ul></ul><ul><ul><li>Conferences / WGroups </li></ul></ul><ul><ul><li>Partnerships and advocacy </li></ul></ul><ul><ul><li>Legal advice </li></ul></ul><ul><li>Content and professional services operation </li></ul><ul><ul><li>Small team </li></ul></ul><ul><ul><li>Define the guidelines and rule book </li></ul></ul><ul><ul><li>Marshal and seed a wider movement </li></ul></ul>
    19. 19. Triples Remove Ambiguity and Redundancy Curated Observational Smart Triples Inferred; constructed Knowledge Space Remove Ambiguity and Redundancy Remove Ambiguity and Redundancy (node 1, unique ID) (node 2, unique ID) < Source concept > < Target Concept > < Relations ( edge ) > class date value owner condition Etc. In these areas significant value is added to the triples
    20. 20. Where significant value is added…suggestions <ul><li>… triples represent an economic value and can be charged for </li></ul><ul><li>Curated triples: </li></ul><ul><ul><li>charges at the discretion of the curator </li></ul></ul><ul><li>Inferred triples: </li></ul><ul><ul><li>charges at the discretion of the inferrer </li></ul></ul><ul><li>Disambiguated triples: </li></ul><ul><ul><li>charges at the discretion of disambiguator </li></ul></ul><ul><li>Redundancy-removed triples: </li></ul><ul><ul><li>charges at the discretion of the redundancy remover </li></ul></ul><ul><li>Observed triples </li></ul><ul><ul><li>can be charged for if they are taken from proprietary sources – by or on behalf of the proprietors </li></ul></ul><ul><ul><li>peer-reviewed literature: charges at the discretion of the rights-holder </li></ul></ul>
    21. 21. Includes edges from: <ul><li>Pubmed (400,000,000 sentences, 5,000,000,000 concept co-occurrences) (from public data) </li></ul><ul><li>Protein databases (UniProt, IntAct, PDB, HPRD – 75,000 human curated PPIs) (from public data) </li></ul><ul><li>Private expression data (3000 extra edges, by Merck) (from proprietary data) </li></ul><ul><li>InWeb edges (240,000 unique edges from 17 species) (from proprietary data) </li></ul><ul><li>Plectix edges (5,000 extra edges (PPI modeling) (from proprietary data) </li></ul><ul><li>Gene (co-expression databases (GEO, Express… – 25 square genes) (from public data) </li></ul><ul><li>STRING edges (200,000 gene-gene edges) (from semi public data) </li></ul><ul><li>Reactome edges (240,000 unique edges from 17 species) (from proprietary data) </li></ul><ul><li>Chemspider edges (25,000,000 chemicals) (from semi public data) </li></ul><ul><li>Wiki edges (WikEdge = WikiPathways, WikiProfessionals, Omegawiki, Wikigene) </li></ul><ul><li>Et Cetera </li></ul>Revenue generating Download Concept Web triples
    22. 22. Triple sales and Open Data/Open Access <ul><li>Single triples: </li></ul><ul><ul><li>free and open, via web search interface </li></ul></ul><ul><ul><li>revenue from advertising? </li></ul></ul><ul><li>Collections: </li></ul><ul><ul><li>at the discretion of owners </li></ul></ul><ul><ul><li>e.g. free if used publicly on web, with possibility of advertising revenue </li></ul></ul><ul><ul><li>paid-for if used behind firewall </li></ul></ul>
    23. 23. Benjamin Nowack Blog
    24. 24. Revenue Streams <ul><li>SUBSCRIPTIONS </li></ul><ul><li>SPONSORSHIP </li></ul><ul><li>FEES </li></ul><ul><li>ADDED VALUE SERVICES </li></ul><ul><li>Dual model: </li></ul><ul><li>Public and commercial </li></ul>
    25. 25. <ul><li>SUBSCRIPTIONS AND SPONSORSHIP </li></ul><ul><li>Subscriptions from public organisations and research grants </li></ul><ul><li>Public sponsorship - by a major initiative, e.g. BioBanking or research organisation </li></ul><ul><li>Corporate sponsorship </li></ul><ul><li>FEES </li></ul><ul><li>Membership fees and donations </li></ul><ul><li>Contributors: Certification fees, contribution fees, tensioned to incentives </li></ul><ul><li>Consumers: Content subscription </li></ul><ul><li>Third party service providers </li></ul><ul><li>ADDED VALUE SERVICES </li></ul><ul><li>DOI assignment, certification </li></ul><ul><li>Royalties from charges for services and access (a la iPhone) </li></ul><ul><li>Content services e.g. virtual market bundles </li></ul><ul><li>Dual model: Public and commercial </li></ul>
    26. 26. Next steps <ul><li>Set in motion the CWA 1 st year roadmap </li></ul><ul><li>Incorporate and pump-prime </li></ul><ul><li>Working groups </li></ul><ul><li>Demonstrators </li></ul><ul><li>Sign up </li></ul><ul><li>Recruit members </li></ul><ul><li>Recruit partners </li></ul><ul><li>Recruit volunteers </li></ul><ul><li>Sustain </li></ul>
    27. 27. Participation on the journey… <ul><li>Is the CWA useful to your organisation? </li></ul><ul><li>Would you sign up to the declaration? </li></ul><ul><li>Would you become a member of CWA? What needs to be in place for you to commit? </li></ul><ul><li>Would you help us develop the Alliance? </li></ul><ul><li>Would you participate in one of the working groups? </li></ul><ul><li>Where are the low hanging fruit? </li></ul><ul><li>Who should we partner with? </li></ul><ul><li>How would you like to participate further? Would you sponsor an event? </li></ul>