Ssp Collexis Overview 2009


Published on

When Metadata Is the Content.... How Publishers Are Adding Value to Their Content to Increase Discoverability

  • Be the first to comment

  • Be the first to like this

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide
  • If you as an STM publisher want to offer expert centered solution – the first thing is that you need to know your experts – without clear profiles overcoming author homonym problems is it not possible. User involvement to build a clean database is not very promising – it takes forever and is very unlikely to go beyond 10-20% completenes. Since we at Collexis wanted to implement a prepopulated social network using PubMed as a content source, we have developed an author disambiguation process - with a very high precision. Applying it to PubMed, we were able to create 1.8 million disambiguated profiles with more than 30 million connections between the authors.
  • Ssp Collexis Overview 2009

    1. 1. Darrell W. Gunter, EVP / Chief Marketing Officer Collexis Holdings, Inc. May 28, 2009 When Metadata Is the Content .... How Publishers Are Adding Value to Their Content to Increase Discoverability
    2. 2. Creating fingerprints from documents using semantic technologies
    3. 3. Resolves linguistic ‘challenges’ <ul><li>Stemming, lemmatization and Normalizing </li></ul><ul><ul><ul><li>Plural  singular e.g.: Diseases  disease </li></ul></ul></ul><ul><ul><ul><li>Past  present e.g.: Treated  treat </li></ul></ul></ul><ul><li>Multi-lingual </li></ul><ul><ul><ul><li>Gripe  Griep  Influenza </li></ul></ul></ul><ul><li>Synonyms </li></ul><ul><ul><ul><li>Earth  Globe </li></ul></ul></ul><ul><li>Disambiguation of homonyms </li></ul><ul><ul><ul><li>Jaguar ? Car or Animal </li></ul></ul></ul><ul><li>Abbreviations </li></ul><ul><ul><ul><li>SARS  Severa Acute Respitory Syndrom (SARS) </li></ul></ul></ul><ul><ul><ul><li>Long Form  Short Form </li></ul></ul></ul><ul><li>Phrases: meaningful & coherent utterances </li></ul><ul><ul><ul><li>e.g. blackwater fever </li></ul></ul></ul>
    4. 4. Collexis Fingerprinting Technology <ul><li>Expert Profiles & Networks </li></ul><ul><li>Knowledge Discovery Search </li></ul><ul><li>Hypothesis Generation </li></ul>
    5. 5. Expertise...
    6. 6. Explore instead of Searching!
    7. 7. Potential Co-occurrence of Concepts Hypothesis Generation!
    8. 8. Conceptual Search & Visualization!
    9. 9. Concept & Author View
    10. 10. Science as a collaborative effort! As an STM Publisher... <ul><li>How to support a researcher in the area of collaboration and networking? </li></ul><ul><li>How to support a researcher in finding potential collaborators in interdisciplinary areas? </li></ul><ul><li>How to allow the researcher to effectively manage his network of collaborators? </li></ul><ul><li>How to support the researcher in reaching out to the group of researchers working on the same research question? </li></ul>
    11. 11. Publishers, Societies and traditional networking I <ul><li>In the past, networking between reseachers has taken place mostly </li></ul><ul><ul><li>at conferences </li></ul></ul><ul><ul><li>in the context of societies </li></ul></ul><ul><li>The new social networking wave using semantic technologies is about to cannibalize this form of networking with new generations of scientists entering the scene </li></ul>
    12. 12. Publishers, Societies and traditional networking II <ul><li>Are scientists well served with platforms like LinkedIn, Facebook and MySpace? </li></ul><ul><li>Not really as these platforms are not specific for their work </li></ul><ul><li>However the content / STM products can be used to establish pre-populated social networks – supporting very specifically the needs of the target group </li></ul>
    13. 13. Smith, Taylor and Foster – the disambiguation challenge <ul><li>Before you can suggest collaborators etc. you need to know you experts – which is sometimes challenging </li></ul><ul><li>Disambiguation of authors is a crucial success factor for pre-populated social networks </li></ul><ul><li>Collexis has developed a sophisticated process to achieve this - and has applied it to PubMed </li></ul><ul><li>Using PubMed as a source Collexis has created 1.8 million disambiguated expert profiles in a free social networking site </li></ul>
    14. 14. Ingredients for a pre-populated STM Network <ul><li>Users / experts </li></ul><ul><li>Relations between users </li></ul><ul><li>Common interest area to make it interesting to join </li></ul><ul><li>Possibility to solve technical challenges like author disambiguation </li></ul><ul><li>That‘s it! </li></ul>Authors... Co-authorship, affiliation... Society, journal... Problem solved... Pre-populated social networks – turning content into communities!
    15. 16. <ul><li>Co-author based networks </li></ul>
    16. 17. <ul><li>Geographical </li></ul><ul><li>mapping of the </li></ul><ul><li>co-author network </li></ul>
    17. 18. Same data for sub networks
    18. 19. Government Research Institutions
    19. 20. Consortiums i.e. NIH CTSA
    20. 21. ...or for peer reviewer identification
    21. 22. Creating STM sub-networks <ul><li>Having a researcher to join 25 different networks? </li></ul><ul><li>Should the user related information be spread out over 25 different social networks? </li></ul><ul><li>With BiomedExperts an &quot;Umbrella Network&quot;is available, where complete and disambiguated profiles are maintained – ready to be integrated into customized special interest networks on a topic level, society or publisher level </li></ul><ul><li>One profile, one login, many networks! </li></ul>
    22. 23. STM Subnetworks – ready to go today! Sub network for a society Sub network for a journal Reviewer assignment Sub network of all author of one publisher Virtual conferences - network of participants Network of funded researchers
    23. 24. The Use of Semantic Technologies <ul><li>... Is making these networks and services possible – not theoretically, already in reality... Which means that the </li></ul><ul><li>&quot;This is where the meta data has become the content.!&quot; </li></ul>
    24. 25. Thank you for your attention! <ul><li>Darrell W. Gunter, EVP / Chief Marketing Officer [email_address] , cell +1-973-454-3475 </li></ul><ul><li>Christian Herzog, MD, Managing Director STM </li></ul><ul><li>[email_address] , cell +49-171-5678917 </li></ul>