Executive Summary: Natural Language Processing of Twitter #swineflu (H1N1) Posts using Semantic MEDLINE Prototype

Loading...

Flash Player 9 (or above) is needed to view presentations.
We have detected that you do not have it on your computer. To install it, go here.

0 comments

Post a comment

    Post a comment
    Embed Video
    Edit your comment Cancel

    Notes on slide 1

    Influenza A(H1N1)Executive Summary:Natural Language Processing of Twitter #swineflu Posts using the Semantic MEDLINE PrototypeDr. AllaKeselman, Dr. Thomas Rindflesch, David HaleNational Library of Medicine, National Institutes of Health,Department of Health and Human ServicesMay 2009

    CDCemergency page on Twitter showing posts during the initial H1N1 outbreak, April 2009http://twitter.com/CDCemergency

    H1N1 information via Twitter:Communication issuesInformation receiversInformation overload>12,000 #swineflu (H1N1) posts/hour @ peakSignal:Noise ratioQuality?Authority?Twitter accounts impersonating CDCInformation providersEffective information provisionBiosurveillance

    (un)Controlled VocabularyFolksonomyHashtags (#)GrammarAbbreviationsSRSLY IMO ROI 4 RT? YMMVHigh context

    Examples of #swineflu Tweets

    Acquisition ChallengesTwitter timelineStorage requirementsPrivacyTwitter APILimited search functionalityTemporal and range limitationsRange definition limited to midnight1500 posts from limit

    Semantic MEDLINE PrototypeSummarizes MEDLINE citations returned by PubMed searchNatural Language Processing (MetaMap, SemRep) used to analyze salient content in titles and abstractsInformation presented in graph that has links to the MEDLINE text processedVisualize relationships, such as:A is a process of BX treats Y

    Semantic MEDLINE Prototype Search pageBreast Cancer is highlighted in a list of available MEDLINE searches.

    Semantic MEDLINE Prototype Search pageSummarize page for term: breast cancer.“Malignant neoplasm of breast” is highlighted in a list of topics on which to summarize.

    Semantic MEDLINE Prototype Search pageSemantic MEDLINE Visualization. The term “Malignant neoplasm of breast” is in the center of the page. Dark blue arrows point into the center showing that terms such as “Endocrine therapy” and “Operative Surgical Procedures” TREATS the center term. Medications such as “trastuzumab” and “Tamoxifen” point to the center term. A brown arrow points from the center to the term “human” showing that the center term is a PROCESS OF “human.”

    Semantic processing of#swineflu TweetsSample - 1267 TweetsAfternoon of April 27, 2009No adjustments made to NLP software (MetaMap, SemRep)No additional vocabulary, abbreviations, etc.

    Preliminary Processing of #swineflu TweetsSample page from SemRep report, showing SemRep processing, Concepts, and Filter concepts.

    Preliminary Processing of #swineflu TweetsSample page from SemRep report, showing Filter concepts by semantic type, Predications, and PROCESS_OF terms.

    Concepts in Tweets Isolated by Semantic ProcessingDisease: influenzaDisease symptom: coughingGeographic area: MexicoAnimal: family suidaeHealth care organization: Centers for Disease Control and Prevention (U.S.)Medical device: mask

    Next StepsProcessing of larger datasetinclude non-H1N1-related TweetsAdditional vocabularyFolksonomy, abbreviations, etc.Visualization of semantic processing results

    OpportunitiesBiosurveillanceMonitoring of wide-spread sentimentTargeted information provisionRespond to misinformation trendsEvaluation of accuracy/authenticity

    LinksSemantic MEDLINE Prototypehttp://skr3.nlm.nih.gov/SemMedDemo/Semantic Medline: Multi-Document Summarization and Visualizationhttp://www.nlm.nih.gov/pubs/techbull/mj07/theater_ppt/semantic.pptNational Library of Medicinehttp://www.nlm.nih.govNational Institutes of Healthhttp://nih.govDepartment of Health and Human Serviceshttp://hhs.gov

    Dr. AllaKeselmankeselmana AT mail DOT nlm DOT nih DOT govDr. Thomas Rindfleschtrindflesch AT mail DOT nih DOT govDavid Haledavid DOT hale ATnih DOT gov

    1 Favorite

    Executive Summary: Natural Language Processing of Twitter #swineflu (H1N1) Posts using Semantic MEDLINE Prototype - Presentation Transcript

    1. Influenza A(H1N1) Executive Summary: Natural Language Processing of Twitter #swineflu Posts using the Semantic MEDLINE Prototype Dr. AllaKeselman, Dr. Thomas Rindflesch, David Hale National Library of Medicine, National Institutes of Health, Department of Health and Human Services May 2009
    2. http://twitter.com/CDCemergency
    3. H1N1 information via Twitter: Communication issues • Information receivers – Information overload • >12,000 #swineflu (H1N1) posts/hour @ peak – Signal:Noise ratio • Quality? • Authority? – Twitter accounts impersonating CDC • Information providers – Effective information provision – Biosurveillance
    4. (un)ControlledVocabulary • Folksonomy • Hashtags(#) • Grammar • Abbreviations – SRSLY IMO ROI 4 RT? YMMV • High context
    5. #swineflu Tweets
    6. Acquisition Challenges • Twitter timeline – Storage requirements – Privacy • Twitter API – Limited search functionality • Temporal and range limitations – Range definition limited to midnight – 1500 posts from limit
    7. Semantic MEDLINE Prototype • Summarizes MEDLINE citations returned by PubMed search • Natural Language Processing (MetaMap, SemRep) used to analyze salient content in titles and abstracts • Information presented in graph that has links to the MEDLINE text processed • Visualize relationships, such as: – A is a process of B – X treats Y
    8. http://skr3.nlm.nih.gov/SemMedDemo/
    9. http://skr3.nlm.nih.gov/SemMedDemo/
    10. http://skr3.nlm.nih.gov/SemMedDemo/
    11. Semantic processing of #swineflu Tweets • Sample - 1267 Tweets – Afternoon of April 27, 2009 • No adjustments made to NLP software (MetaMap, SemRep) – No additional vocabulary, abbreviations, etc.
    12. Preliminary Processing of #swineflu Tweets
    13. Preliminary Processing of #swineflu Tweets
    14. Concepts in Tweets Isolated by Semantic Processing • Disease: influenza • Disease symptom: coughing • Geographic area: Mexico • Animal: family suidae • Health care organization: Centers for Disease Control and Prevention (U.S.) • Medical device: mask
    15. Next Steps • Processing of larger dataset – include non-H1N1-related Tweets • Additional vocabulary – Folksonomy, abbreviations, etc. • Visualization of semantic processing results
    16. Opportunities • Biosurveillance • Monitoring of wide-spread sentiment • Targeted information provision – Respond to misinformation trends • Evaluation of accuracy/authenticity
    17. Links • Semantic MEDLINE Prototype – http://skr3.nlm.nih.gov/SemMedDemo/ • Semantic Medline: Multi-Document Summarization and Visualization – http://www.nlm.nih.gov/pubs/techbull/mj07/theater_ppt/ semantic.ppt • National Library of Medicine – http://www.nlm.nih.gov • National Institutes of Health – http://nih.gov • Department of Health and Human Services – http://hhs.gov
    18. Dr. AllaKeselman keselmana AT mail DOT nlm DOT nih DOT gov Dr. Thomas Rindflesch trindflesch AT mail DOT nih DOT gov David Hale davidDOT hale AT nih DOT gov

    + Specialized Information Services, U.S. National Library of MedicineSpecialized Information Services, U.S. National Library of Medicine, 5 months ago

    custom

    865 views, 1 favs, 1 embeds more stats

    Natural Language Processing of Twitter #swineflu Po more

    More info about this document

    CC Attribution License

    Go to text version

    • Total Views 865
      • 842 on SlideShare
      • 23 from embeds
    • Comments 0
    • Favorites 1
    • Downloads 0
    Most viewed embeds
    • 23 views on http://www.scienceforseo.com

    more

    All embeds
    • 23 views on http://www.scienceforseo.com

    less

    Flagged as inappropriate Flag as inappropriate
    Flag as inappropriate

    Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

    Cancel
    File a copyright complaint
    Having problems? Go to our helpdesk?

    Categories