Slideshow transcript
Slide 1: Semantic Analysis in IA SMS Management & Technology Matthew Hodgson ACT regional-lead, Web and Information Management 23 Sept 2007 1
Slide 2: 2 SMS Management & Technology
Slide 3: 3 SMS Management & Technology
Slide 4: Jeffrey Veen on analysing content “a mind-numbingly detailed odyssey through your web site... SMS Management & Technology …this process…is a relatively straightforward process of clicking through your web site and recording what you find.” Source: http://www.adaptivepath.com/ideas/essays/archives/000040.php 4
Slide 5: 5 SMS Management & Technology
Slide 6: Content overview – first take Medical restrictions text Free-text built in Word and hand-crafted (*grrr*) Unclassified Varied consistency within and between texts Highly complex sentence structures in pseudo-legalese SMS Management & Technology Style reflects the author rather than the meaning in the communication Content needed for re-use Content output was needed for reuse by others Multiple audiences Multiple purposes for re-use Codification Codification (after authoring) takes too long Need to reduce timeframes! 6
Slide 7: The task . . .analyse and codify Concept 1 SMS Management & Technology Concept 3 Concept 2 Concept 5 Concept 4 Concept 5 7
Slide 8: Linguistics SMS Management & Technology …a whole discipline devoted to the study of language 8
Slide 9: “You’re joking!?” All language has structure – even someone’s pseudo-legal English SMS Management & Technology Analysing language is actually easier than you might think 9
Slide 10: The approach Analyse semantics of content There is a predicable structure It’s all just Lego™ building blocks (nouns, verbs, SMS Management & Technology adjectives, etc) Implied meaning can be made overt New tools for IAs to play with! Understand semantics, the structure of sentences, and you can analyse, categorise and codify English! 10
Slide 11: Language as Lego™ Building blocks Subject (S) Verb (V) SMS Management & Technology Object (O) Order of blocks Differs depending on the language 11
Slide 12: Order from chaos SVO languages English, French, Chinese, Bulgarian, Swahili SOV SMS Management & Technology Japanese, Turkish, Korean VSO Classical Arabic, Celtic and Hawaiian VOS Fijian, Yoda’s amusing phrases 12
Slide 13: Subjects, verbs and objects Sometimes, though, the SVO structure is hidden: The apple is red or The apple is a red apple? SMS Management & Technology Subject Verb Object The apple is a red apple Uncovering the hidden structure helps to differentiate between the subject and the object and identify the who and what 13
Slide 14: Sentences as (apple) trees Root Noun Verb Phrase Phrase SMS Management & Technology Noun Phrase Verb Det Noun Det Adj Noun (be) The apple is a red apple SUBJECT VERB OBJECT 14
Slide 15: Semantic analysis Medical restrictions wording: Restricted benefit SMS Management & Technology Gastro-oesophageal reflux disease; Scleroderma oesophagus; Authority required Peptic ulcer 15
Slide 16: Semantic analysis (cont.) Actual sentence Peptic ulcer SMS Management & Technology Implied sentence The prescription of medicine is restricted to the initial treatment of patients with peptic ulcer 16
Slide 17: Root Verb Phrase Noun Phrase Prepos Phrase SMS Management & Technology Noun Phrase Noun Phrase Prepos Phrase Noun Phrase N AUX DET N P V AUX DET ADJ N P N P ADJ N (SUBJECT) V the prescription of medicine is restricted to the initial treatment of patients with peptic ulcer 17
Slide 18: WHO WHAT ACTION TREATED? CONDITION? REQUIRED Subject Object Verb Condition being treated Practical aspects + = Measures/descriptors of Condition severity Patient descriptors (population/group) continuing Initial or (ADJ) the prescription of medicine is restricted to the initial treatment of 70 year old patients with peptic ulcer sign form female record details of doctor continuing male with malignant tumor record date SMS Management & Technology Authority mother complete action sheet whole body pregnant with scleroderma oesphagus include area diagrams other ADJ with advanced psoriasis treat for period of time nausea and with provide preivous history vomiting with chronic pain prescribe number repeats with partial seizures contact Medicare hormone Authority with dependent breast cancer obtain PBS- dBMARD number not receiving cytotoxic chemotherapy previously receiving treated by clinical immunologist metastatic subsidised treatment PBS- dBMARD receiving radiotherapy not previously receiving treated by dermatologist subsidised treatment A 5HT3 KEY receiving antagonist PBS subsidised specialist group Limitation of Prescribing to a specific receiving 2 years treatment ADJ having ADJ/PP surgery incomplete resolution of NOUN no indication of PREP not responding to ADJ anelgesics other not responding to drugs anti-epileptic VERB unable take solid form of topiramate descriptors of population Existing treatment 18
Slide 19: “Who Treated” semantic model [DRUG] Sex [ADJECTIVE] All [TREATMENT] Pregnant Male [THERAPY] Ethnicity [ETHNICITY] Breastfeeding Female Surgery [TYPE] Over [NUMBER] Diet Vocation Veteran Under Exercise [LIST] Exactly Between Hours Entitlement [?] Veteran At least Days [LIST] Weeks Months That meet a specific definition /criteria as set out in [LIST of references] Years General schedule of Lipid-lowering Drugs Age and Starts new prepositional-phrase in the same text- block Within timeframe of SMS Management & Technology Over a period of Treatments In conjunction with Initiation Treatment with PBS subsidised Not in conjunction with Stabilisation Treatment of PBS non-subsidised Patient Following Effective Treatment for Group At a dose of Preceeding Ineffective Starts new Received Inappropriate Starts new prepositional-phrase prepositional-phrase in the same text- Has not received Initial [mg ...etc] Hourly ? in the same text- block Can not receive Continuing block Daily Not responding Maintenance Weekly Responding Co-administered with Fortnightly Failed to qualify for Monthly Qualified for Yearly Trials Not indicated Indicated Has had Has not had Can have Can not have Diagnosis confirmed by Starts new prepositional-phrase = As evidenced by in the same text- As measured by? block Treated by [CLINICIAN] Requiring special expertise in [EXPERTISE] Requiring no special expertise Documented Evidence of [SEVERITY] [CONDITION] history [DISORDER] in Clinical findings [MEASURED AS]? Symptoms? [PROCEDURE] [DEFINED BY] Disease progression Disease regression 19
Slide 20: “Authority Action” semantic model Initial By telephone To [AUTHORITY] Ongoing In writing Medicare Subsequent Electronically ...etc... Remaining Initial Supply By telephone To [AUTHORITY] Ongoing Therapy In writing Medicare Subsequent Treatment Electronically ...etc... Remaining SMS Management & Technology [AMOUNT] Repeats Authority Action To complete Starts new prepositional-phrase in the same text- (allow) Maximum [AMOUNT] [TIME] Initial [TIMEFRAME] Supply block (allow) Minimum days Ongoing Therapy weeks Subsequent Treatment months Remaining [AMOUNT] Repeats Followed by By telephone Within timeframe of [TIME] In writing days Electronically weeks Starts new prepositional-phrase months in the same text- block Where Initial approval [AMOUNT] Repeats By telephone To [AUTHORITY] Ongoing In writing Medicare Starts new Subsequent Electronically ...etc... prepositional-phrase in the same text- Remaining block 20
Slide 21: High-level semantic overview WHO WHAT HOW TREATED CONDITION AUTHORISED Clinical initiation or SMS Management & Technology Prior Age continuation Definitions treatments limitations Severity Contact information criteria + + + + = Notes and Foreword Definitions Patient Group Condition Authority Action Cautions Grandfathering Prescribing Prescribing advice clauses clinicians Patient Condition groups 21
Slide 22: How did the ‘trees’ help? Inferred How people think about and structure content Described Business processes that produce content SMS Management & Technology Identified Where content quality is poor so it can be improved Critical components of the sentence for codification Designed Taxonomies and describe folk taxonomies Built Systems to help bring some structure to content authoring 22
Slide 23: How can I do this stuff too?! (a side-step) Theory is important An understanding of semantics - sentence trees and grammar SMS Management & Technology Text books by authors like Fromkin and Rodman can help through the tricky bits Need good tools Conexor: www.conexor.fi/demo/syntax Big sheets of paper (and an electronic whiteboard) Visio (not PowerPoint!) 23
Slide 24: Demo Connexor www.conexor.fi/demo/syntax SMS Management & Technology 24
Slide 25: Introducing ways to codify restrictions How are we actually going to codify the stuff?! Give people Lego™ or ‘fridge-magnets’ to build sentences Build a prototype to explore and demonstrate conceptual design Communicate SMS Management & Technology Talk about ideas with business owners Explore possibilities with end-users Build-in ‘no surprises’ into change management Iterate Iterate and refine concepts and design before it was built Inform Developers of intent and requirements The building of an ‘tool’ for codifying content (hooray for Axure!) 25
Slide 26: Demo Protyotyping with Axure SMS Management & Technology 26
Slide 27: SMS Management & Technology 27
Slide 28: SMS Management & Technology 28
Slide 29: SMS Management & Technology 29
Slide 30: SMS Management & Technology 30
Slide 31: SMS Management & Technology 31
Slide 32: SMS Management & Technology 32
Slide 33: SMS Management & Technology 33
Slide 34: SMS Management & Technology 34
Slide 35: Why should I care about this? Google uses semantic analysis to index content Translation software uses semantic analysis to SMS Management & Technology identify ‘components’ for translation Good sentence structure equals: Accurate indexing Higher rank relevance of content Happy people (they find what they’re looking for) 35
Slide 36: Summing up Content is still king, but: Is it’s quality any good? Does it match your website’s categories? Is your metadata ok? Can people find the content they need? SMS Management & Technology Do you need to understand your content better? Semantic analysis can: Make your content audits more objective Inform processes to improve the quality of the content Inform processes to improve search engine indexing Inform metadata creation Improve website navigation design 36
Slide 37: Please Sir, can I have some more…? email: mhodgson@smsmt.com web: www.smsmt.com SMS Management & Technology blog: magia3e.wordpress.com twitter: magia3e community: iacanberra.org cartoons: © Garry Larson 37
Slide 38: SMS Management & Technology 38 Fin






Add a comment on Slide 1
If you have a SlideShare account, login to comment; else you can comment as a guest- Favorites & Groups
Showing 1-50 of 6 (more)