IAs, Language and Lego -- an introduction to Semantic Analysis

Loading...

Flash Player 9 (or above) is needed to view presentations.
We have detected that you do not have it on your computer. To install it, go here.

1 comments

Comments 1 - 1 of 1 previous next Post a comment

  • + aidankenyon aidankenyon 2 years ago
    Great preso Matthew. I really enjoyed your talk and it's always refreshing to add something new to my IA toolbox. Keep up the good work.
Post a comment
Embed Video
Edit your comment Cancel

7 Favorites & 1 Event

IAs, Language and Lego -- an introduction to Semantic Analysis - Presentation Transcript

  1. IAs, Language and Lego™ – an Introduction to Semantic Analysis Matthew Hodgson Regional-lead, Web and Information Management, Canberra Australia 12 April 2008
  2.  
  3.  
  4. IA Tools for understanding content
  5. Content analysis…
    • We all:
    • Think about information in different ways
    • Write about information in different ways
    Information: we all think differently …
  6. … we all even write differently …
  7. Jeffrey Veen on analysing content
    • “ a mind-numbingly detailed odyssey through your web site...
    • … this process…is a relatively straightforward process of clicking through your web site and recording what you find.”
    Source: http://www.adaptivepath.com/ideas/essays/archives/000040.php
  8. When analysing content …
  9. An extract of medical restrictions text
  10. What is this content?!
    • Medical restrictions text
    • Free-text built in Word and hand-crafted (*grrr*)
    • Unclassified
    • Varied consistency within and between texts
    • Highly complex sentence structures in pseudo-legalese
    • Style reflects the author rather than the meaning in the communication
    • Content needed for re-use
    • Content output was needed for reuse by others
    • Multiple audiences
    • Multiple purposes for re-use
    • Codification
    • Codification by 3 rd parties (after authoring) takes too long
    • Need to reduce timeframes!
  11. The task . . .analyse and codify
  12. What tools would be appropriate?
    • ?
    • Linguistics
    • … a whole discipline devoted to the
    • study of language…
    preposition verb adjective noun determiner subject object conjunction semantics sentence structure all language has structure
  13. Language is like Lego™
    • Building blocks
    • Subject (S)
    • Verb (V)
    • Object (O)
    • Order of blocks
    • Differs depending on the language
  14. Language is like Lego™
    • SVO languages
    • English, French, Chinese, Bulgarian, Swahili
    • SOV
    • Japanese, Turkish, Korean
    • VSO
    • Classical Arabic, Celtic and Hawaiian
    • VOS
    • Fijian, Yoda’s amusing phrases
  15. Lego bricks: subjects, verbs and objects
    • Sometimes, though, the SVO structure is hidden:
    • “ The Lego is red” or
    • “ Those Lego bricks are [some] red Lego bricks” ?
    • Uncovering the hidden structure helps to differentiate between the subject and the object and identify the who and what
  16. Uncovering hidden meaning
    • If the LEGO trademark is used at all, it should always be used as an adjective , not as a noun .
    • For example, say
    • "MODELS BUILT OF LEGO BRICKS".
    • Never say
    • "MODELS BUILT OF LEGOs".
    • Source: http://everything2.com/title/legOS
  17. Lego trees…
  18. Semantic analysis
    • Medical restrictions wording:
    • Restricted benefit Gastro-oesophageal reflux disease; Scleroderma oesophagus;
    • Authority required Peptic ulcer
  19. Semantic analysis (cont.)
    • Actual sentence
    • Peptic ulcer
    • Implied sentence
    • The prescription of medicine is restricted to the initial treatment of patients with peptic ulcer
  20. Semantic structure of ‘peptic ulcer’
  21. Semantic model for restrictions text
  22. Semantics describing “Who Treated”
  23. Semantics describing “Authority Action”
  24. High-level semantic overview
  25. Yes, it can be codified!
    • Medical restrictions:
    • Did have structure
    • Did have underlying logic
    • Were based on repeatable business processes
    • Could be codified
    • Could we make a ‘system’ to reinforce the structure at the point of authoring?
  26. Demo
    • Putting it together in a system:
    • Supporting building of content restrictions in a codified way
    • Protyotyping with Axure
  27.  
  28.  
  29.  
  30.  
  31.  
  32.  
  33.  
  34.  
  35. The semantic analysis advantage vs
    • Identifies:
    • Themes in content
    • Identifies:
    • Themes in content
    • Work processes
    • Folk taxonomies used
    • ‘ Things’ written about
  36. What else could you use it for?
    • When you need to understand:
    • Business processes that create content
    • When you want to disassemble content for:
    • FAQs
    • A-Z indexes
    • Help files
  37. How can I add this to my toolbox??!
    • Theory is important
    • An understanding of semantics - sentence trees and grammar
    • Text books by authors like Fromkin and Rodman can help through the tricky bits
    • Need good tools
    • Connexor : http://www.connexor.eu/technology/machinese/demo/
    • Big sheets of paper (and an electronic whiteboard)
    • Visio (not PowerPoint!)
  38. Demo
    • Connexor:
    • http://www.connexor.eu/technology/machinese/demo/
  39. Connexor
  40. Connexor – machine tagger
  41. Connexor – machine syntax
  42. Why should I care about this?
    • Google uses semantic analysis to index content
    • Translation software uses semantic analysis to identify ‘components’ for translation
    • Good sentence structure equals:
      • Accurate indexing
      • Higher rank relevance of content
      • Happy people (they find what they’re looking for)
  43. Why should I care about this?
  44. ‘ Calais’ by Reuters
  45. Summing up
    • Content is still king!
    • But how can you tell if your content:
    • Is of good quality?
    • Matches your website’s categories?
    • Accurately reflects your metadata?
    • Can be found by people?
    • Semantic analysis can:
    • Make your content audits more objective
    • Inform processes to improve the quality of the content
    • Inform processes to improve search engine indexing
    • Inform metadata creation
    • Inform choice of taxonomy
  46. Take-home message
    • Semantic analysis can help IAs:
    • Infer
    • How people think about, and structure, their information
    • Describe
    • Business processes that produce content
    • Identify
    • Where content quality is poor so it can be improved
    • Critical components of the sentence for codification
    • Design
    • Taxonomies and describe folk taxonomies
    • Build
    • Systems to help bring some structure to content authoring
    • Fin
  47. IAs, Language and Lego™ an Introduction to Semantic Analysis
  48. by Matthew Hodgson Regional-lead, Web and Information Management SMS Management & Technology Canberra Australia
  49. by Matthew Hodgson Email [email_address] Blog magia3e.wordpress.com Slideshare www.slideshare.net/magia3e Twitter magia3e

+ Matthew HodgsonMatthew Hodgson, 2 years ago

custom

2358 views, 7 favs, 3 embeds more stats

This presentation will introduce Semantic Analysis more

More Info

© All Rights Reserved

Go to text version
  • Total Views 2358
    • 2271 on SlideShare
    • 87 from embeds
  • Comments 1
  • Favorites 7
  • Downloads 86
Most viewed embeds
  • 72 views on http://www.iasummit.org
  • 14 views on http://magia3e.wordpress.com
  • 1 views on http://magia3e.wordpress.com.

more

All embeds
  • 72 views on http://www.iasummit.org
  • 14 views on http://magia3e.wordpress.com
  • 1 views on http://magia3e.wordpress.com.

less

Flagged as inappropriate Flag as inappropriate
Flag as innappropriate

Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

Cancel

Categories

Groups / Events