Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Text Pattern Formation For Information Extraction

717 views

Published on

  • ⇒ www.WritePaper.info ⇐ This service will write as best as they can. So you do not need to waste the time on rewritings.
       Reply 
    Are you sure you want to  Yes  No
    Your message goes here
  • I can advise you this service - ⇒ www.WritePaper.info ⇐ Bought essay here. No problem.
       Reply 
    Are you sure you want to  Yes  No
    Your message goes here
  • Be the first to like this

Text Pattern Formation For Information Extraction

  1. 1. Lidia M. Pivovarova Saint-Petersburg State University The Ph.D. advisor: prof. Valery Sh. Rubashkin NLDB 2008
  2. 2. FACTORS - - the system designed to monitor underling characteristics of a subject domain
  3. 3. General System Description The Ontology TEXTS Lemmatization, part-of-speech tagging, semantic mark-up Morph. analyzer Semantic analyzer Situation State Search Patterns
  4. 4. The Factors <ul><li>Factors – the required information aspects. </li></ul><ul><li>~ 100 factors </li></ul><ul><li>Factors: </li></ul><ul><li>- qualitative </li></ul><ul><li>e.g. social tension , investment attractiveness, </li></ul><ul><li>level of sovereignty, human rights activity </li></ul><ul><li>- quantitative </li></ul><ul><li>e.g. the number of unemployed, an average salary, </li></ul><ul><li>the inflation level, the ammount of import </li></ul>
  5. 5. Numerical values <ul><li>Qualitative factors: </li></ul><ul><li>very small , small , less than average , average, more than average , large , very large . </li></ul><ul><li>Quantitative factors: </li></ul><ul><li> the number + <unit> </li></ul><ul><li> e. g. </li></ul><ul><li> an average salary –> monetary unit (ruble, $, … ) </li></ul><ul><li> the number of unemployed -> no units </li></ul><ul><li> </li></ul>
  6. 6. The Patterns <ul><li>Qualitative factors ->“factor + numerical value” patterns. </li></ul><ul><li>e. g. Social tension <-- spontaneous meeting (large) </li></ul><ul><li>Quantitative factors -> “only factor” patterns. </li></ul><ul><li>e. g. The number of unemployed <-- become unemployed </li></ul><ul><li>Search algorithm </li></ul><ul><li>1) find a pattern </li></ul><ul><li>2) find a number + unit </li></ul><ul><li>if not </li></ul><ul><li>3) find words large, small, increase, decrease etc. </li></ul>
  7. 7. Pattern Formation Process <ul><li>Pattern is a set of words and ontology concepts. </li></ul><ul><li>Ontology provides: </li></ul><ul><li>- pattern generalization </li></ul><ul><li>- synonym accumulation </li></ul><ul><li>- information about units </li></ul><ul><li>Pattern formation: user marks relevant fragment in a text or chooses concept from the ontology. </li></ul>
  8. 8. Example <ul><li>As is known, European Union strictly demanded Latvia to close the both generating units of Ignalinskaya nuclear power station. It is also promised to remit 3 billions euro for this goal. </li></ul><ul><li>Factors: </li></ul><ul><li>The EU pressure to Latvia. </li></ul><ul><li>The financial aid of EU to Latvia. </li></ul>

×