SlideShare a Scribd company logo
Selection and Aggregation of Sentences
in the Knowledge Formation Process
M.S. Shibut, V.S. Yakovishin
The Academy of Public Administration under the aegis of the President of the Republic of Belarus,
17, Moskovskaya Str., 220007, Minsk, Republic of Belarus, m_shibut@pac.by,
http://pac.by/en
Let S , S , S , S , S be sentences, expressed in terms of formal language, as shown in the figure below,1 2 3 4 5
where a, in, o are signs of the secondary sentence parts, p, pt, pPs are signs of the different predicates (for
thepresent,pastindefinite,andpresentsimplepassive,respectively).
According to the selection rule, the first sentence must be eliminated because of intensional superiority of
the second sentence (S Н S ). The sentences S , S , S , S can be integrated in compliance with the1 2 2 3 4 5
aggregation rule. Let “man”, “young man”, “library” be the subjects contained in user's request. Then, as a
result of integration on the given subjects, the following three subject knowledge descriptions can be
obtained:s({man})={S ,S ,S },s({man,man_a.young})={S ,S },s({library})={S ,S }.2 3 5 2 5 2 4
Knowledge-based text adaptation.The subject knowledge formation can be used as a basis for automatic
creation(compiling)ofadapted(user-oriented)textmaterials,suchas
-variousinformation-analyticalreviews;
-individualelectronictextbooks;
-anyotheradaptedtextmaterials.
Knowledge-based information search. The information search can be realized as a two-stage process
(thatresemblestheoreprocessing):
- data search: the usual information retrieval is realized to draw information (as full as possible) from a
numberofsources;
- knowledge search (“ore dressing”): the obtained results are processed to extract only the important
information(“valuableelements”).
Knowledge-based machine translation. In the translation of the source text from one natural language to
another, the subject knowledge base (where the lexical compatibility is fixed) can be used as a supporting
interlingua, that plays the role of an effective filter for screening all the misplaced meanings of polysemous
words.
The knowledge formation is presented as the process of selection and aggregation of input sentences. In
this process, the text sentences are at first transformed into the formal language, and then they are
integrated into the knowledge representation. The integration of the sentences that have one and the same
subject is considered as a subject knowledge representation, and any collection of the subject knowledge
representations, produced in the knowledge formation process, is considered as a user-oriented (“highly
tailored”) description of subject field. It is supposed that the subject (usually characterized as “the
something or someone that the sentence is about”, “the thing being talked about”) is expressed by a
grammatically separated noun phrase that represents either the absolutely independent part of sentence
(the formal subject of the division subject-predicate) or the general determinative part, i.e. the attribute that
relates to the whole sentence (the actual subject of the division theme-rheme, also known as topic-
comment,representingthe“reflectionofthespeaker'sattitudetowardswhatissaid”).
The presented here knowledge formation method is based on the using of the special formal language. In
the formal language, input text sentences are expressed in the set-theoretical (parenthesis-free, “discrete”)
form as sets of their syntactic elements (syntagmes), which allows us to reduce the semantic identification
ofsentencestotheusingofstandardset-theoreticalrelationofinclusion.
Subject knowledge formation is a growth process in which two formation rules, namely the rules of
selectionandaggregationofsentences,mustrealize.
Selectionrule:o sentencesS andS mustbeeliminated,ifitisasubset ofanothersentence,i.e.1 2
{S , S }® S , if S КS .1 2 1 1 2
Aggregation rule realizes the integration of already selected sentences: if S , S , ... are sentences that1 2
havethesamesubjectN, theywilluniteinasubjectknowledgerepresentation,i.e.
{S ,S , ...}® s(N).1 2
neof the
Subject knowledge representation is a set s(N) of sentences S , S , ... with the common subject,1 2
representedbyanounphraseN(containedinuser's request),i.e.
s(N){S | К N, i і 1}.i
Subject field representation is any collection s(N , N , ...) of subject knowledge representation produced1 2
intheknowledgeformationprocess,i.e.
s(N , N , ...) = {s(N ), s(N ), ... },1 2 1 2
where N , N , are noun phrases that play the role of subjects in the division “subject-predicate” or in the1 2
actualdivision“theme-rheme”.
Si
Stepwise subordination:
Syntagme:
(as in The book of the new author)
(as in The new book)
(X ∆ (X X ))={X ∆ X , X ∆ X }1 1 2 2 3 1 1 2 2 2 3∆
(X ∆X )={X ∆X }1 2 1 2
Collateral subordination:
(as in The new book of the author)
((X ∆ X )∆ X )={X ∆ X , X ∆ X }1 1 2 2 3 1 1 2 1 2 3
Multisyntagme:
(as in The new and old books)
(X ∆(X СX ))={X ∆X , X ∆X }1 2 3 1 2 1 3
Subject (absolutely independent part):
(as in The man reads a book)
((X ∆ X )∆ X )={X , X ∆ X , X ∆ X }1 1 2 2 3 1 1 1 2 1 2 3
Theme (topic):
(as in In the evening, the man reads a book)
((X ∆ (X ∆ X ))∆ X )={∆ X , X , X ∆ X , X ∆ X }1 1 2 2 3 3 4 3 4 1 1 1 2 2 2 3
The book
The book
The man
The man
of the author
reads
reads
new
new
a book
a book in the evening
dependent
member
dependent
member
dependent
members
homogeneous
parts
subject
subjecttheme
head
member
head
member
head
members
The book
The book
of the authornew
new and old
Input sentences
1. The young man reads a book.
2. The young man reads a book in the library.
3. The man walked in the park.
4. The library is situated in a graceful street.
5. The young man kicked the ball.
…
Knowledge representation
1. man, man_a.young, man_p.read, read_o.book
2. man, man_a.young, man_p.read, read_o.book, read_in.library
3. man, man_pt.walk, walk_ in.park
4. library, library_pPs. situate, situate_in.street, street_a. graceful
…
Knowledge representation
2. man, man_a.young, man_p.read, read_o.book, read_in.library
3. man, man_pt.walk, walk_ in.park
4. library, library_pPs. situate, situate_in.street, street_a. graceful
…
Knowledge representation for “library”
__________________________
…
4. library,
library_pPs. situate,
situate_in.street,
street_a. graceful
2.man,
man_a.young,
man_p.read,
read_o.book,read_in.library
Knowledge representation for “man”
2. man,
man_a.young,
man_p.read,
read_o.book, read_in.library
3. man_pt.walk,
walk_ in.park
__________________________
…
User-oriented description of subject field
2. The library is situated in a graceful street.
4. The young man reads a book in the library.
User-oriented description of subject field
2. The man walked in the park.
3. The young man reads a book in the library.
4. The young man kicked the ball.
Selection rule
Aggregation
rule Query “man”Query “library”
Id14
The described research was supported by research program on the Development of the State System of
Scientific and Technical Information of the Republic of Belarus for 2009-2010, task No 3.3, sponsored by
theStateCommitteeforScienceandTechnologyoftheRepublicofBelarus.
We are pleased to thank prof. Rauf Sadykhov and prof.Anatoly Sachenko for their assistance.We are also
verygratefultodr.IrynaTurchenkoforthepresentationofourpaper.
Transformation into the
formal language
Knowledge
formation

More Related Content

Similar to Shibut poster i11 168

An Outline Of Type-Theoretical Approaches To Lexical Semantics
An Outline Of Type-Theoretical Approaches To Lexical SemanticsAn Outline Of Type-Theoretical Approaches To Lexical Semantics
An Outline Of Type-Theoretical Approaches To Lexical Semantics
Tye Rausch
 
Procedural Pragmatics and the studyof discourse
Procedural Pragmatics and the studyof discourseProcedural Pragmatics and the studyof discourse
Procedural Pragmatics and the studyof discourse
Louis de Saussure
 
Procedural pragmatics suncorrectedproofs
Procedural pragmatics suncorrectedproofsProcedural pragmatics suncorrectedproofs
Procedural pragmatics suncorrectedproofsLouis de Saussure
 
A corpus driven comparative analysis of modal verbs in pakistani and british ...
A corpus driven comparative analysis of modal verbs in pakistani and british ...A corpus driven comparative analysis of modal verbs in pakistani and british ...
A corpus driven comparative analysis of modal verbs in pakistani and british ...
Alexander Decker
 
15 unit 4
15 unit 415 unit 4
A Statistical Model for Morphology Inspired by the Amis Language
A Statistical Model for Morphology Inspired by the Amis LanguageA Statistical Model for Morphology Inspired by the Amis Language
A Statistical Model for Morphology Inspired by the Amis Language
dannyijwest
 
A statistical model for morphology inspired by the Amis language
A statistical model for morphology inspired by the Amis languageA statistical model for morphology inspired by the Amis language
A statistical model for morphology inspired by the Amis language
IJwest
 
Volodymyr Getmanskyi - “First steps from NLP to NLU” AI&BigDataDay 2017
Volodymyr Getmanskyi - “First steps from NLP to NLU” AI&BigDataDay 2017Volodymyr Getmanskyi - “First steps from NLP to NLU” AI&BigDataDay 2017
Volodymyr Getmanskyi - “First steps from NLP to NLU” AI&BigDataDay 2017
Lviv Startup Club
 
B211120
B211120B211120
The Search For Irony: A Textual Analysis of the Lyrics of Ironic by Alanis Mo...
The Search For Irony: A Textual Analysis of the Lyrics of Ironic by Alanis Mo...The Search For Irony: A Textual Analysis of the Lyrics of Ironic by Alanis Mo...
The Search For Irony: A Textual Analysis of the Lyrics of Ironic by Alanis Mo...
Andy Boon
 
Sfl and lexical cohesion (gianna, maira)
Sfl and lexical cohesion (gianna, maira)Sfl and lexical cohesion (gianna, maira)
Sfl and lexical cohesion (gianna, maira)rominacheme
 
Tensor-based Models of Natural Language Semantics
Tensor-based Models of Natural Language SemanticsTensor-based Models of Natural Language Semantics
Tensor-based Models of Natural Language Semantics
Dimitrios Kartsaklis
 
Full Articles (Volume Two) - The Seventh International Conference on Language...
Full Articles (Volume Two) - The Seventh International Conference on Language...Full Articles (Volume Two) - The Seventh International Conference on Language...
Full Articles (Volume Two) - The Seventh International Conference on Language...
The Annual International Conference on Languages, Linguistics, Translation and Literature
 
Listening comprehension in efl teaching
Listening comprehension in efl teachingListening comprehension in efl teaching
Listening comprehension in efl teachingmora-deyanira
 
Listening Comprehension in EFL Teaching
Listening Comprehension in EFL TeachingListening Comprehension in EFL Teaching
Listening Comprehension in EFL Teachingmora-deyanira
 
Some principles of the formation and development of ethical terms in the Engl...
Some principles of the formation and development of ethical terms in the Engl...Some principles of the formation and development of ethical terms in the Engl...
Some principles of the formation and development of ethical terms in the Engl...
SubmissionResearchpa
 
Macedonian se constructions and their equivalents in english
Macedonian se constructions and their equivalents in englishMacedonian se constructions and their equivalents in english
Macedonian se constructions and their equivalents in english
Croslinguistic
 
Theoretical_grammar_of_english.doc
Theoretical_grammar_of_english.docTheoretical_grammar_of_english.doc
Theoretical_grammar_of_english.doc
ssuser67b22b1
 

Similar to Shibut poster i11 168 (20)

An Outline Of Type-Theoretical Approaches To Lexical Semantics
An Outline Of Type-Theoretical Approaches To Lexical SemanticsAn Outline Of Type-Theoretical Approaches To Lexical Semantics
An Outline Of Type-Theoretical Approaches To Lexical Semantics
 
The sentence and the utterance
The sentence and the utteranceThe sentence and the utterance
The sentence and the utterance
 
Procedural Pragmatics and the studyof discourse
Procedural Pragmatics and the studyof discourseProcedural Pragmatics and the studyof discourse
Procedural Pragmatics and the studyof discourse
 
Procedural pragmatics suncorrectedproofs
Procedural pragmatics suncorrectedproofsProcedural pragmatics suncorrectedproofs
Procedural pragmatics suncorrectedproofs
 
A corpus driven comparative analysis of modal verbs in pakistani and british ...
A corpus driven comparative analysis of modal verbs in pakistani and british ...A corpus driven comparative analysis of modal verbs in pakistani and british ...
A corpus driven comparative analysis of modal verbs in pakistani and british ...
 
15 unit 4
15 unit 415 unit 4
15 unit 4
 
A Statistical Model for Morphology Inspired by the Amis Language
A Statistical Model for Morphology Inspired by the Amis LanguageA Statistical Model for Morphology Inspired by the Amis Language
A Statistical Model for Morphology Inspired by the Amis Language
 
A statistical model for morphology inspired by the Amis language
A statistical model for morphology inspired by the Amis languageA statistical model for morphology inspired by the Amis language
A statistical model for morphology inspired by the Amis language
 
Volodymyr Getmanskyi - “First steps from NLP to NLU” AI&BigDataDay 2017
Volodymyr Getmanskyi - “First steps from NLP to NLU” AI&BigDataDay 2017Volodymyr Getmanskyi - “First steps from NLP to NLU” AI&BigDataDay 2017
Volodymyr Getmanskyi - “First steps from NLP to NLU” AI&BigDataDay 2017
 
B211120
B211120B211120
B211120
 
The Search For Irony: A Textual Analysis of the Lyrics of Ironic by Alanis Mo...
The Search For Irony: A Textual Analysis of the Lyrics of Ironic by Alanis Mo...The Search For Irony: A Textual Analysis of the Lyrics of Ironic by Alanis Mo...
The Search For Irony: A Textual Analysis of the Lyrics of Ironic by Alanis Mo...
 
Sfl and lexical cohesion (gianna, maira)
Sfl and lexical cohesion (gianna, maira)Sfl and lexical cohesion (gianna, maira)
Sfl and lexical cohesion (gianna, maira)
 
Tensor-based Models of Natural Language Semantics
Tensor-based Models of Natural Language SemanticsTensor-based Models of Natural Language Semantics
Tensor-based Models of Natural Language Semantics
 
Full Articles (Volume Two) - The Seventh International Conference on Language...
Full Articles (Volume Two) - The Seventh International Conference on Language...Full Articles (Volume Two) - The Seventh International Conference on Language...
Full Articles (Volume Two) - The Seventh International Conference on Language...
 
Listening comprehension in efl teaching
Listening comprehension in efl teachingListening comprehension in efl teaching
Listening comprehension in efl teaching
 
Listening Comprehension in EFL Teaching
Listening Comprehension in EFL TeachingListening Comprehension in EFL Teaching
Listening Comprehension in EFL Teaching
 
Some principles of the formation and development of ethical terms in the Engl...
Some principles of the formation and development of ethical terms in the Engl...Some principles of the formation and development of ethical terms in the Engl...
Some principles of the formation and development of ethical terms in the Engl...
 
Macedonian se constructions and their equivalents in english
Macedonian se constructions and their equivalents in englishMacedonian se constructions and their equivalents in english
Macedonian se constructions and their equivalents in english
 
Scopos theory
Scopos theoryScopos theory
Scopos theory
 
Theoretical_grammar_of_english.doc
Theoretical_grammar_of_english.docTheoretical_grammar_of_english.doc
Theoretical_grammar_of_english.doc
 

Recently uploaded

如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
yqqaatn0
 
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptxBREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
RASHMI M G
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
PRIYANKA PATEL
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
Sharon Liu
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
tonzsalvador2222
 
Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
IshaGoswami9
 
Toxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and ArsenicToxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and Arsenic
sanjana502982
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
TinyAnderson
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
Nucleophilic Addition of carbonyl compounds.pptx
Nucleophilic Addition of carbonyl  compounds.pptxNucleophilic Addition of carbonyl  compounds.pptx
Nucleophilic Addition of carbonyl compounds.pptx
SSR02
 
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
Wasswaderrick3
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
David Osipyan
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
RitabrataSarkar3
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
The Evolution of Science Education PraxiLabs’ Vision- Presentation (2).pdf
The Evolution of Science Education PraxiLabs’ Vision- Presentation (2).pdfThe Evolution of Science Education PraxiLabs’ Vision- Presentation (2).pdf
The Evolution of Science Education PraxiLabs’ Vision- Presentation (2).pdf
mediapraxi
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
muralinath2
 
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptxANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
RASHMI M G
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Erdal Coalmaker
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
yqqaatn0
 

Recently uploaded (20)

如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
 
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptxBREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
 
Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
 
Toxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and ArsenicToxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and Arsenic
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
Nucleophilic Addition of carbonyl compounds.pptx
Nucleophilic Addition of carbonyl  compounds.pptxNucleophilic Addition of carbonyl  compounds.pptx
Nucleophilic Addition of carbonyl compounds.pptx
 
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
The Evolution of Science Education PraxiLabs’ Vision- Presentation (2).pdf
The Evolution of Science Education PraxiLabs’ Vision- Presentation (2).pdfThe Evolution of Science Education PraxiLabs’ Vision- Presentation (2).pdf
The Evolution of Science Education PraxiLabs’ Vision- Presentation (2).pdf
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
 
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptxANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
 

Shibut poster i11 168

  • 1. Selection and Aggregation of Sentences in the Knowledge Formation Process M.S. Shibut, V.S. Yakovishin The Academy of Public Administration under the aegis of the President of the Republic of Belarus, 17, Moskovskaya Str., 220007, Minsk, Republic of Belarus, m_shibut@pac.by, http://pac.by/en Let S , S , S , S , S be sentences, expressed in terms of formal language, as shown in the figure below,1 2 3 4 5 where a, in, o are signs of the secondary sentence parts, p, pt, pPs are signs of the different predicates (for thepresent,pastindefinite,andpresentsimplepassive,respectively). According to the selection rule, the first sentence must be eliminated because of intensional superiority of the second sentence (S Н S ). The sentences S , S , S , S can be integrated in compliance with the1 2 2 3 4 5 aggregation rule. Let “man”, “young man”, “library” be the subjects contained in user's request. Then, as a result of integration on the given subjects, the following three subject knowledge descriptions can be obtained:s({man})={S ,S ,S },s({man,man_a.young})={S ,S },s({library})={S ,S }.2 3 5 2 5 2 4 Knowledge-based text adaptation.The subject knowledge formation can be used as a basis for automatic creation(compiling)ofadapted(user-oriented)textmaterials,suchas -variousinformation-analyticalreviews; -individualelectronictextbooks; -anyotheradaptedtextmaterials. Knowledge-based information search. The information search can be realized as a two-stage process (thatresemblestheoreprocessing): - data search: the usual information retrieval is realized to draw information (as full as possible) from a numberofsources; - knowledge search (“ore dressing”): the obtained results are processed to extract only the important information(“valuableelements”). Knowledge-based machine translation. In the translation of the source text from one natural language to another, the subject knowledge base (where the lexical compatibility is fixed) can be used as a supporting interlingua, that plays the role of an effective filter for screening all the misplaced meanings of polysemous words. The knowledge formation is presented as the process of selection and aggregation of input sentences. In this process, the text sentences are at first transformed into the formal language, and then they are integrated into the knowledge representation. The integration of the sentences that have one and the same subject is considered as a subject knowledge representation, and any collection of the subject knowledge representations, produced in the knowledge formation process, is considered as a user-oriented (“highly tailored”) description of subject field. It is supposed that the subject (usually characterized as “the something or someone that the sentence is about”, “the thing being talked about”) is expressed by a grammatically separated noun phrase that represents either the absolutely independent part of sentence (the formal subject of the division subject-predicate) or the general determinative part, i.e. the attribute that relates to the whole sentence (the actual subject of the division theme-rheme, also known as topic- comment,representingthe“reflectionofthespeaker'sattitudetowardswhatissaid”). The presented here knowledge formation method is based on the using of the special formal language. In the formal language, input text sentences are expressed in the set-theoretical (parenthesis-free, “discrete”) form as sets of their syntactic elements (syntagmes), which allows us to reduce the semantic identification ofsentencestotheusingofstandardset-theoreticalrelationofinclusion. Subject knowledge formation is a growth process in which two formation rules, namely the rules of selectionandaggregationofsentences,mustrealize. Selectionrule:o sentencesS andS mustbeeliminated,ifitisasubset ofanothersentence,i.e.1 2 {S , S }® S , if S КS .1 2 1 1 2 Aggregation rule realizes the integration of already selected sentences: if S , S , ... are sentences that1 2 havethesamesubjectN, theywilluniteinasubjectknowledgerepresentation,i.e. {S ,S , ...}® s(N).1 2 neof the Subject knowledge representation is a set s(N) of sentences S , S , ... with the common subject,1 2 representedbyanounphraseN(containedinuser's request),i.e. s(N){S | К N, i і 1}.i Subject field representation is any collection s(N , N , ...) of subject knowledge representation produced1 2 intheknowledgeformationprocess,i.e. s(N , N , ...) = {s(N ), s(N ), ... },1 2 1 2 where N , N , are noun phrases that play the role of subjects in the division “subject-predicate” or in the1 2 actualdivision“theme-rheme”. Si Stepwise subordination: Syntagme: (as in The book of the new author) (as in The new book) (X ∆ (X X ))={X ∆ X , X ∆ X }1 1 2 2 3 1 1 2 2 2 3∆ (X ∆X )={X ∆X }1 2 1 2 Collateral subordination: (as in The new book of the author) ((X ∆ X )∆ X )={X ∆ X , X ∆ X }1 1 2 2 3 1 1 2 1 2 3 Multisyntagme: (as in The new and old books) (X ∆(X СX ))={X ∆X , X ∆X }1 2 3 1 2 1 3 Subject (absolutely independent part): (as in The man reads a book) ((X ∆ X )∆ X )={X , X ∆ X , X ∆ X }1 1 2 2 3 1 1 1 2 1 2 3 Theme (topic): (as in In the evening, the man reads a book) ((X ∆ (X ∆ X ))∆ X )={∆ X , X , X ∆ X , X ∆ X }1 1 2 2 3 3 4 3 4 1 1 1 2 2 2 3 The book The book The man The man of the author reads reads new new a book a book in the evening dependent member dependent member dependent members homogeneous parts subject subjecttheme head member head member head members The book The book of the authornew new and old Input sentences 1. The young man reads a book. 2. The young man reads a book in the library. 3. The man walked in the park. 4. The library is situated in a graceful street. 5. The young man kicked the ball. … Knowledge representation 1. man, man_a.young, man_p.read, read_o.book 2. man, man_a.young, man_p.read, read_o.book, read_in.library 3. man, man_pt.walk, walk_ in.park 4. library, library_pPs. situate, situate_in.street, street_a. graceful … Knowledge representation 2. man, man_a.young, man_p.read, read_o.book, read_in.library 3. man, man_pt.walk, walk_ in.park 4. library, library_pPs. situate, situate_in.street, street_a. graceful … Knowledge representation for “library” __________________________ … 4. library, library_pPs. situate, situate_in.street, street_a. graceful 2.man, man_a.young, man_p.read, read_o.book,read_in.library Knowledge representation for “man” 2. man, man_a.young, man_p.read, read_o.book, read_in.library 3. man_pt.walk, walk_ in.park __________________________ … User-oriented description of subject field 2. The library is situated in a graceful street. 4. The young man reads a book in the library. User-oriented description of subject field 2. The man walked in the park. 3. The young man reads a book in the library. 4. The young man kicked the ball. Selection rule Aggregation rule Query “man”Query “library” Id14 The described research was supported by research program on the Development of the State System of Scientific and Technical Information of the Republic of Belarus for 2009-2010, task No 3.3, sponsored by theStateCommitteeforScienceandTechnologyoftheRepublicofBelarus. We are pleased to thank prof. Rauf Sadykhov and prof.Anatoly Sachenko for their assistance.We are also verygratefultodr.IrynaTurchenkoforthepresentationofourpaper. Transformation into the formal language Knowledge formation