SlideShare a Scribd company logo
1 of 3
Varvara Krayvanova, AltSTU, Barnaul
krayvanova@gmail.com
Problem and tasks
What we want to have?
What we need to do?
Long
scientific
text
Wikification
Ontology
Illustrations
IDEF0
Use case
Activity
Slides for lectures
MAGIC
Long
scientific
text
Definitions
Nature process
Metodology
Metodology
Split
Special algorithms
Ontology
IDEF0
Use case
Nature process
Varvara Krayvanova, AltSTU, Barnaul
krayvanova@gmail.com
Model of text fragmentation
How we can split text into semantic fragments?
T =  sk - natural language text,
sk is k-th sentence in the text.
The window Wi,j =  si,..., sj is a continuous
sequence of sentences of text,
• i is the number of the first sentence,
• j is the number of the last sentence,
• L = j - i is window size.
We take all windows with size L.
For each window we calculate:
(1) Total count of nouns PNoun
(2) Count of different nouns PDiffNoun
(3) Total count of verbs PVerb
(4) Count of different verbs PDiffVerb
(5) Total count of adjectives PAdj
(6) Count of different adjectives PDiffAdj
And we clusterize the set of windows
using these parameters.
Ok, we have that:
I can read it, but I don’t want.
What about
this fragment?
Each sentence sk assigned to some cluster c
from a finite set of clusters C.
Varvara Krayvanova, AltSTU, Barnaul
krayvanova@gmail.com
Verb nest
How we can detect the fragment type?
Vk - the set of verbs in the sentence sk.
Ev =  sk |v  Vk - an ordered list of
sentences that contain a verb v.
• Vunic - rare verbs, |Ev| is below the
border: |Ev| < .
• Vcommon - common verbs.
• Vmarker - verbs-markers.
T
v =  si | si  sk Ev and |k-i|≤ - textual
neighborhood, cv = T
v c.
The verb vm is marker of cluster c, if:
•|cvm|/| T
vm|>,
•aC |avm|/| T
vm|≤ .
N = {v| Ev T
v} - text nest of verb-
marker vm.
Cluster annotation (expert) Verbs-markers (automatic
extraction)
Cluster 1. Description of the
research objects:
introduction definitions and
process of snow formation.
СЛУЖИТЬ, СМОТРЕТЬ,
ЗАВИСЕТЬ, ЯВЛЯТЬСЯ,
ОПРЕДЕЛЯТЬ, ИМЕТЬ,
ПРОИСХОДИТЬ
(TO SERVE, TO WATCH,
TO DEPEND, TO BE,
TO DEFINE, TO HAVE,
TO HAPPEN)
Cluster 2. Chapter about
calculations
and laboratory processing
of research results, different
tables of classifications,
fragments about
parameters measurement.
ВЫЧИСЛЯТЬ, ВЫЧИСЛЯТЬСЯ,
ЗАПИСЫВАТЬСЯ
(TO CALCULATE,
TO BE CALCULATE,
TO REGISTER)
Cluster 3. Observation
methodology: observation
areas marking, equipment
and recommendations.
СОСТОЯТЬ, ПРИНИМАТЬСЯ,
ИСПОЛЬЗОВАТЬ,
РЕКОМЕНДОВАТЬ, БЫТЬ
(TO CONSIST, TO BE TAKEN,
TO USE, TO RECOMMEND,
TO BE as a link-verb)

More Related Content

Viewers also liked

Rostislav Yavorskiy - AIST'2014 Closing Presentation
Rostislav Yavorskiy - AIST'2014 Closing PresentationRostislav Yavorskiy - AIST'2014 Closing Presentation
Rostislav Yavorskiy - AIST'2014 Closing Presentation
AIST
 
Marina Danshina - Semiotic system of musical texts
Marina Danshina - Semiotic system of musical textsMarina Danshina - Semiotic system of musical texts
Marina Danshina - Semiotic system of musical texts
AIST
 
Rita Gaibadullina - Automatic defect recognition in corrosion logging using m...
Rita Gaibadullina - Automatic defect recognition in corrosion logging using m...Rita Gaibadullina - Automatic defect recognition in corrosion logging using m...
Rita Gaibadullina - Automatic defect recognition in corrosion logging using m...
AIST
 
Bulat Fatkulin - The Afghanistan chapter of the chinese online encyclopedia b...
Bulat Fatkulin - The Afghanistan chapter of the chinese online encyclopedia b...Bulat Fatkulin - The Afghanistan chapter of the chinese online encyclopedia b...
Bulat Fatkulin - The Afghanistan chapter of the chinese online encyclopedia b...
AIST
 
Nikolay Karpov - Single-sentence readability prediction in russian
Nikolay Karpov - Single-sentence readability prediction in russianNikolay Karpov - Single-sentence readability prediction in russian
Nikolay Karpov - Single-sentence readability prediction in russian
AIST
 
Елена Малютина - Оценка параметров хаотического процесса с помощью Ukf-фильтр...
Елена Малютина - Оценка параметров хаотического процесса с помощью Ukf-фильтр...Елена Малютина - Оценка параметров хаотического процесса с помощью Ukf-фильтр...
Елена Малютина - Оценка параметров хаотического процесса с помощью Ukf-фильтр...
AIST
 
Alexander Semenov - Recent Advances in Social Network Analysis
Alexander Semenov - Recent Advances in Social Network AnalysisAlexander Semenov - Recent Advances in Social Network Analysis
Alexander Semenov - Recent Advances in Social Network Analysis
AIST
 
Dmitriy Kolesov - GIS as an environment for integration and analysis of spati...
Dmitriy Kolesov - GIS as an environment for integration and analysis of spati...Dmitriy Kolesov - GIS as an environment for integration and analysis of spati...
Dmitriy Kolesov - GIS as an environment for integration and analysis of spati...
AIST
 
Daniel Khachay - GPS navigation algorithm based on osm data
Daniel Khachay - GPS navigation algorithm based on osm dataDaniel Khachay - GPS navigation algorithm based on osm data
Daniel Khachay - GPS navigation algorithm based on osm data
AIST
 

Viewers also liked (11)

Rostislav Yavorskiy - AIST'2014 Closing Presentation
Rostislav Yavorskiy - AIST'2014 Closing PresentationRostislav Yavorskiy - AIST'2014 Closing Presentation
Rostislav Yavorskiy - AIST'2014 Closing Presentation
 
Marina Danshina - Semiotic system of musical texts
Marina Danshina - Semiotic system of musical textsMarina Danshina - Semiotic system of musical texts
Marina Danshina - Semiotic system of musical texts
 
Rita Gaibadullina - Automatic defect recognition in corrosion logging using m...
Rita Gaibadullina - Automatic defect recognition in corrosion logging using m...Rita Gaibadullina - Automatic defect recognition in corrosion logging using m...
Rita Gaibadullina - Automatic defect recognition in corrosion logging using m...
 
Bulat Fatkulin - The Afghanistan chapter of the chinese online encyclopedia b...
Bulat Fatkulin - The Afghanistan chapter of the chinese online encyclopedia b...Bulat Fatkulin - The Afghanistan chapter of the chinese online encyclopedia b...
Bulat Fatkulin - The Afghanistan chapter of the chinese online encyclopedia b...
 
Nikolay Karpov - Single-sentence readability prediction in russian
Nikolay Karpov - Single-sentence readability prediction in russianNikolay Karpov - Single-sentence readability prediction in russian
Nikolay Karpov - Single-sentence readability prediction in russian
 
Елена Малютина - Оценка параметров хаотического процесса с помощью Ukf-фильтр...
Елена Малютина - Оценка параметров хаотического процесса с помощью Ukf-фильтр...Елена Малютина - Оценка параметров хаотического процесса с помощью Ukf-фильтр...
Елена Малютина - Оценка параметров хаотического процесса с помощью Ukf-фильтр...
 
Dialogue systems and personal assistants
Dialogue systems and personal assistantsDialogue systems and personal assistants
Dialogue systems and personal assistants
 
Alexander Semenov - Recent Advances in Social Network Analysis
Alexander Semenov - Recent Advances in Social Network AnalysisAlexander Semenov - Recent Advances in Social Network Analysis
Alexander Semenov - Recent Advances in Social Network Analysis
 
Open Data and Data Journalism
Open Data and Data JournalismOpen Data and Data Journalism
Open Data and Data Journalism
 
Dmitriy Kolesov - GIS as an environment for integration and analysis of spati...
Dmitriy Kolesov - GIS as an environment for integration and analysis of spati...Dmitriy Kolesov - GIS as an environment for integration and analysis of spati...
Dmitriy Kolesov - GIS as an environment for integration and analysis of spati...
 
Daniel Khachay - GPS navigation algorithm based on osm data
Daniel Khachay - GPS navigation algorithm based on osm dataDaniel Khachay - GPS navigation algorithm based on osm data
Daniel Khachay - GPS navigation algorithm based on osm data
 

More from AIST

Алена Ильина и Иван Бибилов, GoTo - GoTo школы, конкурсы и хакатоны
Алена Ильина и Иван Бибилов, GoTo - GoTo школы, конкурсы и хакатоныАлена Ильина и Иван Бибилов, GoTo - GoTo школы, конкурсы и хакатоны
Алена Ильина и Иван Бибилов, GoTo - GoTo школы, конкурсы и хакатоны
AIST
 
Иосиф Иткин, Exactpro - TBA
Иосиф Иткин, Exactpro - TBAИосиф Иткин, Exactpro - TBA
Иосиф Иткин, Exactpro - TBA
AIST
 

More from AIST (20)

Alexey Mikhaylichenko - Automatic Detection of Bone Contours in X-Ray Images
Alexey Mikhaylichenko - Automatic Detection of Bone Contours in X-Ray  ImagesAlexey Mikhaylichenko - Automatic Detection of Bone Contours in X-Ray  Images
Alexey Mikhaylichenko - Automatic Detection of Bone Contours in X-Ray Images
 
Алена Ильина и Иван Бибилов, GoTo - GoTo школы, конкурсы и хакатоны
Алена Ильина и Иван Бибилов, GoTo - GoTo школы, конкурсы и хакатоныАлена Ильина и Иван Бибилов, GoTo - GoTo школы, конкурсы и хакатоны
Алена Ильина и Иван Бибилов, GoTo - GoTo школы, конкурсы и хакатоны
 
Станислав Кралин, Сайтсофт - Связанные открытые данные федеральных органов ис...
Станислав Кралин, Сайтсофт - Связанные открытые данные федеральных органов ис...Станислав Кралин, Сайтсофт - Связанные открытые данные федеральных органов ис...
Станислав Кралин, Сайтсофт - Связанные открытые данные федеральных органов ис...
 
Павел Браславский,Velpas - Velpas: мобильный визуальный поиск
Павел Браславский,Velpas - Velpas: мобильный визуальный поискПавел Браславский,Velpas - Velpas: мобильный визуальный поиск
Павел Браславский,Velpas - Velpas: мобильный визуальный поиск
 
Евгений Цымбалов, Webgames - Методы машинного обучения для задач игровой анал...
Евгений Цымбалов, Webgames - Методы машинного обучения для задач игровой анал...Евгений Цымбалов, Webgames - Методы машинного обучения для задач игровой анал...
Евгений Цымбалов, Webgames - Методы машинного обучения для задач игровой анал...
 
Александр Москвичев, EveResearch - Алгоритмы анализа данных в маркетинговых и...
Александр Москвичев, EveResearch - Алгоритмы анализа данных в маркетинговых и...Александр Москвичев, EveResearch - Алгоритмы анализа данных в маркетинговых и...
Александр Москвичев, EveResearch - Алгоритмы анализа данных в маркетинговых и...
 
Петр Ермаков, HeadHunter - Модерация резюме: от людей к роботам. Машинное обу...
Петр Ермаков, HeadHunter - Модерация резюме: от людей к роботам. Машинное обу...Петр Ермаков, HeadHunter - Модерация резюме: от людей к роботам. Машинное обу...
Петр Ермаков, HeadHunter - Модерация резюме: от людей к роботам. Машинное обу...
 
Иосиф Иткин, Exactpro - TBA
Иосиф Иткин, Exactpro - TBAИосиф Иткин, Exactpro - TBA
Иосиф Иткин, Exactpro - TBA
 
Nikolay Karpov - Evolvable Semantic Platform for Facilitating Knowledge Exchange
Nikolay Karpov - Evolvable Semantic Platform for Facilitating Knowledge ExchangeNikolay Karpov - Evolvable Semantic Platform for Facilitating Knowledge Exchange
Nikolay Karpov - Evolvable Semantic Platform for Facilitating Knowledge Exchange
 
George Moiseev - Classification of E-commerce Websites by Product Categories
George Moiseev - Classification of E-commerce Websites by Product CategoriesGeorge Moiseev - Classification of E-commerce Websites by Product Categories
George Moiseev - Classification of E-commerce Websites by Product Categories
 
Elena Bruches - The Hybrid Approach to Part-of-Speech Disambiguation
Elena Bruches - The Hybrid Approach to Part-of-Speech DisambiguationElena Bruches - The Hybrid Approach to Part-of-Speech Disambiguation
Elena Bruches - The Hybrid Approach to Part-of-Speech Disambiguation
 
Marina Danshina - The methodology of automated decryption of znamenny chants
Marina Danshina - The methodology of automated decryption of znamenny chantsMarina Danshina - The methodology of automated decryption of znamenny chants
Marina Danshina - The methodology of automated decryption of znamenny chants
 
Edward Klyshinsky - The Corpus of Syntactic Co-occurences: the First Glance
Edward Klyshinsky - The Corpus of Syntactic Co-occurences: the First GlanceEdward Klyshinsky - The Corpus of Syntactic Co-occurences: the First Glance
Edward Klyshinsky - The Corpus of Syntactic Co-occurences: the First Glance
 
Galina Lavrentyeva - Anti-spoofing Methods for Automatic Speaker Verification...
Galina Lavrentyeva - Anti-spoofing Methods for Automatic Speaker Verification...Galina Lavrentyeva - Anti-spoofing Methods for Automatic Speaker Verification...
Galina Lavrentyeva - Anti-spoofing Methods for Automatic Speaker Verification...
 
Oleksandr Frei and Murat Apishev - Parallel Non-blocking Deterministic Algori...
Oleksandr Frei and Murat Apishev - Parallel Non-blocking Deterministic Algori...Oleksandr Frei and Murat Apishev - Parallel Non-blocking Deterministic Algori...
Oleksandr Frei and Murat Apishev - Parallel Non-blocking Deterministic Algori...
 
Kaytoue Mehdi - Finding duplicate labels in behavioral data: an application f...
Kaytoue Mehdi - Finding duplicate labels in behavioral data: an application f...Kaytoue Mehdi - Finding duplicate labels in behavioral data: an application f...
Kaytoue Mehdi - Finding duplicate labels in behavioral data: an application f...
 
Valeri Labunets - The bichromatic excitable Schrodinger metamedium
Valeri Labunets - The bichromatic excitable Schrodinger metamediumValeri Labunets - The bichromatic excitable Schrodinger metamedium
Valeri Labunets - The bichromatic excitable Schrodinger metamedium
 
Valeri Labunets - Fast multiparametric wavelet transforms and packets for ima...
Valeri Labunets - Fast multiparametric wavelet transforms and packets for ima...Valeri Labunets - Fast multiparametric wavelet transforms and packets for ima...
Valeri Labunets - Fast multiparametric wavelet transforms and packets for ima...
 
Alexander Karkishchenko - Threefold Symmetry Detection in Hexagonal Images Ba...
Alexander Karkishchenko - Threefold Symmetry Detection in Hexagonal Images Ba...Alexander Karkishchenko - Threefold Symmetry Detection in Hexagonal Images Ba...
Alexander Karkishchenko - Threefold Symmetry Detection in Hexagonal Images Ba...
 
Artyom Makovetskii - An Efficient Algorithm for Total Variation Denoising
Artyom Makovetskii - An Efficient Algorithm for Total Variation DenoisingArtyom Makovetskii - An Efficient Algorithm for Total Variation Denoising
Artyom Makovetskii - An Efficient Algorithm for Total Variation Denoising
 

Recently uploaded

Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
Silpa
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
Scintica Instrumentation
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
Areesha Ahmad
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
NazaninKarimi6
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
Silpa
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 

Recently uploaded (20)

Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
 
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Chemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfChemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdf
 
Stages in the normal growth curve
Stages in the normal growth curveStages in the normal growth curve
Stages in the normal growth curve
 
An introduction on sequence tagged site mapping
An introduction on sequence tagged site mappingAn introduction on sequence tagged site mapping
An introduction on sequence tagged site mapping
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 

Varvara Krayvanova - Automatic selection of verbs-markers for segmentation task of process descriptions in natural language texts

  • 1. Varvara Krayvanova, AltSTU, Barnaul krayvanova@gmail.com Problem and tasks What we want to have? What we need to do? Long scientific text Wikification Ontology Illustrations IDEF0 Use case Activity Slides for lectures MAGIC Long scientific text Definitions Nature process Metodology Metodology Split Special algorithms Ontology IDEF0 Use case Nature process
  • 2. Varvara Krayvanova, AltSTU, Barnaul krayvanova@gmail.com Model of text fragmentation How we can split text into semantic fragments? T =  sk - natural language text, sk is k-th sentence in the text. The window Wi,j =  si,..., sj is a continuous sequence of sentences of text, • i is the number of the first sentence, • j is the number of the last sentence, • L = j - i is window size. We take all windows with size L. For each window we calculate: (1) Total count of nouns PNoun (2) Count of different nouns PDiffNoun (3) Total count of verbs PVerb (4) Count of different verbs PDiffVerb (5) Total count of adjectives PAdj (6) Count of different adjectives PDiffAdj And we clusterize the set of windows using these parameters. Ok, we have that: I can read it, but I don’t want. What about this fragment? Each sentence sk assigned to some cluster c from a finite set of clusters C.
  • 3. Varvara Krayvanova, AltSTU, Barnaul krayvanova@gmail.com Verb nest How we can detect the fragment type? Vk - the set of verbs in the sentence sk. Ev =  sk |v  Vk - an ordered list of sentences that contain a verb v. • Vunic - rare verbs, |Ev| is below the border: |Ev| < . • Vcommon - common verbs. • Vmarker - verbs-markers. T v =  si | si  sk Ev and |k-i|≤ - textual neighborhood, cv = T v c. The verb vm is marker of cluster c, if: •|cvm|/| T vm|>, •aC |avm|/| T vm|≤ . N = {v| Ev T v} - text nest of verb- marker vm. Cluster annotation (expert) Verbs-markers (automatic extraction) Cluster 1. Description of the research objects: introduction definitions and process of snow formation. СЛУЖИТЬ, СМОТРЕТЬ, ЗАВИСЕТЬ, ЯВЛЯТЬСЯ, ОПРЕДЕЛЯТЬ, ИМЕТЬ, ПРОИСХОДИТЬ (TO SERVE, TO WATCH, TO DEPEND, TO BE, TO DEFINE, TO HAVE, TO HAPPEN) Cluster 2. Chapter about calculations and laboratory processing of research results, different tables of classifications, fragments about parameters measurement. ВЫЧИСЛЯТЬ, ВЫЧИСЛЯТЬСЯ, ЗАПИСЫВАТЬСЯ (TO CALCULATE, TO BE CALCULATE, TO REGISTER) Cluster 3. Observation methodology: observation areas marking, equipment and recommendations. СОСТОЯТЬ, ПРИНИМАТЬСЯ, ИСПОЛЬЗОВАТЬ, РЕКОМЕНДОВАТЬ, БЫТЬ (TO CONSIST, TO BE TAKEN, TO USE, TO RECOMMEND, TO BE as a link-verb)