SlideShare a Scribd company logo
STAT Requirement Analysis  2/24/09
New Goal: Avoiding “analysis paralysis” ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Flowchart and Definitions CorpusReader  reads text from a source into  Corpus . No processing is done and everything (label, metadata, etc.) stays in text format. FeatureExtrator  converts the text in  Document s to features* Annotator  transforms the corpus into another corpus by adding annotations Corpus  is a set of  Document s in text format. Dataset  is a set of  Instance s which are feature representation of  Document  text. Learner  uses the dataset to learn a model Model  is a set parameters* learned from the data by  Learner   * Not modeled Classifier  uses the model to predict classes in and produces a  Classification   Classification  contains predictions and information about them ClassificationEvaluator  computes the evaluation metrics for the  Classification Text Data Corpus Dataset Model Classification ClassificationEvaluation  contains evaluation metrics  Classification Evaluation
CorpusReader FeatureExtractor Dataset Learner Model Classifier Classification ClassificationEvaluator ClassificationEvaluation Document Instance contains contains processed-by produces learns-from produced-by used-by produces evaluated-by classified-by produces produces 1 * 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 * 1 1 STAT Domain Model – v5 1 evaluate-on 1 This object holds the evaluation results calculated by ClassificationEvaluator and also reference to all relevant objects to produce useful reports Annotator Corpus 1 modifies 0..1 * Notably absent from this diagram are classes needed by Corpus (e.g.,WordList, Split,  etc.) or by machine learning component s(e.g., DistanceMetric, ProbabilityDistribution, etc.) CorpusReader provides the protected variation for input sources (file, web, etc.) Instances are representation of documents needed for machine learning Classification hold the predictions produced by Classifier  Document could contain Labels, Annotations, MetaData, etc. Any class to mutates the corpus, e.g., POSTagger Any class that converts the document components to feature vector. Also includes feature selection, aggregation, etc.

More Related Content

What's hot

Mapping inheritance structures_mapping_class
Mapping inheritance structures_mapping_classMapping inheritance structures_mapping_class
Mapping inheritance structures_mapping_classTodor Kolev
 
SAP ABAP data dictionary
SAP ABAP data dictionarySAP ABAP data dictionary
SAP ABAP data dictionary
Revanth Nagaraju
 
Lecture03 abap on line
Lecture03 abap on lineLecture03 abap on line
Lecture03 abap on lineMilind Patil
 
Analytics machine learning in weka
Analytics machine learning in wekaAnalytics machine learning in weka
Analytics machine learning in weka
Sudhakar Chavan
 
Finding everything about findings about (fa)
Finding everything about findings about (fa)Finding everything about findings about (fa)
Finding everything about findings about (fa)Ram Gali
 
MS Sql Server: Reporting manipulating data
MS Sql Server: Reporting manipulating dataMS Sql Server: Reporting manipulating data
MS Sql Server: Reporting manipulating data
DataminingTools Inc
 
Dynamic Data Validation Lists
Dynamic Data Validation ListsDynamic Data Validation Lists
Dynamic Data Validation Lists
Marc Rivait, PMP
 
3rd june
3rd june3rd june
Entity relationship diagram_KSMS_BCA_2nd Sem_Saborni Pal_20th March 2020
Entity relationship diagram_KSMS_BCA_2nd Sem_Saborni Pal_20th March 2020Entity relationship diagram_KSMS_BCA_2nd Sem_Saborni Pal_20th March 2020
Entity relationship diagram_KSMS_BCA_2nd Sem_Saborni Pal_20th March 2020
PriyankaBhattacharya28
 
Acutate erd pro
Acutate erd proAcutate erd pro
Acutate erd pro
Aishwarya Savant
 
Three tier Architecture of ASP_Net
Three tier Architecture of ASP_NetThree tier Architecture of ASP_Net
Three tier Architecture of ASP_Net
Biswadip Goswami
 
Using an SQL coverage measurement for testing database application
Using an SQL coverage measurement for testing database applicationUsing an SQL coverage measurement for testing database application
Using an SQL coverage measurement for testing database application
KhalidKhan412
 

What's hot (17)

Mapping inheritance structures_mapping_class
Mapping inheritance structures_mapping_classMapping inheritance structures_mapping_class
Mapping inheritance structures_mapping_class
 
Cis145 Final Review
Cis145 Final ReviewCis145 Final Review
Cis145 Final Review
 
SAP ABAP data dictionary
SAP ABAP data dictionarySAP ABAP data dictionary
SAP ABAP data dictionary
 
Lecture03 abap on line
Lecture03 abap on lineLecture03 abap on line
Lecture03 abap on line
 
CIS145 Final Review
CIS145 Final ReviewCIS145 Final Review
CIS145 Final Review
 
Analytics machine learning in weka
Analytics machine learning in wekaAnalytics machine learning in weka
Analytics machine learning in weka
 
Cis145 Final Review
Cis145 Final ReviewCis145 Final Review
Cis145 Final Review
 
Finding everything about findings about (fa)
Finding everything about findings about (fa)Finding everything about findings about (fa)
Finding everything about findings about (fa)
 
Cis266 final review
Cis266 final reviewCis266 final review
Cis266 final review
 
MS Sql Server: Reporting manipulating data
MS Sql Server: Reporting manipulating dataMS Sql Server: Reporting manipulating data
MS Sql Server: Reporting manipulating data
 
Abap dictionary 1
Abap dictionary 1Abap dictionary 1
Abap dictionary 1
 
Dynamic Data Validation Lists
Dynamic Data Validation ListsDynamic Data Validation Lists
Dynamic Data Validation Lists
 
3rd june
3rd june3rd june
3rd june
 
Entity relationship diagram_KSMS_BCA_2nd Sem_Saborni Pal_20th March 2020
Entity relationship diagram_KSMS_BCA_2nd Sem_Saborni Pal_20th March 2020Entity relationship diagram_KSMS_BCA_2nd Sem_Saborni Pal_20th March 2020
Entity relationship diagram_KSMS_BCA_2nd Sem_Saborni Pal_20th March 2020
 
Acutate erd pro
Acutate erd proAcutate erd pro
Acutate erd pro
 
Three tier Architecture of ASP_Net
Three tier Architecture of ASP_NetThree tier Architecture of ASP_Net
Three tier Architecture of ASP_Net
 
Using an SQL coverage measurement for testing database application
Using an SQL coverage measurement for testing database applicationUsing an SQL coverage measurement for testing database application
Using an SQL coverage measurement for testing database application
 

Viewers also liked

Organi-Deviance Part I
Organi-Deviance Part IOrgani-Deviance Part I
Organi-Deviance Part I
cmhusted
 
Summary Of Dissertation Presentation
Summary Of Dissertation PresentationSummary Of Dissertation Presentation
Summary Of Dissertation Presentationcmhusted
 
Is A Corporate Criminal Profile Possible
Is A Corporate Criminal Profile PossibleIs A Corporate Criminal Profile Possible
Is A Corporate Criminal Profile Possiblecmhusted
 
Requirementv4
Requirementv4Requirementv4
Requirementv4stat
 
Op weg naar de grote wereld
Op weg naar de grote wereldOp weg naar de grote wereld
Op weg naar de grote wereldikknip
 
Stat Tech Reportv1
Stat Tech Reportv1Stat Tech Reportv1
Stat Tech Reportv1stat
 
STAT Requirement Analysis
STAT Requirement AnalysisSTAT Requirement Analysis
STAT Requirement Analysisstat
 
Sosialisasi Pemilu 2009
Sosialisasi Pemilu 2009Sosialisasi Pemilu 2009
Sosialisasi Pemilu 2009
Panda Friendster
 
Riesgos laborales
Riesgos laboralesRiesgos laborales
Riesgos laboralesasilva100
 
ИБЛИОТЕКА В ЦИФРОВОЙ СРЕДЕ: УПАДОК И РАЗРУШЕНИЕ ИЛИ НАЧАЛО НОВОЙ ИСТОРИИ?
ИБЛИОТЕКА В ЦИФРОВОЙ СРЕДЕ: УПАДОК И РАЗРУШЕНИЕ ИЛИ НАЧАЛО НОВОЙ ИСТОРИИ?ИБЛИОТЕКА В ЦИФРОВОЙ СРЕДЕ: УПАДОК И РАЗРУШЕНИЕ ИЛИ НАЧАЛО НОВОЙ ИСТОРИИ?
ИБЛИОТЕКА В ЦИФРОВОЙ СРЕДЕ: УПАДОК И РАЗРУШЕНИЕ ИЛИ НАЧАЛО НОВОЙ ИСТОРИИ?
Андрей Федоров
 
kammersieger2012.pdf
kammersieger2012.pdfkammersieger2012.pdf
kammersieger2012.pdf
unn | UNITED NEWS NETWORK GmbH
 
Asiaticos en el barrio de la Refinería
Asiaticos en el barrio de la RefineríaAsiaticos en el barrio de la Refinería
Asiaticos en el barrio de la Refinería
Soccorso Volpe
 
Koolituse e-õppekeskkond
Koolituse e-õppekeskkondKoolituse e-õppekeskkond
Koolituse e-õppekeskkond
Hans Põldoja
 
Blender
BlenderBlender
Blender
Mauro Fava
 
PARA MIS VERDADEROS AMIGOS
PARA MIS VERDADEROS AMIGOSPARA MIS VERDADEROS AMIGOS
PARA MIS VERDADEROS AMIGOS
guest521b81
 
Picture dictionary
Picture dictionaryPicture dictionary
Picture dictionaryJs Alonso
 

Viewers also liked (20)

Organi-Deviance Part I
Organi-Deviance Part IOrgani-Deviance Part I
Organi-Deviance Part I
 
我愛上攝影
我愛上攝影我愛上攝影
我愛上攝影
 
Summary Of Dissertation Presentation
Summary Of Dissertation PresentationSummary Of Dissertation Presentation
Summary Of Dissertation Presentation
 
Is A Corporate Criminal Profile Possible
Is A Corporate Criminal Profile PossibleIs A Corporate Criminal Profile Possible
Is A Corporate Criminal Profile Possible
 
Requirementv4
Requirementv4Requirementv4
Requirementv4
 
Op weg naar de grote wereld
Op weg naar de grote wereldOp weg naar de grote wereld
Op weg naar de grote wereld
 
Stat Tech Reportv1
Stat Tech Reportv1Stat Tech Reportv1
Stat Tech Reportv1
 
Sample
SampleSample
Sample
 
STAT Requirement Analysis
STAT Requirement AnalysisSTAT Requirement Analysis
STAT Requirement Analysis
 
Sosialisasi Pemilu 2009
Sosialisasi Pemilu 2009Sosialisasi Pemilu 2009
Sosialisasi Pemilu 2009
 
Riesgos laborales
Riesgos laboralesRiesgos laborales
Riesgos laborales
 
ИБЛИОТЕКА В ЦИФРОВОЙ СРЕДЕ: УПАДОК И РАЗРУШЕНИЕ ИЛИ НАЧАЛО НОВОЙ ИСТОРИИ?
ИБЛИОТЕКА В ЦИФРОВОЙ СРЕДЕ: УПАДОК И РАЗРУШЕНИЕ ИЛИ НАЧАЛО НОВОЙ ИСТОРИИ?ИБЛИОТЕКА В ЦИФРОВОЙ СРЕДЕ: УПАДОК И РАЗРУШЕНИЕ ИЛИ НАЧАЛО НОВОЙ ИСТОРИИ?
ИБЛИОТЕКА В ЦИФРОВОЙ СРЕДЕ: УПАДОК И РАЗРУШЕНИЕ ИЛИ НАЧАЛО НОВОЙ ИСТОРИИ?
 
kammersieger2012.pdf
kammersieger2012.pdfkammersieger2012.pdf
kammersieger2012.pdf
 
Asiaticos en el barrio de la Refinería
Asiaticos en el barrio de la RefineríaAsiaticos en el barrio de la Refinería
Asiaticos en el barrio de la Refinería
 
Prog
ProgProg
Prog
 
Koolituse e-õppekeskkond
Koolituse e-õppekeskkondKoolituse e-õppekeskkond
Koolituse e-õppekeskkond
 
Blender
BlenderBlender
Blender
 
PARA MIS VERDADEROS AMIGOS
PARA MIS VERDADEROS AMIGOSPARA MIS VERDADEROS AMIGOS
PARA MIS VERDADEROS AMIGOS
 
Picture dictionary
Picture dictionaryPicture dictionary
Picture dictionary
 
MỞ BÁN CĂN HỘ EMERALD TRUNG TÂM QUẬN THỦ ĐỨC
MỞ BÁN CĂN HỘ EMERALD TRUNG TÂM QUẬN THỦ ĐỨC MỞ BÁN CĂN HỘ EMERALD TRUNG TÂM QUẬN THỦ ĐỨC
MỞ BÁN CĂN HỘ EMERALD TRUNG TÂM QUẬN THỦ ĐỨC
 

Similar to Stat2 25 09

Spark SQL In Depth www.syedacademy.com
Spark SQL In Depth www.syedacademy.comSpark SQL In Depth www.syedacademy.com
Spark SQL In Depth www.syedacademy.com
Syed Hadoop
 
47468272 introduction-to-informatica
47468272 introduction-to-informatica47468272 introduction-to-informatica
47468272 introduction-to-informatica
Venkat485
 
Oracle report from ppt
Oracle report from pptOracle report from ppt
Oracle report from ppt
kingshuk_goswami
 
Reengineering PDF-Based Documents Targeting Complex Software Specifications
Reengineering PDF-Based Documents Targeting Complex Software SpecificationsReengineering PDF-Based Documents Targeting Complex Software Specifications
Reengineering PDF-Based Documents Targeting Complex Software Specifications
Moutasm Tamimi
 
Java Annotations
Java AnnotationsJava Annotations
Java Annotations
Serhii Kartashov
 
Machine Learning in GATE Valentin Tablan
Machine Learning in GATE Valentin TablanMachine Learning in GATE Valentin Tablan
Machine Learning in GATE Valentin Tablanbutest
 
Cognos framework manager
Cognos framework managerCognos framework manager
Cognos framework manager
maxonlinetr
 
IRJET- Resume Information Extraction Framework
IRJET- Resume Information Extraction FrameworkIRJET- Resume Information Extraction Framework
IRJET- Resume Information Extraction Framework
IRJET Journal
 
Creation of a Test Bed Environment for Core Java Applications using White Box...
Creation of a Test Bed Environment for Core Java Applications using White Box...Creation of a Test Bed Environment for Core Java Applications using White Box...
Creation of a Test Bed Environment for Core Java Applications using White Box...
cscpconf
 
Overview of query evaluation
Overview of query evaluationOverview of query evaluation
Overview of query evaluationavniS
 
Data Science Process.pptx
Data Science Process.pptxData Science Process.pptx
Data Science Process.pptx
WidsoulDevil
 
BERT QnA System for Airplane Flight Manual
BERT QnA System for Airplane Flight ManualBERT QnA System for Airplane Flight Manual
BERT QnA System for Airplane Flight Manual
ArkaGhosh65
 
PM Architecture
PM ArchitecturePM Architecture
PM Architecture
Traction Software
 
A Recommender System for Refining Ekeko/X Transformation
A Recommender System for Refining Ekeko/X TransformationA Recommender System for Refining Ekeko/X Transformation
A Recommender System for Refining Ekeko/X Transformation
Coen De Roover
 
Python programming
Python programmingPython programming
Python programming
Swetha544947
 

Similar to Stat2 25 09 (20)

 
Spark SQL In Depth www.syedacademy.com
Spark SQL In Depth www.syedacademy.comSpark SQL In Depth www.syedacademy.com
Spark SQL In Depth www.syedacademy.com
 
47468272 introduction-to-informatica
47468272 introduction-to-informatica47468272 introduction-to-informatica
47468272 introduction-to-informatica
 
Ch08
Ch08Ch08
Ch08
 
Ch08
Ch08Ch08
Ch08
 
Oracle report from ppt
Oracle report from pptOracle report from ppt
Oracle report from ppt
 
Reengineering PDF-Based Documents Targeting Complex Software Specifications
Reengineering PDF-Based Documents Targeting Complex Software SpecificationsReengineering PDF-Based Documents Targeting Complex Software Specifications
Reengineering PDF-Based Documents Targeting Complex Software Specifications
 
Java Annotations
Java AnnotationsJava Annotations
Java Annotations
 
Machine Learning in GATE Valentin Tablan
Machine Learning in GATE Valentin TablanMachine Learning in GATE Valentin Tablan
Machine Learning in GATE Valentin Tablan
 
Cognos framework manager
Cognos framework managerCognos framework manager
Cognos framework manager
 
IRJET- Resume Information Extraction Framework
IRJET- Resume Information Extraction FrameworkIRJET- Resume Information Extraction Framework
IRJET- Resume Information Extraction Framework
 
Ax
AxAx
Ax
 
Creation of a Test Bed Environment for Core Java Applications using White Box...
Creation of a Test Bed Environment for Core Java Applications using White Box...Creation of a Test Bed Environment for Core Java Applications using White Box...
Creation of a Test Bed Environment for Core Java Applications using White Box...
 
Overview of query evaluation
Overview of query evaluationOverview of query evaluation
Overview of query evaluation
 
Data Science Process.pptx
Data Science Process.pptxData Science Process.pptx
Data Science Process.pptx
 
BERT QnA System for Airplane Flight Manual
BERT QnA System for Airplane Flight ManualBERT QnA System for Airplane Flight Manual
BERT QnA System for Airplane Flight Manual
 
PM Architecture
PM ArchitecturePM Architecture
PM Architecture
 
A Recommender System for Refining Ekeko/X Transformation
A Recommender System for Refining Ekeko/X TransformationA Recommender System for Refining Ekeko/X Transformation
A Recommender System for Refining Ekeko/X Transformation
 
3 analysis and design overview
3 analysis and design overview3 analysis and design overview
3 analysis and design overview
 
Python programming
Python programmingPython programming
Python programming
 

Recently uploaded

The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
Vivekanand Anglo Vedic Academy
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
Scholarhat
 
Normal Labour/ Stages of Labour/ Mechanism of Labour
Normal Labour/ Stages of Labour/ Mechanism of LabourNormal Labour/ Stages of Labour/ Mechanism of Labour
Normal Labour/ Stages of Labour/ Mechanism of Labour
Wasim Ak
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
thanhdowork
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBCSTRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
kimdan468
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
tarandeep35
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
Group Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana BuscigliopptxGroup Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana Buscigliopptx
ArianaBusciglio
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 
Digital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion DesignsDigital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion Designs
chanes7
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
Chapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdfChapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdf
Kartik Tiwari
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 

Recently uploaded (20)

The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
 
Normal Labour/ Stages of Labour/ Mechanism of Labour
Normal Labour/ Stages of Labour/ Mechanism of LabourNormal Labour/ Stages of Labour/ Mechanism of Labour
Normal Labour/ Stages of Labour/ Mechanism of Labour
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBCSTRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
Group Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana BuscigliopptxGroup Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana Buscigliopptx
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 
Digital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion DesignsDigital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion Designs
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
Chapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdfChapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdf
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 

Stat2 25 09

  • 2.
  • 3. Flowchart and Definitions CorpusReader reads text from a source into Corpus . No processing is done and everything (label, metadata, etc.) stays in text format. FeatureExtrator converts the text in Document s to features* Annotator transforms the corpus into another corpus by adding annotations Corpus is a set of Document s in text format. Dataset is a set of Instance s which are feature representation of Document text. Learner uses the dataset to learn a model Model is a set parameters* learned from the data by Learner * Not modeled Classifier uses the model to predict classes in and produces a Classification Classification contains predictions and information about them ClassificationEvaluator computes the evaluation metrics for the Classification Text Data Corpus Dataset Model Classification ClassificationEvaluation contains evaluation metrics Classification Evaluation
  • 4. CorpusReader FeatureExtractor Dataset Learner Model Classifier Classification ClassificationEvaluator ClassificationEvaluation Document Instance contains contains processed-by produces learns-from produced-by used-by produces evaluated-by classified-by produces produces 1 * 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 * 1 1 STAT Domain Model – v5 1 evaluate-on 1 This object holds the evaluation results calculated by ClassificationEvaluator and also reference to all relevant objects to produce useful reports Annotator Corpus 1 modifies 0..1 * Notably absent from this diagram are classes needed by Corpus (e.g.,WordList, Split, etc.) or by machine learning component s(e.g., DistanceMetric, ProbabilityDistribution, etc.) CorpusReader provides the protected variation for input sources (file, web, etc.) Instances are representation of documents needed for machine learning Classification hold the predictions produced by Classifier Document could contain Labels, Annotations, MetaData, etc. Any class to mutates the corpus, e.g., POSTagger Any class that converts the document components to feature vector. Also includes feature selection, aggregation, etc.