SlideShare a Scribd company logo
1 of 10
TEXT MINING
NAME:- AKASH KUMAR SINHA
CLASS:- CSE-A
College Roll:- 192015028 [15]
Univ. Roll:- 12500119160
Contents
• Introduction
• Data Mining Vs. Text Mining
• I/O model for Text Mining
• Steps for Text Mining
• Application of Text Mining
• Demerits of Text Mining
• Refrences
Introduction
• Text mining is a Discovery
• It is also known as Text Data
Mining(TDM)
• It is used to extract relevant
information or knowledge from
different sources which are
unstructured or semi-structured.
Data Mining vs Text Mining
Data Mining Text Mining
Process to extract information from
datasets
It’s a part of Data Mining which
processes the text only.
Databases are used to gather data Text is used to gather data.
Data is Homogeneous Data is Heterogeneous
Data is easy to Retrieve Data is not easy to Retrieve
Data is stored in Structured Format Data is stored in Un-structed Format
I/O MODEL FOR
TEXT MINING
Text
Mining
Technique
Pattern
Connections
Trends
OUTPUT
INPUT
TEXT
DOCUMENT
Steps for Text
Mining
Pre-Processing the Text
Applying Text Mining Techniques
 Information Extraction
 Information Retrieval
 Categorization
 Clustering
 Summarization
Analysing the Text
Application of Text
Mining
Analysis of Market trends
 Classification Technique
 Information Extraction Technique
Analysis and Screening of Junk Emails
 Classification based on pre-defined frequently
occurring items.
Merits of Text
Mining
 Extraction of relevant information and
relationships from natural documents
 Extraction of Information from Unstructured or
Semi Structured Documents
 Database limits itself to storage of less
information whereas Text Mining overcomes this
Limitation.
Demerits of Text
Mining
× Requires Initial Learned Information System
for Initial Extraction
× Suitable programs are not been defined to
Analyze Text from Mining Knowledge or
Information
× High potential to gather garbled or false
results
THANK YOU

More Related Content

Similar to ML12_12500119160.pptx

UI introduction_to_data_mining YA.ppt
UI introduction_to_data_mining YA.pptUI introduction_to_data_mining YA.ppt
UI introduction_to_data_mining YA.pptmirbella
 
2019 DSA 105 Introduction to Data Science Week 5
2019 DSA 105 Introduction to Data Science Week 52019 DSA 105 Introduction to Data Science Week 5
2019 DSA 105 Introduction to Data Science Week 5Ferdin Joe John Joseph PhD
 
Web search-metrics-tutorial-www2010-section-1of7-introduction
Web search-metrics-tutorial-www2010-section-1of7-introductionWeb search-metrics-tutorial-www2010-section-1of7-introduction
Web search-metrics-tutorial-www2010-section-1of7-introductionAli Dasdan
 
Maintenance for university course out line.docx
Maintenance for university  course out line.docxMaintenance for university  course out line.docx
Maintenance for university course out line.docxamare lakew
 
SE(Computer Based Information Systems (CBIS)).pptx
SE(Computer Based Information Systems (CBIS)).pptxSE(Computer Based Information Systems (CBIS)).pptx
SE(Computer Based Information Systems (CBIS)).pptxAmitSingh770691
 
Data mining - Process, Techniques and Research Topics
Data mining - Process, Techniques and Research TopicsData mining - Process, Techniques and Research Topics
Data mining - Process, Techniques and Research TopicsTechsparks
 
Database management system
Database management systemDatabase management system
Database management systemAnujRana43
 
Behind the scenes of data science
Behind the scenes of data scienceBehind the scenes of data science
Behind the scenes of data scienceLoïc Lejoly
 
Data Mining and Data Warehousing
Data Mining and Data WarehousingData Mining and Data Warehousing
Data Mining and Data WarehousingAmdocs
 

Similar to ML12_12500119160.pptx (20)

Compilerpt
CompilerptCompilerpt
Compilerpt
 
UI introduction_to_data_mining YA.ppt
UI introduction_to_data_mining YA.pptUI introduction_to_data_mining YA.ppt
UI introduction_to_data_mining YA.ppt
 
2019 DSA 105 Introduction to Data Science Week 5
2019 DSA 105 Introduction to Data Science Week 52019 DSA 105 Introduction to Data Science Week 5
2019 DSA 105 Introduction to Data Science Week 5
 
Web search-metrics-tutorial-www2010-section-1of7-introduction
Web search-metrics-tutorial-www2010-section-1of7-introductionWeb search-metrics-tutorial-www2010-section-1of7-introduction
Web search-metrics-tutorial-www2010-section-1of7-introduction
 
23.database
23.database23.database
23.database
 
Maintenance for university course out line.docx
Maintenance for university  course out line.docxMaintenance for university  course out line.docx
Maintenance for university course out line.docx
 
SE(Computer Based Information Systems (CBIS)).pptx
SE(Computer Based Information Systems (CBIS)).pptxSE(Computer Based Information Systems (CBIS)).pptx
SE(Computer Based Information Systems (CBIS)).pptx
 
Data mining - Process, Techniques and Research Topics
Data mining - Process, Techniques and Research TopicsData mining - Process, Techniques and Research Topics
Data mining - Process, Techniques and Research Topics
 
Ch_2.pdf
Ch_2.pdfCh_2.pdf
Ch_2.pdf
 
Bi 5
Bi 5Bi 5
Bi 5
 
NCCT.pptx
NCCT.pptxNCCT.pptx
NCCT.pptx
 
Data mining
Data miningData mining
Data mining
 
Digital data
Digital dataDigital data
Digital data
 
Digital Types
Digital TypesDigital Types
Digital Types
 
DOWLD SLIDES.pptx
DOWLD SLIDES.pptxDOWLD SLIDES.pptx
DOWLD SLIDES.pptx
 
Data mining
Data mining Data mining
Data mining
 
Introduction to dbms.
Introduction to dbms.Introduction to dbms.
Introduction to dbms.
 
Database management system
Database management systemDatabase management system
Database management system
 
Behind the scenes of data science
Behind the scenes of data scienceBehind the scenes of data science
Behind the scenes of data science
 
Data Mining and Data Warehousing
Data Mining and Data WarehousingData Mining and Data Warehousing
Data Mining and Data Warehousing
 

Recently uploaded

S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxS1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxSCMS School of Architecture
 
5G and 6G refer to generations of mobile network technology, each representin...
5G and 6G refer to generations of mobile network technology, each representin...5G and 6G refer to generations of mobile network technology, each representin...
5G and 6G refer to generations of mobile network technology, each representin...archanaece3
 
Augmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptxAugmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptxMustafa Ahmed
 
Introduction to Robotics in Mechanical Engineering.pptx
Introduction to Robotics in Mechanical Engineering.pptxIntroduction to Robotics in Mechanical Engineering.pptx
Introduction to Robotics in Mechanical Engineering.pptxhublikarsn
 
Max. shear stress theory-Maximum Shear Stress Theory ​ Maximum Distortional ...
Max. shear stress theory-Maximum Shear Stress Theory ​  Maximum Distortional ...Max. shear stress theory-Maximum Shear Stress Theory ​  Maximum Distortional ...
Max. shear stress theory-Maximum Shear Stress Theory ​ Maximum Distortional ...ronahami
 
Working Principle of Echo Sounder and Doppler Effect.pdf
Working Principle of Echo Sounder and Doppler Effect.pdfWorking Principle of Echo Sounder and Doppler Effect.pdf
Working Principle of Echo Sounder and Doppler Effect.pdfSkNahidulIslamShrabo
 
Danikor Product Catalog- Screw Feeder.pdf
Danikor Product Catalog- Screw Feeder.pdfDanikor Product Catalog- Screw Feeder.pdf
Danikor Product Catalog- Screw Feeder.pdfthietkevietthinh
 
Presentation on Slab, Beam, Column, and Foundation/Footing
Presentation on Slab,  Beam, Column, and Foundation/FootingPresentation on Slab,  Beam, Column, and Foundation/Footing
Presentation on Slab, Beam, Column, and Foundation/FootingEr. Suman Jyoti
 
Dynamo Scripts for Task IDs and Space Naming.pptx
Dynamo Scripts for Task IDs and Space Naming.pptxDynamo Scripts for Task IDs and Space Naming.pptx
Dynamo Scripts for Task IDs and Space Naming.pptxMustafa Ahmed
 
Convergence of Robotics and Gen AI offers excellent opportunities for Entrepr...
Convergence of Robotics and Gen AI offers excellent opportunities for Entrepr...Convergence of Robotics and Gen AI offers excellent opportunities for Entrepr...
Convergence of Robotics and Gen AI offers excellent opportunities for Entrepr...ssuserdfc773
 
Path loss model, OKUMURA Model, Hata Model
Path loss model, OKUMURA Model, Hata ModelPath loss model, OKUMURA Model, Hata Model
Path loss model, OKUMURA Model, Hata ModelDrAjayKumarYadav4
 
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...HenryBriggs2
 
Passive Air Cooling System and Solar Water Heater.ppt
Passive Air Cooling System and Solar Water Heater.pptPassive Air Cooling System and Solar Water Heater.ppt
Passive Air Cooling System and Solar Water Heater.pptamrabdallah9
 
TMU_GDSC_20240509.pdfTMU_GDSC_20240509.pdf
TMU_GDSC_20240509.pdfTMU_GDSC_20240509.pdfTMU_GDSC_20240509.pdfTMU_GDSC_20240509.pdf
TMU_GDSC_20240509.pdfTMU_GDSC_20240509.pdfssuserded2d4
 
Call for Papers - Journal of Electrical Systems (JES), E-ISSN: 1112-5209, ind...
Call for Papers - Journal of Electrical Systems (JES), E-ISSN: 1112-5209, ind...Call for Papers - Journal of Electrical Systems (JES), E-ISSN: 1112-5209, ind...
Call for Papers - Journal of Electrical Systems (JES), E-ISSN: 1112-5209, ind...Christo Ananth
 
Computer Graphics Introduction To Curves
Computer Graphics Introduction To CurvesComputer Graphics Introduction To Curves
Computer Graphics Introduction To CurvesChandrakantDivate1
 
Ground Improvement Technique: Earth Reinforcement
Ground Improvement Technique: Earth ReinforcementGround Improvement Technique: Earth Reinforcement
Ground Improvement Technique: Earth ReinforcementDr. Deepak Mudgal
 
Introduction to Artificial Intelligence ( AI)
Introduction to Artificial Intelligence ( AI)Introduction to Artificial Intelligence ( AI)
Introduction to Artificial Intelligence ( AI)ChandrakantDivate1
 
Theory of Time 2024 (Universal Theory for Everything)
Theory of Time 2024 (Universal Theory for Everything)Theory of Time 2024 (Universal Theory for Everything)
Theory of Time 2024 (Universal Theory for Everything)Ramkumar k
 
PE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiesPE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiessarkmank1
 

Recently uploaded (20)

S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxS1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
 
5G and 6G refer to generations of mobile network technology, each representin...
5G and 6G refer to generations of mobile network technology, each representin...5G and 6G refer to generations of mobile network technology, each representin...
5G and 6G refer to generations of mobile network technology, each representin...
 
Augmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptxAugmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptx
 
Introduction to Robotics in Mechanical Engineering.pptx
Introduction to Robotics in Mechanical Engineering.pptxIntroduction to Robotics in Mechanical Engineering.pptx
Introduction to Robotics in Mechanical Engineering.pptx
 
Max. shear stress theory-Maximum Shear Stress Theory ​ Maximum Distortional ...
Max. shear stress theory-Maximum Shear Stress Theory ​  Maximum Distortional ...Max. shear stress theory-Maximum Shear Stress Theory ​  Maximum Distortional ...
Max. shear stress theory-Maximum Shear Stress Theory ​ Maximum Distortional ...
 
Working Principle of Echo Sounder and Doppler Effect.pdf
Working Principle of Echo Sounder and Doppler Effect.pdfWorking Principle of Echo Sounder and Doppler Effect.pdf
Working Principle of Echo Sounder and Doppler Effect.pdf
 
Danikor Product Catalog- Screw Feeder.pdf
Danikor Product Catalog- Screw Feeder.pdfDanikor Product Catalog- Screw Feeder.pdf
Danikor Product Catalog- Screw Feeder.pdf
 
Presentation on Slab, Beam, Column, and Foundation/Footing
Presentation on Slab,  Beam, Column, and Foundation/FootingPresentation on Slab,  Beam, Column, and Foundation/Footing
Presentation on Slab, Beam, Column, and Foundation/Footing
 
Dynamo Scripts for Task IDs and Space Naming.pptx
Dynamo Scripts for Task IDs and Space Naming.pptxDynamo Scripts for Task IDs and Space Naming.pptx
Dynamo Scripts for Task IDs and Space Naming.pptx
 
Convergence of Robotics and Gen AI offers excellent opportunities for Entrepr...
Convergence of Robotics and Gen AI offers excellent opportunities for Entrepr...Convergence of Robotics and Gen AI offers excellent opportunities for Entrepr...
Convergence of Robotics and Gen AI offers excellent opportunities for Entrepr...
 
Path loss model, OKUMURA Model, Hata Model
Path loss model, OKUMURA Model, Hata ModelPath loss model, OKUMURA Model, Hata Model
Path loss model, OKUMURA Model, Hata Model
 
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
 
Passive Air Cooling System and Solar Water Heater.ppt
Passive Air Cooling System and Solar Water Heater.pptPassive Air Cooling System and Solar Water Heater.ppt
Passive Air Cooling System and Solar Water Heater.ppt
 
TMU_GDSC_20240509.pdfTMU_GDSC_20240509.pdf
TMU_GDSC_20240509.pdfTMU_GDSC_20240509.pdfTMU_GDSC_20240509.pdfTMU_GDSC_20240509.pdf
TMU_GDSC_20240509.pdfTMU_GDSC_20240509.pdf
 
Call for Papers - Journal of Electrical Systems (JES), E-ISSN: 1112-5209, ind...
Call for Papers - Journal of Electrical Systems (JES), E-ISSN: 1112-5209, ind...Call for Papers - Journal of Electrical Systems (JES), E-ISSN: 1112-5209, ind...
Call for Papers - Journal of Electrical Systems (JES), E-ISSN: 1112-5209, ind...
 
Computer Graphics Introduction To Curves
Computer Graphics Introduction To CurvesComputer Graphics Introduction To Curves
Computer Graphics Introduction To Curves
 
Ground Improvement Technique: Earth Reinforcement
Ground Improvement Technique: Earth ReinforcementGround Improvement Technique: Earth Reinforcement
Ground Improvement Technique: Earth Reinforcement
 
Introduction to Artificial Intelligence ( AI)
Introduction to Artificial Intelligence ( AI)Introduction to Artificial Intelligence ( AI)
Introduction to Artificial Intelligence ( AI)
 
Theory of Time 2024 (Universal Theory for Everything)
Theory of Time 2024 (Universal Theory for Everything)Theory of Time 2024 (Universal Theory for Everything)
Theory of Time 2024 (Universal Theory for Everything)
 
PE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiesPE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and properties
 

ML12_12500119160.pptx

  • 1. TEXT MINING NAME:- AKASH KUMAR SINHA CLASS:- CSE-A College Roll:- 192015028 [15] Univ. Roll:- 12500119160
  • 2. Contents • Introduction • Data Mining Vs. Text Mining • I/O model for Text Mining • Steps for Text Mining • Application of Text Mining • Demerits of Text Mining • Refrences
  • 3. Introduction • Text mining is a Discovery • It is also known as Text Data Mining(TDM) • It is used to extract relevant information or knowledge from different sources which are unstructured or semi-structured.
  • 4. Data Mining vs Text Mining Data Mining Text Mining Process to extract information from datasets It’s a part of Data Mining which processes the text only. Databases are used to gather data Text is used to gather data. Data is Homogeneous Data is Heterogeneous Data is easy to Retrieve Data is not easy to Retrieve Data is stored in Structured Format Data is stored in Un-structed Format
  • 5. I/O MODEL FOR TEXT MINING Text Mining Technique Pattern Connections Trends OUTPUT INPUT TEXT DOCUMENT
  • 6. Steps for Text Mining Pre-Processing the Text Applying Text Mining Techniques  Information Extraction  Information Retrieval  Categorization  Clustering  Summarization Analysing the Text
  • 7. Application of Text Mining Analysis of Market trends  Classification Technique  Information Extraction Technique Analysis and Screening of Junk Emails  Classification based on pre-defined frequently occurring items.
  • 8. Merits of Text Mining  Extraction of relevant information and relationships from natural documents  Extraction of Information from Unstructured or Semi Structured Documents  Database limits itself to storage of less information whereas Text Mining overcomes this Limitation.
  • 9. Demerits of Text Mining × Requires Initial Learned Information System for Initial Extraction × Suitable programs are not been defined to Analyze Text from Mining Knowledge or Information × High potential to gather garbled or false results