SlideShare a Scribd company logo
The next step in
Search Technology
OUTLINE
What is Text Mining?
What is unstructured text
Need for Text Mining?
Text Mining sub tasks
 Applications of text mining
 Barriers
 Today of text mining
 Tomorrow of text mining
Data Mining
Goals of Data Mining
What can Data mining do?
Text Mining v/s.
Data Mining
Web Mining
Information Retrieval
Text retrieval
Information is retrieved so as to fulfill the needs of
customers.
Does not discover anything new about the query.
IRS find the result from a large database by matching the
query.
E.g.: the search engines, which identify the relevant
documents according to a given set of words on www.
IE is the process of automatically extracting structured
data from unstructured machine readable codes.
It highly relies on Natural Language Processing systems.
Natural Language Processing
It converts samples of human language into formal
representation which can be understood by the computer.
Its types are:
Natural Language Generation System
Natural Language Understanding System
Information extraction
Spam filtering
• A spam filter is a program that is used to detect
unwanted email and prevent those
messages from getting to a user's inbox.
Sophisticated program, such as Bayesian filters ,
attempt to identify spam through suspicious word
patterns or word frequency .
• Bayesian spam filtering :It identifies spam e-mail through
suspicious word patterns or word frequency.
Applications of Text Mining
Creating suggestion and recommendations
• Text mining helps customers in providing suggestions for online stores such as
amazon, based on their interests. The prediction algorithms are of huge
importance to online stores -the more accurate they are, the more the online store
will sell.
• A large online store like Amazon may have millions of customers and millions
of items in stock. New customers will have limited information about their
preferences, while more established customers may have too much.
• The data on which these algorithms work is constantly updated and changed.
Customers are browsing the site and the prediction algorithm should take the
recently browsed items into consideration.
• Traditionally, these recommendation algorithms have worked by finding similar
customers in the database.
Barriers that we need to overcome to
make best use of text mining tools in the
future:
1) Text mining is a complex technical
process that requires skilled staff.
2) It requires unrestricted access to
information sources.
3) Copyright can be a barrier.
• Text mining is already producing efficiencies and new knowledge in areas as
diverse as biological science, particle physics, media and communications. It has
been used to hypothesise the causes of rare diseases and how pre-existing drugs
could be used to target different diseases.
• The technique was also used recently to analyse the vast amount of text
produced on websites, blogs and social media such as Twitter - where copyright
holders allowed - and showed that the messages exchanged on Twitter during
the English riots of 2011 were not to blame for inciting riots.
• The business benefit of text mining is in identifying emerging trends, and to
explore consumer preferences and competitor developments. Text mining is
particularly used in larger companies as part of their customer relationship
management strategy and in the pharmaceutical industry as part of their research
and development strategy.
Today of Text Mining
Text mining has been garnering a significant amount of
importance in recent years, creating a strong industrial
impact. Based on this observation, it is evident that the future
of text mining companies would be promising in the coming
years. The age of innovation for this is not over.
It is, therefore, unmistakable that in the years to come many
new doors and exciting opportunities will open up through
the advanced text mining services offered by various
professional text mining companies
DATA MINING
It is the process of discovering interesting knowledge, such as
patterns, associations, changes, anomalies and significant
structures from large amount of data stored in databases, data
warehouses or other information repositories.
Why Data mining?
Due to wide availability of huge amounts of data in electronic forms and the
imminent need to turn such data into useful information and knowledge for
broad applications including business management, decision report, market
analysis and decision report data mining has attracted a great deal of attention
in information industry in recent years.
 Prediction: how certain attributes within
the data will behave in the future.
 Identification : identify the existence of
an item, an event, an activity.
 Classification: partition the data into
categories.
 Optimization: optimize the use of limited
resources.
Goals of Data Mining
Application of Data Mining
Marketing:
 analysis of human behavior.
 advertising campaigns.
 targeted mailings
 segmentation of customers, stores or
products.
Finance:
 creditworthiness of clients.
 performance analysis of finance
investments.
 fraud detection
Manufacturing:
 optimization of resources.
 optimization of manufacturing processes.
 product design based on customer
requirements.
Healthcare:
 discovering patterns in X-ray images.
 analyzing the side effects of drugs.
 analyzing the effectiveness of treatments
Continued
References
1) http://en.wikipedia.org/wiki/Text_mining
2) http://www.cs.waikato.ac.nz/~ihw/papers/04-IHW-
Textmining.pdf
3)http://comminfo.rutgers.edu/~msharp/text_mining.htm
4)http://www.cs.sunysb.edu/~cse634/presentations/TextMining
.pdf
5)http://www.mpi-inf.mpg.de/yago-naga/yago/demo.html
6)http://searchbusinessanalytics.techtarget.com/definition/tex
t-mining
By:
Bhawana

More Related Content

What's hot

Introduction to Text Mining
Introduction to Text MiningIntroduction to Text Mining
Introduction to Text Mining
Minha Hwang
 
Introduction to Text Mining and Semantics
Introduction to Text Mining and SemanticsIntroduction to Text Mining and Semantics
Introduction to Text Mining and Semantics
Seth Grimes
 
Data Mining: Text and web mining
Data Mining: Text and web miningData Mining: Text and web mining
Data Mining: Text and web mining
DataminingTools Inc
 
Text mining presentation in Data mining Area
Text mining presentation in Data mining AreaText mining presentation in Data mining Area
Text mining presentation in Data mining Area
MahamudHasanCSE
 
4.4 text mining
4.4 text mining4.4 text mining
4.4 text mining
Krish_ver2
 
Textmining Information Extraction
Textmining Information ExtractionTextmining Information Extraction
Textmining Information Extraction
guest0edcaf
 
Role of Text Mining in Search Engine
Role of Text Mining in Search EngineRole of Text Mining in Search Engine
Role of Text Mining in Search Engine
Jay R Modi
 
Text mining
Text miningText mining
Text mining
Pankaj Thakur
 
SA2: Text Mining from User Generated Content
SA2: Text Mining from User Generated ContentSA2: Text Mining from User Generated Content
SA2: Text Mining from User Generated Content
John Breslin
 
Text Mining Framework
Text Mining FrameworkText Mining Framework
Text Mining Framework
Prakhyath Rai
 
Week12
Week12Week12
Week12
Esha Meher
 
3. introduction to text mining
3. introduction to text mining3. introduction to text mining
3. introduction to text miningLokesh Ramaswamy
 
Web_Mining_Overview_Nfaoui_El_Habib
Web_Mining_Overview_Nfaoui_El_HabibWeb_Mining_Overview_Nfaoui_El_Habib
Web_Mining_Overview_Nfaoui_El_Habib
El Habib NFAOUI
 
Conceptual foundations of text mining and preprocessing steps nfaoui el_habib
Conceptual foundations of text mining and preprocessing steps nfaoui el_habibConceptual foundations of text mining and preprocessing steps nfaoui el_habib
Conceptual foundations of text mining and preprocessing steps nfaoui el_habib
El Habib NFAOUI
 
Tovek Presentation by Livio Costantini
Tovek Presentation by Livio CostantiniTovek Presentation by Livio Costantini
Tovek Presentation by Livio Costantinimaxfalc
 
Model of information retrieval (3)
Model  of information retrieval (3)Model  of information retrieval (3)
Model of information retrieval (3)9866825059
 
Information_Retrieval_Models_Nfaoui_El_Habib
Information_Retrieval_Models_Nfaoui_El_HabibInformation_Retrieval_Models_Nfaoui_El_Habib
Information_Retrieval_Models_Nfaoui_El_Habib
El Habib NFAOUI
 
Ir 01
Ir   01Ir   01
CS6007 information retrieval - 5 units notes
CS6007   information retrieval - 5 units notesCS6007   information retrieval - 5 units notes
CS6007 information retrieval - 5 units notes
Anandh Arumugakan
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information Retrieval
Roi Blanco
 

What's hot (20)

Introduction to Text Mining
Introduction to Text MiningIntroduction to Text Mining
Introduction to Text Mining
 
Introduction to Text Mining and Semantics
Introduction to Text Mining and SemanticsIntroduction to Text Mining and Semantics
Introduction to Text Mining and Semantics
 
Data Mining: Text and web mining
Data Mining: Text and web miningData Mining: Text and web mining
Data Mining: Text and web mining
 
Text mining presentation in Data mining Area
Text mining presentation in Data mining AreaText mining presentation in Data mining Area
Text mining presentation in Data mining Area
 
4.4 text mining
4.4 text mining4.4 text mining
4.4 text mining
 
Textmining Information Extraction
Textmining Information ExtractionTextmining Information Extraction
Textmining Information Extraction
 
Role of Text Mining in Search Engine
Role of Text Mining in Search EngineRole of Text Mining in Search Engine
Role of Text Mining in Search Engine
 
Text mining
Text miningText mining
Text mining
 
SA2: Text Mining from User Generated Content
SA2: Text Mining from User Generated ContentSA2: Text Mining from User Generated Content
SA2: Text Mining from User Generated Content
 
Text Mining Framework
Text Mining FrameworkText Mining Framework
Text Mining Framework
 
Week12
Week12Week12
Week12
 
3. introduction to text mining
3. introduction to text mining3. introduction to text mining
3. introduction to text mining
 
Web_Mining_Overview_Nfaoui_El_Habib
Web_Mining_Overview_Nfaoui_El_HabibWeb_Mining_Overview_Nfaoui_El_Habib
Web_Mining_Overview_Nfaoui_El_Habib
 
Conceptual foundations of text mining and preprocessing steps nfaoui el_habib
Conceptual foundations of text mining and preprocessing steps nfaoui el_habibConceptual foundations of text mining and preprocessing steps nfaoui el_habib
Conceptual foundations of text mining and preprocessing steps nfaoui el_habib
 
Tovek Presentation by Livio Costantini
Tovek Presentation by Livio CostantiniTovek Presentation by Livio Costantini
Tovek Presentation by Livio Costantini
 
Model of information retrieval (3)
Model  of information retrieval (3)Model  of information retrieval (3)
Model of information retrieval (3)
 
Information_Retrieval_Models_Nfaoui_El_Habib
Information_Retrieval_Models_Nfaoui_El_HabibInformation_Retrieval_Models_Nfaoui_El_Habib
Information_Retrieval_Models_Nfaoui_El_Habib
 
Ir 01
Ir   01Ir   01
Ir 01
 
CS6007 information retrieval - 5 units notes
CS6007   information retrieval - 5 units notesCS6007   information retrieval - 5 units notes
CS6007 information retrieval - 5 units notes
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information Retrieval
 

Viewers also liked

Textometry and Information Discovery : A New Approach to Mining Textual Data ...
Textometry and Information Discovery : A New Approach to Mining Textual Data ...Textometry and Information Discovery : A New Approach to Mining Textual Data ...
Textometry and Information Discovery : A New Approach to Mining Textual Data ...
Marguerite Leenhardt
 
Tdm information retrieval
Tdm information retrievalTdm information retrieval
Tdm information retrievalKU Leuven
 
Data Mining and Text Mining in Educational Research
Data Mining and Text Mining in Educational ResearchData Mining and Text Mining in Educational Research
Data Mining and Text Mining in Educational Research
Qiang Hao
 
Textmining Introduction
Textmining IntroductionTextmining Introduction
Textmining Introduction
Datamining Tools
 
3. introduction to text mining
3. introduction to text mining3. introduction to text mining
3. introduction to text mining
Lokesh Ramaswamy
 
Machine Learning and Data Mining: 19 Mining Text And Web Data
Machine Learning and Data Mining: 19 Mining Text And Web DataMachine Learning and Data Mining: 19 Mining Text And Web Data
Machine Learning and Data Mining: 19 Mining Text And Web Data
Pier Luca Lanzi
 

Viewers also liked (7)

Textometry and Information Discovery : A New Approach to Mining Textual Data ...
Textometry and Information Discovery : A New Approach to Mining Textual Data ...Textometry and Information Discovery : A New Approach to Mining Textual Data ...
Textometry and Information Discovery : A New Approach to Mining Textual Data ...
 
Tdm information retrieval
Tdm information retrievalTdm information retrieval
Tdm information retrieval
 
Data Mining and Text Mining in Educational Research
Data Mining and Text Mining in Educational ResearchData Mining and Text Mining in Educational Research
Data Mining and Text Mining in Educational Research
 
Textmining Introduction
Textmining IntroductionTextmining Introduction
Textmining Introduction
 
Types of interviews
Types of interviewsTypes of interviews
Types of interviews
 
3. introduction to text mining
3. introduction to text mining3. introduction to text mining
3. introduction to text mining
 
Machine Learning and Data Mining: 19 Mining Text And Web Data
Machine Learning and Data Mining: 19 Mining Text And Web DataMachine Learning and Data Mining: 19 Mining Text And Web Data
Machine Learning and Data Mining: 19 Mining Text And Web Data
 

Similar to Text mining and data mining

Web Mining
Web MiningWeb Mining
Web Mining
Shobha Rani
 
A Survey on Data Mining
A Survey on Data MiningA Survey on Data Mining
A Survey on Data Mining
IOSR Journals
 
Data Mining & Applications
Data Mining & ApplicationsData Mining & Applications
Data Mining & Applications
Fazle Rabbi Ador
 
DATA, TEXT, AND WEB MINING FOR BUSINESS INTELLIGENCE: A SURVEY
DATA, TEXT, AND WEB MINING FOR BUSINESS INTELLIGENCE: A SURVEYDATA, TEXT, AND WEB MINING FOR BUSINESS INTELLIGENCE: A SURVEY
DATA, TEXT, AND WEB MINING FOR BUSINESS INTELLIGENCE: A SURVEY
ijdkp
 
An introduction to Data Mining
An introduction to Data MiningAn introduction to Data Mining
An introduction to Data Mining
Shobhita Dayal
 
Unit 1 (DSBDA) PD.pptx
Unit 1 (DSBDA)  PD.pptxUnit 1 (DSBDA)  PD.pptx
Unit 1 (DSBDA) PD.pptx
Samiksha880257
 
An introduction to Data Mining by Kurt Thearling
An introduction to Data Mining by Kurt ThearlingAn introduction to Data Mining by Kurt Thearling
An introduction to Data Mining by Kurt Thearling
Pim Piepers
 
Big Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARLBig Data & DS Analytics for PAARL
Content analytics
Content analyticsContent analytics
Content analytics
Mayank Tyagi
 
Fundamentals of data mining and its applications
Fundamentals of data mining and its applicationsFundamentals of data mining and its applications
Fundamentals of data mining and its applicationsSubrat Swain
 
Odam an optimized distributed association rule mining algorithm (synopsis)
Odam an optimized distributed association rule mining algorithm (synopsis)Odam an optimized distributed association rule mining algorithm (synopsis)
Odam an optimized distributed association rule mining algorithm (synopsis)Mumbai Academisc
 
Real World Application of Big Data In Data Mining Tools
Real World Application of Big Data In Data Mining ToolsReal World Application of Big Data In Data Mining Tools
Real World Application of Big Data In Data Mining Tools
ijsrd.com
 
Big data
Big dataBig data
Big data
Abhishek Palo
 
Big data
Big dataBig data
Big data
Abhishek Palo
 
QuickView #3 - Big Data
QuickView #3 - Big DataQuickView #3 - Big Data
QuickView #3 - Big Data
Sonovate
 
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Oomph! Recruitment
 
Big data
Big dataBig data
Secondary Research in Applied Marketing Research
Secondary Research in Applied Marketing ResearchSecondary Research in Applied Marketing Research
Secondary Research in Applied Marketing Research
Kelly Page
 

Similar to Text mining and data mining (20)

Web Mining
Web MiningWeb Mining
Web Mining
 
A Survey on Data Mining
A Survey on Data MiningA Survey on Data Mining
A Survey on Data Mining
 
Data Mining & Applications
Data Mining & ApplicationsData Mining & Applications
Data Mining & Applications
 
DATA, TEXT, AND WEB MINING FOR BUSINESS INTELLIGENCE: A SURVEY
DATA, TEXT, AND WEB MINING FOR BUSINESS INTELLIGENCE: A SURVEYDATA, TEXT, AND WEB MINING FOR BUSINESS INTELLIGENCE: A SURVEY
DATA, TEXT, AND WEB MINING FOR BUSINESS INTELLIGENCE: A SURVEY
 
An introduction to Data Mining
An introduction to Data MiningAn introduction to Data Mining
An introduction to Data Mining
 
Unit 1 (DSBDA) PD.pptx
Unit 1 (DSBDA)  PD.pptxUnit 1 (DSBDA)  PD.pptx
Unit 1 (DSBDA) PD.pptx
 
An introduction to Data Mining by Kurt Thearling
An introduction to Data Mining by Kurt ThearlingAn introduction to Data Mining by Kurt Thearling
An introduction to Data Mining by Kurt Thearling
 
Big Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARLBig Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARL
 
Content analytics
Content analyticsContent analytics
Content analytics
 
Fundamentals of data mining and its applications
Fundamentals of data mining and its applicationsFundamentals of data mining and its applications
Fundamentals of data mining and its applications
 
Odam an optimized distributed association rule mining algorithm (synopsis)
Odam an optimized distributed association rule mining algorithm (synopsis)Odam an optimized distributed association rule mining algorithm (synopsis)
Odam an optimized distributed association rule mining algorithm (synopsis)
 
Real World Application of Big Data In Data Mining Tools
Real World Application of Big Data In Data Mining ToolsReal World Application of Big Data In Data Mining Tools
Real World Application of Big Data In Data Mining Tools
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Abstract
AbstractAbstract
Abstract
 
QuickView #3 - Big Data
QuickView #3 - Big DataQuickView #3 - Big Data
QuickView #3 - Big Data
 
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
 
An introduction to data mining
An introduction to data miningAn introduction to data mining
An introduction to data mining
 
Big data
Big dataBig data
Big data
 
Secondary Research in Applied Marketing Research
Secondary Research in Applied Marketing ResearchSecondary Research in Applied Marketing Research
Secondary Research in Applied Marketing Research
 

Recently uploaded

Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
g2nightmarescribd
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
Alison B. Lowndes
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Jeffrey Haguewood
 

Recently uploaded (20)

Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 

Text mining and data mining

  • 1. The next step in Search Technology
  • 2. OUTLINE What is Text Mining? What is unstructured text Need for Text Mining? Text Mining sub tasks  Applications of text mining  Barriers  Today of text mining  Tomorrow of text mining Data Mining Goals of Data Mining What can Data mining do?
  • 3.
  • 4. Text Mining v/s. Data Mining Web Mining Information Retrieval
  • 5.
  • 6. Text retrieval Information is retrieved so as to fulfill the needs of customers. Does not discover anything new about the query. IRS find the result from a large database by matching the query. E.g.: the search engines, which identify the relevant documents according to a given set of words on www.
  • 7. IE is the process of automatically extracting structured data from unstructured machine readable codes. It highly relies on Natural Language Processing systems. Natural Language Processing It converts samples of human language into formal representation which can be understood by the computer. Its types are: Natural Language Generation System Natural Language Understanding System Information extraction
  • 8. Spam filtering • A spam filter is a program that is used to detect unwanted email and prevent those messages from getting to a user's inbox. Sophisticated program, such as Bayesian filters , attempt to identify spam through suspicious word patterns or word frequency . • Bayesian spam filtering :It identifies spam e-mail through suspicious word patterns or word frequency. Applications of Text Mining
  • 9. Creating suggestion and recommendations • Text mining helps customers in providing suggestions for online stores such as amazon, based on their interests. The prediction algorithms are of huge importance to online stores -the more accurate they are, the more the online store will sell. • A large online store like Amazon may have millions of customers and millions of items in stock. New customers will have limited information about their preferences, while more established customers may have too much. • The data on which these algorithms work is constantly updated and changed. Customers are browsing the site and the prediction algorithm should take the recently browsed items into consideration. • Traditionally, these recommendation algorithms have worked by finding similar customers in the database.
  • 10. Barriers that we need to overcome to make best use of text mining tools in the future: 1) Text mining is a complex technical process that requires skilled staff. 2) It requires unrestricted access to information sources. 3) Copyright can be a barrier.
  • 11. • Text mining is already producing efficiencies and new knowledge in areas as diverse as biological science, particle physics, media and communications. It has been used to hypothesise the causes of rare diseases and how pre-existing drugs could be used to target different diseases. • The technique was also used recently to analyse the vast amount of text produced on websites, blogs and social media such as Twitter - where copyright holders allowed - and showed that the messages exchanged on Twitter during the English riots of 2011 were not to blame for inciting riots. • The business benefit of text mining is in identifying emerging trends, and to explore consumer preferences and competitor developments. Text mining is particularly used in larger companies as part of their customer relationship management strategy and in the pharmaceutical industry as part of their research and development strategy. Today of Text Mining
  • 12. Text mining has been garnering a significant amount of importance in recent years, creating a strong industrial impact. Based on this observation, it is evident that the future of text mining companies would be promising in the coming years. The age of innovation for this is not over. It is, therefore, unmistakable that in the years to come many new doors and exciting opportunities will open up through the advanced text mining services offered by various professional text mining companies
  • 13. DATA MINING It is the process of discovering interesting knowledge, such as patterns, associations, changes, anomalies and significant structures from large amount of data stored in databases, data warehouses or other information repositories. Why Data mining? Due to wide availability of huge amounts of data in electronic forms and the imminent need to turn such data into useful information and knowledge for broad applications including business management, decision report, market analysis and decision report data mining has attracted a great deal of attention in information industry in recent years.
  • 14.  Prediction: how certain attributes within the data will behave in the future.  Identification : identify the existence of an item, an event, an activity.  Classification: partition the data into categories.  Optimization: optimize the use of limited resources. Goals of Data Mining
  • 15. Application of Data Mining Marketing:  analysis of human behavior.  advertising campaigns.  targeted mailings  segmentation of customers, stores or products. Finance:  creditworthiness of clients.  performance analysis of finance investments.  fraud detection
  • 16. Manufacturing:  optimization of resources.  optimization of manufacturing processes.  product design based on customer requirements. Healthcare:  discovering patterns in X-ray images.  analyzing the side effects of drugs.  analyzing the effectiveness of treatments Continued