SlideShare a Scribd company logo
1/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Information retrieval (IR) and Data mining (DM)
By: Dr. LOUNNAS Bilal
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
2/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Slide 01: Introduction and contents - Course contents
Course outline
Introduction
of IR and DM
Data In-
dexation
Information
Retrieval IR
Data Min-
ing DM
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
3/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Slide 01: Introduction and contents - Textbook, website, and stuff
I m WebSite : https://sites.google.com/view/lounnasbilal
I Books : Introduction to information retrieval, Data mining
Concepts and technics.
I Other stuff : TP (prefered C#), Weka, R, SQL Server BI,....
others, Projects.
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
4/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Definition
Definition
Information retrieval (IR) is finding material (usually documents)
of an unstructured nature (usually text) that satisfies an informa-
tion need from within large collections (usually stored on com-
puters).
I Also
Information retrieval (IR) is the activity of obtaining infor-
mation resources relevant to an information need from a
collection of information resources
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
5/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
History
I The idea of using
computers to search
for relevant pieces of
information and that
was popularized in the
article “As We May
Think” by Vannevar
Bush in 1945
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
6/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
History
I Before 70 ies : Manual IR in libraries: manual indexing; manual
categorization.
I Between 70 and 80 ies : Automatic IR in libraries.
I After 90 ies : IR on the web and in digital libraries.
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
7/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Terminology
I General: Information Retrieval, Information Need, Query,
Retrieval Model,Retrieval Engine, Search Engine, Relevance,
Relevance Feedback, Evalua-tion, Information Seeking,
Human-Computer-Interaction, Browsing, Inter-faces, Ad-hoc
Retrieval, Filtering
I Related: Document Management, Knowledge Engineering
I Expert: term frequency, document frequency, inverse document
frequency,vector-space model, probabilistic model, BM25, DFR,
page rank, stemming,precision
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
8/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Terminology
A great glossary has been written by the Berkeley University titled by
The Modern Information Retrieval Glossary.
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
9/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Automated information retrieval
Information retrieval in computer science
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
10/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Topics of IR
I Retrieval models
I Text processing
I Efficiency, compression, MapReduce, Scalability
I Distributed IR
I Multimedia: image, video, sound, speech
I Web retrieval and social media search
I Cross-lingual IR (FIRE), Structured Data (XML),
I Digital libraries, Enterprise Search, Legal IR, Patent Search,
Genomics IR
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
11/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Conferences of IR
I SIGIR: Conference on Research and Development in Information
Retrieval
I ECIR: European Conference on Information Retrieval
I CIKM: Conference on Information and Knowledge Management
I WWW: International World Wide Web Conference
I WSDM: Conference on Web Search and Data Mining
I ICTIR: International Conference on Theory of Information
Retrieval
I TREC: Text REtrieval Conference
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
12/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Definition
In the past decad the evolution of data repositories has reach a huge
amount of data, and that make a difficult task to extract a useful
information to be work on.
What is Data Mining
DM is the process of discovering interesting knowledge from
large amounts of data stored in databases, data warehouses,
or other information repositories, and summarizing it into useful
information.
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
13/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Definition
Data mining as simply an essential step in the process of knowledge
discovery.
1 Data cleaning
2 Data integration
3 Data selection
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
14/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Definition
DM as a step of KDD
1 Data transformation
2 Data mining
3 Pattern evaluation
4 Knowledge presentation
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
15/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Why data mining is important?
Why DM is important?
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
16/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Why data mining is important?
Why DM is important?
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
17/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Data mining tasks
/ Data mining can be categorized into tasks, according to different
goals of a data mining practitioner. The two "high-level" primary goals
of data mining, in practice, are prediction and description
Prediction
Description
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
18/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Data mining tasks
Classification
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
19/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Data mining algorithms
Some DM algorithms
Algorithm Task
C4.0 Classification
K-Means Clustering
SVM Classification and regression
Apriori Association rules
EM Estimation
PageRank Classification
AdaBoost Classification and regression
kNN Clustering
Naïve Bayes Estimation
CART Classification
Table: Data mining most known algorithms and their classification
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
20/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Data mining algorithms
Some DM algorithms
1 C4.0 : Decision tree, very popular - TOP 10 algorithm 2008
springer LNCS.
2 K-Means : Clustering algorithm.
Clustering is the task of grouping a set of objects in such a way
that objects in the same group are more similar to each other.
3 SVM - Support vector machine : Classification.
Given a set of training examples, each marked as belonging to one
or the other of two categories, a classification algorithm builds a
model that assigns new examples to one category or the other
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
21/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Data mining algorithms
Some DM algorithms
1 Apriori : Association rule learning, used for frequent item set
mining.
Association rule is a method for discovering interesting relations
between variables in large databases.
Example: onions + potatoes = burger
2 EM - Expectation maximization : Estimation.
Example: Missing values exist among the data
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
22/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
DM process
CRISP-DM Cross Industry Standard Process for Data Mining - early
90’s
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
23/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
DM process
CRISP-DM This methodology should make large data mining
projects faster, cheaper, more reliable and more manageable.
The life cycle of a data mining project consists of six phases.
The sequence of the phases is not rigid. Moving back and forth
between theme is always required. It depends on the outcome
of each phase which phase or which particular task of a phase,
has to be performed next. The arrows indicate the most impor-
tant and frequent dependencies between phases.
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
24/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Kinds of data mining
1 Graph mining : circuits, chemical compounds, protein structures,
biological networks, social networks, workflows.
2 Spatial Data Mining : maps, preprocessed remote sensing or
medical imaging data.
3 Multimedia Data Mining : audio, video, image, graphics, speech.
4 Text Mining : unstructured data such as news articles, research
papers, books, digital libraries, e-mail messages, and Web pages.
5 Mining the World Wide Web : Web mining is a more challenging
task that searches for Web structures, ranks the importance of
Web contents, discovers the regularity and dynamics of Web
contents, and mines Web access patterns.
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
25/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Data mining application
1 Data Mining for Finance
2 Data mining for the Industry sectors
3 Data Mining for the Telecommunication Industry
4 Data Mining for Biology
5 Data mining for Intrusion Detection
6 Data mining for Education
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
26/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Data mining tools
1 SAS Enterprise Miner
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
27/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Data mining tools
1 Clementine, from SPSS
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
28/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Data mining tools
1 Statistica Data Miner from Statsoft
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
29/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Data mining tools
1 Oracle Data Mining (ODM)
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
30/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Data mining tools
1 Microsoft SQL Server 2008R2 - Analysis Services
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
31/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Data mining tools
1 Weka
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
32/32
About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM
Data mining tools
1 RapidMiner
Information retrieval (IR) and Data mining (DM) Dr B. Lounnas

More Related Content

Similar to Slide 01.pdf

01Intro.ppt
01Intro.ppt01Intro.ppt
01Intro.ppt
AidaMustapha6
 
01Introduction to data mining chapter 1.ppt
01Introduction to data mining chapter 1.ppt01Introduction to data mining chapter 1.ppt
01Introduction to data mining chapter 1.ppt
admsoyadm4
 
data mining
data miningdata mining
data mining
AMITKUMAR202236
 
01Intro.ppt
01Intro.ppt01Intro.ppt
01Intro.ppt
VaibhavGupta447155
 
Unit 1 (Chapter-1) on data mining concepts.ppt
Unit 1 (Chapter-1) on data mining concepts.pptUnit 1 (Chapter-1) on data mining concepts.ppt
Unit 1 (Chapter-1) on data mining concepts.ppt
PadmajaLaksh
 
Upstate CSCI 525 Data Mining Chapter 1
Upstate CSCI 525 Data Mining Chapter 1Upstate CSCI 525 Data Mining Chapter 1
Upstate CSCI 525 Data Mining Chapter 1
DanWooster1
 
Data Management Planning at the DCC: a human factor
Data Management Planning at the DCC: a human factorData Management Planning at the DCC: a human factor
Data Management Planning at the DCC: a human factor
Martin Donnelly
 
Chapter 1. Introduction.ppt
Chapter 1. Introduction.pptChapter 1. Introduction.ppt
Chapter 1. Introduction.ppt
Subrata Kumer Paul
 
Chap 1 general introduction of information retrieval
Chap 1  general introduction of information retrievalChap 1  general introduction of information retrieval
Chap 1 general introduction of information retrieval
Malobe Lottin Cyrille Marcel
 
Semester V-converted.pdf
Semester V-converted.pdfSemester V-converted.pdf
Semester V-converted.pdf
GayathriRHICETCSESTA
 
Ci2004-10.doc
Ci2004-10.docCi2004-10.doc
Ci2004-10.doc
butest
 
Chapter 13Big Data AnalyticsBusiness Intelligence a.docx
Chapter 13Big Data AnalyticsBusiness Intelligence a.docxChapter 13Big Data AnalyticsBusiness Intelligence a.docx
Chapter 13Big Data AnalyticsBusiness Intelligence a.docx
keturahhazelhurst
 
Principles for proper data management and reuse--An RDA view
Principles for proper data management and reuse--An RDA viewPrinciples for proper data management and reuse--An RDA view
Principles for proper data management and reuse--An RDA view
Research Data Alliance
 
Ir 01
Ir   01Ir   01
Data Management Planning at the DCC
Data Management Planning at the DCCData Management Planning at the DCC
Data Management Planning at the DCC
Martin Donnelly
 
PowerPoint Template
PowerPoint TemplatePowerPoint Template
PowerPoint Template
butest
 
data mining
data miningdata mining
data mining
uoitc
 
iConference: Overview of data management planning
iConference: Overview of data management planningiConference: Overview of data management planning
iConference: Overview of data management planning
Carly Strasser
 
DBMS
DBMSDBMS
UNIT2-Data Mining.pdf
UNIT2-Data Mining.pdfUNIT2-Data Mining.pdf
UNIT2-Data Mining.pdf
Nancykumari47
 

Similar to Slide 01.pdf (20)

01Intro.ppt
01Intro.ppt01Intro.ppt
01Intro.ppt
 
01Introduction to data mining chapter 1.ppt
01Introduction to data mining chapter 1.ppt01Introduction to data mining chapter 1.ppt
01Introduction to data mining chapter 1.ppt
 
data mining
data miningdata mining
data mining
 
01Intro.ppt
01Intro.ppt01Intro.ppt
01Intro.ppt
 
Unit 1 (Chapter-1) on data mining concepts.ppt
Unit 1 (Chapter-1) on data mining concepts.pptUnit 1 (Chapter-1) on data mining concepts.ppt
Unit 1 (Chapter-1) on data mining concepts.ppt
 
Upstate CSCI 525 Data Mining Chapter 1
Upstate CSCI 525 Data Mining Chapter 1Upstate CSCI 525 Data Mining Chapter 1
Upstate CSCI 525 Data Mining Chapter 1
 
Data Management Planning at the DCC: a human factor
Data Management Planning at the DCC: a human factorData Management Planning at the DCC: a human factor
Data Management Planning at the DCC: a human factor
 
Chapter 1. Introduction.ppt
Chapter 1. Introduction.pptChapter 1. Introduction.ppt
Chapter 1. Introduction.ppt
 
Chap 1 general introduction of information retrieval
Chap 1  general introduction of information retrievalChap 1  general introduction of information retrieval
Chap 1 general introduction of information retrieval
 
Semester V-converted.pdf
Semester V-converted.pdfSemester V-converted.pdf
Semester V-converted.pdf
 
Ci2004-10.doc
Ci2004-10.docCi2004-10.doc
Ci2004-10.doc
 
Chapter 13Big Data AnalyticsBusiness Intelligence a.docx
Chapter 13Big Data AnalyticsBusiness Intelligence a.docxChapter 13Big Data AnalyticsBusiness Intelligence a.docx
Chapter 13Big Data AnalyticsBusiness Intelligence a.docx
 
Principles for proper data management and reuse--An RDA view
Principles for proper data management and reuse--An RDA viewPrinciples for proper data management and reuse--An RDA view
Principles for proper data management and reuse--An RDA view
 
Ir 01
Ir   01Ir   01
Ir 01
 
Data Management Planning at the DCC
Data Management Planning at the DCCData Management Planning at the DCC
Data Management Planning at the DCC
 
PowerPoint Template
PowerPoint TemplatePowerPoint Template
PowerPoint Template
 
data mining
data miningdata mining
data mining
 
iConference: Overview of data management planning
iConference: Overview of data management planningiConference: Overview of data management planning
iConference: Overview of data management planning
 
DBMS
DBMSDBMS
DBMS
 
UNIT2-Data Mining.pdf
UNIT2-Data Mining.pdfUNIT2-Data Mining.pdf
UNIT2-Data Mining.pdf
 

Recently uploaded

A Free 200-Page eBook ~ Brain and Mind Exercise.pptx
A Free 200-Page eBook ~ Brain and Mind Exercise.pptxA Free 200-Page eBook ~ Brain and Mind Exercise.pptx
A Free 200-Page eBook ~ Brain and Mind Exercise.pptx
OH TEIK BIN
 
Bonku-Babus-Friend by Sathyajith Ray (9)
Bonku-Babus-Friend by Sathyajith Ray  (9)Bonku-Babus-Friend by Sathyajith Ray  (9)
Bonku-Babus-Friend by Sathyajith Ray (9)
nitinpv4ai
 
How to Manage Reception Report in Odoo 17
How to Manage Reception Report in Odoo 17How to Manage Reception Report in Odoo 17
How to Manage Reception Report in Odoo 17
Celine George
 
Diversity Quiz Prelims by Quiz Club, IIT Kanpur
Diversity Quiz Prelims by Quiz Club, IIT KanpurDiversity Quiz Prelims by Quiz Club, IIT Kanpur
Diversity Quiz Prelims by Quiz Club, IIT Kanpur
Quiz Club IIT Kanpur
 
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...
indexPub
 
adjectives.ppt for class 1 to 6, grammar
adjectives.ppt for class 1 to 6, grammaradjectives.ppt for class 1 to 6, grammar
adjectives.ppt for class 1 to 6, grammar
7DFarhanaMohammed
 
مصحف القراءات العشر أعد أحرف الخلاف سمير بسيوني.pdf
مصحف القراءات العشر   أعد أحرف الخلاف سمير بسيوني.pdfمصحف القراءات العشر   أعد أحرف الخلاف سمير بسيوني.pdf
مصحف القراءات العشر أعد أحرف الخلاف سمير بسيوني.pdf
سمير بسيوني
 
CapTechTalks Webinar Slides June 2024 Donovan Wright.pptx
CapTechTalks Webinar Slides June 2024 Donovan Wright.pptxCapTechTalks Webinar Slides June 2024 Donovan Wright.pptx
CapTechTalks Webinar Slides June 2024 Donovan Wright.pptx
CapitolTechU
 
Pharmaceutics Pharmaceuticals best of brub
Pharmaceutics Pharmaceuticals best of brubPharmaceutics Pharmaceuticals best of brub
Pharmaceutics Pharmaceuticals best of brub
danielkiash986
 
78 Microsoft-Publisher - Sirin Sultana Bora.pptx
78 Microsoft-Publisher - Sirin Sultana Bora.pptx78 Microsoft-Publisher - Sirin Sultana Bora.pptx
78 Microsoft-Publisher - Sirin Sultana Bora.pptx
Kalna College
 
CIS 4200-02 Group 1 Final Project Report (1).pdf
CIS 4200-02 Group 1 Final Project Report (1).pdfCIS 4200-02 Group 1 Final Project Report (1).pdf
CIS 4200-02 Group 1 Final Project Report (1).pdf
blueshagoo1
 
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
TechSoup
 
Contiguity Of Various Message Forms - Rupam Chandra.pptx
Contiguity Of Various Message Forms - Rupam Chandra.pptxContiguity Of Various Message Forms - Rupam Chandra.pptx
Contiguity Of Various Message Forms - Rupam Chandra.pptx
Kalna College
 
NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...
NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...
NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...
Payaamvohra1
 
Information and Communication Technology in Education
Information and Communication Technology in EducationInformation and Communication Technology in Education
Information and Communication Technology in Education
MJDuyan
 
欧洲杯下注-欧洲杯下注押注官网-欧洲杯下注押注网站|【​网址​🎉ac44.net🎉​】
欧洲杯下注-欧洲杯下注押注官网-欧洲杯下注押注网站|【​网址​🎉ac44.net🎉​】欧洲杯下注-欧洲杯下注押注官网-欧洲杯下注押注网站|【​网址​🎉ac44.net🎉​】
欧洲杯下注-欧洲杯下注押注官网-欧洲杯下注押注网站|【​网址​🎉ac44.net🎉​】
andagarcia212
 
Skimbleshanks-The-Railway-Cat by T S Eliot
Skimbleshanks-The-Railway-Cat by T S EliotSkimbleshanks-The-Railway-Cat by T S Eliot
Skimbleshanks-The-Railway-Cat by T S Eliot
nitinpv4ai
 
KHUSWANT SINGH.pptx ALL YOU NEED TO KNOW ABOUT KHUSHWANT SINGH
KHUSWANT SINGH.pptx ALL YOU NEED TO KNOW ABOUT KHUSHWANT SINGHKHUSWANT SINGH.pptx ALL YOU NEED TO KNOW ABOUT KHUSHWANT SINGH
KHUSWANT SINGH.pptx ALL YOU NEED TO KNOW ABOUT KHUSHWANT SINGH
shreyassri1208
 
220711130083 SUBHASHREE RAKSHIT Internet resources for social science
220711130083 SUBHASHREE RAKSHIT  Internet resources for social science220711130083 SUBHASHREE RAKSHIT  Internet resources for social science
220711130083 SUBHASHREE RAKSHIT Internet resources for social science
Kalna College
 
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...
EduSkills OECD
 

Recently uploaded (20)

A Free 200-Page eBook ~ Brain and Mind Exercise.pptx
A Free 200-Page eBook ~ Brain and Mind Exercise.pptxA Free 200-Page eBook ~ Brain and Mind Exercise.pptx
A Free 200-Page eBook ~ Brain and Mind Exercise.pptx
 
Bonku-Babus-Friend by Sathyajith Ray (9)
Bonku-Babus-Friend by Sathyajith Ray  (9)Bonku-Babus-Friend by Sathyajith Ray  (9)
Bonku-Babus-Friend by Sathyajith Ray (9)
 
How to Manage Reception Report in Odoo 17
How to Manage Reception Report in Odoo 17How to Manage Reception Report in Odoo 17
How to Manage Reception Report in Odoo 17
 
Diversity Quiz Prelims by Quiz Club, IIT Kanpur
Diversity Quiz Prelims by Quiz Club, IIT KanpurDiversity Quiz Prelims by Quiz Club, IIT Kanpur
Diversity Quiz Prelims by Quiz Club, IIT Kanpur
 
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...
 
adjectives.ppt for class 1 to 6, grammar
adjectives.ppt for class 1 to 6, grammaradjectives.ppt for class 1 to 6, grammar
adjectives.ppt for class 1 to 6, grammar
 
مصحف القراءات العشر أعد أحرف الخلاف سمير بسيوني.pdf
مصحف القراءات العشر   أعد أحرف الخلاف سمير بسيوني.pdfمصحف القراءات العشر   أعد أحرف الخلاف سمير بسيوني.pdf
مصحف القراءات العشر أعد أحرف الخلاف سمير بسيوني.pdf
 
CapTechTalks Webinar Slides June 2024 Donovan Wright.pptx
CapTechTalks Webinar Slides June 2024 Donovan Wright.pptxCapTechTalks Webinar Slides June 2024 Donovan Wright.pptx
CapTechTalks Webinar Slides June 2024 Donovan Wright.pptx
 
Pharmaceutics Pharmaceuticals best of brub
Pharmaceutics Pharmaceuticals best of brubPharmaceutics Pharmaceuticals best of brub
Pharmaceutics Pharmaceuticals best of brub
 
78 Microsoft-Publisher - Sirin Sultana Bora.pptx
78 Microsoft-Publisher - Sirin Sultana Bora.pptx78 Microsoft-Publisher - Sirin Sultana Bora.pptx
78 Microsoft-Publisher - Sirin Sultana Bora.pptx
 
CIS 4200-02 Group 1 Final Project Report (1).pdf
CIS 4200-02 Group 1 Final Project Report (1).pdfCIS 4200-02 Group 1 Final Project Report (1).pdf
CIS 4200-02 Group 1 Final Project Report (1).pdf
 
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
 
Contiguity Of Various Message Forms - Rupam Chandra.pptx
Contiguity Of Various Message Forms - Rupam Chandra.pptxContiguity Of Various Message Forms - Rupam Chandra.pptx
Contiguity Of Various Message Forms - Rupam Chandra.pptx
 
NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...
NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...
NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...
 
Information and Communication Technology in Education
Information and Communication Technology in EducationInformation and Communication Technology in Education
Information and Communication Technology in Education
 
欧洲杯下注-欧洲杯下注押注官网-欧洲杯下注押注网站|【​网址​🎉ac44.net🎉​】
欧洲杯下注-欧洲杯下注押注官网-欧洲杯下注押注网站|【​网址​🎉ac44.net🎉​】欧洲杯下注-欧洲杯下注押注官网-欧洲杯下注押注网站|【​网址​🎉ac44.net🎉​】
欧洲杯下注-欧洲杯下注押注官网-欧洲杯下注押注网站|【​网址​🎉ac44.net🎉​】
 
Skimbleshanks-The-Railway-Cat by T S Eliot
Skimbleshanks-The-Railway-Cat by T S EliotSkimbleshanks-The-Railway-Cat by T S Eliot
Skimbleshanks-The-Railway-Cat by T S Eliot
 
KHUSWANT SINGH.pptx ALL YOU NEED TO KNOW ABOUT KHUSHWANT SINGH
KHUSWANT SINGH.pptx ALL YOU NEED TO KNOW ABOUT KHUSHWANT SINGHKHUSWANT SINGH.pptx ALL YOU NEED TO KNOW ABOUT KHUSHWANT SINGH
KHUSWANT SINGH.pptx ALL YOU NEED TO KNOW ABOUT KHUSHWANT SINGH
 
220711130083 SUBHASHREE RAKSHIT Internet resources for social science
220711130083 SUBHASHREE RAKSHIT  Internet resources for social science220711130083 SUBHASHREE RAKSHIT  Internet resources for social science
220711130083 SUBHASHREE RAKSHIT Internet resources for social science
 
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...
 

Slide 01.pdf

  • 1. 1/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Information retrieval (IR) and Data mining (DM) By: Dr. LOUNNAS Bilal Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 2. 2/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Slide 01: Introduction and contents - Course contents Course outline Introduction of IR and DM Data In- dexation Information Retrieval IR Data Min- ing DM Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 3. 3/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Slide 01: Introduction and contents - Textbook, website, and stuff I m WebSite : https://sites.google.com/view/lounnasbilal I Books : Introduction to information retrieval, Data mining Concepts and technics. I Other stuff : TP (prefered C#), Weka, R, SQL Server BI,.... others, Projects. Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 4. 4/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Definition Definition Information retrieval (IR) is finding material (usually documents) of an unstructured nature (usually text) that satisfies an informa- tion need from within large collections (usually stored on com- puters). I Also Information retrieval (IR) is the activity of obtaining infor- mation resources relevant to an information need from a collection of information resources Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 5. 5/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM History I The idea of using computers to search for relevant pieces of information and that was popularized in the article “As We May Think” by Vannevar Bush in 1945 Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 6. 6/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM History I Before 70 ies : Manual IR in libraries: manual indexing; manual categorization. I Between 70 and 80 ies : Automatic IR in libraries. I After 90 ies : IR on the web and in digital libraries. Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 7. 7/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Terminology I General: Information Retrieval, Information Need, Query, Retrieval Model,Retrieval Engine, Search Engine, Relevance, Relevance Feedback, Evalua-tion, Information Seeking, Human-Computer-Interaction, Browsing, Inter-faces, Ad-hoc Retrieval, Filtering I Related: Document Management, Knowledge Engineering I Expert: term frequency, document frequency, inverse document frequency,vector-space model, probabilistic model, BM25, DFR, page rank, stemming,precision Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 8. 8/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Terminology A great glossary has been written by the Berkeley University titled by The Modern Information Retrieval Glossary. Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 9. 9/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Automated information retrieval Information retrieval in computer science Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 10. 10/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Topics of IR I Retrieval models I Text processing I Efficiency, compression, MapReduce, Scalability I Distributed IR I Multimedia: image, video, sound, speech I Web retrieval and social media search I Cross-lingual IR (FIRE), Structured Data (XML), I Digital libraries, Enterprise Search, Legal IR, Patent Search, Genomics IR Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 11. 11/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Conferences of IR I SIGIR: Conference on Research and Development in Information Retrieval I ECIR: European Conference on Information Retrieval I CIKM: Conference on Information and Knowledge Management I WWW: International World Wide Web Conference I WSDM: Conference on Web Search and Data Mining I ICTIR: International Conference on Theory of Information Retrieval I TREC: Text REtrieval Conference Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 12. 12/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Definition In the past decad the evolution of data repositories has reach a huge amount of data, and that make a difficult task to extract a useful information to be work on. What is Data Mining DM is the process of discovering interesting knowledge from large amounts of data stored in databases, data warehouses, or other information repositories, and summarizing it into useful information. Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 13. 13/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Definition Data mining as simply an essential step in the process of knowledge discovery. 1 Data cleaning 2 Data integration 3 Data selection Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 14. 14/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Definition DM as a step of KDD 1 Data transformation 2 Data mining 3 Pattern evaluation 4 Knowledge presentation Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 15. 15/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Why data mining is important? Why DM is important? Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 16. 16/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Why data mining is important? Why DM is important? Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 17. 17/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Data mining tasks / Data mining can be categorized into tasks, according to different goals of a data mining practitioner. The two "high-level" primary goals of data mining, in practice, are prediction and description Prediction Description Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 18. 18/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Data mining tasks Classification Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 19. 19/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Data mining algorithms Some DM algorithms Algorithm Task C4.0 Classification K-Means Clustering SVM Classification and regression Apriori Association rules EM Estimation PageRank Classification AdaBoost Classification and regression kNN Clustering Naïve Bayes Estimation CART Classification Table: Data mining most known algorithms and their classification Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 20. 20/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Data mining algorithms Some DM algorithms 1 C4.0 : Decision tree, very popular - TOP 10 algorithm 2008 springer LNCS. 2 K-Means : Clustering algorithm. Clustering is the task of grouping a set of objects in such a way that objects in the same group are more similar to each other. 3 SVM - Support vector machine : Classification. Given a set of training examples, each marked as belonging to one or the other of two categories, a classification algorithm builds a model that assigns new examples to one category or the other Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 21. 21/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Data mining algorithms Some DM algorithms 1 Apriori : Association rule learning, used for frequent item set mining. Association rule is a method for discovering interesting relations between variables in large databases. Example: onions + potatoes = burger 2 EM - Expectation maximization : Estimation. Example: Missing values exist among the data Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 22. 22/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM DM process CRISP-DM Cross Industry Standard Process for Data Mining - early 90’s Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 23. 23/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM DM process CRISP-DM This methodology should make large data mining projects faster, cheaper, more reliable and more manageable. The life cycle of a data mining project consists of six phases. The sequence of the phases is not rigid. Moving back and forth between theme is always required. It depends on the outcome of each phase which phase or which particular task of a phase, has to be performed next. The arrows indicate the most impor- tant and frequent dependencies between phases. Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 24. 24/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Kinds of data mining 1 Graph mining : circuits, chemical compounds, protein structures, biological networks, social networks, workflows. 2 Spatial Data Mining : maps, preprocessed remote sensing or medical imaging data. 3 Multimedia Data Mining : audio, video, image, graphics, speech. 4 Text Mining : unstructured data such as news articles, research papers, books, digital libraries, e-mail messages, and Web pages. 5 Mining the World Wide Web : Web mining is a more challenging task that searches for Web structures, ranks the importance of Web contents, discovers the regularity and dynamics of Web contents, and mines Web access patterns. Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 25. 25/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Data mining application 1 Data Mining for Finance 2 Data mining for the Industry sectors 3 Data Mining for the Telecommunication Industry 4 Data Mining for Biology 5 Data mining for Intrusion Detection 6 Data mining for Education Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 26. 26/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Data mining tools 1 SAS Enterprise Miner Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 27. 27/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Data mining tools 1 Clementine, from SPSS Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 28. 28/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Data mining tools 1 Statistica Data Miner from Statsoft Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 29. 29/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Data mining tools 1 Oracle Data Mining (ODM) Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 30. 30/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Data mining tools 1 Microsoft SQL Server 2008R2 - Analysis Services Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 31. 31/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Data mining tools 1 Weka Information retrieval (IR) and Data mining (DM) Dr B. Lounnas
  • 32. 32/32 About the course Primitives of IR Topics and Conferences of IR Primitives of DM Advanced DM Data mining tools 1 RapidMiner Information retrieval (IR) and Data mining (DM) Dr B. Lounnas