Text mining (news group, email, documents) and Web analysis.
Intelligent query answering
Predict whether a patient, hospitalized due to a heart attack, will have a second heart attack. The prediction can use demographic, diet and clinical measurements for that patient
Predict the price of a stock six months from now, on the basis of company performance measures and economic data
Identify the numbers in a handwritten ZIP code, from a digitized image
Estimate the amount of glucose in the blood of a diabetic person from the infrared spectrum of the person’s blood
Identify the risk factors for prostate cancer, based on clinical and demographic variables
Identify significant financial news stories in a stream of news stories
Identify groups of web-customers with similar browsing behavior
Distinguish pornographic from non-pornographic web pages
Data Mining: A KDD Process
Data mining: the core of knowledge discovery process.
Data Cleaning Data Integration Databases Data Warehouse Knowledge Task-relevant Data Selection Data Mining Pattern Evaluation
Architecture of a Typical Data Mining System Data Warehouse Data cleaning & data integration Filtering Databases Database or data warehouse server Data mining engine Pattern evaluation Graphical user interface Knowledge-base
Data Mining: On What Kind of Data?
Advanced DB and information repositories
Object-oriented and object-relational databases
Time-series data and temporal data
Text databases and multimedia databases
Heterogeneous and legacy databases
Data Mining Functionalities (1)
Exploratory Data Analysis
Generalize, summarize, and contrast data characteristics, e.g., dry vs. wet regions