SlideShare a Scribd company logo
1 of 10
Download to read offline
International Refereed Journal of Engineering and Science (IRJES) 
ISSN (Online) 2319-183X, (Print) 2319-1821 
Volume 3, Issue 8 (August 2014), PP.01-10 
Mining Materializations In Dissimilar Stages 
Sapram Choudaiah, 
Associate Professor, Department of MCA, Siddhartha Institute of Engineering and Technology, 
Puttur, Chittoor (Dist) , Andhra Pradesh, India. 
Abstract:-Teradata provide the foundation to give a company the authority to grow, to complete in today’s 
dynamic marketplace, to achieve the goal of ―transforming transactions into Relationships‖ and to progress the 
business by accomplishment answer to a new generation of questions. Teradata’s scalability allows the system 
to grow as the business grows, from gigabytes to terabytes and beyond. Teradata’s unique technology has been 
proven at customer sites across industries and around the world. Teradata is a large database server that 
accommodates multiple client applications making inquiries against it concurrently. Various client platforms 
access the database through network connection across an IBM mainframe channel connection. The ability to 
manage large amounts of data is accomplished using the concept of parallelism, where in many individual 
processors perform smaller tasks concurrently to accomplish an operation against a huge repository of data. In 
Teradata mining ―Teradata miner tool‖ analyze the data in efficient meaner, it produce the reports and graphs by 
using the Teradata operations. 
Keywords:- Scalability, parallelism, Teradata miner, Repositories, Terabytes of data 
I. INTRODUCTION 
The purpose is not to explain decision trees or other data mining algorithms but rather to focus on how 
to do a decision tree and association analysis data mining tasks using TWM. Before getting started, a brief 
overview of data mining is provided. 
II. SYSTEM STUDY 
Data mining has many definitions and may be called by other names such as knowledge discovery. It 
is generally considered to be a part of the umbrella of tasks, tools, techniques etc. within business Intelligence 
(BI). Many corporate managers consider BI to be the heart of all the processes that support decision making at 
all levels. A definition of data mining typically includes large datasets, discovering previously unknown 
knowledge and patterns and that this knowledge is actionable. That what is discovered is not trivial but can be 
usefully applied. BI and its Data Mining component are receiving considerable attention and fanfare as 
companies utilize BI for competitive advantage. 
Different authors may address the data mining tasks slightly different from each other but the following 
terminology provides a helpful and useful basis for discussing data mining. The data mining tasks are: 
www.irjes.org 1 | Page 
 Description 
 Estimation 
 Classification 
 Prediction 
 Association Analysis 
 Clustering 
2.1 Description 
Uses descriptive statistics to better understand and profile areas of interest. Thus a variety of well 
known statistical tools and methods are used for this task—including frequency charts and other graphical 
output, measures of central tendency and variation. 
2.2 Data Mining Tasks with a Target or Dependent Variable 
Estimation, classification and prediction are data mining tasks that have a target (dependent) 
variable. Sometimes these, are referred to as predictive analysis; however, many authors reserve the term 
Prediction to use of models for the future. The terms supervised and directed apply to these data mining tasks. 
Estimation data mining tasks have an interval level dependent target variable whereas classification data 
mining tasks have a categorical (symbolic) target variable. An example of an estimation data mining task 
would be estimating family income based on a number of attributes; whereas a model to place families into the 
three income brackets of Low, Medium or High would be an example of a classification data mining task. Thus,
Mining Materializations In Dissimilar Stages 
the difference between the two tasks is the type of target variable. When either an estimation data mining task or 
classification task is used to predict future outcomes, the data mining task becomes one of Prediction. Again, 
estimation and classification are referred to as predictive models because that would be the typical application of 
models built for these data mining tasks. 
In summary, the most important concept is that estimation and classification data mining tasks require a target 
variable. However, the difference lies in the data type of the target variable. 
2.3 Data Mining Algorithms for Directed/Supervised Data Mining Tasks 
Linear regression models are the most common data mining algorithms for estimation data mining 
tasks. Of course, linear regression is a very well known and familiar technique. A number of data mining 
algorithms can be used for classification data mining tasks including logistic regression, decision trees, neural 
networks, memory based reasoning (k-nearest neighbor), and Naïve Bayes. 
2.4 Data Mining Tasks without a Target or Dependent Variable 
Association Analysis and Clustering are data mining tasks that do not have a target (dependent) 
variable. Affinity analysis is another term that refers to association analysis and is typically used for 
market basket analysis (MBA) although association analysis can be used for other areas of study. 
MBA is essentially analyzing what purchases tend to be purchased together—that is what items tend to 
have an affinity with other items. Clustering, having no target variable, algorithms attempt to put 
records into groups based on the record’s attributes. The critical concept is that of similarity—those 
within a cluster are very similar to each other and not similar with those in another cluster. 
2.5 Data mining overview summary 
The following table summarizes the data mining tasks and associated data mining algorithms. 
Data mining 
tasks 
Target 
Variable 
Typical Data 
Mining 
Algorithm(s) 
Description No Statistics, 
including 
descriptive, & 
visualization 
Estimation 
Yes 
Interval 
Numeric 
Linear 
Regression 
Classification Yes 
Categoric 
al 
Logistic 
Regression, 
Decision Trees, 
Neural 
Networks, 
Memory Based 
Reasoning, 
Naïve Bayes 
Prediction Yes Estimation and 
Classification 
models for 
prediction 
Association 
Analysis 
No Affinity 
Analysis 
(Market Basket 
Analysis) 
Clustering No k-means, 
Kohonen Self 
Organizing 
Maps (SOM) 
Table 1.1: Data mining overview 
www.irjes.org 2 | Page
Mining Materializations In Dissimilar Stages 
III. TERADATA WAREHOUSE MINER 
To become a TUN member and details for access to the University of Arkansas Teradata resources, see 
the ―How to Become a TUN member…‖ document which includes a link to remote access details. Using the 
icon on the Desktop or the All Programs menu, to open Teradata Warehouse Miner 5.0. Note that TWM is 
Disconnected when initially opened. Click the Open Connection icon as shown to open the 
Fig 3.1: Teradata miner 
Select Data Source dialog. 
Fig 3.2: ODBC Connections 
The System Data Source tab should be the default tab; if not, click it. 
Then locate and select Walton College Teradata from the list of names—only name In this example. 
Click the OK button to open the logon dialog. 
Fig 3.3: Teradata Login 
Note that the grayed out icon to the right of the Open Connection icon is now yellow and the word 
Disconnected has been replaced with the name of the data source to which the system is now connected. Also 
displayed is the tooltip for the Add New Project icon on the toolbar. The File menu can also be used to create a 
new project. 
www.irjes.org 3 | Page
Mining Materializations In Dissimilar Stages 
Fig 3.4: warehouse miner project selection 
Also, note that there is a Project Explorer window to the right and an Execution Status window on the bottom. 
Open a new project by clicking the Add New Project icon—the default name of the new project is Project 1 and 
is in the Project Explorer window—you can change the project name by clicking it and then changing to a 
desired name. 
Fig 3.5: warehouse miner project operations 
Having a project selected in the Project Explorer window activates additional icons on the toolbar—the standard 
windows icons for a folder, saving a file and save all. The other two icons are the Add New Analysis and Add 
Existing Analysis icons which are shown below. 
Fig 3.6: warehouse analysis 
Fig 3.7: existing analysis 
Click the Add New Analysis icon to open its dialog 
Fig 3.8: Agenda selection 
www.irjes.org 4 | Page
Mining Materializations In Dissimilar Stages 
The types of analysis are listed in the Categories pane of the Add New Analysis dialog. Note the 
default Analysis name in the textbox near the bottom of the dialog. Because the type of analysis selected is 
ADS, the default name is BuildADS1—you can build an Analysis Data Set (ADS) if needed. Click the 
Descriptive Statistics category to view the types of analysis available in this category. 
Fig 3.9: new analysis 
DM is the goal of this illustration so click Analytics in the Categories pane. 
Fig 3.10: analysis wizard 
Six data mining techniques are available. Decision Trees, Linear Regression and Logistic Regression 
are directed or supervised data mining techniques. This means that they have a target variable (dependent 
variable) and thus are sometimes called predictive models. Association analysis and clustering are undirected or 
unsupervised data mining techniques and do not have a target or dependent variable. 
The data for this example will be from a table (WAREHOUSE) in the ua_samsclub database and 
consists of more than 4 million rows of data. This table was created by joining desired columns from a number 
of tables in the ua_samsclub data. The data mining problem is to build a model to predict the member status 
code based on other variables in the Warehouse table. 
Click the Decision Tree icon in the Analyses pane and provide an Analysis name. Use the naming convention 
of adding your initials to the end of the Analysis name of MemberStatusiii if you wish. For this example, the 
Analysis name will be Member Status DED. Click the OK button. 
Fig 3.11: miner operation 
The Input source as Table and enter the database in the Available Databases: drop down list box and 
then press the Enter key. It is important to press the Enter key as it populates the Available Tables: drop down 
list box with all the tables in the database. In this example, the desired table is WAREHOUSE. 
www.irjes.org 5 | Page
Mining Materializations In Dissimilar Stages 
Selecting the table WAREHOUSE from the Available Columns drop down list box will populate the 
Available Columns: pane with the columns of the table. The Selected Columns: pane is context dependent— 
in this case, it contains an area for the dependent column and an area for the Independent columns because the 
data mining technique is a decision tree. 
As shown below, WAREHOUSE has been selected as the table. 
Fig 3.12: Selection Window 
After clicking the arrow to move the selected column (MEMBER_STATUS_CD) from the Available 
Columns pane to the to the Dependent Column pane, the arrow is grayed out. This is because this data mining 
model can only have one target or dependent variable. 
The user will then need to click the Independent Columns bar so it will have the focus which allows 
moving selected columns from the Available Columns pane to the Independent Columns pane. 
Note that the transfer arrow will not activate until selection of one or more columns from the Available Columns 
pane is selected. Multiple columns can be selected and moved in a single action. Nine columns have been 
selected Independent Columns as shown. This completes the data selection process—clicks the analysis 
parameters tab. 
Fig 3.13: Column Selection 
The goal of decision tree analysis is to initially analyze each independent variable to determine which 
variable best predicts the target or dependent column (variable). Different algorithms are used to determine the 
predictive accuracy of variables—the default in this case is the Gain Ratio method. Click the drop down box to 
see that three other options are available—Gini Index, Chaid, and Regression Trees. For this example, accept 
the default splitting method of Gain Ratio. This process is recursive and at each node, all variables are again 
examined to determine which one provides the best predictive power – a variable may be split more than once at 
different nodes in the decision tree. 
www.irjes.org 6 | Page
Mining Materializations In Dissimilar Stages 
Fig 3.14: Decision Tree 
For this example, change the Minimum Split Count from 2 to 50 and the Maximum Depth from 100 to 50. 
Accept all the other defaults. Click the expert options just to view possibilities. In this case, the only option is 
the amount of data to be held in memory for processing. There is no need to change the default value. 
Check the Execution Status Window at the bottom. Note that there are more than 4 million rows in the 
WAREHOUSE table. Also, the status is Executing. 
When execution is completed, the Executing Status will change to Complete. Be patient as the model will take 
a few minutes to run—note that at each node, all the variables are evaluated as a candidate for splitting and there 
are more than 4 million rows for the 9 independent variables. 
After the model is complete, the RESULTS tab becomes accessible. Click the RESULTS tab. 
Note that the model has an accuracy of 92.31% and the default tab is the reports tab. On the left under the 
Decision Tree Report, you can click variables or the Confusion Matrix. 
Click Variables to notice only 6 independent variables are listed—recall that the original model included 9 
independent variables. Which variables have been dropped? 
www.irjes.org 7 | Page
Mining Materializations In Dissimilar Stages 
Fig 3.15: Variable window 
Click Confusion Matrix and review the results. 
Fig 3.16: Confusion Matrix 
www.irjes.org 8 | Page 
Click the graphs tab. 
Fig 3.17: Graph 
A high-level visual is provided without any detail in the upper window. Click any box in the upper 
window to get detail in the lower window or just browse the lower graph. Nodes are by default colored yellow 
and green (green being a leaf node which means no other nodes are split from it). 
Fig 3.18 Node Details 
Click the text tree tab and then click on one of the rules—a Node Details dialog appears. It is a leaf 
node green and is node 5 if you want to reviews the data from the tree browser. 
You can save the model by clicking the multiple disks on the toolbar. 
IV. ASSOCIATION ANALYSIS 
The following example is for association analysis that uses the warehouse table of ua_samsclub. The 
data format for Association Analysis is a transaction format—consisting of two columns; a customer and a 
product. Thus, if a customer purchases 4 items, then there would be 4 rows with the customer repeated. 
The equivalent of this for the warehouse table is visit_nbr and Primary_desc. See the setup below.
Mining Materializations In Dissimilar Stages 
Fig 4.1: association analysis 
Next, click the analysis parameters tab toward the upper left. Note that the default setting is for one antecedent 
and one consequent. 
Fig 4.2: parameter list 
Click the expert options tab. Enter an SQL statement – in this case, only store number 15. 
Click the run button. When the run is completed, click results—a load table button should appear. 
www.irjes.org 9 | Page
Mining Materializations In Dissimilar Stages 
Click the Load button and review the table. 
www.irjes.org 10 | Page 
Click graph. 
Fig 4.3: Graph 
V. CONCLUSION 
Teradata is the approaching of the Data Mining. In future the whole world we start using Teradata 
Database. Now it is costly, works are successful on to reduce its cost. So, it will reach to small professional 
people also. It is firm occupied location will change organization goal line. 
REFERENCES 
[1]. Teradata Warehouse Miner Model Manager User Guide, B035-2303-093A, September 2013 
[2]. Teradata Warehouse Miner Release Definition, B035-2494-093C, September 2013 
[3]. Teradata Warehouse Miner User Guide, Volume 1 Introduction and Profiling, B035-2300- 093A, 
September 2013 
[4]. Teradata Warehouse Miner User Guide, Volume 2 ADS Generation, B035-2301-093A, September 
2013 
[5]. Teradata Warehouse Miner User Guide, Volume 3 Analytic Functions, B035-2302-093A, September 
2013 
AUTHORS 
First Author- S.Choudaiah Received the MCA POST GRADUATE In 
computer applications from SVU, Tirupati. He is currently working as an 
Associate Professor, in Siddhartha institute of Engineering and Technology 
College, puttur, Andhra Pradesh, India. His field of interest is java 
Technologies and data mining and data ware house

More Related Content

What's hot

Data warehousing and online analytical processing
Data warehousing and online analytical processingData warehousing and online analytical processing
Data warehousing and online analytical processingVijayasankariS
 
Introduction to Dimesional Modelling
Introduction to Dimesional ModellingIntroduction to Dimesional Modelling
Introduction to Dimesional ModellingAshish Chandwani
 
Data Mining In Market Research
Data Mining In Market ResearchData Mining In Market Research
Data Mining In Market Researchkevinlan
 
Dimensional Modelling - Basic Concept
Dimensional Modelling - Basic ConceptDimensional Modelling - Basic Concept
Dimensional Modelling - Basic ConceptFolio3 Software
 
Data mining and data warehouse lab manual updated
Data mining and data warehouse lab manual updatedData mining and data warehouse lab manual updated
Data mining and data warehouse lab manual updatedYugal Kumar
 
DW DIMENSN MODELNG
DW DIMENSN MODELNGDW DIMENSN MODELNG
DW DIMENSN MODELNGDivya Tadi
 
Dimensional Modeling Basic Concept with Example
Dimensional Modeling Basic Concept with ExampleDimensional Modeling Basic Concept with Example
Dimensional Modeling Basic Concept with ExampleSajjad Zaheer
 
Dimensional modelling-mod-3
Dimensional modelling-mod-3Dimensional modelling-mod-3
Dimensional modelling-mod-3Malik Alig
 
Dimensional Modeling
Dimensional ModelingDimensional Modeling
Dimensional ModelingSunita Sahu
 
Research trends in data warehousing and data mining
Research trends in data warehousing and data miningResearch trends in data warehousing and data mining
Research trends in data warehousing and data miningEr. Nawaraj Bhandari
 
2 data warehouse life cycle golfarelli
2 data warehouse life cycle golfarelli2 data warehouse life cycle golfarelli
2 data warehouse life cycle golfarellitruongthuthuy47
 
Data warehousing and business intelligence project report
Data warehousing and business intelligence project reportData warehousing and business intelligence project report
Data warehousing and business intelligence project reportsonalighai
 
Dimensional modeling primer
Dimensional modeling primerDimensional modeling primer
Dimensional modeling primerTerry Bunio
 
On multi dimensional cubes of census data: designing and querying
On multi dimensional cubes of census data: designing and queryingOn multi dimensional cubes of census data: designing and querying
On multi dimensional cubes of census data: designing and queryingJaspreet Issaj
 

What's hot (20)

Data warehousing and online analytical processing
Data warehousing and online analytical processingData warehousing and online analytical processing
Data warehousing and online analytical processing
 
Introduction to Dimesional Modelling
Introduction to Dimesional ModellingIntroduction to Dimesional Modelling
Introduction to Dimesional Modelling
 
Database aggregation using metadata
Database aggregation using metadataDatabase aggregation using metadata
Database aggregation using metadata
 
Data Mining In Market Research
Data Mining In Market ResearchData Mining In Market Research
Data Mining In Market Research
 
Dimensional Modelling - Basic Concept
Dimensional Modelling - Basic ConceptDimensional Modelling - Basic Concept
Dimensional Modelling - Basic Concept
 
Dimensional Modeling
Dimensional ModelingDimensional Modeling
Dimensional Modeling
 
Data mining and data warehouse lab manual updated
Data mining and data warehouse lab manual updatedData mining and data warehouse lab manual updated
Data mining and data warehouse lab manual updated
 
DW DIMENSN MODELNG
DW DIMENSN MODELNGDW DIMENSN MODELNG
DW DIMENSN MODELNG
 
Dimensional Modeling Basic Concept with Example
Dimensional Modeling Basic Concept with ExampleDimensional Modeling Basic Concept with Example
Dimensional Modeling Basic Concept with Example
 
Bi assignment
Bi assignmentBi assignment
Bi assignment
 
Dimensional modelling-mod-3
Dimensional modelling-mod-3Dimensional modelling-mod-3
Dimensional modelling-mod-3
 
Z36149154
Z36149154Z36149154
Z36149154
 
Dimensional Modeling
Dimensional ModelingDimensional Modeling
Dimensional Modeling
 
Research trends in data warehousing and data mining
Research trends in data warehousing and data miningResearch trends in data warehousing and data mining
Research trends in data warehousing and data mining
 
Dimensional Modelling
Dimensional ModellingDimensional Modelling
Dimensional Modelling
 
2 data warehouse life cycle golfarelli
2 data warehouse life cycle golfarelli2 data warehouse life cycle golfarelli
2 data warehouse life cycle golfarelli
 
Data warehousing and business intelligence project report
Data warehousing and business intelligence project reportData warehousing and business intelligence project report
Data warehousing and business intelligence project report
 
Dimensional modeling primer
Dimensional modeling primerDimensional modeling primer
Dimensional modeling primer
 
On multi dimensional cubes of census data: designing and querying
On multi dimensional cubes of census data: designing and queryingOn multi dimensional cubes of census data: designing and querying
On multi dimensional cubes of census data: designing and querying
 
Metadata
MetadataMetadata
Metadata
 

Similar to International Refereed Journal of Engineering and Science (IRJES)

using big-data methods analyse the Cross platform aviation
 using big-data methods analyse the Cross platform aviation using big-data methods analyse the Cross platform aviation
using big-data methods analyse the Cross platform aviationranjit banshpal
 
Data Warehousing AWS 12345
Data Warehousing AWS 12345Data Warehousing AWS 12345
Data Warehousing AWS 12345AkhilSinghal21
 
data mining and data warehousing
data mining and data warehousingdata mining and data warehousing
data mining and data warehousingSunny Gandhi
 
Credit card fraud detection using python machine learning
Credit card fraud detection using python machine learningCredit card fraud detection using python machine learning
Credit card fraud detection using python machine learningSandeep Garg
 
Applying Classification Technique using DID3 Algorithm to improve Decision Su...
Applying Classification Technique using DID3 Algorithm to improve Decision Su...Applying Classification Technique using DID3 Algorithm to improve Decision Su...
Applying Classification Technique using DID3 Algorithm to improve Decision Su...IJMER
 
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
A SURVEY ON DATA MINING IN STEEL INDUSTRIESA SURVEY ON DATA MINING IN STEEL INDUSTRIES
A SURVEY ON DATA MINING IN STEEL INDUSTRIESIJCSES Journal
 
UNIT 2: Part 2: Data Warehousing and Data Mining
UNIT 2: Part 2: Data Warehousing and Data MiningUNIT 2: Part 2: Data Warehousing and Data Mining
UNIT 2: Part 2: Data Warehousing and Data MiningNandakumar P
 
McKinsey Global Institute Big data The next frontier for innova.docx
McKinsey Global Institute Big data The next frontier for innova.docxMcKinsey Global Institute Big data The next frontier for innova.docx
McKinsey Global Institute Big data The next frontier for innova.docxandreecapon
 
GROUP PROJECT REPORT_FY6055_FX7378
GROUP PROJECT REPORT_FY6055_FX7378GROUP PROJECT REPORT_FY6055_FX7378
GROUP PROJECT REPORT_FY6055_FX7378Parag Kapile
 
Practical Machine Learning
Practical Machine LearningPractical Machine Learning
Practical Machine LearningLynn Langit
 
Unit-V-Introduction to Data Mining.pptx
Unit-V-Introduction to  Data Mining.pptxUnit-V-Introduction to  Data Mining.pptx
Unit-V-Introduction to Data Mining.pptxHarsha Patel
 
A Review on Classification of Data Imbalance using BigData
A Review on Classification of Data Imbalance using BigDataA Review on Classification of Data Imbalance using BigData
A Review on Classification of Data Imbalance using BigDataIJMIT JOURNAL
 
A REVIEW ON CLASSIFICATION OF DATA IMBALANCE USING BIGDATA
A REVIEW ON CLASSIFICATION OF DATA IMBALANCE USING BIGDATAA REVIEW ON CLASSIFICATION OF DATA IMBALANCE USING BIGDATA
A REVIEW ON CLASSIFICATION OF DATA IMBALANCE USING BIGDATAIJMIT JOURNAL
 
Top 30 Data Analyst Interview Questions.pdf
Top 30 Data Analyst Interview Questions.pdfTop 30 Data Analyst Interview Questions.pdf
Top 30 Data Analyst Interview Questions.pdfShaikSikindar1
 

Similar to International Refereed Journal of Engineering and Science (IRJES) (20)

Data mining
Data miningData mining
Data mining
 
Data mining
Data miningData mining
Data mining
 
using big-data methods analyse the Cross platform aviation
 using big-data methods analyse the Cross platform aviation using big-data methods analyse the Cross platform aviation
using big-data methods analyse the Cross platform aviation
 
Data Warehousing AWS 12345
Data Warehousing AWS 12345Data Warehousing AWS 12345
Data Warehousing AWS 12345
 
data mining and data warehousing
data mining and data warehousingdata mining and data warehousing
data mining and data warehousing
 
Credit card fraud detection using python machine learning
Credit card fraud detection using python machine learningCredit card fraud detection using python machine learning
Credit card fraud detection using python machine learning
 
Applying Classification Technique using DID3 Algorithm to improve Decision Su...
Applying Classification Technique using DID3 Algorithm to improve Decision Su...Applying Classification Technique using DID3 Algorithm to improve Decision Su...
Applying Classification Technique using DID3 Algorithm to improve Decision Su...
 
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
A SURVEY ON DATA MINING IN STEEL INDUSTRIESA SURVEY ON DATA MINING IN STEEL INDUSTRIES
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
 
Unit i
Unit iUnit i
Unit i
 
UNIT 2: Part 2: Data Warehousing and Data Mining
UNIT 2: Part 2: Data Warehousing and Data MiningUNIT 2: Part 2: Data Warehousing and Data Mining
UNIT 2: Part 2: Data Warehousing and Data Mining
 
Data Mining
Data MiningData Mining
Data Mining
 
Data Mining
Data MiningData Mining
Data Mining
 
McKinsey Global Institute Big data The next frontier for innova.docx
McKinsey Global Institute Big data The next frontier for innova.docxMcKinsey Global Institute Big data The next frontier for innova.docx
McKinsey Global Institute Big data The next frontier for innova.docx
 
data mining
data miningdata mining
data mining
 
GROUP PROJECT REPORT_FY6055_FX7378
GROUP PROJECT REPORT_FY6055_FX7378GROUP PROJECT REPORT_FY6055_FX7378
GROUP PROJECT REPORT_FY6055_FX7378
 
Practical Machine Learning
Practical Machine LearningPractical Machine Learning
Practical Machine Learning
 
Unit-V-Introduction to Data Mining.pptx
Unit-V-Introduction to  Data Mining.pptxUnit-V-Introduction to  Data Mining.pptx
Unit-V-Introduction to Data Mining.pptx
 
A Review on Classification of Data Imbalance using BigData
A Review on Classification of Data Imbalance using BigDataA Review on Classification of Data Imbalance using BigData
A Review on Classification of Data Imbalance using BigData
 
A REVIEW ON CLASSIFICATION OF DATA IMBALANCE USING BIGDATA
A REVIEW ON CLASSIFICATION OF DATA IMBALANCE USING BIGDATAA REVIEW ON CLASSIFICATION OF DATA IMBALANCE USING BIGDATA
A REVIEW ON CLASSIFICATION OF DATA IMBALANCE USING BIGDATA
 
Top 30 Data Analyst Interview Questions.pdf
Top 30 Data Analyst Interview Questions.pdfTop 30 Data Analyst Interview Questions.pdf
Top 30 Data Analyst Interview Questions.pdf
 

More from irjes

Automatic Safety Door Lock System for Car
Automatic Safety Door Lock System for CarAutomatic Safety Door Lock System for Car
Automatic Safety Door Lock System for Carirjes
 
Squared Multi-hole Extrusion Process: Experimentation & Optimization
Squared Multi-hole Extrusion Process: Experimentation & OptimizationSquared Multi-hole Extrusion Process: Experimentation & Optimization
Squared Multi-hole Extrusion Process: Experimentation & Optimizationirjes
 
Analysis of Agile and Multi-Agent Based Process Scheduling Model
Analysis of Agile and Multi-Agent Based Process Scheduling ModelAnalysis of Agile and Multi-Agent Based Process Scheduling Model
Analysis of Agile and Multi-Agent Based Process Scheduling Modelirjes
 
Effects of Cutting Tool Parameters on Surface Roughness
Effects of Cutting Tool Parameters on Surface RoughnessEffects of Cutting Tool Parameters on Surface Roughness
Effects of Cutting Tool Parameters on Surface Roughnessirjes
 
Possible limits of accuracy in measurement of fundamental physical constants
Possible limits of accuracy in measurement of fundamental physical constantsPossible limits of accuracy in measurement of fundamental physical constants
Possible limits of accuracy in measurement of fundamental physical constantsirjes
 
Performance Comparison of Energy Detection Based Spectrum Sensing for Cogniti...
Performance Comparison of Energy Detection Based Spectrum Sensing for Cogniti...Performance Comparison of Energy Detection Based Spectrum Sensing for Cogniti...
Performance Comparison of Energy Detection Based Spectrum Sensing for Cogniti...irjes
 
Comparative Study of Pre-Engineered and Conventional Steel Frames for Differe...
Comparative Study of Pre-Engineered and Conventional Steel Frames for Differe...Comparative Study of Pre-Engineered and Conventional Steel Frames for Differe...
Comparative Study of Pre-Engineered and Conventional Steel Frames for Differe...irjes
 
Flip bifurcation and chaos control in discrete-time Prey-predator model
Flip bifurcation and chaos control in discrete-time Prey-predator model Flip bifurcation and chaos control in discrete-time Prey-predator model
Flip bifurcation and chaos control in discrete-time Prey-predator model irjes
 
Energy Awareness and the Role of “Critical Mass” In Smart Cities
Energy Awareness and the Role of “Critical Mass” In Smart CitiesEnergy Awareness and the Role of “Critical Mass” In Smart Cities
Energy Awareness and the Role of “Critical Mass” In Smart Citiesirjes
 
A Firefly Algorithm for Optimizing Spur Gear Parameters Under Non-Lubricated ...
A Firefly Algorithm for Optimizing Spur Gear Parameters Under Non-Lubricated ...A Firefly Algorithm for Optimizing Spur Gear Parameters Under Non-Lubricated ...
A Firefly Algorithm for Optimizing Spur Gear Parameters Under Non-Lubricated ...irjes
 
The Effect of Orientation of Vortex Generators on Aerodynamic Drag Reduction ...
The Effect of Orientation of Vortex Generators on Aerodynamic Drag Reduction ...The Effect of Orientation of Vortex Generators on Aerodynamic Drag Reduction ...
The Effect of Orientation of Vortex Generators on Aerodynamic Drag Reduction ...irjes
 
An Assessment of The Relationship Between The Availability of Financial Resou...
An Assessment of The Relationship Between The Availability of Financial Resou...An Assessment of The Relationship Between The Availability of Financial Resou...
An Assessment of The Relationship Between The Availability of Financial Resou...irjes
 
The Choice of Antenatal Care and Delivery Place in Surabaya (Based on Prefere...
The Choice of Antenatal Care and Delivery Place in Surabaya (Based on Prefere...The Choice of Antenatal Care and Delivery Place in Surabaya (Based on Prefere...
The Choice of Antenatal Care and Delivery Place in Surabaya (Based on Prefere...irjes
 
Prediction of the daily global solar irradiance received on a horizontal surf...
Prediction of the daily global solar irradiance received on a horizontal surf...Prediction of the daily global solar irradiance received on a horizontal surf...
Prediction of the daily global solar irradiance received on a horizontal surf...irjes
 
HARMONIC ANALYSIS ASSOCIATED WITH A GENERALIZED BESSEL-STRUVE OPERATOR ON THE...
HARMONIC ANALYSIS ASSOCIATED WITH A GENERALIZED BESSEL-STRUVE OPERATOR ON THE...HARMONIC ANALYSIS ASSOCIATED WITH A GENERALIZED BESSEL-STRUVE OPERATOR ON THE...
HARMONIC ANALYSIS ASSOCIATED WITH A GENERALIZED BESSEL-STRUVE OPERATOR ON THE...irjes
 
The Role of Community Participation in Planning Processes of Emerging Urban C...
The Role of Community Participation in Planning Processes of Emerging Urban C...The Role of Community Participation in Planning Processes of Emerging Urban C...
The Role of Community Participation in Planning Processes of Emerging Urban C...irjes
 
Understanding the Concept of Strategic Intent
Understanding the Concept of Strategic IntentUnderstanding the Concept of Strategic Intent
Understanding the Concept of Strategic Intentirjes
 
The (R, Q) Control of A Mixture Inventory Model with Backorders and Lost Sale...
The (R, Q) Control of A Mixture Inventory Model with Backorders and Lost Sale...The (R, Q) Control of A Mixture Inventory Model with Backorders and Lost Sale...
The (R, Q) Control of A Mixture Inventory Model with Backorders and Lost Sale...irjes
 
Relation Between Stress And Menstrual Cycle At 18-21 Years Of Age
Relation Between Stress And Menstrual Cycle At 18-21 Years Of AgeRelation Between Stress And Menstrual Cycle At 18-21 Years Of Age
Relation Between Stress And Menstrual Cycle At 18-21 Years Of Ageirjes
 
Wave Transmission on Submerged Breakwater with Interlocking D-Block Armor
Wave Transmission on Submerged Breakwater with Interlocking D-Block ArmorWave Transmission on Submerged Breakwater with Interlocking D-Block Armor
Wave Transmission on Submerged Breakwater with Interlocking D-Block Armorirjes
 

More from irjes (20)

Automatic Safety Door Lock System for Car
Automatic Safety Door Lock System for CarAutomatic Safety Door Lock System for Car
Automatic Safety Door Lock System for Car
 
Squared Multi-hole Extrusion Process: Experimentation & Optimization
Squared Multi-hole Extrusion Process: Experimentation & OptimizationSquared Multi-hole Extrusion Process: Experimentation & Optimization
Squared Multi-hole Extrusion Process: Experimentation & Optimization
 
Analysis of Agile and Multi-Agent Based Process Scheduling Model
Analysis of Agile and Multi-Agent Based Process Scheduling ModelAnalysis of Agile and Multi-Agent Based Process Scheduling Model
Analysis of Agile and Multi-Agent Based Process Scheduling Model
 
Effects of Cutting Tool Parameters on Surface Roughness
Effects of Cutting Tool Parameters on Surface RoughnessEffects of Cutting Tool Parameters on Surface Roughness
Effects of Cutting Tool Parameters on Surface Roughness
 
Possible limits of accuracy in measurement of fundamental physical constants
Possible limits of accuracy in measurement of fundamental physical constantsPossible limits of accuracy in measurement of fundamental physical constants
Possible limits of accuracy in measurement of fundamental physical constants
 
Performance Comparison of Energy Detection Based Spectrum Sensing for Cogniti...
Performance Comparison of Energy Detection Based Spectrum Sensing for Cogniti...Performance Comparison of Energy Detection Based Spectrum Sensing for Cogniti...
Performance Comparison of Energy Detection Based Spectrum Sensing for Cogniti...
 
Comparative Study of Pre-Engineered and Conventional Steel Frames for Differe...
Comparative Study of Pre-Engineered and Conventional Steel Frames for Differe...Comparative Study of Pre-Engineered and Conventional Steel Frames for Differe...
Comparative Study of Pre-Engineered and Conventional Steel Frames for Differe...
 
Flip bifurcation and chaos control in discrete-time Prey-predator model
Flip bifurcation and chaos control in discrete-time Prey-predator model Flip bifurcation and chaos control in discrete-time Prey-predator model
Flip bifurcation and chaos control in discrete-time Prey-predator model
 
Energy Awareness and the Role of “Critical Mass” In Smart Cities
Energy Awareness and the Role of “Critical Mass” In Smart CitiesEnergy Awareness and the Role of “Critical Mass” In Smart Cities
Energy Awareness and the Role of “Critical Mass” In Smart Cities
 
A Firefly Algorithm for Optimizing Spur Gear Parameters Under Non-Lubricated ...
A Firefly Algorithm for Optimizing Spur Gear Parameters Under Non-Lubricated ...A Firefly Algorithm for Optimizing Spur Gear Parameters Under Non-Lubricated ...
A Firefly Algorithm for Optimizing Spur Gear Parameters Under Non-Lubricated ...
 
The Effect of Orientation of Vortex Generators on Aerodynamic Drag Reduction ...
The Effect of Orientation of Vortex Generators on Aerodynamic Drag Reduction ...The Effect of Orientation of Vortex Generators on Aerodynamic Drag Reduction ...
The Effect of Orientation of Vortex Generators on Aerodynamic Drag Reduction ...
 
An Assessment of The Relationship Between The Availability of Financial Resou...
An Assessment of The Relationship Between The Availability of Financial Resou...An Assessment of The Relationship Between The Availability of Financial Resou...
An Assessment of The Relationship Between The Availability of Financial Resou...
 
The Choice of Antenatal Care and Delivery Place in Surabaya (Based on Prefere...
The Choice of Antenatal Care and Delivery Place in Surabaya (Based on Prefere...The Choice of Antenatal Care and Delivery Place in Surabaya (Based on Prefere...
The Choice of Antenatal Care and Delivery Place in Surabaya (Based on Prefere...
 
Prediction of the daily global solar irradiance received on a horizontal surf...
Prediction of the daily global solar irradiance received on a horizontal surf...Prediction of the daily global solar irradiance received on a horizontal surf...
Prediction of the daily global solar irradiance received on a horizontal surf...
 
HARMONIC ANALYSIS ASSOCIATED WITH A GENERALIZED BESSEL-STRUVE OPERATOR ON THE...
HARMONIC ANALYSIS ASSOCIATED WITH A GENERALIZED BESSEL-STRUVE OPERATOR ON THE...HARMONIC ANALYSIS ASSOCIATED WITH A GENERALIZED BESSEL-STRUVE OPERATOR ON THE...
HARMONIC ANALYSIS ASSOCIATED WITH A GENERALIZED BESSEL-STRUVE OPERATOR ON THE...
 
The Role of Community Participation in Planning Processes of Emerging Urban C...
The Role of Community Participation in Planning Processes of Emerging Urban C...The Role of Community Participation in Planning Processes of Emerging Urban C...
The Role of Community Participation in Planning Processes of Emerging Urban C...
 
Understanding the Concept of Strategic Intent
Understanding the Concept of Strategic IntentUnderstanding the Concept of Strategic Intent
Understanding the Concept of Strategic Intent
 
The (R, Q) Control of A Mixture Inventory Model with Backorders and Lost Sale...
The (R, Q) Control of A Mixture Inventory Model with Backorders and Lost Sale...The (R, Q) Control of A Mixture Inventory Model with Backorders and Lost Sale...
The (R, Q) Control of A Mixture Inventory Model with Backorders and Lost Sale...
 
Relation Between Stress And Menstrual Cycle At 18-21 Years Of Age
Relation Between Stress And Menstrual Cycle At 18-21 Years Of AgeRelation Between Stress And Menstrual Cycle At 18-21 Years Of Age
Relation Between Stress And Menstrual Cycle At 18-21 Years Of Age
 
Wave Transmission on Submerged Breakwater with Interlocking D-Block Armor
Wave Transmission on Submerged Breakwater with Interlocking D-Block ArmorWave Transmission on Submerged Breakwater with Interlocking D-Block Armor
Wave Transmission on Submerged Breakwater with Interlocking D-Block Armor
 

Recently uploaded

Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ
 
Arduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptArduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptSAURABHKUMAR892774
 
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)dollysharma2066
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
Risk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfRisk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfROCENODodongVILLACER
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerAnamika Sarkar
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.eptoze12
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfAsst.prof M.Gokilavani
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ
 
Electronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfElectronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfme23b1001
 
An experimental study in using natural admixture as an alternative for chemic...
An experimental study in using natural admixture as an alternative for chemic...An experimental study in using natural admixture as an alternative for chemic...
An experimental study in using natural admixture as an alternative for chemic...Chandu841456
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AIabhishek36461
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxKartikeyaDwivedi3
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst
 

Recently uploaded (20)

Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...
 
Arduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptArduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.ppt
 
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
Risk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfRisk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdf
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
 
Electronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfElectronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdf
 
An experimental study in using natural admixture as an alternative for chemic...
An experimental study in using natural admixture as an alternative for chemic...An experimental study in using natural admixture as an alternative for chemic...
An experimental study in using natural admixture as an alternative for chemic...
 
young call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Serviceyoung call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Service
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AI
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptx
 
Design and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdfDesign and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdf
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024
 

International Refereed Journal of Engineering and Science (IRJES)

  • 1. International Refereed Journal of Engineering and Science (IRJES) ISSN (Online) 2319-183X, (Print) 2319-1821 Volume 3, Issue 8 (August 2014), PP.01-10 Mining Materializations In Dissimilar Stages Sapram Choudaiah, Associate Professor, Department of MCA, Siddhartha Institute of Engineering and Technology, Puttur, Chittoor (Dist) , Andhra Pradesh, India. Abstract:-Teradata provide the foundation to give a company the authority to grow, to complete in today’s dynamic marketplace, to achieve the goal of ―transforming transactions into Relationships‖ and to progress the business by accomplishment answer to a new generation of questions. Teradata’s scalability allows the system to grow as the business grows, from gigabytes to terabytes and beyond. Teradata’s unique technology has been proven at customer sites across industries and around the world. Teradata is a large database server that accommodates multiple client applications making inquiries against it concurrently. Various client platforms access the database through network connection across an IBM mainframe channel connection. The ability to manage large amounts of data is accomplished using the concept of parallelism, where in many individual processors perform smaller tasks concurrently to accomplish an operation against a huge repository of data. In Teradata mining ―Teradata miner tool‖ analyze the data in efficient meaner, it produce the reports and graphs by using the Teradata operations. Keywords:- Scalability, parallelism, Teradata miner, Repositories, Terabytes of data I. INTRODUCTION The purpose is not to explain decision trees or other data mining algorithms but rather to focus on how to do a decision tree and association analysis data mining tasks using TWM. Before getting started, a brief overview of data mining is provided. II. SYSTEM STUDY Data mining has many definitions and may be called by other names such as knowledge discovery. It is generally considered to be a part of the umbrella of tasks, tools, techniques etc. within business Intelligence (BI). Many corporate managers consider BI to be the heart of all the processes that support decision making at all levels. A definition of data mining typically includes large datasets, discovering previously unknown knowledge and patterns and that this knowledge is actionable. That what is discovered is not trivial but can be usefully applied. BI and its Data Mining component are receiving considerable attention and fanfare as companies utilize BI for competitive advantage. Different authors may address the data mining tasks slightly different from each other but the following terminology provides a helpful and useful basis for discussing data mining. The data mining tasks are: www.irjes.org 1 | Page  Description  Estimation  Classification  Prediction  Association Analysis  Clustering 2.1 Description Uses descriptive statistics to better understand and profile areas of interest. Thus a variety of well known statistical tools and methods are used for this task—including frequency charts and other graphical output, measures of central tendency and variation. 2.2 Data Mining Tasks with a Target or Dependent Variable Estimation, classification and prediction are data mining tasks that have a target (dependent) variable. Sometimes these, are referred to as predictive analysis; however, many authors reserve the term Prediction to use of models for the future. The terms supervised and directed apply to these data mining tasks. Estimation data mining tasks have an interval level dependent target variable whereas classification data mining tasks have a categorical (symbolic) target variable. An example of an estimation data mining task would be estimating family income based on a number of attributes; whereas a model to place families into the three income brackets of Low, Medium or High would be an example of a classification data mining task. Thus,
  • 2. Mining Materializations In Dissimilar Stages the difference between the two tasks is the type of target variable. When either an estimation data mining task or classification task is used to predict future outcomes, the data mining task becomes one of Prediction. Again, estimation and classification are referred to as predictive models because that would be the typical application of models built for these data mining tasks. In summary, the most important concept is that estimation and classification data mining tasks require a target variable. However, the difference lies in the data type of the target variable. 2.3 Data Mining Algorithms for Directed/Supervised Data Mining Tasks Linear regression models are the most common data mining algorithms for estimation data mining tasks. Of course, linear regression is a very well known and familiar technique. A number of data mining algorithms can be used for classification data mining tasks including logistic regression, decision trees, neural networks, memory based reasoning (k-nearest neighbor), and Naïve Bayes. 2.4 Data Mining Tasks without a Target or Dependent Variable Association Analysis and Clustering are data mining tasks that do not have a target (dependent) variable. Affinity analysis is another term that refers to association analysis and is typically used for market basket analysis (MBA) although association analysis can be used for other areas of study. MBA is essentially analyzing what purchases tend to be purchased together—that is what items tend to have an affinity with other items. Clustering, having no target variable, algorithms attempt to put records into groups based on the record’s attributes. The critical concept is that of similarity—those within a cluster are very similar to each other and not similar with those in another cluster. 2.5 Data mining overview summary The following table summarizes the data mining tasks and associated data mining algorithms. Data mining tasks Target Variable Typical Data Mining Algorithm(s) Description No Statistics, including descriptive, & visualization Estimation Yes Interval Numeric Linear Regression Classification Yes Categoric al Logistic Regression, Decision Trees, Neural Networks, Memory Based Reasoning, Naïve Bayes Prediction Yes Estimation and Classification models for prediction Association Analysis No Affinity Analysis (Market Basket Analysis) Clustering No k-means, Kohonen Self Organizing Maps (SOM) Table 1.1: Data mining overview www.irjes.org 2 | Page
  • 3. Mining Materializations In Dissimilar Stages III. TERADATA WAREHOUSE MINER To become a TUN member and details for access to the University of Arkansas Teradata resources, see the ―How to Become a TUN member…‖ document which includes a link to remote access details. Using the icon on the Desktop or the All Programs menu, to open Teradata Warehouse Miner 5.0. Note that TWM is Disconnected when initially opened. Click the Open Connection icon as shown to open the Fig 3.1: Teradata miner Select Data Source dialog. Fig 3.2: ODBC Connections The System Data Source tab should be the default tab; if not, click it. Then locate and select Walton College Teradata from the list of names—only name In this example. Click the OK button to open the logon dialog. Fig 3.3: Teradata Login Note that the grayed out icon to the right of the Open Connection icon is now yellow and the word Disconnected has been replaced with the name of the data source to which the system is now connected. Also displayed is the tooltip for the Add New Project icon on the toolbar. The File menu can also be used to create a new project. www.irjes.org 3 | Page
  • 4. Mining Materializations In Dissimilar Stages Fig 3.4: warehouse miner project selection Also, note that there is a Project Explorer window to the right and an Execution Status window on the bottom. Open a new project by clicking the Add New Project icon—the default name of the new project is Project 1 and is in the Project Explorer window—you can change the project name by clicking it and then changing to a desired name. Fig 3.5: warehouse miner project operations Having a project selected in the Project Explorer window activates additional icons on the toolbar—the standard windows icons for a folder, saving a file and save all. The other two icons are the Add New Analysis and Add Existing Analysis icons which are shown below. Fig 3.6: warehouse analysis Fig 3.7: existing analysis Click the Add New Analysis icon to open its dialog Fig 3.8: Agenda selection www.irjes.org 4 | Page
  • 5. Mining Materializations In Dissimilar Stages The types of analysis are listed in the Categories pane of the Add New Analysis dialog. Note the default Analysis name in the textbox near the bottom of the dialog. Because the type of analysis selected is ADS, the default name is BuildADS1—you can build an Analysis Data Set (ADS) if needed. Click the Descriptive Statistics category to view the types of analysis available in this category. Fig 3.9: new analysis DM is the goal of this illustration so click Analytics in the Categories pane. Fig 3.10: analysis wizard Six data mining techniques are available. Decision Trees, Linear Regression and Logistic Regression are directed or supervised data mining techniques. This means that they have a target variable (dependent variable) and thus are sometimes called predictive models. Association analysis and clustering are undirected or unsupervised data mining techniques and do not have a target or dependent variable. The data for this example will be from a table (WAREHOUSE) in the ua_samsclub database and consists of more than 4 million rows of data. This table was created by joining desired columns from a number of tables in the ua_samsclub data. The data mining problem is to build a model to predict the member status code based on other variables in the Warehouse table. Click the Decision Tree icon in the Analyses pane and provide an Analysis name. Use the naming convention of adding your initials to the end of the Analysis name of MemberStatusiii if you wish. For this example, the Analysis name will be Member Status DED. Click the OK button. Fig 3.11: miner operation The Input source as Table and enter the database in the Available Databases: drop down list box and then press the Enter key. It is important to press the Enter key as it populates the Available Tables: drop down list box with all the tables in the database. In this example, the desired table is WAREHOUSE. www.irjes.org 5 | Page
  • 6. Mining Materializations In Dissimilar Stages Selecting the table WAREHOUSE from the Available Columns drop down list box will populate the Available Columns: pane with the columns of the table. The Selected Columns: pane is context dependent— in this case, it contains an area for the dependent column and an area for the Independent columns because the data mining technique is a decision tree. As shown below, WAREHOUSE has been selected as the table. Fig 3.12: Selection Window After clicking the arrow to move the selected column (MEMBER_STATUS_CD) from the Available Columns pane to the to the Dependent Column pane, the arrow is grayed out. This is because this data mining model can only have one target or dependent variable. The user will then need to click the Independent Columns bar so it will have the focus which allows moving selected columns from the Available Columns pane to the Independent Columns pane. Note that the transfer arrow will not activate until selection of one or more columns from the Available Columns pane is selected. Multiple columns can be selected and moved in a single action. Nine columns have been selected Independent Columns as shown. This completes the data selection process—clicks the analysis parameters tab. Fig 3.13: Column Selection The goal of decision tree analysis is to initially analyze each independent variable to determine which variable best predicts the target or dependent column (variable). Different algorithms are used to determine the predictive accuracy of variables—the default in this case is the Gain Ratio method. Click the drop down box to see that three other options are available—Gini Index, Chaid, and Regression Trees. For this example, accept the default splitting method of Gain Ratio. This process is recursive and at each node, all variables are again examined to determine which one provides the best predictive power – a variable may be split more than once at different nodes in the decision tree. www.irjes.org 6 | Page
  • 7. Mining Materializations In Dissimilar Stages Fig 3.14: Decision Tree For this example, change the Minimum Split Count from 2 to 50 and the Maximum Depth from 100 to 50. Accept all the other defaults. Click the expert options just to view possibilities. In this case, the only option is the amount of data to be held in memory for processing. There is no need to change the default value. Check the Execution Status Window at the bottom. Note that there are more than 4 million rows in the WAREHOUSE table. Also, the status is Executing. When execution is completed, the Executing Status will change to Complete. Be patient as the model will take a few minutes to run—note that at each node, all the variables are evaluated as a candidate for splitting and there are more than 4 million rows for the 9 independent variables. After the model is complete, the RESULTS tab becomes accessible. Click the RESULTS tab. Note that the model has an accuracy of 92.31% and the default tab is the reports tab. On the left under the Decision Tree Report, you can click variables or the Confusion Matrix. Click Variables to notice only 6 independent variables are listed—recall that the original model included 9 independent variables. Which variables have been dropped? www.irjes.org 7 | Page
  • 8. Mining Materializations In Dissimilar Stages Fig 3.15: Variable window Click Confusion Matrix and review the results. Fig 3.16: Confusion Matrix www.irjes.org 8 | Page Click the graphs tab. Fig 3.17: Graph A high-level visual is provided without any detail in the upper window. Click any box in the upper window to get detail in the lower window or just browse the lower graph. Nodes are by default colored yellow and green (green being a leaf node which means no other nodes are split from it). Fig 3.18 Node Details Click the text tree tab and then click on one of the rules—a Node Details dialog appears. It is a leaf node green and is node 5 if you want to reviews the data from the tree browser. You can save the model by clicking the multiple disks on the toolbar. IV. ASSOCIATION ANALYSIS The following example is for association analysis that uses the warehouse table of ua_samsclub. The data format for Association Analysis is a transaction format—consisting of two columns; a customer and a product. Thus, if a customer purchases 4 items, then there would be 4 rows with the customer repeated. The equivalent of this for the warehouse table is visit_nbr and Primary_desc. See the setup below.
  • 9. Mining Materializations In Dissimilar Stages Fig 4.1: association analysis Next, click the analysis parameters tab toward the upper left. Note that the default setting is for one antecedent and one consequent. Fig 4.2: parameter list Click the expert options tab. Enter an SQL statement – in this case, only store number 15. Click the run button. When the run is completed, click results—a load table button should appear. www.irjes.org 9 | Page
  • 10. Mining Materializations In Dissimilar Stages Click the Load button and review the table. www.irjes.org 10 | Page Click graph. Fig 4.3: Graph V. CONCLUSION Teradata is the approaching of the Data Mining. In future the whole world we start using Teradata Database. Now it is costly, works are successful on to reduce its cost. So, it will reach to small professional people also. It is firm occupied location will change organization goal line. REFERENCES [1]. Teradata Warehouse Miner Model Manager User Guide, B035-2303-093A, September 2013 [2]. Teradata Warehouse Miner Release Definition, B035-2494-093C, September 2013 [3]. Teradata Warehouse Miner User Guide, Volume 1 Introduction and Profiling, B035-2300- 093A, September 2013 [4]. Teradata Warehouse Miner User Guide, Volume 2 ADS Generation, B035-2301-093A, September 2013 [5]. Teradata Warehouse Miner User Guide, Volume 3 Analytic Functions, B035-2302-093A, September 2013 AUTHORS First Author- S.Choudaiah Received the MCA POST GRADUATE In computer applications from SVU, Tirupati. He is currently working as an Associate Professor, in Siddhartha institute of Engineering and Technology College, puttur, Andhra Pradesh, India. His field of interest is java Technologies and data mining and data ware house