Ramin Vakili
&
Amin Afzali
DATA MINING APPLICATIONS
2
RAPID MINER
•What’s Rapid Miner?
•Advantages of Rapid
Miner:
•Special features in this
App:
•Open sours
•Wrote by Java
•Developed since 2001
•Try to support all of the data mining algorithms
•Ability of adding the other open sours data mining to App
•Give report and copy of the implementation of the algorithm
•Perfect GUI
•Match with most out puts of softwares like Excel
•Quick correcting and error detection
•Online video learning
•Help of many functions and suitable wiki in website of developer app
•Ability of running different algorithms in same time and comparing
together
•Compatible with different operation systems like Linux, Windows, Mac
•Text finding tool in App
•Add all of Weka learning algorithm after updating
3
.
•What’s Scipy?
•Advantages of Scipy:
•Disadvantages of
Scipy:
•SciPy (pronounced “Sigh Pie”) is a computing environment
and open source ecosystem of software for the Python
programming language that support some data mining
algorithms.
•For mathematic usage
•Data mining with Scipy is simple and complete because of Python!
•Learning Algorithms in Scipy is not complete yet
4
•What’s Weka?
•Advantages:
• Open source data mining App
• Contain many data mining libraries
• Wrote by Java language in the Waikato university
• Have many learning packages
• Suitable GUI
• Specially for data mining
• Simple workplace
• Suitable tool for text mining
• Ability of running several learning algorithms and
comparing
• Wrote by Java and can run in different platforms
5
&
Similarity Difference
Wrote by Java Poor performance in connecting to files
containing Excel data and data bases
aren’t Java base
Published under the GPL license Reading CSV files in Weka have not
suitable organize
Rapid miner upload many Weka
algorithms in own workplace
Rapid miner have better GUI than Weka
6
WHY WE CHOOSE RAPID MINER?
•Western Europe, with 37 percent of votes
•North America, with 35 percent of votes
•Eastern Europe, with 10 percent votes
•Asia, with 6 percent votes
•Oceania, with 4 percent votes
•Latin America, with 4 percent votes
•Africa and middle East, with 4 percent votes
0 5 10 15 20 25 30 35 40
Rapid Miner
R
Excel
SAS
Waka
27.7
23.3
21.8
13.6
11.8
37.8
29.8
24.3
12.1
14.4
Usage of data mining Apps in 2011 & 2010
2010
2011
7
RAPID MINER DATA MINING TOOL
•Analytical processing
design
•Performance and
flexibility
•Scalability
•Input data format
• More than 50000 download Since 2001
• Rapid-I for companies
• First place in IT challenges Europe named open source
business award
• More users that other in KDnuggets challenge for 4th
time in 2011
• Quik fixes
• Meta data transformation
• Breakpoint
• Rapid miner have more than 500 operator like:
• Data processing
• Modeling
• Text mining
• Web mining
• Opinion mining
• Rapid miner have 20 ways for visualize
• Is like Rational database
• On the fly
• Churn reduction
• Customer retention
• Data flows
8
• Compatibility with data bases like:
• Oracle
• IBM DB2
• Microsoft SQL server
• MySQL
• Excel
• Access
• SPSS
HOW TO INSTALL First go to
www.rapidminer.com
9

_rapid_miner

  • 1.
  • 2.
  • 3.
    RAPID MINER •What’s RapidMiner? •Advantages of Rapid Miner: •Special features in this App: •Open sours •Wrote by Java •Developed since 2001 •Try to support all of the data mining algorithms •Ability of adding the other open sours data mining to App •Give report and copy of the implementation of the algorithm •Perfect GUI •Match with most out puts of softwares like Excel •Quick correcting and error detection •Online video learning •Help of many functions and suitable wiki in website of developer app •Ability of running different algorithms in same time and comparing together •Compatible with different operation systems like Linux, Windows, Mac •Text finding tool in App •Add all of Weka learning algorithm after updating 3
  • 4.
    . •What’s Scipy? •Advantages ofScipy: •Disadvantages of Scipy: •SciPy (pronounced “Sigh Pie”) is a computing environment and open source ecosystem of software for the Python programming language that support some data mining algorithms. •For mathematic usage •Data mining with Scipy is simple and complete because of Python! •Learning Algorithms in Scipy is not complete yet 4
  • 5.
    •What’s Weka? •Advantages: • Opensource data mining App • Contain many data mining libraries • Wrote by Java language in the Waikato university • Have many learning packages • Suitable GUI • Specially for data mining • Simple workplace • Suitable tool for text mining • Ability of running several learning algorithms and comparing • Wrote by Java and can run in different platforms 5
  • 6.
    & Similarity Difference Wrote byJava Poor performance in connecting to files containing Excel data and data bases aren’t Java base Published under the GPL license Reading CSV files in Weka have not suitable organize Rapid miner upload many Weka algorithms in own workplace Rapid miner have better GUI than Weka 6
  • 7.
    WHY WE CHOOSERAPID MINER? •Western Europe, with 37 percent of votes •North America, with 35 percent of votes •Eastern Europe, with 10 percent votes •Asia, with 6 percent votes •Oceania, with 4 percent votes •Latin America, with 4 percent votes •Africa and middle East, with 4 percent votes 0 5 10 15 20 25 30 35 40 Rapid Miner R Excel SAS Waka 27.7 23.3 21.8 13.6 11.8 37.8 29.8 24.3 12.1 14.4 Usage of data mining Apps in 2011 & 2010 2010 2011 7
  • 8.
    RAPID MINER DATAMINING TOOL •Analytical processing design •Performance and flexibility •Scalability •Input data format • More than 50000 download Since 2001 • Rapid-I for companies • First place in IT challenges Europe named open source business award • More users that other in KDnuggets challenge for 4th time in 2011 • Quik fixes • Meta data transformation • Breakpoint • Rapid miner have more than 500 operator like: • Data processing • Modeling • Text mining • Web mining • Opinion mining • Rapid miner have 20 ways for visualize • Is like Rational database • On the fly • Churn reduction • Customer retention • Data flows 8 • Compatibility with data bases like: • Oracle • IBM DB2 • Microsoft SQL server • MySQL • Excel • Access • SPSS
  • 9.
    HOW TO INSTALLFirst go to www.rapidminer.com 9