Developing Extensions for RapidMiner …rapidly 
November 17th, 2014 
Sabrina Kirstein
RapidMiner Company Overview 
2 
Easy-to-use, blazing fast, and very easy to integrate with any IT infrastructure 
Support from a thriving communityof contributors creating new extensions and applications 
Processes designed in RapidMiner can be one-click deployedto RapidMiner Server or RapidMiner Cloud 
A unique Marketplacefor independent developers to publish their innovative extensions 
RapidMiner delivers the power of predictive analytics to business users. No programming required. 
More than 60 connectors (incl. SAP, Hadoop, Cloud connectors like Twitter and Zapier) allowing easy access to structured and unstructured data.
RapidMiner History 
3 
Cloud 
•Cloud 
•Hadoop 
Business Source 
•Commercial Editions 
•Community Editions 
•Client and Server 
Open Source 
•Command Line 
•Initial Workbench 
Open Source 
•Complete Workbench 
•CommunityExtensions 
•Marketplace 
Community Growth 
2007 
2010 
2013 
2014 
5,000 
30,000 
150,000 
250,000
RapidMiner Metrics 
4 
60+ 
Employees 
Worldwide 
100+ 
Active Developers 
600+ 
Customers in over 50 Countries 
40,000+ 
Downloads per Month 
35,000+ 
Active Deployments with over 250,000 Users
Product Overview 
5
RapidMiner Studio 
•With access to over 1500 different operators, the Java-based visual environment of RapidMiner allows for rapid data mining process development 
6 
Visual Process Design Environment
Accelerators 
7 
Wizard 
•Selection of data and label (e.g. churn) column. 
•Label column contains missings values if unknown –those will be predicted 
Results 
•Predictions (individuals, churn predictions) 
•Descriptive model 
•Model accuracy and lift chart
RapidMiner Cloud Repository & Execution 
8
RapidMiner Server 
9 
The RapidMiner Server provides enterprise-wide process development and process to web- service conversion with dynamic dashboards and data visualizations.
Extensions and the Marketplace 
10 
http://marketplace.rapidminer.com
ExistingExtensions 
11 
Edda–Extensions for Binominal Text Classification 
Instance selection and Prototype based rules 
RapidMiner Finance and Economics Extension 
Multimedia Mining Extension
RapidMiner Finance and Economics Extension 
Edda–Extensions for Binominal Text Classification 
ExistingExtensions 
Confidential 
12 
Instance selection and Prototype based rules 
Multimedia Mining Extension
Linked Open Data Extension 
•Assume a rating system for books giving us an ISBN number and a rating from 1 to 5 
•Goal: Predict the popularity of new books 
13 
…
Linked Open Data Extension 
•Assume a rating system for books giving us an ISBN number and a rating from 1 to 5 
•Goal: Predict the popularity of new books 
14 
… 
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> 
PREFIX ontology: <http://dbpedia.org/ontology/> 
select distinct ?book ?author ?isbn?country ?abstract ?pages ?language 
where { 
?book rdf:typeontology:Book. 
?book ontology:author?author . 
?book ontology:abstract?abstract . 
?book ontology:isbn?isbn. 
?book ontology:numberOfPages?pages . 
?book ontology:language?language . 
?book ontology:country?country . 
}
Linked Open Data Extension 
•Assume a rating system for books giving us an ISBN number and a rating from 1 to 5 
•Goal: Predict the popularity of new books 
15 
… 
…
Text-/Web-Mining Extensions 
16
Multimedia Mining Extension 
17
WhiBoExtension 
18
MLWizardExtension 
19 
1. Define data location 
2. Evaluation of different models
MLWizardExtension 
20 
3. Load the best model 
4. The process will be designed for you
HowtoextendRapidMiner Studio 
Confidential 21
HowtoextendRapidMiner Studio 
Confidential 22 
gitclone https://github.com/rapidminer/rapidminer-extension-tutorial.gitgradleinstallExtension 
•Live Demo: 
–Extension skeleton 
–Operators 
–Special data objects 
–Advanced Extension elements 
–Accelerators 
•Documentation 
http://www.rapidminer.com/documentation
HowtointegrateRapidMiner 
•By web services: 
23 
Web Service API 
1.Export process as a web 
service in RM Server 
2.Select output format 
(JSON, XML, PNG, …) 
3. 
•HTTP POST to that URL 
•Read process results from HTTP response 
or 
•<iframe> into other Website
HowtointegrateRapidMiner 
•OEM: 
24 
Java 
1.RapidMiner can be easily invoked 
2.Call RapidMiner.init() 
3.Use the code: 
Create processes, run processes or transform data
RapidMinerUSA 
RapidMiner, Inc. (Headquarters) 
10 Fawcett St 
Cambridge, MA 02138 
United States 
E-mailcontact-us@rapidminer.com 
Phone+1 -617 -401 -7708 
Fax+1 -617 -401 -7709 
THANK YOU 
25 
RapidMinerGermany 
RapidMinerGmbH 
StockumerStr. 475 
44227 Dortmund 
Germany 
E-mailcontact-de@rapidminer.com 
Phone+49 -231 -425 786 9-0 
Fax+49 -231 -425 786 9-9 
RapidMinerUK 
RapidMinerLtd. 
QuatroHouse, Frimley Road 
CamberleyGU16 7ER 
United Kingdom 
E-mailcontact-uk@rapidminer.com 
Phone+44 1276 804 426 
Fax+1 -617 -401 –7709 
www.rapidminer.com 
RapidMiner Hungary 
RapidMiner Kft 
Iparutca5 
1095 Budapest 
Hungary 
E-mailcontact-hu@rapidminer.com 
Phone+44 1276 804 426 
Fax+1 -617 -401 -7709

Slides PAPIs.io'14 RapidMiner

  • 1.
    Developing Extensions forRapidMiner …rapidly November 17th, 2014 Sabrina Kirstein
  • 2.
    RapidMiner Company Overview 2 Easy-to-use, blazing fast, and very easy to integrate with any IT infrastructure Support from a thriving communityof contributors creating new extensions and applications Processes designed in RapidMiner can be one-click deployedto RapidMiner Server or RapidMiner Cloud A unique Marketplacefor independent developers to publish their innovative extensions RapidMiner delivers the power of predictive analytics to business users. No programming required. More than 60 connectors (incl. SAP, Hadoop, Cloud connectors like Twitter and Zapier) allowing easy access to structured and unstructured data.
  • 3.
    RapidMiner History 3 Cloud •Cloud •Hadoop Business Source •Commercial Editions •Community Editions •Client and Server Open Source •Command Line •Initial Workbench Open Source •Complete Workbench •CommunityExtensions •Marketplace Community Growth 2007 2010 2013 2014 5,000 30,000 150,000 250,000
  • 4.
    RapidMiner Metrics 4 60+ Employees Worldwide 100+ Active Developers 600+ Customers in over 50 Countries 40,000+ Downloads per Month 35,000+ Active Deployments with over 250,000 Users
  • 5.
  • 6.
    RapidMiner Studio •Withaccess to over 1500 different operators, the Java-based visual environment of RapidMiner allows for rapid data mining process development 6 Visual Process Design Environment
  • 7.
    Accelerators 7 Wizard •Selection of data and label (e.g. churn) column. •Label column contains missings values if unknown –those will be predicted Results •Predictions (individuals, churn predictions) •Descriptive model •Model accuracy and lift chart
  • 8.
  • 9.
    RapidMiner Server 9 The RapidMiner Server provides enterprise-wide process development and process to web- service conversion with dynamic dashboards and data visualizations.
  • 10.
    Extensions and theMarketplace 10 http://marketplace.rapidminer.com
  • 11.
    ExistingExtensions 11 Edda–Extensionsfor Binominal Text Classification Instance selection and Prototype based rules RapidMiner Finance and Economics Extension Multimedia Mining Extension
  • 12.
    RapidMiner Finance andEconomics Extension Edda–Extensions for Binominal Text Classification ExistingExtensions Confidential 12 Instance selection and Prototype based rules Multimedia Mining Extension
  • 13.
    Linked Open DataExtension •Assume a rating system for books giving us an ISBN number and a rating from 1 to 5 •Goal: Predict the popularity of new books 13 …
  • 14.
    Linked Open DataExtension •Assume a rating system for books giving us an ISBN number and a rating from 1 to 5 •Goal: Predict the popularity of new books 14 … PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX ontology: <http://dbpedia.org/ontology/> select distinct ?book ?author ?isbn?country ?abstract ?pages ?language where { ?book rdf:typeontology:Book. ?book ontology:author?author . ?book ontology:abstract?abstract . ?book ontology:isbn?isbn. ?book ontology:numberOfPages?pages . ?book ontology:language?language . ?book ontology:country?country . }
  • 15.
    Linked Open DataExtension •Assume a rating system for books giving us an ISBN number and a rating from 1 to 5 •Goal: Predict the popularity of new books 15 … …
  • 16.
  • 17.
  • 18.
  • 19.
    MLWizardExtension 19 1.Define data location 2. Evaluation of different models
  • 20.
    MLWizardExtension 20 3.Load the best model 4. The process will be designed for you
  • 21.
  • 22.
    HowtoextendRapidMiner Studio Confidential22 gitclone https://github.com/rapidminer/rapidminer-extension-tutorial.gitgradleinstallExtension •Live Demo: –Extension skeleton –Operators –Special data objects –Advanced Extension elements –Accelerators •Documentation http://www.rapidminer.com/documentation
  • 23.
    HowtointegrateRapidMiner •By webservices: 23 Web Service API 1.Export process as a web service in RM Server 2.Select output format (JSON, XML, PNG, …) 3. •HTTP POST to that URL •Read process results from HTTP response or •<iframe> into other Website
  • 24.
    HowtointegrateRapidMiner •OEM: 24 Java 1.RapidMiner can be easily invoked 2.Call RapidMiner.init() 3.Use the code: Create processes, run processes or transform data
  • 25.
    RapidMinerUSA RapidMiner, Inc.(Headquarters) 10 Fawcett St Cambridge, MA 02138 United States E-mailcontact-us@rapidminer.com Phone+1 -617 -401 -7708 Fax+1 -617 -401 -7709 THANK YOU 25 RapidMinerGermany RapidMinerGmbH StockumerStr. 475 44227 Dortmund Germany E-mailcontact-de@rapidminer.com Phone+49 -231 -425 786 9-0 Fax+49 -231 -425 786 9-9 RapidMinerUK RapidMinerLtd. QuatroHouse, Frimley Road CamberleyGU16 7ER United Kingdom E-mailcontact-uk@rapidminer.com Phone+44 1276 804 426 Fax+1 -617 -401 –7709 www.rapidminer.com RapidMiner Hungary RapidMiner Kft Iparutca5 1095 Budapest Hungary E-mailcontact-hu@rapidminer.com Phone+44 1276 804 426 Fax+1 -617 -401 -7709