Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Quick presentation for the OpenML workshop in Eindhoven 2014
1. Manuel Martín Salvador
@draxus
msalvador@bournemouth.ac.uk
OpenML workshop
Eindhoven 21/10/2014
2. Background
● MSc. Computer Engineering
● Master in Soft Computing and Intelligent Systems
Currently
● PhD Student – Automatic and adaptive pre-processing for building
predictive models
● Teaching – Data Mining lab
5. Data preparation and pre-processing
Labour
intensive
tasks
(up to 80% of
a data mining
process)
6. Automating pre-processing
A lot of available techniques
No free lunch
Multiple combinations
Order of pre-processing methods matters
No semantic → some approaches use ontologies
Meta-learning → needs a good database of
experiments
7. Scientific workflow platforms and
repositories with experiments
Software Repository Applications
DiscoveryNet (inactive) -
Kepler - Various
Taverna MyExperiment (open) Bioinformatics
Pegasus - Various
Galaxy - Biomedical
Pipeline Pilot Accelrys (commercial)
* MLComp (“open”) Machine Learning
Weka,MOA,R,RapidMiner OpenML (open) Machine Learning
9. TO DO
How to increase the number of pre-processing methods in OpenML?
- The only way right now is using FilteredClassifier in Weka
- What about R, MOA, RapidMiner?
Improving flow representation
- Right now is difficult to see how components are connected
- Clear distinction of parameters
- What about including Weka flows (XML based) and ADAMS flows?
- PMML support?
Statistics for available data, tasks, flows and runs
Flow recommendation system for a given dataset
[dataset, data characteristics, prediction accuracy, flow_id]
Flow validation before executing it
[dataset, data characteristics, flow characteristics, failure]
10. A little bit further
Adapting flows while processing data streams
- Detecting changes in data characteristics
- Locally checking input/output in each flow component
- Change propagation
- Reducing cost of adaptation
11. Photos CC by Cristina Granados
Visit us!
Data Science Institute @ Bournemouth University