Introducing
Fusions
BigML Release: Fusions
Introducing
Fusions
BigML, Inc Spring 2018 Release Webinar: Fusions
Fusions Release
2
CHARLES PARKER, PH.D. - VP of Machine
Learning Algorithms
Please enter questions into chat box – We will
answer some via chat and others at the end of the
session
https://bigml.com/releases
ATAKAN CETINSOY - VP of Predictive Applications
Resources
Moderator
Speaker
Contact support@bigml.com
Twitter @bigmlcom
Questions
BigML, Inc Spring 2018 Release Webinar: Fusions 3
Much Ado About Fusions
1 Diving into Fusions
2 Some Pros and Cons
3 Aside: Prediction Explanations
4 Aside: New Text Options
BigML, Inc Spring 2018 Release Webinar: Fusions 4
Much Ado About Fusions
1 Diving into Fusions
2 Some Pros and Cons
3 Aside: Prediction Explanations
4 Aside: New Text Options
BigML, Inc Spring 2018 Release Webinar: Fusions 5
Mixture of Experts
Prediction!
BigML, Inc Spring 2018 Release Webinar: Fusions 6
Mixture of Experts
?
Prediction!
BigML, Inc Spring 2018 Release Webinar: Fusions 7
Mixture of Experts
BigML, Inc Spring 2018 Release Webinar: Fusions 8
Ensemble?
Prediction!Aggregate!
BigML, Inc Spring 2018 Release Webinar: Fusions 9
Creating a Fusion
BigML, Inc Spring 2018 Release Webinar: Fusions 10
Ensemble?
Prediction!Aggregate!
BigML, Inc Spring 2018 Release Webinar: Fusions 11
Prediction!Aggregate!
Fusion = Diverse Ensemble
BigML, Inc Spring 2018 Release Webinar: Fusions 12
Prediction!Aggregate!
Other Techniques?
BigML, Inc Spring 2018 Release Webinar: Fusions 13
Prediction!
Stacking
BigML, Inc Spring 2018 Release Webinar: Fusions 14
Boosting
Prediction!
BigML, Inc Spring 2018 Release Webinar: Fusions 15
Much Ado About Fusions
Diving into Fusions
Some Pros and Cons
3 Aside: Prediction Explanations
4 Aside: New Text Options
2
1
BigML, Inc Spring 2018 Release Webinar: Fusions 16
• A bit wobbly
• Regions of the input space might
have under-performing predictions
• Probably pretty fast
• With OptiML, it’s the best thing we
could find
Single Models vs. Fusions
• More stable
• Errors tend to be “smoothed out”
across the entire input space
• Maybe somewhat slow
• You’ll have to do some additional
validation to check performance
FusionsSingle Models
BigML, Inc Spring 2018 Release Webinar: Fusions 17
What About Performance?
• This is not typically a step that will result in huge performance gains,
unless you’ve got significant feature diversity (spoiler alert for aside
#2!)
• You’re usually better off feature engineering / acquiring more data
• Do it for stability
• . . . or to improve the importance profile (spoiler alert for aside #1!)
BigML, Inc Spring 2018 Release Webinar: Fusions 18
Much Ado About Fusions
Diving into Fusions
Aside: Prediction Explanations
2 Some Pros and Cons
4 Aside: New Text Options
3
1
BigML, Inc Spring 2018 Release Webinar: Fusions 19
Importance of Importance
What’s really important? Does it make sense?
BigML, Inc Spring 2018 Release Webinar: Fusions 20
Prediction Explanation
BigML, Inc Spring 2018 Release Webinar: Fusions 21
Much Ado About Fusions
Diving into Fusions
Aside: New Text Options
2 Some Pros and Cons
3 Aside: Prediction Explanations
4
1
BigML, Inc Spring 2018 Release Webinar: Fusions 22
Improved Text Processing
• 15 new languages, 22 in total
• New stop word options and term filters
• Longer n-grams
BigML, Inc Spring 2018 Release Webinar: Fusions 23
Text Options Define Features
• Change the level of stop word removal
• Use multi-grams and single term filtering
• Turn off stemming
• Then, learn models and create a Fusion!
BigML, Inc Spring 2018 Release Webinar: Fusions 24
Learn More
https://bigml.com/releases/spring-2018
https://bigml.com/whatsnew
Questions?
@bigmlcom support@bigml.com

BigML Release: Fusions

  • 1.
  • 2.
    BigML, Inc Spring2018 Release Webinar: Fusions Fusions Release 2 CHARLES PARKER, PH.D. - VP of Machine Learning Algorithms Please enter questions into chat box – We will answer some via chat and others at the end of the session https://bigml.com/releases ATAKAN CETINSOY - VP of Predictive Applications Resources Moderator Speaker Contact support@bigml.com Twitter @bigmlcom Questions
  • 3.
    BigML, Inc Spring2018 Release Webinar: Fusions 3 Much Ado About Fusions 1 Diving into Fusions 2 Some Pros and Cons 3 Aside: Prediction Explanations 4 Aside: New Text Options
  • 4.
    BigML, Inc Spring2018 Release Webinar: Fusions 4 Much Ado About Fusions 1 Diving into Fusions 2 Some Pros and Cons 3 Aside: Prediction Explanations 4 Aside: New Text Options
  • 5.
    BigML, Inc Spring2018 Release Webinar: Fusions 5 Mixture of Experts Prediction!
  • 6.
    BigML, Inc Spring2018 Release Webinar: Fusions 6 Mixture of Experts ? Prediction!
  • 7.
    BigML, Inc Spring2018 Release Webinar: Fusions 7 Mixture of Experts
  • 8.
    BigML, Inc Spring2018 Release Webinar: Fusions 8 Ensemble? Prediction!Aggregate!
  • 9.
    BigML, Inc Spring2018 Release Webinar: Fusions 9 Creating a Fusion
  • 10.
    BigML, Inc Spring2018 Release Webinar: Fusions 10 Ensemble? Prediction!Aggregate!
  • 11.
    BigML, Inc Spring2018 Release Webinar: Fusions 11 Prediction!Aggregate! Fusion = Diverse Ensemble
  • 12.
    BigML, Inc Spring2018 Release Webinar: Fusions 12 Prediction!Aggregate! Other Techniques?
  • 13.
    BigML, Inc Spring2018 Release Webinar: Fusions 13 Prediction! Stacking
  • 14.
    BigML, Inc Spring2018 Release Webinar: Fusions 14 Boosting Prediction!
  • 15.
    BigML, Inc Spring2018 Release Webinar: Fusions 15 Much Ado About Fusions Diving into Fusions Some Pros and Cons 3 Aside: Prediction Explanations 4 Aside: New Text Options 2 1
  • 16.
    BigML, Inc Spring2018 Release Webinar: Fusions 16 • A bit wobbly • Regions of the input space might have under-performing predictions • Probably pretty fast • With OptiML, it’s the best thing we could find Single Models vs. Fusions • More stable • Errors tend to be “smoothed out” across the entire input space • Maybe somewhat slow • You’ll have to do some additional validation to check performance FusionsSingle Models
  • 17.
    BigML, Inc Spring2018 Release Webinar: Fusions 17 What About Performance? • This is not typically a step that will result in huge performance gains, unless you’ve got significant feature diversity (spoiler alert for aside #2!) • You’re usually better off feature engineering / acquiring more data • Do it for stability • . . . or to improve the importance profile (spoiler alert for aside #1!)
  • 18.
    BigML, Inc Spring2018 Release Webinar: Fusions 18 Much Ado About Fusions Diving into Fusions Aside: Prediction Explanations 2 Some Pros and Cons 4 Aside: New Text Options 3 1
  • 19.
    BigML, Inc Spring2018 Release Webinar: Fusions 19 Importance of Importance What’s really important? Does it make sense?
  • 20.
    BigML, Inc Spring2018 Release Webinar: Fusions 20 Prediction Explanation
  • 21.
    BigML, Inc Spring2018 Release Webinar: Fusions 21 Much Ado About Fusions Diving into Fusions Aside: New Text Options 2 Some Pros and Cons 3 Aside: Prediction Explanations 4 1
  • 22.
    BigML, Inc Spring2018 Release Webinar: Fusions 22 Improved Text Processing • 15 new languages, 22 in total • New stop word options and term filters • Longer n-grams
  • 23.
    BigML, Inc Spring2018 Release Webinar: Fusions 23 Text Options Define Features • Change the level of stop word removal • Use multi-grams and single term filtering • Turn off stemming • Then, learn models and create a Fusion!
  • 24.
    BigML, Inc Spring2018 Release Webinar: Fusions 24 Learn More https://bigml.com/releases/spring-2018 https://bigml.com/whatsnew
  • 25.