H2O.ai
Machine Intelligence
Meetup Hosted by 6Sense, 9/17/2015
Using H2O GBM for Ad Click Prediction
H2O.ai
Machine Intelligence
Company Overview
Company
Product
• Team: 35. Founded in 2012, Mountain View, CA
• Stanford Math & Systems Engineers
• FULLY Open Source Leader in Machine & Deep learning
• Ease of Use and Smarter Applications
• FULLY Open Source API’s R, Python, Spark & Hadoop
• Expanding Predictions to Mass Analyst markets
H2O.ai
Machine Intelligence
Executive Team
Board of Directors
Jishnu Bhattacharjee // Nexus Ventures
Ash Bhardwaj // Flextronics
Scientific Advisory Council
Trevor Hastie
Stephen Boyd
Rob Tibshirani
Sri Satish Ambati
CEO & Co-founder
DataStax
Cliff Click
CTO & Co-founder
Sun, Java Hotspot
Tom Kraljevic
VP of Engineering
Abrizio, Intel
H2O.ai
Machine Intelligence
Product Overview
• Open Source
• R and Python APIs,
Web UI
• Sparkling Water
• Flow interface
• Cutting-edge
algorithms
• Smarter applications
H2O.ai
Machine Intelligence
Product Overview
Speed Matters!
No Sampling
Interactive UI
Cutting-Edge Algos
• Time is valuable
• In-memory is faster
• Intelligence as a service
• High speed AND accuracy
• Scale to big data
• Access data links
• Use all data without sampling
• Online modeling with H2O Flow
• Model comparison
• Suite of cutting-edge algorithms
• Deep Learning
• NanoFast Scoring Engine
H2O.ai
Machine Intelligence
Use Case: Click Prediction
Kaggle Contest
• Overview
• Data
H2O.ai
Machine Intelligence
Use Case: Click Prediction
H2O.ai
Machine Intelligence
Start with Baseline
H2O.ai
Machine Intelligence
Revisit ERD: Features
H2O.ai
Machine Intelligence
Aggregations in H2O
• Group By
o Conditional aggregations
o Count records by User
o Sum clicks by query
• Merge
o Join the results of a group by, for
example, to another data frame by
a key
o E.g. merge record count by user
back into training frame
H2O.ai
Machine Intelligence
Better Features
H2O.ai
Machine Intelligence
Better Model
H2O.ai
Machine Intelligence
Recipe for More Features
From overview by winner: PDF

400 million Search Results -Predict Contextual Ad Clicks