SlideShare a Scribd company logo
1 of 31
Declarative Transfer Learning
from Deep CNNs at Scale
Supun Nakandala and Arun Kumar
{snakanda, arunkk}@eng.ucsd.edu
Growth of unstructured data
Source: ETC@USC
65.5 53
18.1 10.3 5.1
0
50
100
%
What type of data is used by Data
Scientists?
Relation Data Text Data Image Data
Other Video Data
Source: 2017 Kaggle Survey
Dark Data
2
Data growth mainly driven by
unstructured data
e-Commerce Healthcare Social Media
unstructured
structured
Deep Convolution Neural Networks (CNN) provide opportunities to
holistically integrate image data with analytics pipelines.
Opportunity: CNN
3
4
CNN: Hierarchical Feature Extractors
Low level features Mid level features High level features
CNN: Training Limitations
Lot of labelled training data Lot of compute power Time consuming
“Dark art” of
hyperparameter
tuning
5
“Transfer Learning” mitigates
these limitations
Outline
6
Example and Motivations
Our System Vista
Experimental Evaluation
Brand Tags Price
Transfer Learning: CNNs for the other 90%
Structured Data
Image Data
Train ML
Model
Evaluate
7
Pre-trained CNN
Brand Tags Price
Transfer Learning: Bottleneck
Structured Data
Image Data
Train ML
Model
Evaluate
8
Which layer will result in the best
accuracy?
Brand Tags Price
Transfer Learning: Current Practice
Structured Data
Image Data
Train ML
Model
Evaluate
9
+ Efficient CNN inference
- Doesn’t support scalable
processing
+ Scalable processing
+ Fault tolerant
- No support for CNNs
Problems with Current Practice
Usability: Manual management of CNN features.
Efficiency: From image inference for all feature layers has
computational redundancies.
Reliability: CNN layers are big, requires careful memory configuration.
Disk spills
System crashes!
10
l1 l2 l3
Outline
11
Example and Motivations
Our System Vista
Overview
System Architecture
System Optimizations
Experimental Evaluation
Vista: Overview
12
Vista is a declarative system for scalable feature transfer from
deep CNNs for multimodal analytics.
Vista takes in:
Brand Tags Price
Structured Data Image Data
l1 l2 l3
Pre-trained CNN and layers of
interest
ML model
Vista optimizes the CNN feature transfer workload and reliably
runs it.
Outline
13
Example and Motivations
Our System Vista
Overview
System Architecture
System Optimizations
Experimental Evaluation
Vista: Architecture
Declarative API Benefit: Usability
Optimizer
Logical Plan Optimizations
Physical Plan Optimizations
System Configuration Optimizations
Benefit: Efficiency
and Reliability
Pre-trained
CNNs
Benefit: Efficiency,
Scalability, and Fault
Tolerance 14
Outline
15
Our System Vista
System Optimizations
Logical Plan Optimizations
Physical Plan Optimizations
System Configuration Optimizations
Current Practice: Repeated Inference
16
Structured Data {S}
Images
l1 l2 l3
Image Features {l1}
⋈
Multimodal Features {S, l1}
ML Model Lazy Materialization
Problem: Repeated inferences
Extract all layers in one go
17
Structured Data {S}
Images
l1 l2 l3
Image Features {l1, l2, l3}
⋈
Multimodal Features {S, l1, l2, l3}
Eager MaterializationML Model
{S, l1}
ML Model
{S, l2}
ML Model
{S, l3} Problem: High Memory
Footprint
- disk spills/cache misses
- system crashes!
Our Novel Plan: Staged CNN Inference
18
Structured Data {S}
Images
l1 l2 l3
Image Features {l1}
Staged Materialization
ML Model
Benefits: No redundant
computations, minimum
memory footprint
⋈
Multimodal Features {S, l1} Multimodal Features
{S, l2}
ML Model
l1 l2 l3
Partial CNN Operator
Our Novel Plan: Staged CNN Inference
19
Structured Data {S}
Images
l1 l2 l3
Image Features {l1}
Staged Materialization
ML Model
Benefits: No redundant
computations, minimum
memory footprint
⋈
Multimodal Features {S, l1} Multimodal Features
{S, l2}
ML Model
l1 l2 l3
Partial CNN Operator
Problem: High join overhead
14 KB
784 KB
Our Novel Plan: Staged CNN Inference - Reordered
20
Images
l1 l2 l3
Image Features {l1}
Staged Materialization
ML Model
Benefits: No redundant
computations, minimum
memory footprint
Multimodal Features {S, l1}
Structured Data {S}
⋈
Multimodal Features
{S, l2}
ML Model
l1 l2 l3
Partial CNN Operator
Problem: High join overhead
Outline
21
Our System Vista
System Optimizations
Logical Plan Optimizations
Physical Plan Optimizations
System Configuration Optimizations
Vista: Physical Plan Optimizations
Benefit: Vista automatically picks the physical plan choices.
22
Join Operator
Options: Broadcast vs Shuffle join
Trade-Offs: Memory footprint vs Network cost
Storage Format
Options: Compressed vs Uncompressed
Trade-Offs: Memory footprint vs Compute cost
Outline
23
Our System Vista
System Optimizations
Logical Plan Optimizations
Physical Plan Optimizations
System Configuration Optimizations
Vista: System Configuration Optimizations
24
Memory allocation
Query parallelism
Data partition size
Memory Allocation
25
Challenge: Default configurations won’t work
CNN features are big
Non trivial CNN model inference memory
In-Memory
Storage
Core
Memory
User
Memory
OS Reserved
Memory
CNN Inference
Memory
UDF Execution
Query Processing
e.g. join processing Store intermediate data CNN model footprints
Benefit: Vista frees the data scientist from manual memory
and system configuration tuning.
Query Parallelism and Data Partition Size
26
Benefit: Vista sets Query Parallelism to improve utilization
and reduce disk spills.
Query Parallelism
Increase Query Parallelism  Increase CNN Inference
Memory  Less Storage Memory
Data Partition Size
Too big  System Crash, Too small  High overhead
Benefit: Vista sets optimal data partition size to reduce
overheads and avoid crashes.
Outline
27
Example and Motivations
Our System Vista
Experimental Evaluation
Experimental Setup
8 worker nodes
and 1 master node
32 GB RAM
Intel Xeon @ 2.00GHz CPU with 8 cores
300 GB HDD
Version 2.2.0
Runs in standalone mode
Version 1.3.0 28
Dataset & Workloads
Amazon product reviews dataset
ML model:
Logistic Regression for 10 iterations
29
Number of Records 200,000
Number of Structured Features 200 – price, category embedding, review
embedding
Image Image of each product item
Target Predict each product is popular or not
Pre-trained CNNs:
AlexNet – Last 4 layers
VGG16 – Last 3 layers
ResNet50 – Last 5 layers
End-to-end reliability and efficiency
162.47
345.6
314.43
51.1
X
163.37
44.37
X
156.73
16.33 X X16.8
61.7
35.86
0
50
100
150
200
250
300
350
400
AlexNet VGG ResNet
Spark-TF/Amazon
Lazy-1 Lazy-5 Lazy-7 Eager Vista X System crashes
10X
Runtime(min)
Experimental results for other data systems, datasets, and drill-down experiments can be found in our paper.
System crashes due
to mismanaged
memory
30
Summary of Vista
Declarative system for scalable feature transfer from
CNNs.
Thank You!
Project Webpage: https://adalabucsd.github.io/vista.html
Performs DBMS inspired logical plan, physical plan, and
system configuration optimizations.
Improves efficiency by up to 90% and avoids unexpected
system crashes.
31

More Related Content

Similar to Vista: A declarative system for large-scale CNN feature transfer for multimodal analytics

2014 IEEE JAVA DATA MINING PROJECT Mining weakly labeled web facial images fo...
2014 IEEE JAVA DATA MINING PROJECT Mining weakly labeled web facial images fo...2014 IEEE JAVA DATA MINING PROJECT Mining weakly labeled web facial images fo...
2014 IEEE JAVA DATA MINING PROJECT Mining weakly labeled web facial images fo...IEEEFINALYEARSTUDENTPROJECT
 
2014 IEEE JAVA DATA MINING PROJECT Mining weakly labeled web facial images fo...
2014 IEEE JAVA DATA MINING PROJECT Mining weakly labeled web facial images fo...2014 IEEE JAVA DATA MINING PROJECT Mining weakly labeled web facial images fo...
2014 IEEE JAVA DATA MINING PROJECT Mining weakly labeled web facial images fo...IEEEMEMTECHSTUDENTSPROJECTS
 
IEEE 2014 JAVA DATA MINING PROJECTS Mining weakly labeled web facial images f...
IEEE 2014 JAVA DATA MINING PROJECTS Mining weakly labeled web facial images f...IEEE 2014 JAVA DATA MINING PROJECTS Mining weakly labeled web facial images f...
IEEE 2014 JAVA DATA MINING PROJECTS Mining weakly labeled web facial images f...IEEEFINALYEARSTUDENTPROJECTS
 
Scalable Learning in Computer Vision
Scalable Learning in Computer VisionScalable Learning in Computer Vision
Scalable Learning in Computer Visionbutest
 
B4UConference_machine learning_deeplearning
B4UConference_machine learning_deeplearningB4UConference_machine learning_deeplearning
B4UConference_machine learning_deeplearningHoa Le
 
Machine learning_ Replicating Human Brain
Machine learning_ Replicating Human BrainMachine learning_ Replicating Human Brain
Machine learning_ Replicating Human BrainNishant Jain
 
Mining weakly labeled web facial images for search based face annotation
Mining weakly labeled web facial images for search based face annotation Mining weakly labeled web facial images for search based face annotation
Mining weakly labeled web facial images for search based face annotation Adz91 Digital Ads Pvt Ltd
 
Faster deep learning solutions from training to inference - Amitai Armon & Ni...
Faster deep learning solutions from training to inference - Amitai Armon & Ni...Faster deep learning solutions from training to inference - Amitai Armon & Ni...
Faster deep learning solutions from training to inference - Amitai Armon & Ni...Codemotion Tel Aviv
 
Improving Resource Availability in Data Center using Deep Learning.pdf
Improving Resource Availability in Data Center using Deep Learning.pdfImproving Resource Availability in Data Center using Deep Learning.pdf
Improving Resource Availability in Data Center using Deep Learning.pdfKundjanasith Thonglek
 
Guiding through a typical Machine Learning Pipeline
Guiding through a typical Machine Learning PipelineGuiding through a typical Machine Learning Pipeline
Guiding through a typical Machine Learning PipelineMichael Gerke
 
Online movie ticket booking
Online movie ticket bookingOnline movie ticket booking
Online movie ticket bookingmrinnovater007
 
AI powered emotion recognition: From Inception to Production - Global AI Conf...
AI powered emotion recognition: From Inception to Production - Global AI Conf...AI powered emotion recognition: From Inception to Production - Global AI Conf...
AI powered emotion recognition: From Inception to Production - Global AI Conf...Vandana Kannan
 
AI powered emotion recognition: From Inception to Production - Global AI Conf...
AI powered emotion recognition: From Inception to Production - Global AI Conf...AI powered emotion recognition: From Inception to Production - Global AI Conf...
AI powered emotion recognition: From Inception to Production - Global AI Conf...Apache MXNet
 
Strata + Hadoop 2015 Slides
Strata + Hadoop 2015 SlidesStrata + Hadoop 2015 Slides
Strata + Hadoop 2015 SlidesJun Liu
 
Intel's Machine Learning Strategy
Intel's Machine Learning StrategyIntel's Machine Learning Strategy
Intel's Machine Learning Strategyinside-BigData.com
 
Developing Tools for “What if…” Testing of Large-scale Software Systems
Developing Tools for “What if…” Testing of Large-scale Software SystemsDeveloping Tools for “What if…” Testing of Large-scale Software Systems
Developing Tools for “What if…” Testing of Large-scale Software Systems James Hill
 
Machine Learning AND Deep Learning for OpenPOWER
Machine Learning AND Deep Learning for OpenPOWERMachine Learning AND Deep Learning for OpenPOWER
Machine Learning AND Deep Learning for OpenPOWERGanesan Narayanasamy
 
Movie recommendation Engine using Artificial Intelligence
Movie recommendation Engine using Artificial IntelligenceMovie recommendation Engine using Artificial Intelligence
Movie recommendation Engine using Artificial IntelligenceHarivamshi D
 
Amazon SageMaker 內建機器學習演算法 (Level 400)
Amazon SageMaker 內建機器學習演算法 (Level 400)Amazon SageMaker 內建機器學習演算法 (Level 400)
Amazon SageMaker 內建機器學習演算法 (Level 400)Amazon Web Services
 

Similar to Vista: A declarative system for large-scale CNN feature transfer for multimodal analytics (20)

2014 IEEE JAVA DATA MINING PROJECT Mining weakly labeled web facial images fo...
2014 IEEE JAVA DATA MINING PROJECT Mining weakly labeled web facial images fo...2014 IEEE JAVA DATA MINING PROJECT Mining weakly labeled web facial images fo...
2014 IEEE JAVA DATA MINING PROJECT Mining weakly labeled web facial images fo...
 
2014 IEEE JAVA DATA MINING PROJECT Mining weakly labeled web facial images fo...
2014 IEEE JAVA DATA MINING PROJECT Mining weakly labeled web facial images fo...2014 IEEE JAVA DATA MINING PROJECT Mining weakly labeled web facial images fo...
2014 IEEE JAVA DATA MINING PROJECT Mining weakly labeled web facial images fo...
 
IEEE 2014 JAVA DATA MINING PROJECTS Mining weakly labeled web facial images f...
IEEE 2014 JAVA DATA MINING PROJECTS Mining weakly labeled web facial images f...IEEE 2014 JAVA DATA MINING PROJECTS Mining weakly labeled web facial images f...
IEEE 2014 JAVA DATA MINING PROJECTS Mining weakly labeled web facial images f...
 
Scalable Learning in Computer Vision
Scalable Learning in Computer VisionScalable Learning in Computer Vision
Scalable Learning in Computer Vision
 
System mldl meetup
System mldl meetupSystem mldl meetup
System mldl meetup
 
B4UConference_machine learning_deeplearning
B4UConference_machine learning_deeplearningB4UConference_machine learning_deeplearning
B4UConference_machine learning_deeplearning
 
Machine learning_ Replicating Human Brain
Machine learning_ Replicating Human BrainMachine learning_ Replicating Human Brain
Machine learning_ Replicating Human Brain
 
Mining weakly labeled web facial images for search based face annotation
Mining weakly labeled web facial images for search based face annotation Mining weakly labeled web facial images for search based face annotation
Mining weakly labeled web facial images for search based face annotation
 
Faster deep learning solutions from training to inference - Amitai Armon & Ni...
Faster deep learning solutions from training to inference - Amitai Armon & Ni...Faster deep learning solutions from training to inference - Amitai Armon & Ni...
Faster deep learning solutions from training to inference - Amitai Armon & Ni...
 
Improving Resource Availability in Data Center using Deep Learning.pdf
Improving Resource Availability in Data Center using Deep Learning.pdfImproving Resource Availability in Data Center using Deep Learning.pdf
Improving Resource Availability in Data Center using Deep Learning.pdf
 
Guiding through a typical Machine Learning Pipeline
Guiding through a typical Machine Learning PipelineGuiding through a typical Machine Learning Pipeline
Guiding through a typical Machine Learning Pipeline
 
Online movie ticket booking
Online movie ticket bookingOnline movie ticket booking
Online movie ticket booking
 
AI powered emotion recognition: From Inception to Production - Global AI Conf...
AI powered emotion recognition: From Inception to Production - Global AI Conf...AI powered emotion recognition: From Inception to Production - Global AI Conf...
AI powered emotion recognition: From Inception to Production - Global AI Conf...
 
AI powered emotion recognition: From Inception to Production - Global AI Conf...
AI powered emotion recognition: From Inception to Production - Global AI Conf...AI powered emotion recognition: From Inception to Production - Global AI Conf...
AI powered emotion recognition: From Inception to Production - Global AI Conf...
 
Strata + Hadoop 2015 Slides
Strata + Hadoop 2015 SlidesStrata + Hadoop 2015 Slides
Strata + Hadoop 2015 Slides
 
Intel's Machine Learning Strategy
Intel's Machine Learning StrategyIntel's Machine Learning Strategy
Intel's Machine Learning Strategy
 
Developing Tools for “What if…” Testing of Large-scale Software Systems
Developing Tools for “What if…” Testing of Large-scale Software SystemsDeveloping Tools for “What if…” Testing of Large-scale Software Systems
Developing Tools for “What if…” Testing of Large-scale Software Systems
 
Machine Learning AND Deep Learning for OpenPOWER
Machine Learning AND Deep Learning for OpenPOWERMachine Learning AND Deep Learning for OpenPOWER
Machine Learning AND Deep Learning for OpenPOWER
 
Movie recommendation Engine using Artificial Intelligence
Movie recommendation Engine using Artificial IntelligenceMovie recommendation Engine using Artificial Intelligence
Movie recommendation Engine using Artificial Intelligence
 
Amazon SageMaker 內建機器學習演算法 (Level 400)
Amazon SageMaker 內建機器學習演算法 (Level 400)Amazon SageMaker 內建機器學習演算法 (Level 400)
Amazon SageMaker 內建機器學習演算法 (Level 400)
 

Recently uploaded

Rapidoform for Modern Form Building and Insights
Rapidoform for Modern Form Building and InsightsRapidoform for Modern Form Building and Insights
Rapidoform for Modern Form Building and Insightsrapidoform
 
From Knowledge Graphs via Lego Bricks to scientific conversations.pptx
From Knowledge Graphs via Lego Bricks to scientific conversations.pptxFrom Knowledge Graphs via Lego Bricks to scientific conversations.pptx
From Knowledge Graphs via Lego Bricks to scientific conversations.pptxNeo4j
 
Effective Strategies for Wix's Scaling challenges - GeeCon
Effective Strategies for Wix's Scaling challenges - GeeConEffective Strategies for Wix's Scaling challenges - GeeCon
Effective Strategies for Wix's Scaling challenges - GeeConNatan Silnitsky
 
Microsoft365_Dev_Security_2024_05_16.pdf
Microsoft365_Dev_Security_2024_05_16.pdfMicrosoft365_Dev_Security_2024_05_16.pdf
Microsoft365_Dev_Security_2024_05_16.pdfMarkus Moeller
 
BusinessGPT - Security and Governance for Generative AI
BusinessGPT  - Security and Governance for Generative AIBusinessGPT  - Security and Governance for Generative AI
BusinessGPT - Security and Governance for Generative AIAGATSoftware
 
COMPUTER AND ITS COMPONENTS PPT.by naitik sharma Class 9th A mittal internati...
COMPUTER AND ITS COMPONENTS PPT.by naitik sharma Class 9th A mittal internati...COMPUTER AND ITS COMPONENTS PPT.by naitik sharma Class 9th A mittal internati...
COMPUTER AND ITS COMPONENTS PPT.by naitik sharma Class 9th A mittal internati...naitiksharma1124
 
Automate your OpenSIPS config tests - OpenSIPS Summit 2024
Automate your OpenSIPS config tests - OpenSIPS Summit 2024Automate your OpenSIPS config tests - OpenSIPS Summit 2024
Automate your OpenSIPS config tests - OpenSIPS Summit 2024Andreas Granig
 
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...Neo4j
 
Navigation in flutter – how to add stack, tab, and drawer navigators to your ...
Navigation in flutter – how to add stack, tab, and drawer navigators to your ...Navigation in flutter – how to add stack, tab, and drawer navigators to your ...
Navigation in flutter – how to add stack, tab, and drawer navigators to your ...Flutter Agency
 
Prompt Engineering - an Art, a Science, or your next Job Title?
Prompt Engineering - an Art, a Science, or your next Job Title?Prompt Engineering - an Art, a Science, or your next Job Title?
Prompt Engineering - an Art, a Science, or your next Job Title?Maxim Salnikov
 
Evolving Data Governance for the Real-time Streaming and AI Era
Evolving Data Governance for the Real-time Streaming and AI EraEvolving Data Governance for the Real-time Streaming and AI Era
Evolving Data Governance for the Real-time Streaming and AI Eraconfluent
 
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
Workshop -  Architecting Innovative Graph Applications- GraphSummit MilanWorkshop -  Architecting Innovative Graph Applications- GraphSummit Milan
Workshop - Architecting Innovative Graph Applications- GraphSummit MilanNeo4j
 
Community is Just as Important as Code by Andrea Goulet
Community is Just as Important as Code by Andrea GouletCommunity is Just as Important as Code by Andrea Goulet
Community is Just as Important as Code by Andrea GouletAndrea Goulet
 
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale IbridaUNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale IbridaNeo4j
 
The Evolution of Web App Testing_ An Ultimate Guide to Future Trends.pdf
The Evolution of Web App Testing_ An Ultimate Guide to Future Trends.pdfThe Evolution of Web App Testing_ An Ultimate Guide to Future Trends.pdf
The Evolution of Web App Testing_ An Ultimate Guide to Future Trends.pdfkalichargn70th171
 
Team Transformation Tactics for Holistic Testing and Quality (NewCrafts Paris...
Team Transformation Tactics for Holistic Testing and Quality (NewCrafts Paris...Team Transformation Tactics for Holistic Testing and Quality (NewCrafts Paris...
Team Transformation Tactics for Holistic Testing and Quality (NewCrafts Paris...Lisi Hocke
 
Abortion Pill Prices Jane Furse ](+27832195400*)[ 🏥 Women's Abortion Clinic i...
Abortion Pill Prices Jane Furse ](+27832195400*)[ 🏥 Women's Abortion Clinic i...Abortion Pill Prices Jane Furse ](+27832195400*)[ 🏥 Women's Abortion Clinic i...
Abortion Pill Prices Jane Furse ](+27832195400*)[ 🏥 Women's Abortion Clinic i...Abortion Clinic
 

Recently uploaded (20)

Rapidoform for Modern Form Building and Insights
Rapidoform for Modern Form Building and InsightsRapidoform for Modern Form Building and Insights
Rapidoform for Modern Form Building and Insights
 
From Knowledge Graphs via Lego Bricks to scientific conversations.pptx
From Knowledge Graphs via Lego Bricks to scientific conversations.pptxFrom Knowledge Graphs via Lego Bricks to scientific conversations.pptx
From Knowledge Graphs via Lego Bricks to scientific conversations.pptx
 
Effective Strategies for Wix's Scaling challenges - GeeCon
Effective Strategies for Wix's Scaling challenges - GeeConEffective Strategies for Wix's Scaling challenges - GeeCon
Effective Strategies for Wix's Scaling challenges - GeeCon
 
Microsoft365_Dev_Security_2024_05_16.pdf
Microsoft365_Dev_Security_2024_05_16.pdfMicrosoft365_Dev_Security_2024_05_16.pdf
Microsoft365_Dev_Security_2024_05_16.pdf
 
BusinessGPT - Security and Governance for Generative AI
BusinessGPT  - Security and Governance for Generative AIBusinessGPT  - Security and Governance for Generative AI
BusinessGPT - Security and Governance for Generative AI
 
Abortion Clinic In Springs ](+27832195400*)[ 🏥 Safe Abortion Pills in Springs...
Abortion Clinic In Springs ](+27832195400*)[ 🏥 Safe Abortion Pills in Springs...Abortion Clinic In Springs ](+27832195400*)[ 🏥 Safe Abortion Pills in Springs...
Abortion Clinic In Springs ](+27832195400*)[ 🏥 Safe Abortion Pills in Springs...
 
COMPUTER AND ITS COMPONENTS PPT.by naitik sharma Class 9th A mittal internati...
COMPUTER AND ITS COMPONENTS PPT.by naitik sharma Class 9th A mittal internati...COMPUTER AND ITS COMPONENTS PPT.by naitik sharma Class 9th A mittal internati...
COMPUTER AND ITS COMPONENTS PPT.by naitik sharma Class 9th A mittal internati...
 
Automate your OpenSIPS config tests - OpenSIPS Summit 2024
Automate your OpenSIPS config tests - OpenSIPS Summit 2024Automate your OpenSIPS config tests - OpenSIPS Summit 2024
Automate your OpenSIPS config tests - OpenSIPS Summit 2024
 
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
 
Abortion Clinic In Pretoria ](+27832195400*)[ 🏥 Safe Abortion Pills in Pretor...
Abortion Clinic In Pretoria ](+27832195400*)[ 🏥 Safe Abortion Pills in Pretor...Abortion Clinic In Pretoria ](+27832195400*)[ 🏥 Safe Abortion Pills in Pretor...
Abortion Clinic In Pretoria ](+27832195400*)[ 🏥 Safe Abortion Pills in Pretor...
 
Navigation in flutter – how to add stack, tab, and drawer navigators to your ...
Navigation in flutter – how to add stack, tab, and drawer navigators to your ...Navigation in flutter – how to add stack, tab, and drawer navigators to your ...
Navigation in flutter – how to add stack, tab, and drawer navigators to your ...
 
Prompt Engineering - an Art, a Science, or your next Job Title?
Prompt Engineering - an Art, a Science, or your next Job Title?Prompt Engineering - an Art, a Science, or your next Job Title?
Prompt Engineering - an Art, a Science, or your next Job Title?
 
Evolving Data Governance for the Real-time Streaming and AI Era
Evolving Data Governance for the Real-time Streaming and AI EraEvolving Data Governance for the Real-time Streaming and AI Era
Evolving Data Governance for the Real-time Streaming and AI Era
 
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
Workshop -  Architecting Innovative Graph Applications- GraphSummit MilanWorkshop -  Architecting Innovative Graph Applications- GraphSummit Milan
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
 
Community is Just as Important as Code by Andrea Goulet
Community is Just as Important as Code by Andrea GouletCommunity is Just as Important as Code by Andrea Goulet
Community is Just as Important as Code by Andrea Goulet
 
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale IbridaUNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
 
The Evolution of Web App Testing_ An Ultimate Guide to Future Trends.pdf
The Evolution of Web App Testing_ An Ultimate Guide to Future Trends.pdfThe Evolution of Web App Testing_ An Ultimate Guide to Future Trends.pdf
The Evolution of Web App Testing_ An Ultimate Guide to Future Trends.pdf
 
Team Transformation Tactics for Holistic Testing and Quality (NewCrafts Paris...
Team Transformation Tactics for Holistic Testing and Quality (NewCrafts Paris...Team Transformation Tactics for Holistic Testing and Quality (NewCrafts Paris...
Team Transformation Tactics for Holistic Testing and Quality (NewCrafts Paris...
 
Abortion Pill Prices Jane Furse ](+27832195400*)[ 🏥 Women's Abortion Clinic i...
Abortion Pill Prices Jane Furse ](+27832195400*)[ 🏥 Women's Abortion Clinic i...Abortion Pill Prices Jane Furse ](+27832195400*)[ 🏥 Women's Abortion Clinic i...
Abortion Pill Prices Jane Furse ](+27832195400*)[ 🏥 Women's Abortion Clinic i...
 
Abortion Pill Prices Mthatha (@](+27832195400*)[ 🏥 Women's Abortion Clinic In...
Abortion Pill Prices Mthatha (@](+27832195400*)[ 🏥 Women's Abortion Clinic In...Abortion Pill Prices Mthatha (@](+27832195400*)[ 🏥 Women's Abortion Clinic In...
Abortion Pill Prices Mthatha (@](+27832195400*)[ 🏥 Women's Abortion Clinic In...
 

Vista: A declarative system for large-scale CNN feature transfer for multimodal analytics

  • 1. Declarative Transfer Learning from Deep CNNs at Scale Supun Nakandala and Arun Kumar {snakanda, arunkk}@eng.ucsd.edu
  • 2. Growth of unstructured data Source: ETC@USC 65.5 53 18.1 10.3 5.1 0 50 100 % What type of data is used by Data Scientists? Relation Data Text Data Image Data Other Video Data Source: 2017 Kaggle Survey Dark Data 2 Data growth mainly driven by unstructured data e-Commerce Healthcare Social Media unstructured structured
  • 3. Deep Convolution Neural Networks (CNN) provide opportunities to holistically integrate image data with analytics pipelines. Opportunity: CNN 3
  • 4. 4 CNN: Hierarchical Feature Extractors Low level features Mid level features High level features
  • 5. CNN: Training Limitations Lot of labelled training data Lot of compute power Time consuming “Dark art” of hyperparameter tuning 5 “Transfer Learning” mitigates these limitations
  • 6. Outline 6 Example and Motivations Our System Vista Experimental Evaluation
  • 7. Brand Tags Price Transfer Learning: CNNs for the other 90% Structured Data Image Data Train ML Model Evaluate 7 Pre-trained CNN
  • 8. Brand Tags Price Transfer Learning: Bottleneck Structured Data Image Data Train ML Model Evaluate 8 Which layer will result in the best accuracy?
  • 9. Brand Tags Price Transfer Learning: Current Practice Structured Data Image Data Train ML Model Evaluate 9 + Efficient CNN inference - Doesn’t support scalable processing + Scalable processing + Fault tolerant - No support for CNNs
  • 10. Problems with Current Practice Usability: Manual management of CNN features. Efficiency: From image inference for all feature layers has computational redundancies. Reliability: CNN layers are big, requires careful memory configuration. Disk spills System crashes! 10 l1 l2 l3
  • 11. Outline 11 Example and Motivations Our System Vista Overview System Architecture System Optimizations Experimental Evaluation
  • 12. Vista: Overview 12 Vista is a declarative system for scalable feature transfer from deep CNNs for multimodal analytics. Vista takes in: Brand Tags Price Structured Data Image Data l1 l2 l3 Pre-trained CNN and layers of interest ML model Vista optimizes the CNN feature transfer workload and reliably runs it.
  • 13. Outline 13 Example and Motivations Our System Vista Overview System Architecture System Optimizations Experimental Evaluation
  • 14. Vista: Architecture Declarative API Benefit: Usability Optimizer Logical Plan Optimizations Physical Plan Optimizations System Configuration Optimizations Benefit: Efficiency and Reliability Pre-trained CNNs Benefit: Efficiency, Scalability, and Fault Tolerance 14
  • 15. Outline 15 Our System Vista System Optimizations Logical Plan Optimizations Physical Plan Optimizations System Configuration Optimizations
  • 16. Current Practice: Repeated Inference 16 Structured Data {S} Images l1 l2 l3 Image Features {l1} ⋈ Multimodal Features {S, l1} ML Model Lazy Materialization Problem: Repeated inferences
  • 17. Extract all layers in one go 17 Structured Data {S} Images l1 l2 l3 Image Features {l1, l2, l3} ⋈ Multimodal Features {S, l1, l2, l3} Eager MaterializationML Model {S, l1} ML Model {S, l2} ML Model {S, l3} Problem: High Memory Footprint - disk spills/cache misses - system crashes!
  • 18. Our Novel Plan: Staged CNN Inference 18 Structured Data {S} Images l1 l2 l3 Image Features {l1} Staged Materialization ML Model Benefits: No redundant computations, minimum memory footprint ⋈ Multimodal Features {S, l1} Multimodal Features {S, l2} ML Model l1 l2 l3 Partial CNN Operator
  • 19. Our Novel Plan: Staged CNN Inference 19 Structured Data {S} Images l1 l2 l3 Image Features {l1} Staged Materialization ML Model Benefits: No redundant computations, minimum memory footprint ⋈ Multimodal Features {S, l1} Multimodal Features {S, l2} ML Model l1 l2 l3 Partial CNN Operator Problem: High join overhead 14 KB 784 KB
  • 20. Our Novel Plan: Staged CNN Inference - Reordered 20 Images l1 l2 l3 Image Features {l1} Staged Materialization ML Model Benefits: No redundant computations, minimum memory footprint Multimodal Features {S, l1} Structured Data {S} ⋈ Multimodal Features {S, l2} ML Model l1 l2 l3 Partial CNN Operator Problem: High join overhead
  • 21. Outline 21 Our System Vista System Optimizations Logical Plan Optimizations Physical Plan Optimizations System Configuration Optimizations
  • 22. Vista: Physical Plan Optimizations Benefit: Vista automatically picks the physical plan choices. 22 Join Operator Options: Broadcast vs Shuffle join Trade-Offs: Memory footprint vs Network cost Storage Format Options: Compressed vs Uncompressed Trade-Offs: Memory footprint vs Compute cost
  • 23. Outline 23 Our System Vista System Optimizations Logical Plan Optimizations Physical Plan Optimizations System Configuration Optimizations
  • 24. Vista: System Configuration Optimizations 24 Memory allocation Query parallelism Data partition size
  • 25. Memory Allocation 25 Challenge: Default configurations won’t work CNN features are big Non trivial CNN model inference memory In-Memory Storage Core Memory User Memory OS Reserved Memory CNN Inference Memory UDF Execution Query Processing e.g. join processing Store intermediate data CNN model footprints Benefit: Vista frees the data scientist from manual memory and system configuration tuning.
  • 26. Query Parallelism and Data Partition Size 26 Benefit: Vista sets Query Parallelism to improve utilization and reduce disk spills. Query Parallelism Increase Query Parallelism  Increase CNN Inference Memory  Less Storage Memory Data Partition Size Too big  System Crash, Too small  High overhead Benefit: Vista sets optimal data partition size to reduce overheads and avoid crashes.
  • 27. Outline 27 Example and Motivations Our System Vista Experimental Evaluation
  • 28. Experimental Setup 8 worker nodes and 1 master node 32 GB RAM Intel Xeon @ 2.00GHz CPU with 8 cores 300 GB HDD Version 2.2.0 Runs in standalone mode Version 1.3.0 28
  • 29. Dataset & Workloads Amazon product reviews dataset ML model: Logistic Regression for 10 iterations 29 Number of Records 200,000 Number of Structured Features 200 – price, category embedding, review embedding Image Image of each product item Target Predict each product is popular or not Pre-trained CNNs: AlexNet – Last 4 layers VGG16 – Last 3 layers ResNet50 – Last 5 layers
  • 30. End-to-end reliability and efficiency 162.47 345.6 314.43 51.1 X 163.37 44.37 X 156.73 16.33 X X16.8 61.7 35.86 0 50 100 150 200 250 300 350 400 AlexNet VGG ResNet Spark-TF/Amazon Lazy-1 Lazy-5 Lazy-7 Eager Vista X System crashes 10X Runtime(min) Experimental results for other data systems, datasets, and drill-down experiments can be found in our paper. System crashes due to mismanaged memory 30
  • 31. Summary of Vista Declarative system for scalable feature transfer from CNNs. Thank You! Project Webpage: https://adalabucsd.github.io/vista.html Performs DBMS inspired logical plan, physical plan, and system configuration optimizations. Improves efficiency by up to 90% and avoids unexpected system crashes. 31