SlideShare a Scribd company logo
1 of 56
Download to read offline
Sparking Science up with
Research Recommendations
Maya Hristakeva
@mayahhf
Overview
•  What is Mendeley Suggest?
•  Computation Layer
•  Conclusions
Read
&
Organize
Search
&
Discover
Collaborate
&
Network
Experiment
&
Synthesize
Mendeley builds tools to help researchers …
Being the best researcher you can be!
•  Good researchers are on top of their game
•  Large amount of research produced
•  Takes time to get what you need
•  Help researchers by recommending relevant research
Mendeley Suggest
Personalized Article
Recommender
Recommender System Components
information flow (components often built in parallel)
Data
(Feature
Engineering)
Algorithms Business Logic
and Analytics
User Experience
Mendeley Suggest Components (Past)
information flow (components often built in parallel)
Data
(Feature
Engineering)
Algorithms Business Logic
and Analytics
User Experience
Mendeley Suggest Components (Present)
information flow (components often built in parallel)
Data
(Feature
Engineering)
Algorithms Business Logic
and Analytics
User Experience
Mendeley Suggest Components (Goal)
information flow (components often built in parallel)
Data
(Feature
Engineering)
Algorithms Business Logic
and Analytics
User Experience
Overview
•  What is Mendeley Suggest?
•  Computation Layer
–  Algorithms
–  Evaluation
–  Implementations & Performance
•  Conclusions
Personalized Article Recommendations
Input:
User libraries
Output:
Suggested
articles to read
Algorithms:
•  Collaborative Filtering
–  Item-based
–  User-Based
–  Matrix Factorization
•  Content-based
Item-based Collaborative Filtering
Recommend articles that are similar to the ones you read
–  Similarity is based on article co-occurrences in users’ libraries
–  “Users who read x also read y”
User-based Collaborative Filtering
Find users who have similar appreciation for articles as you
–  Similarity is based on users’ libraries overlap
Recommend new articles based on what the users similar to
you read
–  “Users similar to you (based on a, b, c) also read x”
Matrix Factorization CF
2 4 5
5 4 1
5 ? 2
1 5 4
4 2
4 5 1
U
n x k
V
k x m
fij= <Ui*,V*j>
E(U,V) = L(Xij, fij) + R(U,V)
X
n x m
Overview
•  What is Mendeley Suggest?
•  Computation Layer
–  Algorithms
–  Evaluation
–  Implementations
•  Conclusions
Performance
Costly & GoodCostly & Bad
Cheap & GoodCheap & Bad
Costly & GoodCostly & Bad
Cheap & GoodCheap & Bad
Performance
Costly & GoodCostly & Bad
Cheap & GoodCheap & Bad
Performance
How to measure quality?
•  Offline Evaluation
–  Parameter sweep is quick
–  Don’t offend real users
•  Methodology
–  n-fold cross-validation
–  time-based validation
•  Metrics
–  precision, recall and f-measure
–  AUC (area under roc curve), NDCG (normalized discounted cumulative gain)
Overview
•  What is Mendeley Suggest?
•  Computation Layer
–  Algorithms
–  Evaluation
–  Implementations
•  Conclusions
Implementations
Mahout
(Hadoop)
Mendeley
(Hadoop)
Mahout
(Spark)
Mendeley
(Spark)
MLlib
(Spark)
Item-based CF
User-based CF
Matrix
Factorization
Setup
•  EMR Cluster
–  Master: 1 x r3.xlarge instance (4 core, 32GB)
–  Core: 10 x r3.2xlarge instances (8 core, 64GB)
•  Data: user libraries
–  15mil documents >>> 1mil users
–  150mil interactions
•  Offline Evaluation
–  Methodology: time-based evaluation
–  Metric: precision@10
Implementations
Mahout
(Hadoop)
Mendeley
(Hadoop)
Mahout
(Spark)
Mendeley
(Spark)
MLlib
(Spark)
Item-based CF
User-based CF
Matrix
Factorization
Apache Mahout
•  Mahout (out-of-the-box)
–  Item-based CF
•  org.apache.mahout.cf.taste.hadoop.item.RecommenderJob	
  
	
  
–  ALS Matrix Factorization
•  org.apache.mahout.cf.taste.hadoop.als.ParallelALSFactorizationJob	
  
•  org.apache.mahout.cf.taste.hadoop.als.RecommenderJob	
  
	
  
•  Implemented User-based CF on top of Mahout at Mendeley
Costly & GoodCostly & Bad
Cheap & GoodCheap & Bad
Orig. item-based
mahout
Tuned item-based
mahout
-0.5K
(-60%)
Performance
~$125
Costly & GoodCostly & Bad
Cheap & GoodCheap & Bad
Orig. item-based
mahout
Tuned item-based
mahout
-0.5K
(-60%)
Orig. user-based
mahout
Tuned user-based
mahout
-0.1K
(-40%)
Performance
~$125
Costly & GoodCostly & Bad
Cheap & GoodCheap & Bad
Orig. item-based
mahout
Tuned item-based
mahout Orig. user-based
mahout
Tuned user-based
mahout
+150%
-0.2K
(-55%)
-0.7K
(-82%)
Performance
~$125
Mahout Performance
•  Mahout’s recommender is already efficient
–  But your data may have unusual properties
•  We’ve got improvements by
–  Tuning Hadoop’s mapper and reducer allocation over the Recommender Job steps
–  Using an appropriate partitioner
•  Improve quality
–  Mahout provides Item-based CF
–  We have many more items than users
–  Typically, user-based is more appropriate
Implementations
Mahout
(Hadoop)
Mendeley
(Hadoop)
Mahout
(Spark)
Mendeley
(Spark)
MLlib
(Spark)
Item-based CF
User-based CF
Matrix
Factorization
Mahout Spark
•  Co-occurrence Recommenders with Spark
–  Item-Item similarity
•  mahout spark-itemsimilarity
SimilarityAnalysis.cooccurrencesIDSs(ratings,	
  …)	
  
	
  
–  User-User similarity
•  mahout spark-rowsimilarity
SimilarityAnalysis.rowSimilarityIDSs(ratings,	
  …)	
  
	
  
•  Only supports Boolean data and log-likelihood similarity
•  Does not generate actual recommendations
Mahout Spark
•  Could not get to run successfully on our data
•  Got further by tuning parameters but still failed with OOM
–  spark.driver.maxResultSize	
  
–  spark.kryoserializer.buffer.max	
  	
  
–  spark.default.parallelism	
  
–  spark.storage.memoryFraction	
  
	
  
•  Gave best runtime performance on MovieLens datasets
Implementations
Mahout
(Hadoop)
Mendeley
(Hadoop)
Mahout
(Spark)
Mendeley
(Spark)
MLlib
(Spark)
Item-based CF
User-based CF
Matrix
Factorization
Mendeley Spark
•  Started as hack-day project
–  Implement Item-based and User-based CF in Spark
•  Can be implemented in two steps
1.  Compute Item-Item or User-User Similarities
•  given user preferences
2.  Compute Recommendations
•  given similarities and user preferences
Spark: Item-Item Similarity
Spark: Item-Item Similarity
Spark: Item-Item Similarity
Spark: Item-Item Similarity
Spark: Item-Based Recs
Spark: Item-Based Recs
Costly & GoodCostly & Bad
Cheap & GoodCheap & Bad
Tuned IB Mahout
Tuned UB Mahout
Orig. UB Spark
Performance
~$50
Costly & GoodCostly & Bad
Cheap & GoodCheap & Bad
Tuned IB Mahout
Tuned UB Mahout
Orig. UB Spark
Tuned UB Spark
Tuned IB Spark
-0.1K
(-40%)
Performance
~$50
Mendeley Spark Performance
•  Spark implementation of User-based CF performs well
•  Managed to run variation of Item-based CF
–  Uses fewer items per user to recommend similar items to
–  Quality not impacted much
•  We’ve got improvements by tuning
–  Resource allocation
–  Parallelism
–  http://blog.cloudera.com/blog/2015/03/how-to-tune-your-apache-spark-jobs-
part-2/
Implementations
Mahout
(Hadoop)
Mendeley
(Hadoop)
Mahout
(Spark)
Mendeley
(Spark)
MLlib
(Spark)
Item-based CF
User-based CF
Matrix
Factorization
Spark MLlib DimSum
•  DimSum: efficient algorithm for computing all-pairs similarity
–  “Dimension Independent Matrix Square using MapReduce”
–  Contributed by Twitter
•  Replace similarity computation with DimSum
–  Only supports cosine similarity
•  Does not generate actual recommendations
–  Compute recommendations as before
MLlib DimSum Item-Item Similarity
MLlib DimSum User-User Similarity
Costly & GoodCostly & Bad
Cheap & GoodCheap & Bad
Tuned IB Mahout
Tuned UB Mahout
Tuned UB Spark
Tuned IB Spark
UB DimSum
Spark MLlib
Performance
~$50
Spark MLlib Matrix Factorization
Implements alternating least squares (ALS)
1.  Compute Model
2.  Compute Recommendations
Costly & GoodCostly & Bad
Cheap & GoodCheap & Bad
Tuned IB Mahout
Tuned UB Mahout
Tuned UB Spark
Tuned IB Spark
UB DimSum
Spark MLlib
ALS Matrix Fact.
Spark MLlib
-50%
Performance
~$50
MLlib Performance
•  Provides good alternative for computing user-user similarities
–  Due to data sparsity, not getting big gains in runtime
–  Only supports cosine similarity
•  Failed to compute item-item similarities
–  Exceeds maximum allowed value of 2G for spark.kryoserializer.buffer.max	
  	
  
	
  
•  User-based CF outperforms ALS CF
•  Need scalable solution for generating recommendations based on ALS
CF model
Implementations
Mahout
(Hadoop)
Mendeley
(Hadoop)
Mahout
(Spark)
Mendeley
(Spark)
MLlib
(Spark)
Item-based CF
User-based CF
Matrix
Factorization
Overview
•  What is Mendeley Suggest?
•  Computation Layer
•  Conclusions
Costly & GoodCostly & Bad
Cheap & GoodCheap & Bad
Tuned IB Mahout
Tuned UB Mahout
Tuned UB Spark
Tuned IB Spark
UB DimSum
Spark MLlib
ALS Matrix Fact.
Spark MLlib
Performance
+100%
+150%
~$50
Mendeley Suggest Components (Future)
information flow (components often built in parallel)
Data
(Feature
Engineering)
Algorithms Business Logic
and Analytics
User Experience
Conclusions
•  Mendeley Suggest is a personalized article recommender
•  Spark is good alternative to Mahout as computation layer
–  Needs some love and tuning
–  Much fewer lines of code – easier to maintain and extend
•  User-based can outperform item-based and matrix factorization
•  Save resources and money by understanding your data
•  Test offline before deploying
–  but also need online tests to get real performance
Thank you!
mendeley.com/suggest

More Related Content

Viewers also liked

DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...Hakka Labs
 
методичские указания для самостоятельного изучения предмета высшая математика
методичские указания для самостоятельного изучения предмета высшая математикаметодичские указания для самостоятельного изучения предмета высшая математика
методичские указания для самостоятельного изучения предмета высшая математикаRoman Prilipa
 
Barbara quiroz psicopedagogia nivel 200 diurno trabajo p3
Barbara quiroz psicopedagogia nivel 200 diurno trabajo p3Barbara quiroz psicopedagogia nivel 200 diurno trabajo p3
Barbara quiroz psicopedagogia nivel 200 diurno trabajo p3Barbara Quiroz Donoso
 
Farsi Translation - tough leadership brand management
Farsi Translation  - tough leadership brand managementFarsi Translation  - tough leadership brand management
Farsi Translation - tough leadership brand managementAli Reza Bakhshi
 
Palmarès du concours inter-quartiers des sapins de Noël 2016 - Ville d'Ergué-...
Palmarès du concours inter-quartiers des sapins de Noël 2016 - Ville d'Ergué-...Palmarès du concours inter-quartiers des sapins de Noël 2016 - Ville d'Ergué-...
Palmarès du concours inter-quartiers des sapins de Noël 2016 - Ville d'Ergué-...Ville d'Ergué-Gabéric
 
服务计算:现状,思考与建议
服务计算:现状,思考与建议服务计算:现状,思考与建议
服务计算:现状,思考与建议IBM
 
Accelerating Machine Learning Applications on Spark Using GPUs
Accelerating Machine Learning Applications on Spark Using GPUsAccelerating Machine Learning Applications on Spark Using GPUs
Accelerating Machine Learning Applications on Spark Using GPUsIBM
 
Fichamento do livro_convite_a_filosofia
Fichamento do livro_convite_a_filosofiaFichamento do livro_convite_a_filosofia
Fichamento do livro_convite_a_filosofiaMárcia Quirino
 
Determinación de la vida util de un producto agroindustrial
Determinación de la vida util de un producto agroindustrialDeterminación de la vida util de un producto agroindustrial
Determinación de la vida util de un producto agroindustrialJhonás A. Vega
 
Cartões de Natal - 5º ano
Cartões de Natal - 5º anoCartões de Natal - 5º ano
Cartões de Natal - 5º anoIsabela Afra
 
2 decision making
2 decision making2 decision making
2 decision makingalcheikh
 
今さら聞けないビジネスメールの書き方
今さら聞けないビジネスメールの書き方今さら聞けないビジネスメールの書き方
今さら聞けないビジネスメールの書き方webcampusschoo
 
超絶技巧プログラミングと Ruby 3.0 (大江戸 Ruby 会議 05 コミッタ LT)
超絶技巧プログラミングと Ruby 3.0 (大江戸 Ruby 会議 05 コミッタ LT)超絶技巧プログラミングと Ruby 3.0 (大江戸 Ruby 会議 05 コミッタ LT)
超絶技巧プログラミングと Ruby 3.0 (大江戸 Ruby 会議 05 コミッタ LT)mametter
 
Scala Data Pipelines for Music Recommendations
Scala Data Pipelines for Music RecommendationsScala Data Pipelines for Music Recommendations
Scala Data Pipelines for Music RecommendationsChris Johnson
 
Matrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender SystemsMatrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender SystemsLei Guo
 

Viewers also liked (17)

DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
 
методичские указания для самостоятельного изучения предмета высшая математика
методичские указания для самостоятельного изучения предмета высшая математикаметодичские указания для самостоятельного изучения предмета высшая математика
методичские указания для самостоятельного изучения предмета высшая математика
 
Barbara quiroz psicopedagogia nivel 200 diurno trabajo p3
Barbara quiroz psicopedagogia nivel 200 diurno trabajo p3Barbara quiroz psicopedagogia nivel 200 diurno trabajo p3
Barbara quiroz psicopedagogia nivel 200 diurno trabajo p3
 
Farsi Translation - tough leadership brand management
Farsi Translation  - tough leadership brand managementFarsi Translation  - tough leadership brand management
Farsi Translation - tough leadership brand management
 
Palmarès du concours inter-quartiers des sapins de Noël 2016 - Ville d'Ergué-...
Palmarès du concours inter-quartiers des sapins de Noël 2016 - Ville d'Ergué-...Palmarès du concours inter-quartiers des sapins de Noël 2016 - Ville d'Ergué-...
Palmarès du concours inter-quartiers des sapins de Noël 2016 - Ville d'Ergué-...
 
服务计算:现状,思考与建议
服务计算:现状,思考与建议服务计算:现状,思考与建议
服务计算:现状,思考与建议
 
Accelerating Machine Learning Applications on Spark Using GPUs
Accelerating Machine Learning Applications on Spark Using GPUsAccelerating Machine Learning Applications on Spark Using GPUs
Accelerating Machine Learning Applications on Spark Using GPUs
 
Fichamento do livro_convite_a_filosofia
Fichamento do livro_convite_a_filosofiaFichamento do livro_convite_a_filosofia
Fichamento do livro_convite_a_filosofia
 
Determinación de la vida util de un producto agroindustrial
Determinación de la vida util de un producto agroindustrialDeterminación de la vida util de un producto agroindustrial
Determinación de la vida util de un producto agroindustrial
 
Cartões de Natal - 5º ano
Cartões de Natal - 5º anoCartões de Natal - 5º ano
Cartões de Natal - 5º ano
 
Introduction to Sparse Methods
Introduction to Sparse Methods Introduction to Sparse Methods
Introduction to Sparse Methods
 
Terrorismo no século xxi
Terrorismo no século xxiTerrorismo no século xxi
Terrorismo no século xxi
 
2 decision making
2 decision making2 decision making
2 decision making
 
今さら聞けないビジネスメールの書き方
今さら聞けないビジネスメールの書き方今さら聞けないビジネスメールの書き方
今さら聞けないビジネスメールの書き方
 
超絶技巧プログラミングと Ruby 3.0 (大江戸 Ruby 会議 05 コミッタ LT)
超絶技巧プログラミングと Ruby 3.0 (大江戸 Ruby 会議 05 コミッタ LT)超絶技巧プログラミングと Ruby 3.0 (大江戸 Ruby 会議 05 コミッタ LT)
超絶技巧プログラミングと Ruby 3.0 (大江戸 Ruby 会議 05 コミッタ LT)
 
Scala Data Pipelines for Music Recommendations
Scala Data Pipelines for Music RecommendationsScala Data Pipelines for Music Recommendations
Scala Data Pipelines for Music Recommendations
 
Matrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender SystemsMatrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender Systems
 

Similar to Spark Science Research Recommendations

Sparking Science up with Research Recommendations by Maya Hristakeva
Sparking Science up with Research Recommendations by Maya HristakevaSparking Science up with Research Recommendations by Maya Hristakeva
Sparking Science up with Research Recommendations by Maya HristakevaSpark Summit
 
Mahout Tutorial and Hands-on (version 2015)
Mahout Tutorial and Hands-on (version 2015)Mahout Tutorial and Hands-on (version 2015)
Mahout Tutorial and Hands-on (version 2015)Cataldo Musto
 
Tutorial Mahout - Recommendation
Tutorial Mahout - RecommendationTutorial Mahout - Recommendation
Tutorial Mahout - RecommendationCataldo Musto
 
Collaborative Filtering and Recommender Systems By Navisro Analytics
Collaborative Filtering and Recommender Systems By Navisro AnalyticsCollaborative Filtering and Recommender Systems By Navisro Analytics
Collaborative Filtering and Recommender Systems By Navisro AnalyticsNavisro Analytics
 
Combining Machine Learning frameworks with Apache Spark
Combining Machine Learning frameworks with Apache SparkCombining Machine Learning frameworks with Apache Spark
Combining Machine Learning frameworks with Apache SparkDataWorks Summit/Hadoop Summit
 
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...Jose Quesada (hiring)
 
Apache Mahout Tutorial - Recommendation - 2013/2014
Apache Mahout Tutorial - Recommendation - 2013/2014 Apache Mahout Tutorial - Recommendation - 2013/2014
Apache Mahout Tutorial - Recommendation - 2013/2014 Cataldo Musto
 
Combining Machine Learning Frameworks with Apache Spark
Combining Machine Learning Frameworks with Apache SparkCombining Machine Learning Frameworks with Apache Spark
Combining Machine Learning Frameworks with Apache SparkDatabricks
 
Buidling large scale recommendation engine
Buidling large scale recommendation engineBuidling large scale recommendation engine
Buidling large scale recommendation engineKeeyong Han
 
Tuning ML Models: Scaling, Workflows, and Architecture
Tuning ML Models: Scaling, Workflows, and ArchitectureTuning ML Models: Scaling, Workflows, and Architecture
Tuning ML Models: Scaling, Workflows, and ArchitectureDatabricks
 
The Analytics Frontier of the Hadoop Eco-System
The Analytics Frontier of the Hadoop Eco-SystemThe Analytics Frontier of the Hadoop Eco-System
The Analytics Frontier of the Hadoop Eco-Systeminside-BigData.com
 
Advanced Hyperparameter Optimization for Deep Learning with MLflow
Advanced Hyperparameter Optimization for Deep Learning with MLflowAdvanced Hyperparameter Optimization for Deep Learning with MLflow
Advanced Hyperparameter Optimization for Deep Learning with MLflowDatabricks
 
Alex mang patterns for scalability in microsoft azure application
Alex mang   patterns for scalability in microsoft azure applicationAlex mang   patterns for scalability in microsoft azure application
Alex mang patterns for scalability in microsoft azure applicationCodecamp Romania
 
Building High Available and Scalable Machine Learning Applications
Building High Available and Scalable Machine Learning ApplicationsBuilding High Available and Scalable Machine Learning Applications
Building High Available and Scalable Machine Learning ApplicationsYalçın Yenigün
 
Running with Elephants: Predictive Analytics with HDInsight
Running with Elephants: Predictive Analytics with HDInsightRunning with Elephants: Predictive Analytics with HDInsight
Running with Elephants: Predictive Analytics with HDInsightChris Price
 
Best Practices for Hyperparameter Tuning with MLflow
Best Practices for Hyperparameter Tuning with MLflowBest Practices for Hyperparameter Tuning with MLflow
Best Practices for Hyperparameter Tuning with MLflowDatabricks
 
Scalable Automatic Machine Learning in H2O
Scalable Automatic Machine Learning in H2OScalable Automatic Machine Learning in H2O
Scalable Automatic Machine Learning in H2OSri Ambati
 
Tag based recommender system
Tag based recommender systemTag based recommender system
Tag based recommender systemKaren Li
 
Evolving The Optimal Relevancy Scoring Model at Dice.com: Presented by Simon ...
Evolving The Optimal Relevancy Scoring Model at Dice.com: Presented by Simon ...Evolving The Optimal Relevancy Scoring Model at Dice.com: Presented by Simon ...
Evolving The Optimal Relevancy Scoring Model at Dice.com: Presented by Simon ...Lucidworks
 
Machine Learning Pipelines
Machine Learning PipelinesMachine Learning Pipelines
Machine Learning Pipelinesjeykottalam
 

Similar to Spark Science Research Recommendations (20)

Sparking Science up with Research Recommendations by Maya Hristakeva
Sparking Science up with Research Recommendations by Maya HristakevaSparking Science up with Research Recommendations by Maya Hristakeva
Sparking Science up with Research Recommendations by Maya Hristakeva
 
Mahout Tutorial and Hands-on (version 2015)
Mahout Tutorial and Hands-on (version 2015)Mahout Tutorial and Hands-on (version 2015)
Mahout Tutorial and Hands-on (version 2015)
 
Tutorial Mahout - Recommendation
Tutorial Mahout - RecommendationTutorial Mahout - Recommendation
Tutorial Mahout - Recommendation
 
Collaborative Filtering and Recommender Systems By Navisro Analytics
Collaborative Filtering and Recommender Systems By Navisro AnalyticsCollaborative Filtering and Recommender Systems By Navisro Analytics
Collaborative Filtering and Recommender Systems By Navisro Analytics
 
Combining Machine Learning frameworks with Apache Spark
Combining Machine Learning frameworks with Apache SparkCombining Machine Learning frameworks with Apache Spark
Combining Machine Learning frameworks with Apache Spark
 
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
 
Apache Mahout Tutorial - Recommendation - 2013/2014
Apache Mahout Tutorial - Recommendation - 2013/2014 Apache Mahout Tutorial - Recommendation - 2013/2014
Apache Mahout Tutorial - Recommendation - 2013/2014
 
Combining Machine Learning Frameworks with Apache Spark
Combining Machine Learning Frameworks with Apache SparkCombining Machine Learning Frameworks with Apache Spark
Combining Machine Learning Frameworks with Apache Spark
 
Buidling large scale recommendation engine
Buidling large scale recommendation engineBuidling large scale recommendation engine
Buidling large scale recommendation engine
 
Tuning ML Models: Scaling, Workflows, and Architecture
Tuning ML Models: Scaling, Workflows, and ArchitectureTuning ML Models: Scaling, Workflows, and Architecture
Tuning ML Models: Scaling, Workflows, and Architecture
 
The Analytics Frontier of the Hadoop Eco-System
The Analytics Frontier of the Hadoop Eco-SystemThe Analytics Frontier of the Hadoop Eco-System
The Analytics Frontier of the Hadoop Eco-System
 
Advanced Hyperparameter Optimization for Deep Learning with MLflow
Advanced Hyperparameter Optimization for Deep Learning with MLflowAdvanced Hyperparameter Optimization for Deep Learning with MLflow
Advanced Hyperparameter Optimization for Deep Learning with MLflow
 
Alex mang patterns for scalability in microsoft azure application
Alex mang   patterns for scalability in microsoft azure applicationAlex mang   patterns for scalability in microsoft azure application
Alex mang patterns for scalability in microsoft azure application
 
Building High Available and Scalable Machine Learning Applications
Building High Available and Scalable Machine Learning ApplicationsBuilding High Available and Scalable Machine Learning Applications
Building High Available and Scalable Machine Learning Applications
 
Running with Elephants: Predictive Analytics with HDInsight
Running with Elephants: Predictive Analytics with HDInsightRunning with Elephants: Predictive Analytics with HDInsight
Running with Elephants: Predictive Analytics with HDInsight
 
Best Practices for Hyperparameter Tuning with MLflow
Best Practices for Hyperparameter Tuning with MLflowBest Practices for Hyperparameter Tuning with MLflow
Best Practices for Hyperparameter Tuning with MLflow
 
Scalable Automatic Machine Learning in H2O
Scalable Automatic Machine Learning in H2OScalable Automatic Machine Learning in H2O
Scalable Automatic Machine Learning in H2O
 
Tag based recommender system
Tag based recommender systemTag based recommender system
Tag based recommender system
 
Evolving The Optimal Relevancy Scoring Model at Dice.com: Presented by Simon ...
Evolving The Optimal Relevancy Scoring Model at Dice.com: Presented by Simon ...Evolving The Optimal Relevancy Scoring Model at Dice.com: Presented by Simon ...
Evolving The Optimal Relevancy Scoring Model at Dice.com: Presented by Simon ...
 
Machine Learning Pipelines
Machine Learning PipelinesMachine Learning Pipelines
Machine Learning Pipelines
 

Recently uploaded

Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 

Recently uploaded (20)

Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 

Spark Science Research Recommendations