SlideShare a Scribd company logo
Sparking Science up with
Research Recommendations
Maya Hristakeva
@mayahhf
Overview
•  What is Mendeley Suggest?
•  Computation Layer
•  Conclusions
Read
&
Organize
Search
&
Discover
Collaborate
&
Network
Experiment
&
Synthesize
Mendeley builds tools to help researchers …
Being the best researcher you can be!
•  Good researchers are on top of their game
•  Large amount of research produced
•  Takes time to get what you need
•  Help researchers by recommending relevant research
Mendeley Suggest
Personalized Article
Recommender
Recommender System Components
information flow (components often built in parallel)
Data
(Feature
Engineering)
Algorithms Business Logic
and Analytics
User Experience
Mendeley Suggest Components (Past)
information flow (components often built in parallel)
Data
(Feature
Engineering)
Algorithms Business Logic
and Analytics
User Experience
Mendeley Suggest Components (Present)
information flow (components often built in parallel)
Data
(Feature
Engineering)
Algorithms Business Logic
and Analytics
User Experience
Mendeley Suggest Components (Goal)
information flow (components often built in parallel)
Data
(Feature
Engineering)
Algorithms Business Logic
and Analytics
User Experience
Overview
•  What is Mendeley Suggest?
•  Computation Layer
–  Algorithms
–  Evaluation
–  Implementations & Performance
•  Conclusions
Personalized Article Recommendations
Input:
User libraries
Output:
Suggested
articles to read
Algorithms:
•  Collaborative Filtering
–  Item-based
–  User-Based
–  Matrix Factorization
•  Content-based
Item-based Collaborative Filtering
Recommend articles that are similar to the ones you read
–  Similarity is based on article co-occurrences in users’ libraries
–  “Users who read x also read y”
User-based Collaborative Filtering
Find users who have similar appreciation for articles as you
–  Similarity is based on users’ libraries overlap
Recommend new articles based on what the users similar to
you read
–  “Users similar to you (based on a, b, c) also read x”
Matrix Factorization CF
2 4 5
5 4 1
5 ? 2
1 5 4
4 2
4 5 1
U
n x k
V
k x m
fij= <Ui*,V*j>
E(U,V) = L(Xij, fij) + R(U,V)
X
n x m
Overview
•  What is Mendeley Suggest?
•  Computation Layer
–  Algorithms
–  Evaluation
–  Implementations
•  Conclusions
Performance
Costly & GoodCostly & Bad
Cheap & GoodCheap & Bad
Costly & GoodCostly & Bad
Cheap & GoodCheap & Bad
Performance
Costly & GoodCostly & Bad
Cheap & GoodCheap & Bad
Performance
How to measure quality?
•  Offline Evaluation
–  Parameter sweep is quick
–  Don’t offend real users
•  Methodology
–  n-fold cross-validation
–  time-based validation
•  Metrics
–  precision, recall and f-measure
–  AUC (area under roc curve), NDCG (normalized discounted cumulative gain)
Overview
•  What is Mendeley Suggest?
•  Computation Layer
–  Algorithms
–  Evaluation
–  Implementations
•  Conclusions
Implementations
Mahout
(Hadoop)
Mendeley
(Hadoop)
Mahout
(Spark)
Mendeley
(Spark)
MLlib
(Spark)
Item-based CF
User-based CF
Matrix
Factorization
Setup
•  EMR Cluster
–  Master: 1 x r3.xlarge instance (4 core, 32GB)
–  Core: 10 x r3.2xlarge instances (8 core, 64GB)
•  Data: user libraries
–  15mil documents >>> 1mil users
–  150mil interactions
•  Offline Evaluation
–  Methodology: time-based evaluation
–  Metric: precision@10
Implementations
Mahout
(Hadoop)
Mendeley
(Hadoop)
Mahout
(Spark)
Mendeley
(Spark)
MLlib
(Spark)
Item-based CF
User-based CF
Matrix
Factorization
Apache Mahout
•  Mahout (out-of-the-box)
–  Item-based CF
•  org.apache.mahout.cf.taste.hadoop.item.RecommenderJob	
  
	
  
–  ALS Matrix Factorization
•  org.apache.mahout.cf.taste.hadoop.als.ParallelALSFactorizationJob	
  
•  org.apache.mahout.cf.taste.hadoop.als.RecommenderJob	
  
	
  
•  Implemented User-based CF on top of Mahout at Mendeley
Costly & GoodCostly & Bad
Cheap & GoodCheap & Bad
Orig. item-based
mahout
Tuned item-based
mahout
-0.5K
(-60%)
Performance
~$125
Costly & GoodCostly & Bad
Cheap & GoodCheap & Bad
Orig. item-based
mahout
Tuned item-based
mahout
-0.5K
(-60%)
Orig. user-based
mahout
Tuned user-based
mahout
-0.1K
(-40%)
Performance
~$125
Costly & GoodCostly & Bad
Cheap & GoodCheap & Bad
Orig. item-based
mahout
Tuned item-based
mahout Orig. user-based
mahout
Tuned user-based
mahout
+150%
-0.2K
(-55%)
-0.7K
(-82%)
Performance
~$125
Mahout Performance
•  Mahout’s recommender is already efficient
–  But your data may have unusual properties
•  We’ve got improvements by
–  Tuning Hadoop’s mapper and reducer allocation over the Recommender Job steps
–  Using an appropriate partitioner
•  Improve quality
–  Mahout provides Item-based CF
–  We have many more items than users
–  Typically, user-based is more appropriate
Implementations
Mahout
(Hadoop)
Mendeley
(Hadoop)
Mahout
(Spark)
Mendeley
(Spark)
MLlib
(Spark)
Item-based CF
User-based CF
Matrix
Factorization
Mahout Spark
•  Co-occurrence Recommenders with Spark
–  Item-Item similarity
•  mahout spark-itemsimilarity
SimilarityAnalysis.cooccurrencesIDSs(ratings,	
  …)	
  
	
  
–  User-User similarity
•  mahout spark-rowsimilarity
SimilarityAnalysis.rowSimilarityIDSs(ratings,	
  …)	
  
	
  
•  Only supports Boolean data and log-likelihood similarity
•  Does not generate actual recommendations
Mahout Spark
•  Could not get to run successfully on our data
•  Got further by tuning parameters but still failed with OOM
–  spark.driver.maxResultSize	
  
–  spark.kryoserializer.buffer.max	
  	
  
–  spark.default.parallelism	
  
–  spark.storage.memoryFraction	
  
	
  
•  Gave best runtime performance on MovieLens datasets
Implementations
Mahout
(Hadoop)
Mendeley
(Hadoop)
Mahout
(Spark)
Mendeley
(Spark)
MLlib
(Spark)
Item-based CF
User-based CF
Matrix
Factorization
Mendeley Spark
•  Started as hack-day project
–  Implement Item-based and User-based CF in Spark
•  Can be implemented in two steps
1.  Compute Item-Item or User-User Similarities
•  given user preferences
2.  Compute Recommendations
•  given similarities and user preferences
Spark: Item-Item Similarity
Spark: Item-Item Similarity
Spark: Item-Item Similarity
Spark: Item-Item Similarity
Spark: Item-Based Recs
Spark: Item-Based Recs
Costly & GoodCostly & Bad
Cheap & GoodCheap & Bad
Tuned IB Mahout
Tuned UB Mahout
Orig. UB Spark
Performance
~$50
Costly & GoodCostly & Bad
Cheap & GoodCheap & Bad
Tuned IB Mahout
Tuned UB Mahout
Orig. UB Spark
Tuned UB Spark
Tuned IB Spark
-0.1K
(-40%)
Performance
~$50
Mendeley Spark Performance
•  Spark implementation of User-based CF performs well
•  Managed to run variation of Item-based CF
–  Uses fewer items per user to recommend similar items to
–  Quality not impacted much
•  We’ve got improvements by tuning
–  Resource allocation
–  Parallelism
–  http://blog.cloudera.com/blog/2015/03/how-to-tune-your-apache-spark-jobs-
part-2/
Implementations
Mahout
(Hadoop)
Mendeley
(Hadoop)
Mahout
(Spark)
Mendeley
(Spark)
MLlib
(Spark)
Item-based CF
User-based CF
Matrix
Factorization
Spark MLlib DimSum
•  DimSum: efficient algorithm for computing all-pairs similarity
–  “Dimension Independent Matrix Square using MapReduce”
–  Contributed by Twitter
•  Replace similarity computation with DimSum
–  Only supports cosine similarity
•  Does not generate actual recommendations
–  Compute recommendations as before
MLlib DimSum Item-Item Similarity
MLlib DimSum User-User Similarity
Costly & GoodCostly & Bad
Cheap & GoodCheap & Bad
Tuned IB Mahout
Tuned UB Mahout
Tuned UB Spark
Tuned IB Spark
UB DimSum
Spark MLlib
Performance
~$50
Spark MLlib Matrix Factorization
Implements alternating least squares (ALS)
1.  Compute Model
2.  Compute Recommendations
Costly & GoodCostly & Bad
Cheap & GoodCheap & Bad
Tuned IB Mahout
Tuned UB Mahout
Tuned UB Spark
Tuned IB Spark
UB DimSum
Spark MLlib
ALS Matrix Fact.
Spark MLlib
-50%
Performance
~$50
MLlib Performance
•  Provides good alternative for computing user-user similarities
–  Due to data sparsity, not getting big gains in runtime
–  Only supports cosine similarity
•  Failed to compute item-item similarities
–  Exceeds maximum allowed value of 2G for spark.kryoserializer.buffer.max	
  	
  
	
  
•  User-based CF outperforms ALS CF
•  Need scalable solution for generating recommendations based on ALS
CF model
Implementations
Mahout
(Hadoop)
Mendeley
(Hadoop)
Mahout
(Spark)
Mendeley
(Spark)
MLlib
(Spark)
Item-based CF
User-based CF
Matrix
Factorization
Overview
•  What is Mendeley Suggest?
•  Computation Layer
•  Conclusions
Costly & GoodCostly & Bad
Cheap & GoodCheap & Bad
Tuned IB Mahout
Tuned UB Mahout
Tuned UB Spark
Tuned IB Spark
UB DimSum
Spark MLlib
ALS Matrix Fact.
Spark MLlib
Performance
+100%
+150%
~$50
Mendeley Suggest Components (Future)
information flow (components often built in parallel)
Data
(Feature
Engineering)
Algorithms Business Logic
and Analytics
User Experience
Conclusions
•  Mendeley Suggest is a personalized article recommender
•  Spark is good alternative to Mahout as computation layer
–  Needs some love and tuning
–  Much fewer lines of code – easier to maintain and extend
•  User-based can outperform item-based and matrix factorization
•  Save resources and money by understanding your data
•  Test offline before deploying
–  but also need online tests to get real performance
Thank you!
mendeley.com/suggest

More Related Content

Viewers also liked

DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
Hakka Labs
 
методичские указания для самостоятельного изучения предмета высшая математика
методичские указания для самостоятельного изучения предмета высшая математикаметодичские указания для самостоятельного изучения предмета высшая математика
методичские указания для самостоятельного изучения предмета высшая математика
Roman Prilipa
 
Barbara quiroz psicopedagogia nivel 200 diurno trabajo p3
Barbara quiroz psicopedagogia nivel 200 diurno trabajo p3Barbara quiroz psicopedagogia nivel 200 diurno trabajo p3
Barbara quiroz psicopedagogia nivel 200 diurno trabajo p3
Barbara Quiroz Donoso
 
Farsi Translation - tough leadership brand management
Farsi Translation  - tough leadership brand managementFarsi Translation  - tough leadership brand management
Farsi Translation - tough leadership brand management
Ali Reza Bakhshi
 
Palmarès du concours inter-quartiers des sapins de Noël 2016 - Ville d'Ergué-...
Palmarès du concours inter-quartiers des sapins de Noël 2016 - Ville d'Ergué-...Palmarès du concours inter-quartiers des sapins de Noël 2016 - Ville d'Ergué-...
Palmarès du concours inter-quartiers des sapins de Noël 2016 - Ville d'Ergué-...
Ville d'Ergué-Gabéric
 
服务计算:现状,思考与建议
服务计算:现状,思考与建议服务计算:现状,思考与建议
服务计算:现状,思考与建议
IBM
 
Accelerating Machine Learning Applications on Spark Using GPUs
Accelerating Machine Learning Applications on Spark Using GPUsAccelerating Machine Learning Applications on Spark Using GPUs
Accelerating Machine Learning Applications on Spark Using GPUs
IBM
 
Fichamento do livro_convite_a_filosofia
Fichamento do livro_convite_a_filosofiaFichamento do livro_convite_a_filosofia
Fichamento do livro_convite_a_filosofia
Márcia Quirino
 
Determinación de la vida util de un producto agroindustrial
Determinación de la vida util de un producto agroindustrialDeterminación de la vida util de un producto agroindustrial
Determinación de la vida util de un producto agroindustrialJhonás A. Vega
 
Cartões de Natal - 5º ano
Cartões de Natal - 5º anoCartões de Natal - 5º ano
Cartões de Natal - 5º ano
Isabela Afra
 
Introduction to Sparse Methods
Introduction to Sparse Methods Introduction to Sparse Methods
Introduction to Sparse Methods
Shadi Nabil Albarqouni
 
Terrorismo no século xxi
Terrorismo no século xxiTerrorismo no século xxi
Terrorismo no século xxi
Colégio Nova Geração COC
 
2 decision making
2 decision making2 decision making
2 decision makingalcheikh
 
今さら聞けないビジネスメールの書き方
今さら聞けないビジネスメールの書き方今さら聞けないビジネスメールの書き方
今さら聞けないビジネスメールの書き方webcampusschoo
 
超絶技巧プログラミングと Ruby 3.0 (大江戸 Ruby 会議 05 コミッタ LT)
超絶技巧プログラミングと Ruby 3.0 (大江戸 Ruby 会議 05 コミッタ LT)超絶技巧プログラミングと Ruby 3.0 (大江戸 Ruby 会議 05 コミッタ LT)
超絶技巧プログラミングと Ruby 3.0 (大江戸 Ruby 会議 05 コミッタ LT)
mametter
 
Scala Data Pipelines for Music Recommendations
Scala Data Pipelines for Music RecommendationsScala Data Pipelines for Music Recommendations
Scala Data Pipelines for Music Recommendations
Chris Johnson
 
Matrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender SystemsMatrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender SystemsLei Guo
 

Viewers also liked (17)

DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
 
методичские указания для самостоятельного изучения предмета высшая математика
методичские указания для самостоятельного изучения предмета высшая математикаметодичские указания для самостоятельного изучения предмета высшая математика
методичские указания для самостоятельного изучения предмета высшая математика
 
Barbara quiroz psicopedagogia nivel 200 diurno trabajo p3
Barbara quiroz psicopedagogia nivel 200 diurno trabajo p3Barbara quiroz psicopedagogia nivel 200 diurno trabajo p3
Barbara quiroz psicopedagogia nivel 200 diurno trabajo p3
 
Farsi Translation - tough leadership brand management
Farsi Translation  - tough leadership brand managementFarsi Translation  - tough leadership brand management
Farsi Translation - tough leadership brand management
 
Palmarès du concours inter-quartiers des sapins de Noël 2016 - Ville d'Ergué-...
Palmarès du concours inter-quartiers des sapins de Noël 2016 - Ville d'Ergué-...Palmarès du concours inter-quartiers des sapins de Noël 2016 - Ville d'Ergué-...
Palmarès du concours inter-quartiers des sapins de Noël 2016 - Ville d'Ergué-...
 
服务计算:现状,思考与建议
服务计算:现状,思考与建议服务计算:现状,思考与建议
服务计算:现状,思考与建议
 
Accelerating Machine Learning Applications on Spark Using GPUs
Accelerating Machine Learning Applications on Spark Using GPUsAccelerating Machine Learning Applications on Spark Using GPUs
Accelerating Machine Learning Applications on Spark Using GPUs
 
Fichamento do livro_convite_a_filosofia
Fichamento do livro_convite_a_filosofiaFichamento do livro_convite_a_filosofia
Fichamento do livro_convite_a_filosofia
 
Determinación de la vida util de un producto agroindustrial
Determinación de la vida util de un producto agroindustrialDeterminación de la vida util de un producto agroindustrial
Determinación de la vida util de un producto agroindustrial
 
Cartões de Natal - 5º ano
Cartões de Natal - 5º anoCartões de Natal - 5º ano
Cartões de Natal - 5º ano
 
Introduction to Sparse Methods
Introduction to Sparse Methods Introduction to Sparse Methods
Introduction to Sparse Methods
 
Terrorismo no século xxi
Terrorismo no século xxiTerrorismo no século xxi
Terrorismo no século xxi
 
2 decision making
2 decision making2 decision making
2 decision making
 
今さら聞けないビジネスメールの書き方
今さら聞けないビジネスメールの書き方今さら聞けないビジネスメールの書き方
今さら聞けないビジネスメールの書き方
 
超絶技巧プログラミングと Ruby 3.0 (大江戸 Ruby 会議 05 コミッタ LT)
超絶技巧プログラミングと Ruby 3.0 (大江戸 Ruby 会議 05 コミッタ LT)超絶技巧プログラミングと Ruby 3.0 (大江戸 Ruby 会議 05 コミッタ LT)
超絶技巧プログラミングと Ruby 3.0 (大江戸 Ruby 会議 05 コミッタ LT)
 
Scala Data Pipelines for Music Recommendations
Scala Data Pipelines for Music RecommendationsScala Data Pipelines for Music Recommendations
Scala Data Pipelines for Music Recommendations
 
Matrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender SystemsMatrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender Systems
 

Similar to Sparking Science up with Research Recommendations

Sparking Science up with Research Recommendations by Maya Hristakeva
Sparking Science up with Research Recommendations by Maya HristakevaSparking Science up with Research Recommendations by Maya Hristakeva
Sparking Science up with Research Recommendations by Maya Hristakeva
Spark Summit
 
Mahout Tutorial and Hands-on (version 2015)
Mahout Tutorial and Hands-on (version 2015)Mahout Tutorial and Hands-on (version 2015)
Mahout Tutorial and Hands-on (version 2015)
Cataldo Musto
 
Tutorial Mahout - Recommendation
Tutorial Mahout - RecommendationTutorial Mahout - Recommendation
Tutorial Mahout - Recommendation
Cataldo Musto
 
Collaborative Filtering and Recommender Systems By Navisro Analytics
Collaborative Filtering and Recommender Systems By Navisro AnalyticsCollaborative Filtering and Recommender Systems By Navisro Analytics
Collaborative Filtering and Recommender Systems By Navisro Analytics
Navisro Analytics
 
Combining Machine Learning frameworks with Apache Spark
Combining Machine Learning frameworks with Apache SparkCombining Machine Learning frameworks with Apache Spark
Combining Machine Learning frameworks with Apache Spark
DataWorks Summit/Hadoop Summit
 
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
Jose Quesada (hiring)
 
Apache Mahout Tutorial - Recommendation - 2013/2014
Apache Mahout Tutorial - Recommendation - 2013/2014 Apache Mahout Tutorial - Recommendation - 2013/2014
Apache Mahout Tutorial - Recommendation - 2013/2014
Cataldo Musto
 
Combining Machine Learning Frameworks with Apache Spark
Combining Machine Learning Frameworks with Apache SparkCombining Machine Learning Frameworks with Apache Spark
Combining Machine Learning Frameworks with Apache Spark
Databricks
 
Buidling large scale recommendation engine
Buidling large scale recommendation engineBuidling large scale recommendation engine
Buidling large scale recommendation engine
Keeyong Han
 
Tuning ML Models: Scaling, Workflows, and Architecture
Tuning ML Models: Scaling, Workflows, and ArchitectureTuning ML Models: Scaling, Workflows, and Architecture
Tuning ML Models: Scaling, Workflows, and Architecture
Databricks
 
The Analytics Frontier of the Hadoop Eco-System
The Analytics Frontier of the Hadoop Eco-SystemThe Analytics Frontier of the Hadoop Eco-System
The Analytics Frontier of the Hadoop Eco-System
inside-BigData.com
 
Advanced Hyperparameter Optimization for Deep Learning with MLflow
Advanced Hyperparameter Optimization for Deep Learning with MLflowAdvanced Hyperparameter Optimization for Deep Learning with MLflow
Advanced Hyperparameter Optimization for Deep Learning with MLflow
Databricks
 
Alex mang patterns for scalability in microsoft azure application
Alex mang   patterns for scalability in microsoft azure applicationAlex mang   patterns for scalability in microsoft azure application
Alex mang patterns for scalability in microsoft azure application
Codecamp Romania
 
Building High Available and Scalable Machine Learning Applications
Building High Available and Scalable Machine Learning ApplicationsBuilding High Available and Scalable Machine Learning Applications
Building High Available and Scalable Machine Learning Applications
Yalçın Yenigün
 
Running with Elephants: Predictive Analytics with HDInsight
Running with Elephants: Predictive Analytics with HDInsightRunning with Elephants: Predictive Analytics with HDInsight
Running with Elephants: Predictive Analytics with HDInsight
Chris Price
 
Best Practices for Hyperparameter Tuning with MLflow
Best Practices for Hyperparameter Tuning with MLflowBest Practices for Hyperparameter Tuning with MLflow
Best Practices for Hyperparameter Tuning with MLflow
Databricks
 
Scalable Automatic Machine Learning in H2O
Scalable Automatic Machine Learning in H2OScalable Automatic Machine Learning in H2O
Scalable Automatic Machine Learning in H2O
Sri Ambati
 
Tag based recommender system
Tag based recommender systemTag based recommender system
Tag based recommender system
Karen Li
 
Evolving The Optimal Relevancy Scoring Model at Dice.com: Presented by Simon ...
Evolving The Optimal Relevancy Scoring Model at Dice.com: Presented by Simon ...Evolving The Optimal Relevancy Scoring Model at Dice.com: Presented by Simon ...
Evolving The Optimal Relevancy Scoring Model at Dice.com: Presented by Simon ...
Lucidworks
 
Machine Learning Pipelines
Machine Learning PipelinesMachine Learning Pipelines
Machine Learning Pipelines
jeykottalam
 

Similar to Sparking Science up with Research Recommendations (20)

Sparking Science up with Research Recommendations by Maya Hristakeva
Sparking Science up with Research Recommendations by Maya HristakevaSparking Science up with Research Recommendations by Maya Hristakeva
Sparking Science up with Research Recommendations by Maya Hristakeva
 
Mahout Tutorial and Hands-on (version 2015)
Mahout Tutorial and Hands-on (version 2015)Mahout Tutorial and Hands-on (version 2015)
Mahout Tutorial and Hands-on (version 2015)
 
Tutorial Mahout - Recommendation
Tutorial Mahout - RecommendationTutorial Mahout - Recommendation
Tutorial Mahout - Recommendation
 
Collaborative Filtering and Recommender Systems By Navisro Analytics
Collaborative Filtering and Recommender Systems By Navisro AnalyticsCollaborative Filtering and Recommender Systems By Navisro Analytics
Collaborative Filtering and Recommender Systems By Navisro Analytics
 
Combining Machine Learning frameworks with Apache Spark
Combining Machine Learning frameworks with Apache SparkCombining Machine Learning frameworks with Apache Spark
Combining Machine Learning frameworks with Apache Spark
 
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
 
Apache Mahout Tutorial - Recommendation - 2013/2014
Apache Mahout Tutorial - Recommendation - 2013/2014 Apache Mahout Tutorial - Recommendation - 2013/2014
Apache Mahout Tutorial - Recommendation - 2013/2014
 
Combining Machine Learning Frameworks with Apache Spark
Combining Machine Learning Frameworks with Apache SparkCombining Machine Learning Frameworks with Apache Spark
Combining Machine Learning Frameworks with Apache Spark
 
Buidling large scale recommendation engine
Buidling large scale recommendation engineBuidling large scale recommendation engine
Buidling large scale recommendation engine
 
Tuning ML Models: Scaling, Workflows, and Architecture
Tuning ML Models: Scaling, Workflows, and ArchitectureTuning ML Models: Scaling, Workflows, and Architecture
Tuning ML Models: Scaling, Workflows, and Architecture
 
The Analytics Frontier of the Hadoop Eco-System
The Analytics Frontier of the Hadoop Eco-SystemThe Analytics Frontier of the Hadoop Eco-System
The Analytics Frontier of the Hadoop Eco-System
 
Advanced Hyperparameter Optimization for Deep Learning with MLflow
Advanced Hyperparameter Optimization for Deep Learning with MLflowAdvanced Hyperparameter Optimization for Deep Learning with MLflow
Advanced Hyperparameter Optimization for Deep Learning with MLflow
 
Alex mang patterns for scalability in microsoft azure application
Alex mang   patterns for scalability in microsoft azure applicationAlex mang   patterns for scalability in microsoft azure application
Alex mang patterns for scalability in microsoft azure application
 
Building High Available and Scalable Machine Learning Applications
Building High Available and Scalable Machine Learning ApplicationsBuilding High Available and Scalable Machine Learning Applications
Building High Available and Scalable Machine Learning Applications
 
Running with Elephants: Predictive Analytics with HDInsight
Running with Elephants: Predictive Analytics with HDInsightRunning with Elephants: Predictive Analytics with HDInsight
Running with Elephants: Predictive Analytics with HDInsight
 
Best Practices for Hyperparameter Tuning with MLflow
Best Practices for Hyperparameter Tuning with MLflowBest Practices for Hyperparameter Tuning with MLflow
Best Practices for Hyperparameter Tuning with MLflow
 
Scalable Automatic Machine Learning in H2O
Scalable Automatic Machine Learning in H2OScalable Automatic Machine Learning in H2O
Scalable Automatic Machine Learning in H2O
 
Tag based recommender system
Tag based recommender systemTag based recommender system
Tag based recommender system
 
Evolving The Optimal Relevancy Scoring Model at Dice.com: Presented by Simon ...
Evolving The Optimal Relevancy Scoring Model at Dice.com: Presented by Simon ...Evolving The Optimal Relevancy Scoring Model at Dice.com: Presented by Simon ...
Evolving The Optimal Relevancy Scoring Model at Dice.com: Presented by Simon ...
 
Machine Learning Pipelines
Machine Learning PipelinesMachine Learning Pipelines
Machine Learning Pipelines
 

Recently uploaded

Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
Vlad Stirbu
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
Peter Spielvogel
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
UiPathCommunity
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
Jen Stirrup
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 

Recently uploaded (20)

Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 

Sparking Science up with Research Recommendations