Crab - A Python Framework for Building Recommendation Systems


Published on

Keynote introducing the Framework Crab: A Python toolkit for bulding recommendation engines. It is a open source project as an alternative for Mahout Taste for Python developers.
Presented at XII Python User Group Pernambuco, 07-05-2011 at CIN/UFPE.

Published in: Technology, Education
1 Comment
  • Very cool framework!
    Are you sure you want to  Yes  No
    Your message goes here
No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • Crab - A Python Framework for Building Recommendation Systems

    1. 1. Crab A Python Framework for Building Recommendation EnginesMarcel Caraciolo Ricardo Caspirro Bruno Melo @marcelcaraciolo @ricardocaspirro @brunomelo
    2. 2. What is Crab ? A python framework for building recommendation enginesA Scikit module for collaborative, content and hybrid filtering Mahout Alternative for Python Developers :D Open-Source under the BSD license
    3. 3. When started ?It began one year agoCommunity-driven, 4 membersSince April,2011 the open-source labs Muriçoca incorporated itSince April,2011 rewritting it as sub-module of Scikit-learn
    4. 4. Knowing ScikitsScikits are Scipy Toolkits - independent and projects hosted under a common namespace. Scikits Image Scikits MlabWrap Scikits AudioLab Scikit Learn ....
    5. 5. Knowing Scikits Scikit-Learn Machine Learning Algorithms + scientific Python packages (Numpy, Scipy and Matplotlib) goal: Incorporate the Crab as Scikit and part of it as submodule of Scikit-Learn
    6. 6. Why Recommendations ?The world is an over-crowded place !"#$%&()$*+$,-$&.#/0&%)#)$1(,0#
    7. 7. Why Recommendations * +,&-.$/).#&0#/"1.#$%234(".# ? $/)#5(&6 7&.2.#"$4,#)$8 We are overloaded * 93((3&/.#&0#:&3".;#5&&<.# $/)#:-.34#2%$4<.#&/(3/"Thousands of news articles and blog posts each day * =/#>$/&3;#?#@A#+B#4,$//"(.;# 2,&-.$/).#&0#7%&6%$:.# Millions of movies, books and music tracks online "$4,#)$8 Several Places, Offers and Events * =/#C"1#D&%<;#.""%$(# Even Friends sometimes we are overloaded ! 2,&-.$/).#&0#$)#:"..$6".# Ask Charlie Sheen (@charliesheen)... ."/2#2&#-.#7"%#)$8
    8. 8. Why Recommendations ?We really need and consume only a few of them! “A lot of times, people don’t know what they want until you show it to them.” Steve Jobs “We are leaving the Information age, and entering into the Recommendation age.” Chris Anderson, from book Long Tail
    9. 9. Why Recommendations ?Can Google help ? Yes, but only when we really know what we are looking for But, what’s does it mean by “interesting” ?Can Facebook help ? Yes, I tend to find my friends’ stuffs interesting What if i had only few friends and what they like do not always attract me ?Can experts help ? Yes, but it won’t scale well. But it is what they like, not me! Exactly same advice!
    10. 10. Why Recommendations ? Recommendation SystemsSystems designed to recommend to me something I may like
    11. 11. Why Recommendations ? ! "#$%"’ $"’ ( ’ ) *#*+ ) & , Recommendation Systems -+ + *#) . - #/ ’ ) 0 #) 1# !2’ 2 3&"+ ) 1 4’ 5, 6 7 ) , *% "& ’ 863 Graph Representation
    12. 12. The current CrabCollaborative Filtering algorithms User-Based, Item-Based and Slope OneEvaluation of the Recommender Algorithms Precision, Recall, F1-Score, RMSE Precision-Recall Charts
    13. 13. The current Crab Demo Movies Recommendation
    14. 14. Why migrate ?Old Crab running only using Pure Python Recommendations demand heavy maths calculations and lots of processingCompatible with Numpy and Scipy libraries High Standard and popular scientific libraries optimized for scientific calculations in PythonScikits projects are amazing! Active Communities, Scientific Conferences and updated projects (e.g. scikit-learn)Turn the Crab framework visible for the community Join the scientific researchers and machine learning developers around the Globe coding with Python to help us in this project Be Fast and Furious
    15. 15. Why migrate ?Numpy optimized with PyPy 2.x - 48.x Faster
    16. 16. How are we working ? Sprints, Online Discussions and Issues
    17. 17. Future Releases Planned Release 0.1 Collaborative Filtering Algorithms working, sample datasets to load and test Planned Release 0.11 Evaluation of Recommendation Algorithms and Database Models support Planned Release 0.12 Recommendation as Services with REST APIs....
    18. 18. Join us!1. Read our Wiki Page Check out our current sprints and open issues Forks, Pull Requests mandatory4. Join us at #muricoca or at our discussion list in work :(
    19. 19. Crab A Python Framework for Building Recommendation Engines Caraciolo Ricardo Caspirro Bruno Melo @marcelcaraciolo @ricardocaspirro @brunomelo {marcel, ricardo,bruno}