• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Which Algorithms Really Matter
 

Which Algorithms Really Matter

on

  • 19,743 views

This is the position talk that I gave at CIKM. Included are 4 algorithms that I feel don't get much academic attention, but which are very important industrially. It isn't necessarily true that ...

This is the position talk that I gave at CIKM. Included are 4 algorithms that I feel don't get much academic attention, but which are very important industrially. It isn't necessarily true that these algorithms *should* get academic attention, but I do feel that it is true that they are quite important pragmatically speaking.

Statistics

Views

Total Views
19,743
Views on SlideShare
13,067
Embed Views
6,676

Actions

Likes
45
Downloads
214
Comments
0

20 Embeds 6,676

http://mahout.apache.org 6152
https://twitter.com 380
http://localhost 30
http://127.0.0.1 20
http://mahout.staging.apache.org 19
http://176.9.7.116 17
http://hoojung.tumblr.com 16
http://www.google.com 13
https://www.rebelmouse.com 6
http://www.linkedin.com 6
http://lrnmo.com 5
http://assets.txmblr.com 2
http://www.pulse.me 2
https://drive.jolicloud.com 2
http://www.seiu1984.org 1
http://pulse.me&_=1383419409427 HTTP 1
http://pulse.me&_=1383387928621 HTTP 1
http://www.lrnmo.com 1
http://pulse.me&_=1383350326329 HTTP 1
http://eventifier.co 1
More...

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • * A history of what everybody has done. Obviously this is just a cartoon because large numbers of users and interactions with items would be required to build a recommender* Next step will be to predict what a new user might like…
  • *Bob is the “new user” and getting apple is his history
  • *Here is where the recommendation engine needs to go to work…Note to trainer: you might see if audience calls out the answer before revealing next slide…
  • Note to trainer: This is the situation similar to that in which we started, with three users in our history. The difference is that now everybody got a pony. Bob has apple and pony but not a puppy…yet
  • *Binary matrix is stored sparsely
  • *Convert by MapReduce into a binary matrixNote to trainer: Whether consider apple to have occurred with self is open question
  • Old joke: all the world can be divided into 2 categories: Scotch tape and non-Scotch tape… This is a way to think about the co-occurrence
  • Only important co-occurrence is puppy follows apple
  • *Take that row of matrix and combine with all the meta data we might have…*Important thing to get from the co-occurrence matrix is this indicator..Cool thing: analogous to what a lot of recommendation engines do*This row forms the indicator field in a Solr document containing meta-data (you do NOT have to build a separate index for the indicators)Find the useful co-occurrence and get rid of the rest. Sparsify and get the anomalous co-occurrence
  • Note to trainer: take a little time to explore this here and on the next couple of slides. Details enlarged on next slide
  • *This indicator field is where the output of the Mahout recommendation engine are stored (the row from the indicator matrix that identified significant or interesting co-occurrence. *Keep in mind that this recommendation indicator data is added to the same original document in the Solr index that contains meta data for the item in question
  • This is a diagnostics window in the LucidWorksSolr index (not the web interface a user would see). It’s a way for the developer to do a rough evaluation (laugh test) of the choices offered by the recommendation engine.In other words, do these indicator artists represented by their indicator Id make reasonable recommendations Note to trainer: artist 303 happens to be The Beatles. Is that a good match for Chuck Berry?

Which Algorithms Really Matter Which Algorithms Really Matter Presentation Transcript