Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
The Gossip
  Marcus Frödin
Agenda
Project idea and scope
Target environment
Page scraping
Page rank & slope one predictors
AJAX
Project idea

Finding and ranking music on blogs
Scope
1. Find blogs automatically
2. Detect and read feeds
3. Find music in feeds
4. Categorize on similarity
Target environment



...and NetworkX, feedparser, beautifulsoup, ...
Page scraping
1. Parse XHTML to a tree
2. Find the <link> tag for RSS feed
3. Get the <ul> with the blog roll
4. Repeat fo...
Parsing feeds
1. Parse feed to a tree
2. Iterate over entries
3. Find RSS enclosures
4. Read description: artist - song
Parsing feeds
artist - song artist -song
artist - song artist -- song
   artist-song (download)
       artist - “song”
Page rank
Slope one
         Dress Sweater Pants
 Frida     5      3      2
 Peter     3      4     n/a
Magnus    n/a     2      5
Slope one
              Dress Sweater Pants
      Frida     5       3        2
      Peter     3       4       n/a
   Magn...
Slope one
              Dress Sweater Pants
      Frida     5       3        2
      Peter     3       4       n/a
   Magn...
Slope one
              Dress Sweater Pants
      Frida     5       3        2
      Peter     3       4       n/a
   Magn...
Ajax
Piece of cake!
$(‘#search’).click( function()    {

  $(‘#results ul’).load(‘search?q=’+$(‘#search’).text,

       fu...
questions?
Gossip Project Presentation
Gossip Project Presentation
Gossip Project Presentation
Upcoming SlideShare
Loading in …5
×

Gossip Project Presentation

1,219 views

Published on

Project presentation from a class when I still was in school. Don't mind the yellow box, I just did that in Preview to blurt out my name.

Published in: Technology
  • Be the first to comment

Gossip Project Presentation

  1. 1. The Gossip Marcus Frödin
  2. 2. Agenda Project idea and scope Target environment Page scraping Page rank & slope one predictors AJAX
  3. 3. Project idea Finding and ranking music on blogs
  4. 4. Scope 1. Find blogs automatically 2. Detect and read feeds 3. Find music in feeds 4. Categorize on similarity
  5. 5. Target environment ...and NetworkX, feedparser, beautifulsoup, ...
  6. 6. Page scraping 1. Parse XHTML to a tree 2. Find the <link> tag for RSS feed 3. Get the <ul> with the blog roll 4. Repeat for each blog in blog roll
  7. 7. Parsing feeds 1. Parse feed to a tree 2. Iterate over entries 3. Find RSS enclosures 4. Read description: artist - song
  8. 8. Parsing feeds artist - song artist -song artist - song artist -- song artist-song (download) artist - “song”
  9. 9. Page rank
  10. 10. Slope one Dress Sweater Pants Frida 5 3 2 Peter 3 4 n/a Magnus n/a 2 5
  11. 11. Slope one Dress Sweater Pants Frida 5 3 2 Peter 3 4 n/a Magnus n/a 2 5 Magnus(Dress|Sweater)=2+[(5-3)+(3-4)]/2=2.5
  12. 12. Slope one Dress Sweater Pants Frida 5 3 2 Peter 3 4 n/a Magnus n/a 2 5 Magnus(Dress|Sweater)=2+[(5-3)+(3-4)]/2=2.5 Magnus(Dress|Pants)=5+(5-2)=8
  13. 13. Slope one Dress Sweater Pants Frida 5 3 2 Peter 3 4 n/a Magnus n/a 2 5 Magnus(Dress|Sweater)=2+[(5-3)+(3-4)]/2=2.5 Magnus(Dress|Pants)=5+(5-2)=8 Magnus(Dress)=[2*M(D|S)+M(D|P)]/(2+1)=4.33
  14. 14. Ajax Piece of cake! $(‘#search’).click( function() { $(‘#results ul’).load(‘search?q=’+$(‘#search’).text, function() { $(‘#results’).animate( {‘height’: ‘300px’},150) })})
  15. 15. questions?

×