Mining Scipy Lectures

4,107 views

Published on

Lecture given at Scipy 2011, Austin, Tx about mining the Scipy Lectures - Visualization and Clustering

Published in: Technology, Education
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
4,107
On SlideShare
0
From Embeds
0
Number of Embeds
1,495
Actions
Shares
0
Downloads
42
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Mining Scipy Lectures

  1. 1. Mining Lectures Marcel Caraciolo - @marcelcaraciolo 1
  2. 2. Who’s me ? Marcel Pinheiro Caraciolo Brazilian, lover of crabs Director of P&D - brazilian startup Orygens M.S.C Candidate at Data Mining and Recommender Systems Current moderator of the Local Python User Group at Pernambuco Interested at machine learning, recommender systems and mobile computing Blogging about machine learning with Python since 2008 http://aimotion.blogspot.com Young apprentice with Python programming since 2008. 2
  3. 3. How I started this analysis? 24 hours ago... 3
  4. 4. Question How were the topicsdistributed around the Scipy Conference General Sessions ? 4
  5. 5. Scrapping of Scipy Conference Small Web-Crawler for extracting the approved lectures urllib2, re, BeautifulSoap... 5
  6. 6. Resume41 Lectures820 minutes length 6
  7. 7. It means...=~ 4100 tweets posted. 7
  8. 8. Or watch... Star Wars Trilogy 2x 8
  9. 9. Or finish Super Mario Game... 82 x! 9
  10. 10. Na nossa língua agora...Or open the Eclipse Abrir o Eclipse 2 vezes! 2 x! 11 10
  11. 11. Most popular Authors Dharhas Pothina - 3 Wes McKinney - 2 All the others - 1 11
  12. 12. Playing with the text...The most frequent words at the conference nltk, re 12
  13. 13. But let’s take a deeper look. I used the clustering algorithm K-Means Tool used for visualization Ubigraph 13
  14. 14. Distribution of the Lectures Basic Frameworks matplotlib, ipython Building frameworks performance, models, web services Parallelism performance, gpu, statistical Visualization Numpy data analysis, statistical toolkits using Numpy 14
  15. 15. To sum up...Mining english text is so much easier!!! Submit your work also! Spread the scientific python over the communityI expect to be back to Scipy next year! 15
  16. 16. https://github.com/marcelcaraciolo/clustering_scipy Mining Lectures Marcel Caraciolo - @marcelcaraciolo 16

×