Text prospecting

1,133 views
1,071 views

Published on

Lightning talk at OSDC 2008, Sydney

Published in: Technology
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
1,133
On SlideShare
0
From Embeds
0
Number of Embeds
3
Actions
Shares
0
Downloads
2
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Text prospecting

  1. 1. Text Digging Prospecting With Zotero ( http://zotero.org ) And Perl (http://catalystframework.org)
  2. 2. Stores documents and metadata Scrapes from web, academic databases, Google Scholar Zotero
  3. 3. Zotero
  4. 4. Timeline view (MIT Smile project)
  5. 5. Perl Zotero database – 43 tables
  6. 6. DBIx::Class::Schema::Loader Firefox speaks SQLite DBIx::Class Speaks SQLite Zotero DB: 43 Tables DBIC::Schema::Loader infers relationships perfectly
  7. 7. Index Store Zotero has it's own limited fulltext index Zotero::Meta extends this with Keyphrases (Lingua::EN::Tagger) Entities (Net::Calais or others)
  8. 8. Catalyst, Template, Jquery, Emastic (css framework) “Pretty” Browsable Index
  9. 9. Browse keywords
  10. 10. Browse related keywords
  11. 11. View Documents
  12. 12. View Text Snippets
  13. 13. Supported Platforms Anywhere that Perl and Firefox run
  14. 14. Windows Support Hostile environment - managed desktops - unresponsive support staff Solution: - Portable Firefox - Portable Strawberry Perl
  15. 15. Cat In A Box On A Stick (insert picture here) (is Dr Seus out of copyright yet?)
  16. 16. Open source release early 2009 may be licence issues X-( doesn't have a name yet

×