© 2013 LucidWorks
Edanz Journal Selector: Case Study: aPrototype based on Solr/Nutch/HadoopLiang SHEN@shenzhuxiEuropean Bioinformatics Insti...
© 2013 LucidWorksEdanz Journal Selectora Prototype based on Solr/Nutch/Hadoop
© 2013 LucidWorksEnglish editing for scientists
© 2013 LucidWorksHelp scientists publish papers
© 2013 LucidWorksTarget journal?
© 2013 LucidWorksJournal Selector
© 2013 LucidWorksOpen AccessPubMed
© 2013 LucidWorksJournal TOCscreated in 200921,498 journals from1,677 publishersInstitute for ComputerBased LearningHeriot...
© 2013 LucidWorksPartner• Springer Metadata APIProvides metadata for over 5 million online documents• Springer Open Access...
© 2013 LucidWorksOpen Source Stack• Infrastructure: Amazon Web Service• Data processing: Hadoop/Hive• Index: Solr/Lucene• ...
© 2013 LucidWorksInfrastructure: Amazon EC2
© 2013 LucidWorksData processingHDFSIndexAPIFeedsWebPages
© 2013 LucidWorks<script>http://global.js.widget.eja.hk/ja/edanz_ja/w.js</script>Web service
© 2013 LucidWorksEmbeddable web widget
© 2013 LucidWorksSplit Index for performanceIndex can be divided without losing ranking, if there is always a facet field.
© 2013 LucidWorks@shenzhuxiThanks!Questions?
Upcoming SlideShare
Loading in...5
×

Edanz journal selector case study a prototype based on solr nutch hadoop

301

Published on

Presented by Liang Shen, Developer, European Bioinformatics Institute

I'm going to introduce a project I built in 2011: Edanz Journal Selector. It's a tool for scholars to find the right journals to publish their manuscripts. It will be a typical “How We Did It” Development Case Study.

We built Edanz Journal Selector based on Solr/Lucene/Hadoop/Hive and deployed it on Amazon web servies. I'm going to share experiences about architecture, cloud and etc. from this project.

Published in: Education, Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
301
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
3
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Edanz journal selector case study a prototype based on solr nutch hadoop

  1. 1. © 2013 LucidWorks
  2. 2. Edanz Journal Selector: Case Study: aPrototype based on Solr/Nutch/HadoopLiang SHEN@shenzhuxiEuropean Bioinformatics Institute
  3. 3. © 2013 LucidWorksEdanz Journal Selectora Prototype based on Solr/Nutch/Hadoop
  4. 4. © 2013 LucidWorksEnglish editing for scientists
  5. 5. © 2013 LucidWorksHelp scientists publish papers
  6. 6. © 2013 LucidWorksTarget journal?
  7. 7. © 2013 LucidWorksJournal Selector
  8. 8. © 2013 LucidWorksOpen AccessPubMed
  9. 9. © 2013 LucidWorksJournal TOCscreated in 200921,498 journals from1,677 publishersInstitute for ComputerBased LearningHeriot-Watt University
  10. 10. © 2013 LucidWorksPartner• Springer Metadata APIProvides metadata for over 5 million online documents• Springer Open Access APIProvides metadata, full-text content, and images forover 80,000 open access articles
  11. 11. © 2013 LucidWorksOpen Source Stack• Infrastructure: Amazon Web Service• Data processing: Hadoop/Hive• Index: Solr/Lucene• Web service: Drupal• Secret Sauce/Custom Works
  12. 12. © 2013 LucidWorksInfrastructure: Amazon EC2
  13. 13. © 2013 LucidWorksData processingHDFSIndexAPIFeedsWebPages
  14. 14. © 2013 LucidWorks<script>http://global.js.widget.eja.hk/ja/edanz_ja/w.js</script>Web service
  15. 15. © 2013 LucidWorksEmbeddable web widget
  16. 16. © 2013 LucidWorksSplit Index for performanceIndex can be divided without losing ranking, if there is always a facet field.
  17. 17. © 2013 LucidWorks@shenzhuxiThanks!Questions?
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×