Your SlideShare is downloading. ×
Solr and ManifoldCF
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

Solr and ManifoldCF

539

Published on

Published in: Technology
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
539
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
4
Comments
0
Likes
1
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. Solr And ManifoldCF minoru@apache.org
  • 2. Who am I ? 大須賀 稔 (Minoru Osuka)
 <minoru@apache.org>! Committer and PMC member of ManifoldCF 
 at Apache Software Foundation.! Senior Consultant
 at RONDHUIT CO, Ltd.! Formerly Senior Application Engineer
 at Rakuten, Inc.
  • 3. What I do ? Installation support for Solr as an IT consultant.! ! Solr trainer.! ! Solr and ManifoldCF developer.
  • 4. Contents What is ManifoldCF ?! Project status! Architecture! Use case! Resources! Books! Demonstration
  • 5. What is ManifoldCF ? Open Source Crawler! Admin GUI! Built-in scheduler! Job Management! Get contents from repositories! Status Report! Push contents to another servers! History Report Authority Service! Security Search Component Plugin! REST API
  • 6. Project status Latest version : 1.3! IBM FileNet ! Solr! Atlassian JIRA! Elasticsearch! Dropbox! MetaCarta Geographic Text Search! Google Drive! OpenSearchServer! Windows Shares! Microsoft SharePoint 2003/2007/2010! HDFS ! Alfresco! Generic File System! OpenCMIS! Generic JDBC! EMC Documentum! Generic Web! Autonomy Meridio Generic RSS
  • 7. Architecture Push
 Contents Security
 Search Component
 Plugin Output Connector Job Repository Connector Authority Service Security
 Search Component
 Plugin SharePoint
 Plugin Get
 ACLs
  • 8. Use case Web Search Engine! 3. Indexing the Web contents Solr! Hadoop! HDFS Repository
 Connector Solr
 Connector HDFS / MapReduce! ManifoldCF! Solr Connector! HDFS Connector! Web Connector 2. Reduce the HTML noise / Calculate the page rank HDFS Output
 Connector 1. Crawling the Web contents Web
 Connector
  • 9. Demonstration
  • 10. Resources Project Home
 http://manifoldcf.apache.org/! Javadoc
 http://manifoldcf.apache.org/release/trunk/en_US/javadoc.html! Source code
 http://svn.apache.org/repos/asf/manifoldcf/! JIRA
 https://issues.apache.org/jira/browse/CONNECTORSC! Confluence
 https://cwiki.apache.org/confluence/display/CONNECTORS/Index
  • 11. Books ManifoldCF in Action! http://www.manning.com/wright/
  • 12. PR Seminar in RONDHUIT! Apache Solr ご紹介セミナー! Training in RONDHUIT! Solr 4 基礎 / 応用 / クラウド分散運用 / DIH! ManifoldCF 入門
  • 13. Now Hiring ! We are looking for human resources with the desire to grow together and continue to create the future.! Consultant! Technical Support Engineer
  • 14. Thank you for your attention !

×