SlideShare a Scribd company logo
1 of 30
Download to read offline
O C T O B E R 	
   1 3 -­‐ 1 6 , 	
   2 0 1 6 	
   	
   • 	
   	
   A U S T I N , 	
   T X 	
  
Solr	
  &	
  R	
  to	
  deploy	
  
Custom	
  Search	
  Interfaces	
  
	
  
Patrick	
  Beaucamp	
  
Chairman	
  –	
  Bpm-­‐Conseil	
  -­‐	
  France	
  
patrick.beaucamp@bpm-­‐conseil.com	
  
PresentaHon	
  Agenda	
  
Solr	
  &	
  R	
  IntegraHon	
  inside	
  AklaBox	
  
AklaBox	
  PresentaHon	
  
AklaBox	
  &	
  Solr	
  +	
  R	
  &	
  GoJS	
  &	
  OSM	
  
Demo	
  Pla;orm	
  :	
  AklaBox	
  
Going	
  further	
  :	
  Vanilla	
  Air,	
  Spark	
  &	
  R	
  &	
  Solr	
  
Cer@fied	
  on	
  Cloudera	
  &	
  HortonWorks	
  
Run	
  on	
  Hadoop	
  :	
  
Solr/Cloud,	
  Hdfs	
  ...	
  
Ready	
  for	
  OpenStack	
  
Aklabox	
  PresentaHon	
  
Aklabox	
  PresentaHon	
  
User	
  Interface	
  
Aklabox	
  PresentaHon	
  
Upload	
  your	
  
documents	
  
Share	
  your	
  
documents	
  
Collaborate	
  on	
  
documents	
  
Search	
  on	
  
documents	
  
Synchronize	
  
your	
  
documents	
  
Publish	
  your	
  
documents	
  
Document	
  Viewer	
  
Aklabox	
  PresentaHon	
  
WorkFlow	
  
Synchro	
  
Mobile	
  
Aklabox	
  PresentaHon	
  
Standard	
  Search	
  Interface	
  
Solr	
  &	
  R	
  IntegraHon	
  inside	
  AklaBox	
  
•  Why	
  do	
  I	
  get	
  this	
  list	
  when	
  I	
  search	
  inside	
  the	
  document	
  
repository	
  ?	
  
•  What	
  does	
  value	
  when	
  I	
  run	
  a	
  search	
  :	
  weight	
  of	
  every	
  words	
  ?	
  
•  If	
  a	
  word	
  is	
  100	
  @mes	
  in	
  a	
  document,	
  is	
  the	
  document	
  
more	
  valuable	
  for	
  my	
  search	
  ?	
  
•  May	
  be	
  the	
  document	
  I’m	
  looking	
  for	
  has	
  not	
  the	
  exact	
  
word	
  spelling	
  ?	
  
•  How	
  do	
  I	
  take	
  into	
  account	
  mul@	
  language	
  support	
  ?	
  
	
  
Solr	
  &	
  R	
  IntegraHon	
  inside	
  AklaBox	
  
•  We	
  need	
  to	
  review	
  our	
  module	
  and	
  rethink	
  how	
  we	
  can	
  help	
  
user	
  to	
  deploy	
  their	
  own	
  search	
  policy	
  
•  R	
  was	
  a	
  natural	
  choice	
  to	
  create	
  a	
  new	
  search	
  algorithm	
  	
  
•  We	
  use	
  R	
  for	
  our	
  Data	
  Mining	
  development	
  
•  R	
  contains	
  packages	
  to	
  inspect	
  documents	
  
•  R	
  has	
  virtually	
  no	
  limit	
  to	
  analyze	
  and	
  classify	
  documents	
  
•  We	
  read	
  a	
  lot	
  about	
  R	
  &	
  Search	
  engine	
  …	
  
	
  
Solr	
  &	
  R	
  IntegraHon	
  inside	
  AklaBox	
  
•  When	
  do	
  we	
  analyze	
  documents	
  with	
  R	
  :	
  
•  Before	
  Solr	
  Indexa@on	
  
•  AZer	
  Solr	
  Indexa@on	
  
•  Choice	
  :	
  
•  Before	
  Solr	
  Indexa@on	
  
•  We	
  add	
  Metadata	
  on	
  every	
  document,	
  like	
  top	
  words,	
  class	
  
of	
  document	
  ….	
  
•  We	
  create	
  classes	
  for	
  documents,	
  and	
  rela@on	
  between	
  
classes	
  
	
  
Solr	
  &	
  R	
  IntegraHon	
  inside	
  AklaBox	
  
Keywords	
  are	
  added	
  inside	
  Solr	
  Index	
  
Solr	
  &	
  R	
  IntegraHon	
  inside	
  AklaBox	
  
Solr	
  &	
  R	
  IntegraHon	
  inside	
  AklaBox	
  
Solr	
  &	
  R	
  IntegraHon	
  inside	
  AklaBox	
  
R	
  Packages	
  :	
  
	
  
•  tm,	
  textmining	
  func@ons	
  (stemming,	
  words	
  frequency,	
  words	
  manipula@on,	
  
etc...)	
  
•  TF	
  IDF	
  funcHon	
  (Term	
  Frequency)	
  
•  Matrix,	
  for	
  complex	
  ma@rx	
  manipula@on	
  	
  
•  cluster	
  -­‐	
  fanny	
  &	
  kmeans	
  func-ons,	
  to	
  calculate	
  classes	
  on	
  various	
  group	
  
•  libsvm	
  -­‐	
  fonc@uns	
  svm,	
  predict	
  e&	
  tune,	
  for	
  automa@c	
  words	
  classifica@on	
  
•  Sampling	
  –	
  to	
  create	
  &	
  manipulate	
  different	
  data	
  sets	
  
	
  
Solr	
  &	
  R	
  IntegraHon	
  inside	
  AklaBox	
  
+	
  	
  
•  R	
  algorithm	
  runs	
  when	
  the	
  document	
  is	
  uploaded	
  
•  We	
  keep	
  only	
  a	
  few	
  number	
  of	
  words	
  per	
  documents	
  (parameter)	
  
•  We	
  create	
  classes	
  for	
  documents	
  
•  We	
  can	
  managed	
  other	
  concerns,	
  such	
  as	
  interna@onalisa@on	
  
•  R	
  Package	
  can	
  be	
  switch	
  (other	
  algorithm,	
  new	
  deployment)	
  
•  easy	
  &	
  flexible	
  to	
  deploy	
  and	
  maintain	
  
•  No	
  impact	
  on	
  Solr	
  
-­‐	
  
•  Solr	
  index	
  is	
  a	
  gold	
  mine	
  …	
  and	
  we	
  don’t	
  run	
  analysis	
  on	
  it	
  
	
  
AklaBox	
  &	
  Solr	
  +	
  R	
  &	
  GoJS	
  &	
  OSM	
  
AklaBox	
  &	
  Solr	
  +	
  R	
  &	
  GoJS	
  &	
  OSM	
  
Mind	
  Map	
  with	
  Words	
  associa@on	
  
AklaBox	
  &	
  Solr	
  +	
  R	
  &	
  GoJS	
  &	
  OSM	
  
Map	
  Visualiza@on	
  
OSM	
  Visualiza@on	
  
DemonstraHon	
  
DemonstraHon	
  
•  Other	
  Business	
  Cases	
  
	
  
•  Document	
  Management	
  :	
  
Pre-­‐classifica@on	
  of	
  documents	
  (pharmaceu@cal	
  industry)	
  
•  Search	
  engine	
  :	
  	
  
Analysis	
  of	
  WebSite	
  during	
  crawling	
  process	
  
	
  
	
  
•  Open	
  Door	
  to	
  New	
  development	
  
•  Phone@cs	
  search	
  (to	
  solve	
  the	
  word	
  spelling	
  problem)	
  
	
  
Vanilla	
  Air,	
  Spark,	
  Spark	
  Sql	
  for	
  Solr	
  
New	
  Technologies	
  are	
  emerging	
  
…	
  well	
  :	
  it’s	
  already	
  there	
  !!!	
  
Vanilla	
  Air,	
  Spark,	
  Spark	
  Sql	
  for	
  Solr	
  
•  Vanilla	
  Air	
  
– Can	
  Process	
  R	
  Packages	
  
– Can	
  scale	
  with	
  growing	
  number	
  of	
  documents	
  
	
  
www.vanillasmartdata.com	
  
	
  
Vanilla	
  Air,	
  Spark,	
  Spark	
  Sql	
  for	
  Solr	
  
Easy	
  Switch	
  in	
  Architecture	
  -­‐>	
  scalability	
  
Vanilla	
  Air,	
  Spark,	
  Spark	
  &	
  R	
  &	
  Solr	
  
	
  
Spark	
  1.5	
  
Version	
  1.5	
  (sept	
  2015)	
  support	
  for	
  YARN	
  cluster	
  mode	
  in	
  R	
  
Vanilla	
  Air,	
  Spark,	
  Spark	
  &	
  R	
  &	
  Solr	
  
We	
  have	
  now	
  Spark	
  &	
  Solr	
  Tools	
  :	
  SolrRDD	
  
	
  
Tools	
  for	
  reading	
  data	
  from	
  Solr	
  as	
  a	
  Spark	
  RDD	
  and	
  indexing	
  objects	
  
from	
  Spark	
  into	
  Solr	
  using	
  SolrJ	
  
hlps://github.com/LucidWorks/spark-­‐solr	
  
Vanilla	
  Air,	
  Spark,	
  Spark	
  &	
  R	
  &	
  Solr	
  
Admin	
  Side	
  –	
  Runing	
  complex	
  R	
  program	
  
on	
  Solr	
  index,	
  using	
  Vanilla	
  Air	
  
Lucky	
  One	
  !	
  
Solr & R to Deploy Custom Search Interface: Presented by Patrick Beaucamp, Bpm-Conseil

More Related Content

What's hot

Streaming Aggregation in Solr - New Horizons for Search: Presented by Erick E...
Streaming Aggregation in Solr - New Horizons for Search: Presented by Erick E...Streaming Aggregation in Solr - New Horizons for Search: Presented by Erick E...
Streaming Aggregation in Solr - New Horizons for Search: Presented by Erick E...Lucidworks
 
Unified Big Data Processing with Apache Spark (QCON 2014)
Unified Big Data Processing with Apache Spark (QCON 2014)Unified Big Data Processing with Apache Spark (QCON 2014)
Unified Big Data Processing with Apache Spark (QCON 2014)Databricks
 
Strata NYC 2015 - What's coming for the Spark community
Strata NYC 2015 - What's coming for the Spark communityStrata NYC 2015 - What's coming for the Spark community
Strata NYC 2015 - What's coming for the Spark communityDatabricks
 
Data Engineering with Solr and Spark
Data Engineering with Solr and SparkData Engineering with Solr and Spark
Data Engineering with Solr and SparkLucidworks
 
Apache spark linkedin
Apache spark linkedinApache spark linkedin
Apache spark linkedinYukti Kaura
 
Cassandra Community Webinar: Apache Spark Analytics at The Weather Channel - ...
Cassandra Community Webinar: Apache Spark Analytics at The Weather Channel - ...Cassandra Community Webinar: Apache Spark Analytics at The Weather Channel - ...
Cassandra Community Webinar: Apache Spark Analytics at The Weather Channel - ...DataStax Academy
 
State-of-the-Art Drupal Search with Apache Solr
State-of-the-Art Drupal Search with Apache SolrState-of-the-Art Drupal Search with Apache Solr
State-of-the-Art Drupal Search with Apache Solrguest432cd6
 
Lessons from the Field, Episode II: Applying Best Practices to Your Apache S...
 Lessons from the Field, Episode II: Applying Best Practices to Your Apache S... Lessons from the Field, Episode II: Applying Best Practices to Your Apache S...
Lessons from the Field, Episode II: Applying Best Practices to Your Apache S...Databricks
 
Spark Summit EU talk by Ted Malaska
Spark Summit EU talk by Ted MalaskaSpark Summit EU talk by Ted Malaska
Spark Summit EU talk by Ted MalaskaSpark Summit
 
Solr At Scale For Time-Oriented Data: Presented by Brett Hoerner, Rocana
Solr At Scale For Time-Oriented Data: Presented by Brett Hoerner, RocanaSolr At Scale For Time-Oriented Data: Presented by Brett Hoerner, Rocana
Solr At Scale For Time-Oriented Data: Presented by Brett Hoerner, RocanaLucidworks
 
Holden Karau - Spark ML for Custom Models
Holden Karau - Spark ML for Custom ModelsHolden Karau - Spark ML for Custom Models
Holden Karau - Spark ML for Custom Modelssparktc
 
Scaling Self Service Analytics with Databricks and Apache Spark with Amelia C...
Scaling Self Service Analytics with Databricks and Apache Spark with Amelia C...Scaling Self Service Analytics with Databricks and Apache Spark with Amelia C...
Scaling Self Service Analytics with Databricks and Apache Spark with Amelia C...Databricks
 
Building a modern Application with DataFrames
Building a modern Application with DataFramesBuilding a modern Application with DataFrames
Building a modern Application with DataFramesSpark Summit
 
Enabling exploratory data science with Spark and R
Enabling exploratory data science with Spark and REnabling exploratory data science with Spark and R
Enabling exploratory data science with Spark and RDatabricks
 
Spark - The Ultimate Scala Collections by Martin Odersky
Spark - The Ultimate Scala Collections by Martin OderskySpark - The Ultimate Scala Collections by Martin Odersky
Spark - The Ultimate Scala Collections by Martin OderskySpark Summit
 
Spark zeppelin-cassandra at synchrotron
Spark zeppelin-cassandra at synchrotronSpark zeppelin-cassandra at synchrotron
Spark zeppelin-cassandra at synchrotronDuyhai Doan
 
Enabling Exploratory Analysis of Large Data with Apache Spark and R
Enabling Exploratory Analysis of Large Data with Apache Spark and REnabling Exploratory Analysis of Large Data with Apache Spark and R
Enabling Exploratory Analysis of Large Data with Apache Spark and RDatabricks
 
Integrating Spark and Solr-(Timothy Potter, Lucidworks)
Integrating Spark and Solr-(Timothy Potter, Lucidworks)Integrating Spark and Solr-(Timothy Potter, Lucidworks)
Integrating Spark and Solr-(Timothy Potter, Lucidworks)Spark Summit
 
Spark ML for custom models - FOSDEM HPC 2017
Spark ML for custom models - FOSDEM HPC 2017Spark ML for custom models - FOSDEM HPC 2017
Spark ML for custom models - FOSDEM HPC 2017Holden Karau
 

What's hot (20)

Streaming Aggregation in Solr - New Horizons for Search: Presented by Erick E...
Streaming Aggregation in Solr - New Horizons for Search: Presented by Erick E...Streaming Aggregation in Solr - New Horizons for Search: Presented by Erick E...
Streaming Aggregation in Solr - New Horizons for Search: Presented by Erick E...
 
Unified Big Data Processing with Apache Spark (QCON 2014)
Unified Big Data Processing with Apache Spark (QCON 2014)Unified Big Data Processing with Apache Spark (QCON 2014)
Unified Big Data Processing with Apache Spark (QCON 2014)
 
Strata NYC 2015 - What's coming for the Spark community
Strata NYC 2015 - What's coming for the Spark communityStrata NYC 2015 - What's coming for the Spark community
Strata NYC 2015 - What's coming for the Spark community
 
Data Engineering with Solr and Spark
Data Engineering with Solr and SparkData Engineering with Solr and Spark
Data Engineering with Solr and Spark
 
Apache spark linkedin
Apache spark linkedinApache spark linkedin
Apache spark linkedin
 
Cassandra Community Webinar: Apache Spark Analytics at The Weather Channel - ...
Cassandra Community Webinar: Apache Spark Analytics at The Weather Channel - ...Cassandra Community Webinar: Apache Spark Analytics at The Weather Channel - ...
Cassandra Community Webinar: Apache Spark Analytics at The Weather Channel - ...
 
State-of-the-Art Drupal Search with Apache Solr
State-of-the-Art Drupal Search with Apache SolrState-of-the-Art Drupal Search with Apache Solr
State-of-the-Art Drupal Search with Apache Solr
 
Lessons from the Field, Episode II: Applying Best Practices to Your Apache S...
 Lessons from the Field, Episode II: Applying Best Practices to Your Apache S... Lessons from the Field, Episode II: Applying Best Practices to Your Apache S...
Lessons from the Field, Episode II: Applying Best Practices to Your Apache S...
 
Spark Summit EU talk by Ted Malaska
Spark Summit EU talk by Ted MalaskaSpark Summit EU talk by Ted Malaska
Spark Summit EU talk by Ted Malaska
 
Solr At Scale For Time-Oriented Data: Presented by Brett Hoerner, Rocana
Solr At Scale For Time-Oriented Data: Presented by Brett Hoerner, RocanaSolr At Scale For Time-Oriented Data: Presented by Brett Hoerner, Rocana
Solr At Scale For Time-Oriented Data: Presented by Brett Hoerner, Rocana
 
Holden Karau - Spark ML for Custom Models
Holden Karau - Spark ML for Custom ModelsHolden Karau - Spark ML for Custom Models
Holden Karau - Spark ML for Custom Models
 
Scaling Self Service Analytics with Databricks and Apache Spark with Amelia C...
Scaling Self Service Analytics with Databricks and Apache Spark with Amelia C...Scaling Self Service Analytics with Databricks and Apache Spark with Amelia C...
Scaling Self Service Analytics with Databricks and Apache Spark with Amelia C...
 
Building a modern Application with DataFrames
Building a modern Application with DataFramesBuilding a modern Application with DataFrames
Building a modern Application with DataFrames
 
Enabling exploratory data science with Spark and R
Enabling exploratory data science with Spark and REnabling exploratory data science with Spark and R
Enabling exploratory data science with Spark and R
 
Spark - The Ultimate Scala Collections by Martin Odersky
Spark - The Ultimate Scala Collections by Martin OderskySpark - The Ultimate Scala Collections by Martin Odersky
Spark - The Ultimate Scala Collections by Martin Odersky
 
Apache spark basics
Apache spark basicsApache spark basics
Apache spark basics
 
Spark zeppelin-cassandra at synchrotron
Spark zeppelin-cassandra at synchrotronSpark zeppelin-cassandra at synchrotron
Spark zeppelin-cassandra at synchrotron
 
Enabling Exploratory Analysis of Large Data with Apache Spark and R
Enabling Exploratory Analysis of Large Data with Apache Spark and REnabling Exploratory Analysis of Large Data with Apache Spark and R
Enabling Exploratory Analysis of Large Data with Apache Spark and R
 
Integrating Spark and Solr-(Timothy Potter, Lucidworks)
Integrating Spark and Solr-(Timothy Potter, Lucidworks)Integrating Spark and Solr-(Timothy Potter, Lucidworks)
Integrating Spark and Solr-(Timothy Potter, Lucidworks)
 
Spark ML for custom models - FOSDEM HPC 2017
Spark ML for custom models - FOSDEM HPC 2017Spark ML for custom models - FOSDEM HPC 2017
Spark ML for custom models - FOSDEM HPC 2017
 

Viewers also liked

Final Total Preliminary Report
Final Total Preliminary ReportFinal Total Preliminary Report
Final Total Preliminary ReportMrugen Deshmukh
 
Implementing a custom search syntax using solr, lucene & parboiled
Implementing a custom search syntax using solr, lucene & parboiledImplementing a custom search syntax using solr, lucene & parboiled
Implementing a custom search syntax using solr, lucene & parboiledlucenerevolution
 
Google Custom Search Engine Implementation: Issues for Librarians
Google Custom Search Engine Implementation: Issues for LibrariansGoogle Custom Search Engine Implementation: Issues for Librarians
Google Custom Search Engine Implementation: Issues for LibrariansMark-Shane Scale ♞
 
Hacking Lucene for Custom Search Results
Hacking Lucene for Custom Search ResultsHacking Lucene for Custom Search Results
Hacking Lucene for Custom Search ResultsOpenSource Connections
 
Google custom search engine
Google custom search engineGoogle custom search engine
Google custom search engineKey Resourcing
 
25+ Usability Hacks to Transform the Site Search Experience on Your Magento S...
25+ Usability Hacks to Transform the Site Search Experience on Your Magento S...25+ Usability Hacks to Transform the Site Search Experience on Your Magento S...
25+ Usability Hacks to Transform the Site Search Experience on Your Magento S...Unbxd
 
NACHR Conf Presentation Headhunting in Healthcare - Free Tools Greg Hawkes
NACHR Conf Presentation Headhunting in Healthcare - Free Tools Greg HawkesNACHR Conf Presentation Headhunting in Healthcare - Free Tools Greg Hawkes
NACHR Conf Presentation Headhunting in Healthcare - Free Tools Greg HawkesGreg Hawkes
 
Final ERE Referral Webex_Final (1)
Final ERE Referral Webex_Final (1)Final ERE Referral Webex_Final (1)
Final ERE Referral Webex_Final (1)Commvault
 
kaustubh new cv (2015-17)
kaustubh new cv (2015-17)kaustubh new cv (2015-17)
kaustubh new cv (2015-17)kaustubh Tewari
 
Practical Tips to Identify and Engage Talent
Practical Tips to Identify and Engage TalentPractical Tips to Identify and Engage Talent
Practical Tips to Identify and Engage TalentRecruitDC
 
Tutorial on developing a Solr search component plugin
Tutorial on developing a Solr search component pluginTutorial on developing a Solr search component plugin
Tutorial on developing a Solr search component pluginsearchbox-com
 
PPC Hacks To Help You Optimise Like A Boss - Search Leeds - Arianne Donoghue
PPC Hacks To Help You Optimise Like A Boss - Search Leeds - Arianne DonoghuePPC Hacks To Help You Optimise Like A Boss - Search Leeds - Arianne Donoghue
PPC Hacks To Help You Optimise Like A Boss - Search Leeds - Arianne DonoghueArianne Donoghue
 
Big Data Ingestion @ Flipkart Data Platform
Big Data Ingestion @ Flipkart Data PlatformBig Data Ingestion @ Flipkart Data Platform
Big Data Ingestion @ Flipkart Data PlatformNavneet Gupta
 
Combined Boolean Slideshare
Combined Boolean SlideshareCombined Boolean Slideshare
Combined Boolean SlideshareCommvault
 
Linkedin profile audit 2016
Linkedin profile audit 2016Linkedin profile audit 2016
Linkedin profile audit 2016Aaron Lintz
 
Insight house for Forbes
Insight house for ForbesInsight house for Forbes
Insight house for Forbesinsight-house
 

Viewers also liked (20)

Karan_CV
Karan_CVKaran_CV
Karan_CV
 
Final Total Preliminary Report
Final Total Preliminary ReportFinal Total Preliminary Report
Final Total Preliminary Report
 
Implementing a custom search syntax using solr, lucene & parboiled
Implementing a custom search syntax using solr, lucene & parboiledImplementing a custom search syntax using solr, lucene & parboiled
Implementing a custom search syntax using solr, lucene & parboiled
 
Google Custom Search Engine Implementation: Issues for Librarians
Google Custom Search Engine Implementation: Issues for LibrariansGoogle Custom Search Engine Implementation: Issues for Librarians
Google Custom Search Engine Implementation: Issues for Librarians
 
Hacking Lucene for Custom Search Results
Hacking Lucene for Custom Search ResultsHacking Lucene for Custom Search Results
Hacking Lucene for Custom Search Results
 
Google custom search engine
Google custom search engineGoogle custom search engine
Google custom search engine
 
25+ Usability Hacks to Transform the Site Search Experience on Your Magento S...
25+ Usability Hacks to Transform the Site Search Experience on Your Magento S...25+ Usability Hacks to Transform the Site Search Experience on Your Magento S...
25+ Usability Hacks to Transform the Site Search Experience on Your Magento S...
 
NACHR Conf Presentation Headhunting in Healthcare - Free Tools Greg Hawkes
NACHR Conf Presentation Headhunting in Healthcare - Free Tools Greg HawkesNACHR Conf Presentation Headhunting in Healthcare - Free Tools Greg Hawkes
NACHR Conf Presentation Headhunting in Healthcare - Free Tools Greg Hawkes
 
Final ERE Referral Webex_Final (1)
Final ERE Referral Webex_Final (1)Final ERE Referral Webex_Final (1)
Final ERE Referral Webex_Final (1)
 
kaustubh new cv (2015-17)
kaustubh new cv (2015-17)kaustubh new cv (2015-17)
kaustubh new cv (2015-17)
 
Practical Tips to Identify and Engage Talent
Practical Tips to Identify and Engage TalentPractical Tips to Identify and Engage Talent
Practical Tips to Identify and Engage Talent
 
Tutorial on developing a Solr search component plugin
Tutorial on developing a Solr search component pluginTutorial on developing a Solr search component plugin
Tutorial on developing a Solr search component plugin
 
PPC Hacks To Help You Optimise Like A Boss - Search Leeds - Arianne Donoghue
PPC Hacks To Help You Optimise Like A Boss - Search Leeds - Arianne DonoghuePPC Hacks To Help You Optimise Like A Boss - Search Leeds - Arianne Donoghue
PPC Hacks To Help You Optimise Like A Boss - Search Leeds - Arianne Donoghue
 
Sampling designs
Sampling designsSampling designs
Sampling designs
 
Keys for Success from Streams to Queries
Keys for Success from Streams to QueriesKeys for Success from Streams to Queries
Keys for Success from Streams to Queries
 
Big Data Ingestion @ Flipkart Data Platform
Big Data Ingestion @ Flipkart Data PlatformBig Data Ingestion @ Flipkart Data Platform
Big Data Ingestion @ Flipkart Data Platform
 
Combined Boolean Slideshare
Combined Boolean SlideshareCombined Boolean Slideshare
Combined Boolean Slideshare
 
Linkedin profile audit 2016
Linkedin profile audit 2016Linkedin profile audit 2016
Linkedin profile audit 2016
 
CamKee
CamKeeCamKee
CamKee
 
Insight house for Forbes
Insight house for ForbesInsight house for Forbes
Insight house for Forbes
 

Similar to Solr & R to Deploy Custom Search Interface: Presented by Patrick Beaucamp, Bpm-Conseil

Data Science at Scale with Apache Spark and Zeppelin Notebook
Data Science at Scale with Apache Spark and Zeppelin NotebookData Science at Scale with Apache Spark and Zeppelin Notebook
Data Science at Scale with Apache Spark and Zeppelin NotebookCarolyn Duby
 
Practical Machine Learning for Smarter Search with Solr and Spark
Practical Machine Learning for Smarter Search with Solr and SparkPractical Machine Learning for Smarter Search with Solr and Spark
Practical Machine Learning for Smarter Search with Solr and SparkJake Mannix
 
Practical Machine Learning for Smarter Search with Spark+Solr
Practical Machine Learning for Smarter Search with Spark+SolrPractical Machine Learning for Smarter Search with Spark+Solr
Practical Machine Learning for Smarter Search with Spark+SolrJake Mannix
 
Rapid prototyping with solr - By Erik Hatcher
Rapid prototyping with solr -  By Erik Hatcher Rapid prototyping with solr -  By Erik Hatcher
Rapid prototyping with solr - By Erik Hatcher lucenerevolution
 
Jake Mannix, Lead Data Engineer, Lucidworks at MLconf SEA - 5/20/16
Jake Mannix, Lead Data Engineer, Lucidworks at MLconf SEA - 5/20/16Jake Mannix, Lead Data Engineer, Lucidworks at MLconf SEA - 5/20/16
Jake Mannix, Lead Data Engineer, Lucidworks at MLconf SEA - 5/20/16MLconf
 
Introduction to Solr
Introduction to SolrIntroduction to Solr
Introduction to SolrErik Hatcher
 
Data Science at Scale: Using Apache Spark for Data Science at Bitly
Data Science at Scale: Using Apache Spark for Data Science at BitlyData Science at Scale: Using Apache Spark for Data Science at Bitly
Data Science at Scale: Using Apache Spark for Data Science at BitlySarah Guido
 
The First Class Integration of Solr with Hadoop
The First Class Integration of Solr with HadoopThe First Class Integration of Solr with Hadoop
The First Class Integration of Solr with Hadooplucenerevolution
 
Building Enterprise Search Engines using Open Source Technologies
Building Enterprise Search Engines using Open Source TechnologiesBuilding Enterprise Search Engines using Open Source Technologies
Building Enterprise Search Engines using Open Source TechnologiesRahul Singh
 
Building Enterprise Search Engines using Open Source Technologies
Building Enterprise Search Engines using Open Source TechnologiesBuilding Enterprise Search Engines using Open Source Technologies
Building Enterprise Search Engines using Open Source TechnologiesAnant Corporation
 
ApacheCon Europe 2012 -Big Search 4 Big Data
ApacheCon Europe 2012 -Big Search 4 Big DataApacheCon Europe 2012 -Big Search 4 Big Data
ApacheCon Europe 2012 -Big Search 4 Big DataOpenSource Connections
 
Talavant Data Lake Analytics
Talavant Data Lake Analytics Talavant Data Lake Analytics
Talavant Data Lake Analytics Sean Forgatch
 
Apache Spark for Everyone - Women Who Code Workshop
Apache Spark for Everyone - Women Who Code WorkshopApache Spark for Everyone - Women Who Code Workshop
Apache Spark for Everyone - Women Who Code WorkshopAmanda Casari
 
Building Search & Recommendation Engines
Building Search & Recommendation EnginesBuilding Search & Recommendation Engines
Building Search & Recommendation EnginesTrey Grainger
 
Apache Solr 5.0 and beyond
Apache Solr 5.0 and beyondApache Solr 5.0 and beyond
Apache Solr 5.0 and beyondAnshum Gupta
 
State-of-the-Art Drupal Search with Apache Solr
State-of-the-Art Drupal Search with Apache SolrState-of-the-Art Drupal Search with Apache Solr
State-of-the-Art Drupal Search with Apache SolrRobert Douglass
 
Building a Real-Time News Search Engine: Presented by Ramkumar Aiyengar, Bloo...
Building a Real-Time News Search Engine: Presented by Ramkumar Aiyengar, Bloo...Building a Real-Time News Search Engine: Presented by Ramkumar Aiyengar, Bloo...
Building a Real-Time News Search Engine: Presented by Ramkumar Aiyengar, Bloo...Lucidworks
 

Similar to Solr & R to Deploy Custom Search Interface: Presented by Patrick Beaucamp, Bpm-Conseil (20)

Data Science at Scale with Apache Spark and Zeppelin Notebook
Data Science at Scale with Apache Spark and Zeppelin NotebookData Science at Scale with Apache Spark and Zeppelin Notebook
Data Science at Scale with Apache Spark and Zeppelin Notebook
 
Practical Machine Learning for Smarter Search with Solr and Spark
Practical Machine Learning for Smarter Search with Solr and SparkPractical Machine Learning for Smarter Search with Solr and Spark
Practical Machine Learning for Smarter Search with Solr and Spark
 
Practical Machine Learning for Smarter Search with Spark+Solr
Practical Machine Learning for Smarter Search with Spark+SolrPractical Machine Learning for Smarter Search with Spark+Solr
Practical Machine Learning for Smarter Search with Spark+Solr
 
Data Science
Data ScienceData Science
Data Science
 
Rapid Prototyping with Solr
Rapid Prototyping with SolrRapid Prototyping with Solr
Rapid Prototyping with Solr
 
Rapid prototyping with solr - By Erik Hatcher
Rapid prototyping with solr -  By Erik Hatcher Rapid prototyping with solr -  By Erik Hatcher
Rapid prototyping with solr - By Erik Hatcher
 
Jake Mannix, Lead Data Engineer, Lucidworks at MLconf SEA - 5/20/16
Jake Mannix, Lead Data Engineer, Lucidworks at MLconf SEA - 5/20/16Jake Mannix, Lead Data Engineer, Lucidworks at MLconf SEA - 5/20/16
Jake Mannix, Lead Data Engineer, Lucidworks at MLconf SEA - 5/20/16
 
Introduction to Solr
Introduction to SolrIntroduction to Solr
Introduction to Solr
 
Data Science at Scale: Using Apache Spark for Data Science at Bitly
Data Science at Scale: Using Apache Spark for Data Science at BitlyData Science at Scale: Using Apache Spark for Data Science at Bitly
Data Science at Scale: Using Apache Spark for Data Science at Bitly
 
The First Class Integration of Solr with Hadoop
The First Class Integration of Solr with HadoopThe First Class Integration of Solr with Hadoop
The First Class Integration of Solr with Hadoop
 
Building Enterprise Search Engines using Open Source Technologies
Building Enterprise Search Engines using Open Source TechnologiesBuilding Enterprise Search Engines using Open Source Technologies
Building Enterprise Search Engines using Open Source Technologies
 
Building Enterprise Search Engines using Open Source Technologies
Building Enterprise Search Engines using Open Source TechnologiesBuilding Enterprise Search Engines using Open Source Technologies
Building Enterprise Search Engines using Open Source Technologies
 
ApacheCon Europe 2012 -Big Search 4 Big Data
ApacheCon Europe 2012 -Big Search 4 Big DataApacheCon Europe 2012 -Big Search 4 Big Data
ApacheCon Europe 2012 -Big Search 4 Big Data
 
Talavant Data Lake Analytics
Talavant Data Lake Analytics Talavant Data Lake Analytics
Talavant Data Lake Analytics
 
Apache Spark for Everyone - Women Who Code Workshop
Apache Spark for Everyone - Women Who Code WorkshopApache Spark for Everyone - Women Who Code Workshop
Apache Spark for Everyone - Women Who Code Workshop
 
Building Search & Recommendation Engines
Building Search & Recommendation EnginesBuilding Search & Recommendation Engines
Building Search & Recommendation Engines
 
Apache Solr 5.0 and beyond
Apache Solr 5.0 and beyondApache Solr 5.0 and beyond
Apache Solr 5.0 and beyond
 
State-of-the-Art Drupal Search with Apache Solr
State-of-the-Art Drupal Search with Apache SolrState-of-the-Art Drupal Search with Apache Solr
State-of-the-Art Drupal Search with Apache Solr
 
Cloudera search
Cloudera searchCloudera search
Cloudera search
 
Building a Real-Time News Search Engine: Presented by Ramkumar Aiyengar, Bloo...
Building a Real-Time News Search Engine: Presented by Ramkumar Aiyengar, Bloo...Building a Real-Time News Search Engine: Presented by Ramkumar Aiyengar, Bloo...
Building a Real-Time News Search Engine: Presented by Ramkumar Aiyengar, Bloo...
 

More from Lucidworks

Search is the Tip of the Spear for Your B2B eCommerce Strategy
Search is the Tip of the Spear for Your B2B eCommerce StrategySearch is the Tip of the Spear for Your B2B eCommerce Strategy
Search is the Tip of the Spear for Your B2B eCommerce StrategyLucidworks
 
Drive Agent Effectiveness in Salesforce
Drive Agent Effectiveness in SalesforceDrive Agent Effectiveness in Salesforce
Drive Agent Effectiveness in SalesforceLucidworks
 
How Crate & Barrel Connects Shoppers with Relevant Products
How Crate & Barrel Connects Shoppers with Relevant ProductsHow Crate & Barrel Connects Shoppers with Relevant Products
How Crate & Barrel Connects Shoppers with Relevant ProductsLucidworks
 
Lucidworks & IMRG Webinar – Best-In-Class Retail Product Discovery
Lucidworks & IMRG Webinar – Best-In-Class Retail Product DiscoveryLucidworks & IMRG Webinar – Best-In-Class Retail Product Discovery
Lucidworks & IMRG Webinar – Best-In-Class Retail Product DiscoveryLucidworks
 
Connected Experiences Are Personalized Experiences
Connected Experiences Are Personalized ExperiencesConnected Experiences Are Personalized Experiences
Connected Experiences Are Personalized ExperiencesLucidworks
 
Intelligent Insight Driven Policing with MC+A, Toronto Police Service and Luc...
Intelligent Insight Driven Policing with MC+A, Toronto Police Service and Luc...Intelligent Insight Driven Policing with MC+A, Toronto Police Service and Luc...
Intelligent Insight Driven Policing with MC+A, Toronto Police Service and Luc...Lucidworks
 
[Webinar] Intelligent Policing. Leveraging Data to more effectively Serve Com...
[Webinar] Intelligent Policing. Leveraging Data to more effectively Serve Com...[Webinar] Intelligent Policing. Leveraging Data to more effectively Serve Com...
[Webinar] Intelligent Policing. Leveraging Data to more effectively Serve Com...Lucidworks
 
Preparing for Peak in Ecommerce | eTail Asia 2020
Preparing for Peak in Ecommerce | eTail Asia 2020Preparing for Peak in Ecommerce | eTail Asia 2020
Preparing for Peak in Ecommerce | eTail Asia 2020Lucidworks
 
Accelerate The Path To Purchase With Product Discovery at Retail Innovation C...
Accelerate The Path To Purchase With Product Discovery at Retail Innovation C...Accelerate The Path To Purchase With Product Discovery at Retail Innovation C...
Accelerate The Path To Purchase With Product Discovery at Retail Innovation C...Lucidworks
 
AI-Powered Linguistics and Search with Fusion and Rosette
AI-Powered Linguistics and Search with Fusion and RosetteAI-Powered Linguistics and Search with Fusion and Rosette
AI-Powered Linguistics and Search with Fusion and RosetteLucidworks
 
The Service Industry After COVID-19: The Soul of Service in a Virtual Moment
The Service Industry After COVID-19: The Soul of Service in a Virtual MomentThe Service Industry After COVID-19: The Soul of Service in a Virtual Moment
The Service Industry After COVID-19: The Soul of Service in a Virtual MomentLucidworks
 
Webinar: Smart answers for employee and customer support after covid 19 - Europe
Webinar: Smart answers for employee and customer support after covid 19 - EuropeWebinar: Smart answers for employee and customer support after covid 19 - Europe
Webinar: Smart answers for employee and customer support after covid 19 - EuropeLucidworks
 
Smart Answers for Employee and Customer Support After COVID-19
Smart Answers for Employee and Customer Support After COVID-19Smart Answers for Employee and Customer Support After COVID-19
Smart Answers for Employee and Customer Support After COVID-19Lucidworks
 
Applying AI & Search in Europe - featuring 451 Research
Applying AI & Search in Europe - featuring 451 ResearchApplying AI & Search in Europe - featuring 451 Research
Applying AI & Search in Europe - featuring 451 ResearchLucidworks
 
Webinar: Accelerate Data Science with Fusion 5.1
Webinar: Accelerate Data Science with Fusion 5.1Webinar: Accelerate Data Science with Fusion 5.1
Webinar: Accelerate Data Science with Fusion 5.1Lucidworks
 
Webinar: 5 Must-Have Items You Need for Your 2020 Ecommerce Strategy
Webinar: 5 Must-Have Items You Need for Your 2020 Ecommerce StrategyWebinar: 5 Must-Have Items You Need for Your 2020 Ecommerce Strategy
Webinar: 5 Must-Have Items You Need for Your 2020 Ecommerce StrategyLucidworks
 
Where Search Meets Science and Style Meets Savings: Nordstrom Rack's Journey ...
Where Search Meets Science and Style Meets Savings: Nordstrom Rack's Journey ...Where Search Meets Science and Style Meets Savings: Nordstrom Rack's Journey ...
Where Search Meets Science and Style Meets Savings: Nordstrom Rack's Journey ...Lucidworks
 
Apply Knowledge Graphs and Search for Real-World Decision Intelligence
Apply Knowledge Graphs and Search for Real-World Decision IntelligenceApply Knowledge Graphs and Search for Real-World Decision Intelligence
Apply Knowledge Graphs and Search for Real-World Decision IntelligenceLucidworks
 
Webinar: Building a Business Case for Enterprise Search
Webinar: Building a Business Case for Enterprise SearchWebinar: Building a Business Case for Enterprise Search
Webinar: Building a Business Case for Enterprise SearchLucidworks
 
Why Insight Engines Matter in 2020 and Beyond
Why Insight Engines Matter in 2020 and BeyondWhy Insight Engines Matter in 2020 and Beyond
Why Insight Engines Matter in 2020 and BeyondLucidworks
 

More from Lucidworks (20)

Search is the Tip of the Spear for Your B2B eCommerce Strategy
Search is the Tip of the Spear for Your B2B eCommerce StrategySearch is the Tip of the Spear for Your B2B eCommerce Strategy
Search is the Tip of the Spear for Your B2B eCommerce Strategy
 
Drive Agent Effectiveness in Salesforce
Drive Agent Effectiveness in SalesforceDrive Agent Effectiveness in Salesforce
Drive Agent Effectiveness in Salesforce
 
How Crate & Barrel Connects Shoppers with Relevant Products
How Crate & Barrel Connects Shoppers with Relevant ProductsHow Crate & Barrel Connects Shoppers with Relevant Products
How Crate & Barrel Connects Shoppers with Relevant Products
 
Lucidworks & IMRG Webinar – Best-In-Class Retail Product Discovery
Lucidworks & IMRG Webinar – Best-In-Class Retail Product DiscoveryLucidworks & IMRG Webinar – Best-In-Class Retail Product Discovery
Lucidworks & IMRG Webinar – Best-In-Class Retail Product Discovery
 
Connected Experiences Are Personalized Experiences
Connected Experiences Are Personalized ExperiencesConnected Experiences Are Personalized Experiences
Connected Experiences Are Personalized Experiences
 
Intelligent Insight Driven Policing with MC+A, Toronto Police Service and Luc...
Intelligent Insight Driven Policing with MC+A, Toronto Police Service and Luc...Intelligent Insight Driven Policing with MC+A, Toronto Police Service and Luc...
Intelligent Insight Driven Policing with MC+A, Toronto Police Service and Luc...
 
[Webinar] Intelligent Policing. Leveraging Data to more effectively Serve Com...
[Webinar] Intelligent Policing. Leveraging Data to more effectively Serve Com...[Webinar] Intelligent Policing. Leveraging Data to more effectively Serve Com...
[Webinar] Intelligent Policing. Leveraging Data to more effectively Serve Com...
 
Preparing for Peak in Ecommerce | eTail Asia 2020
Preparing for Peak in Ecommerce | eTail Asia 2020Preparing for Peak in Ecommerce | eTail Asia 2020
Preparing for Peak in Ecommerce | eTail Asia 2020
 
Accelerate The Path To Purchase With Product Discovery at Retail Innovation C...
Accelerate The Path To Purchase With Product Discovery at Retail Innovation C...Accelerate The Path To Purchase With Product Discovery at Retail Innovation C...
Accelerate The Path To Purchase With Product Discovery at Retail Innovation C...
 
AI-Powered Linguistics and Search with Fusion and Rosette
AI-Powered Linguistics and Search with Fusion and RosetteAI-Powered Linguistics and Search with Fusion and Rosette
AI-Powered Linguistics and Search with Fusion and Rosette
 
The Service Industry After COVID-19: The Soul of Service in a Virtual Moment
The Service Industry After COVID-19: The Soul of Service in a Virtual MomentThe Service Industry After COVID-19: The Soul of Service in a Virtual Moment
The Service Industry After COVID-19: The Soul of Service in a Virtual Moment
 
Webinar: Smart answers for employee and customer support after covid 19 - Europe
Webinar: Smart answers for employee and customer support after covid 19 - EuropeWebinar: Smart answers for employee and customer support after covid 19 - Europe
Webinar: Smart answers for employee and customer support after covid 19 - Europe
 
Smart Answers for Employee and Customer Support After COVID-19
Smart Answers for Employee and Customer Support After COVID-19Smart Answers for Employee and Customer Support After COVID-19
Smart Answers for Employee and Customer Support After COVID-19
 
Applying AI & Search in Europe - featuring 451 Research
Applying AI & Search in Europe - featuring 451 ResearchApplying AI & Search in Europe - featuring 451 Research
Applying AI & Search in Europe - featuring 451 Research
 
Webinar: Accelerate Data Science with Fusion 5.1
Webinar: Accelerate Data Science with Fusion 5.1Webinar: Accelerate Data Science with Fusion 5.1
Webinar: Accelerate Data Science with Fusion 5.1
 
Webinar: 5 Must-Have Items You Need for Your 2020 Ecommerce Strategy
Webinar: 5 Must-Have Items You Need for Your 2020 Ecommerce StrategyWebinar: 5 Must-Have Items You Need for Your 2020 Ecommerce Strategy
Webinar: 5 Must-Have Items You Need for Your 2020 Ecommerce Strategy
 
Where Search Meets Science and Style Meets Savings: Nordstrom Rack's Journey ...
Where Search Meets Science and Style Meets Savings: Nordstrom Rack's Journey ...Where Search Meets Science and Style Meets Savings: Nordstrom Rack's Journey ...
Where Search Meets Science and Style Meets Savings: Nordstrom Rack's Journey ...
 
Apply Knowledge Graphs and Search for Real-World Decision Intelligence
Apply Knowledge Graphs and Search for Real-World Decision IntelligenceApply Knowledge Graphs and Search for Real-World Decision Intelligence
Apply Knowledge Graphs and Search for Real-World Decision Intelligence
 
Webinar: Building a Business Case for Enterprise Search
Webinar: Building a Business Case for Enterprise SearchWebinar: Building a Business Case for Enterprise Search
Webinar: Building a Business Case for Enterprise Search
 
Why Insight Engines Matter in 2020 and Beyond
Why Insight Engines Matter in 2020 and BeyondWhy Insight Engines Matter in 2020 and Beyond
Why Insight Engines Matter in 2020 and Beyond
 

Recently uploaded

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 

Recently uploaded (20)

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 

Solr & R to Deploy Custom Search Interface: Presented by Patrick Beaucamp, Bpm-Conseil

  • 1. O C T O B E R   1 3 -­‐ 1 6 ,   2 0 1 6     •     A U S T I N ,   T X  
  • 2. Solr  &  R  to  deploy   Custom  Search  Interfaces     Patrick  Beaucamp   Chairman  –  Bpm-­‐Conseil  -­‐  France   patrick.beaucamp@bpm-­‐conseil.com  
  • 3. PresentaHon  Agenda   Solr  &  R  IntegraHon  inside  AklaBox   AklaBox  PresentaHon   AklaBox  &  Solr  +  R  &  GoJS  &  OSM   Demo  Pla;orm  :  AklaBox   Going  further  :  Vanilla  Air,  Spark  &  R  &  Solr  
  • 4. Cer@fied  on  Cloudera  &  HortonWorks   Run  on  Hadoop  :   Solr/Cloud,  Hdfs  ...   Ready  for  OpenStack   Aklabox  PresentaHon  
  • 6. Aklabox  PresentaHon   Upload  your   documents   Share  your   documents   Collaborate  on   documents   Search  on   documents   Synchronize   your   documents   Publish  your   documents   Document  Viewer  
  • 7. Aklabox  PresentaHon   WorkFlow   Synchro   Mobile  
  • 8. Aklabox  PresentaHon   Standard  Search  Interface  
  • 9. Solr  &  R  IntegraHon  inside  AklaBox   •  Why  do  I  get  this  list  when  I  search  inside  the  document   repository  ?   •  What  does  value  when  I  run  a  search  :  weight  of  every  words  ?   •  If  a  word  is  100  @mes  in  a  document,  is  the  document   more  valuable  for  my  search  ?   •  May  be  the  document  I’m  looking  for  has  not  the  exact   word  spelling  ?   •  How  do  I  take  into  account  mul@  language  support  ?    
  • 10. Solr  &  R  IntegraHon  inside  AklaBox   •  We  need  to  review  our  module  and  rethink  how  we  can  help   user  to  deploy  their  own  search  policy   •  R  was  a  natural  choice  to  create  a  new  search  algorithm     •  We  use  R  for  our  Data  Mining  development   •  R  contains  packages  to  inspect  documents   •  R  has  virtually  no  limit  to  analyze  and  classify  documents   •  We  read  a  lot  about  R  &  Search  engine  …    
  • 11. Solr  &  R  IntegraHon  inside  AklaBox   •  When  do  we  analyze  documents  with  R  :   •  Before  Solr  Indexa@on   •  AZer  Solr  Indexa@on   •  Choice  :   •  Before  Solr  Indexa@on   •  We  add  Metadata  on  every  document,  like  top  words,  class   of  document  ….   •  We  create  classes  for  documents,  and  rela@on  between   classes    
  • 12. Solr  &  R  IntegraHon  inside  AklaBox   Keywords  are  added  inside  Solr  Index  
  • 13. Solr  &  R  IntegraHon  inside  AklaBox  
  • 14. Solr  &  R  IntegraHon  inside  AklaBox  
  • 15. Solr  &  R  IntegraHon  inside  AklaBox   R  Packages  :     •  tm,  textmining  func@ons  (stemming,  words  frequency,  words  manipula@on,   etc...)   •  TF  IDF  funcHon  (Term  Frequency)   •  Matrix,  for  complex  ma@rx  manipula@on     •  cluster  -­‐  fanny  &  kmeans  func-ons,  to  calculate  classes  on  various  group   •  libsvm  -­‐  fonc@uns  svm,  predict  e&  tune,  for  automa@c  words  classifica@on   •  Sampling  –  to  create  &  manipulate  different  data  sets    
  • 16. Solr  &  R  IntegraHon  inside  AklaBox   +     •  R  algorithm  runs  when  the  document  is  uploaded   •  We  keep  only  a  few  number  of  words  per  documents  (parameter)   •  We  create  classes  for  documents   •  We  can  managed  other  concerns,  such  as  interna@onalisa@on   •  R  Package  can  be  switch  (other  algorithm,  new  deployment)   •  easy  &  flexible  to  deploy  and  maintain   •  No  impact  on  Solr   -­‐   •  Solr  index  is  a  gold  mine  …  and  we  don’t  run  analysis  on  it    
  • 17. AklaBox  &  Solr  +  R  &  GoJS  &  OSM  
  • 18. AklaBox  &  Solr  +  R  &  GoJS  &  OSM   Mind  Map  with  Words  associa@on  
  • 19. AklaBox  &  Solr  +  R  &  GoJS  &  OSM   Map  Visualiza@on   OSM  Visualiza@on  
  • 21. DemonstraHon   •  Other  Business  Cases     •  Document  Management  :   Pre-­‐classifica@on  of  documents  (pharmaceu@cal  industry)   •  Search  engine  :     Analysis  of  WebSite  during  crawling  process       •  Open  Door  to  New  development   •  Phone@cs  search  (to  solve  the  word  spelling  problem)    
  • 22. Vanilla  Air,  Spark,  Spark  Sql  for  Solr   New  Technologies  are  emerging   …  well  :  it’s  already  there  !!!  
  • 23. Vanilla  Air,  Spark,  Spark  Sql  for  Solr   •  Vanilla  Air   – Can  Process  R  Packages   – Can  scale  with  growing  number  of  documents     www.vanillasmartdata.com    
  • 24. Vanilla  Air,  Spark,  Spark  Sql  for  Solr   Easy  Switch  in  Architecture  -­‐>  scalability  
  • 25. Vanilla  Air,  Spark,  Spark  &  R  &  Solr     Spark  1.5   Version  1.5  (sept  2015)  support  for  YARN  cluster  mode  in  R  
  • 26. Vanilla  Air,  Spark,  Spark  &  R  &  Solr   We  have  now  Spark  &  Solr  Tools  :  SolrRDD     Tools  for  reading  data  from  Solr  as  a  Spark  RDD  and  indexing  objects   from  Spark  into  Solr  using  SolrJ   hlps://github.com/LucidWorks/spark-­‐solr  
  • 27. Vanilla  Air,  Spark,  Spark  &  R  &  Solr   Admin  Side  –  Runing  complex  R  program   on  Solr  index,  using  Vanilla  Air  
  • 28.