SlideShare a Scribd company logo
1 of 12
Justia’s Cloud Farmer   Software Developer
@NickMoline
http://www.nick.pro/
Pros                                        Cons
Really simple to set up                     No discrete field searching
(for web pages or documents)                (other than things like title:)
Can run inside your firewall                Physical branded box to install and
                                            support
Great highlighting and snippet generation   Very limited control over look / feel of
                                            results
Creates “Cached Version” even of PDFs
                                            No Geospatial
                                            No JSON version
                                            Discontinued
Pros                                           Cons
Really simple to set up                        No discrete field searching
(for web pages or documents)                   (other than things like title:)
If your site is indexed, no wait time to get   Minimal control over new content getting
started                                        indexed on your terms
Great highlighting and snippet generation      Very limited control over look / feel of
                                               results
                                               With JSON/XML version can only return 4
                                               or 8 results at a time
                                               No Geospatial
Pros                                      Cons
Very fast for searching                   Very slow for indexing
Full control of when content is indexed   Requires reindexing ALL content, every
                                          time
Good Geospatial Search built in
Newer versions can be connected to with   Doesn’t return any of the textual content,
MySQL libraries and queried like a DB     so requires a separate database query
                                          ALWAYS
Filters & Faceting                        Only on Numeric fields
Field Boosting!
Pros                                      Cons
Very extendable and configurable          Very difficult to optimize performance
Full control of when content is indexed   Adding Content
Geospatial with “LocalSOLR” plugin
Returns content                           More content you return, slower it gets
Does highlighting                         Highlighting is not good performance
Tons of Faceting options
Sharding and Cores                        Again, hard to optimize
Field Boosting
Document Boosting!
Pros                                    Cons
Extremely fast
Automatically Scales (no thinking)      No control of the scaling
Automatically Shards when adding
content
Easy Re-indexing of content
Returns content for creating snippets
Easy JSON implementation
                                        No geo (yet)
                                        No highlighting/snippet gen (yet)
                                        No field boosting (yet)
• Duplicate Word Mark field as both text and literal
• Do 4 Searches:
   • Exact Word Mark Match
         • bq=(and type:'trademark_case' literal_word_mark:'amazon')
   • Prefix Word Mark Match
         • bq=(and type:'trademark_case' (and (not literal_word_mark:'amazon')
           literal_word_mark:'amazon*'))
   • Anywhere Word Mark Match
         • bq=(and type:'trademark_case' (and (not literal_word_mark:'amazon*')
           word_mark:'amazon'))
   • Full Text Search
         • bq=(and type:'trademark_case' (and (not literal_word_mark:'amazon*')
           (not literal_word_mark:'amazon') (not word_mark:'amazon')))
• Pass counts with pagination links
Justia’s Cloud Farmer   Software Developer
@NickMoline
http://www.nick.pro/

More Related Content

Viewers also liked

Who are SBS.doc
Who are SBS.docWho are SBS.doc
Who are SBS.docSally15
 
まちづくりと復興について
まちづくりと復興についてまちづくりと復興について
まちづくりと復興についてJunichi Toyoshima
 
Bcsl 033 data and file structures lab s3-1
Bcsl 033 data and file structures lab s3-1Bcsl 033 data and file structures lab s3-1
Bcsl 033 data and file structures lab s3-1Dr. Loganathan R
 
Clipacevedoslideshare 091126171514-phpapp01
Clipacevedoslideshare 091126171514-phpapp01Clipacevedoslideshare 091126171514-phpapp01
Clipacevedoslideshare 091126171514-phpapp01liveware82
 
Second Stage Booster: Optimizing Drupal and Wordpress for SEO, Speed and Soci...
Second Stage Booster: Optimizing Drupal and Wordpress for SEO, Speed and Soci...Second Stage Booster: Optimizing Drupal and Wordpress for SEO, Speed and Soci...
Second Stage Booster: Optimizing Drupal and Wordpress for SEO, Speed and Soci...Nick Moline
 
world with problem
world with problemworld with problem
world with problemarcjoan
 
The Afterburner - Optimizing Drupal for Speed and SEO
The Afterburner - Optimizing Drupal for Speed and SEOThe Afterburner - Optimizing Drupal for Speed and SEO
The Afterburner - Optimizing Drupal for Speed and SEONick Moline
 
Kobie Quarterly Review: Retail Edition, June 2013
Kobie Quarterly Review: Retail Edition, June 2013Kobie Quarterly Review: Retail Edition, June 2013
Kobie Quarterly Review: Retail Edition, June 2013Jennifer Lingerfelt
 
Guide To Qrops Pension Planning
Guide To Qrops Pension PlanningGuide To Qrops Pension Planning
Guide To Qrops Pension Planninggregsmithhk
 
#ThroughGlass : An Introduction to Google Glass
#ThroughGlass : An Introduction to Google Glass#ThroughGlass : An Introduction to Google Glass
#ThroughGlass : An Introduction to Google GlassNick Moline
 
Individual differences and call ppt
Individual differences and call pptIndividual differences and call ppt
Individual differences and call pptgianbisa
 
Sarcs sexii power point final
Sarcs sexii power point finalSarcs sexii power point final
Sarcs sexii power point finalMichael Greenwell
 
Who are SBS.doc
Who are SBS.docWho are SBS.doc
Who are SBS.docSally15
 
The story of_intenet_1
The story of_intenet_1The story of_intenet_1
The story of_intenet_1Jorgelcb
 
Reconstruction and town planning of ishinomaki
Reconstruction and town planning of ishinomakiReconstruction and town planning of ishinomaki
Reconstruction and town planning of ishinomakiJunichi Toyoshima
 

Viewers also liked (20)

Who are SBS.doc
Who are SBS.docWho are SBS.doc
Who are SBS.doc
 
まちづくりと復興について
まちづくりと復興についてまちづくりと復興について
まちづくりと復興について
 
Bcsl 033 data and file structures lab s3-1
Bcsl 033 data and file structures lab s3-1Bcsl 033 data and file structures lab s3-1
Bcsl 033 data and file structures lab s3-1
 
Clipacevedoslideshare 091126171514-phpapp01
Clipacevedoslideshare 091126171514-phpapp01Clipacevedoslideshare 091126171514-phpapp01
Clipacevedoslideshare 091126171514-phpapp01
 
Second Stage Booster: Optimizing Drupal and Wordpress for SEO, Speed and Soci...
Second Stage Booster: Optimizing Drupal and Wordpress for SEO, Speed and Soci...Second Stage Booster: Optimizing Drupal and Wordpress for SEO, Speed and Soci...
Second Stage Booster: Optimizing Drupal and Wordpress for SEO, Speed and Soci...
 
world with problem
world with problemworld with problem
world with problem
 
The Afterburner - Optimizing Drupal for Speed and SEO
The Afterburner - Optimizing Drupal for Speed and SEOThe Afterburner - Optimizing Drupal for Speed and SEO
The Afterburner - Optimizing Drupal for Speed and SEO
 
Kobie Quarterly Review: Retail Edition, June 2013
Kobie Quarterly Review: Retail Edition, June 2013Kobie Quarterly Review: Retail Edition, June 2013
Kobie Quarterly Review: Retail Edition, June 2013
 
Guide To Qrops Pension Planning
Guide To Qrops Pension PlanningGuide To Qrops Pension Planning
Guide To Qrops Pension Planning
 
Ning post #2
Ning post #2Ning post #2
Ning post #2
 
Ning post #3
Ning post #3Ning post #3
Ning post #3
 
#ThroughGlass : An Introduction to Google Glass
#ThroughGlass : An Introduction to Google Glass#ThroughGlass : An Introduction to Google Glass
#ThroughGlass : An Introduction to Google Glass
 
Individual differences and call ppt
Individual differences and call pptIndividual differences and call ppt
Individual differences and call ppt
 
Sarcs sexii power point final
Sarcs sexii power point finalSarcs sexii power point final
Sarcs sexii power point final
 
Who are SBS.doc
Who are SBS.docWho are SBS.doc
Who are SBS.doc
 
The story of_intenet_1
The story of_intenet_1The story of_intenet_1
The story of_intenet_1
 
Ning post #4
Ning post #4Ning post #4
Ning post #4
 
Ning post 2
Ning post 2Ning post 2
Ning post 2
 
Stres na drodze - raport
Stres na drodze - raportStres na drodze - raport
Stres na drodze - raport
 
Reconstruction and town planning of ishinomaki
Reconstruction and town planning of ishinomakiReconstruction and town planning of ishinomaki
Reconstruction and town planning of ishinomaki
 

Similar to Justia and Amazon CloudSearch

Introduction to MongoDB
Introduction to MongoDBIntroduction to MongoDB
Introduction to MongoDBJustin Smestad
 
TERMINALFOUR t44u 2012 - Publish Performance
TERMINALFOUR t44u 2012 - Publish PerformanceTERMINALFOUR t44u 2012 - Publish Performance
TERMINALFOUR t44u 2012 - Publish PerformanceTerminalfour
 
Between a Block & a Hard Place
Between a Block & a Hard PlaceBetween a Block & a Hard Place
Between a Block & a Hard PlaceWP Engine
 
N tier web applications
N tier web applicationsN tier web applications
N tier web applicationsMike Feltman
 
DBSight Introduction
DBSight IntroductionDBSight Introduction
DBSight Introductionchrislusf
 
CosmosDB for DBAs & Developers
CosmosDB for DBAs & DevelopersCosmosDB for DBAs & Developers
CosmosDB for DBAs & DevelopersNiko Neugebauer
 
NoSQL, Apache SOLR and Apache Hadoop
NoSQL, Apache SOLR and Apache HadoopNoSQL, Apache SOLR and Apache Hadoop
NoSQL, Apache SOLR and Apache HadoopDmitry Kan
 
WebObjects Developer Tools
WebObjects Developer ToolsWebObjects Developer Tools
WebObjects Developer ToolsWO Community
 
SharePoint 2014: Where to save my data, for devs!
SharePoint 2014: Where to save my data, for devs!SharePoint 2014: Where to save my data, for devs!
SharePoint 2014: Where to save my data, for devs!Ben Steinhauser
 
Implementación de tfs 2010 en entornos complejos (cómo y por qué) v03
Implementación de tfs 2010 en entornos complejos (cómo y por qué) v03Implementación de tfs 2010 en entornos complejos (cómo y por qué) v03
Implementación de tfs 2010 en entornos complejos (cómo y por qué) v03Diego Ferreyra
 
Alfresco 5.0 Technology Review
Alfresco 5.0 Technology ReviewAlfresco 5.0 Technology Review
Alfresco 5.0 Technology ReviewZia Consulting
 
Extending Your Reach with Career Portal and Search Engine Optimization
Extending Your Reach with Career Portal and Search Engine OptimizationExtending Your Reach with Career Portal and Search Engine Optimization
Extending Your Reach with Career Portal and Search Engine OptimizationJeremyOtt5
 
AWS re:Invent 2016: Case Study: How Spokeo Improved Web Application Response ...
AWS re:Invent 2016: Case Study: How Spokeo Improved Web Application Response ...AWS re:Invent 2016: Case Study: How Spokeo Improved Web Application Response ...
AWS re:Invent 2016: Case Study: How Spokeo Improved Web Application Response ...Amazon Web Services
 
Local Storage for Web Applications
Local Storage for Web ApplicationsLocal Storage for Web Applications
Local Storage for Web ApplicationsMarkku Laine
 
Django Deployment with Fabric
Django Deployment with FabricDjango Deployment with Fabric
Django Deployment with FabricJonas Nockert
 
JavaScript Build System Battle Royale | PrDC 2017
JavaScript Build System Battle Royale | PrDC 2017JavaScript Build System Battle Royale | PrDC 2017
JavaScript Build System Battle Royale | PrDC 2017David Wesst
 
Web Test Automation Framework - IndicThreads Conference
Web Test Automation Framework  - IndicThreads ConferenceWeb Test Automation Framework  - IndicThreads Conference
Web Test Automation Framework - IndicThreads ConferenceIndicThreads
 
Exploring MongoDB & Elasticsearch: Better Together
Exploring MongoDB & Elasticsearch: Better TogetherExploring MongoDB & Elasticsearch: Better Together
Exploring MongoDB & Elasticsearch: Better TogetherObjectRocket
 
Modern JavaScript Talk
Modern JavaScript TalkModern JavaScript Talk
Modern JavaScript TalkDerek Binkley
 

Similar to Justia and Amazon CloudSearch (20)

Introduction to MongoDB
Introduction to MongoDBIntroduction to MongoDB
Introduction to MongoDB
 
TERMINALFOUR t44u 2012 - Publish Performance
TERMINALFOUR t44u 2012 - Publish PerformanceTERMINALFOUR t44u 2012 - Publish Performance
TERMINALFOUR t44u 2012 - Publish Performance
 
Between a Block & a Hard Place
Between a Block & a Hard PlaceBetween a Block & a Hard Place
Between a Block & a Hard Place
 
N tier web applications
N tier web applicationsN tier web applications
N tier web applications
 
DBSight Introduction
DBSight IntroductionDBSight Introduction
DBSight Introduction
 
CosmosDB for DBAs & Developers
CosmosDB for DBAs & DevelopersCosmosDB for DBAs & Developers
CosmosDB for DBAs & Developers
 
NoSQL, Apache SOLR and Apache Hadoop
NoSQL, Apache SOLR and Apache HadoopNoSQL, Apache SOLR and Apache Hadoop
NoSQL, Apache SOLR and Apache Hadoop
 
WebObjects Developer Tools
WebObjects Developer ToolsWebObjects Developer Tools
WebObjects Developer Tools
 
50 Shades of SharePoint: SharePoint 2013 Insanity Demystified
50 Shades of SharePoint: SharePoint 2013 Insanity Demystified50 Shades of SharePoint: SharePoint 2013 Insanity Demystified
50 Shades of SharePoint: SharePoint 2013 Insanity Demystified
 
SharePoint 2014: Where to save my data, for devs!
SharePoint 2014: Where to save my data, for devs!SharePoint 2014: Where to save my data, for devs!
SharePoint 2014: Where to save my data, for devs!
 
Implementación de tfs 2010 en entornos complejos (cómo y por qué) v03
Implementación de tfs 2010 en entornos complejos (cómo y por qué) v03Implementación de tfs 2010 en entornos complejos (cómo y por qué) v03
Implementación de tfs 2010 en entornos complejos (cómo y por qué) v03
 
Alfresco 5.0 Technology Review
Alfresco 5.0 Technology ReviewAlfresco 5.0 Technology Review
Alfresco 5.0 Technology Review
 
Extending Your Reach with Career Portal and Search Engine Optimization
Extending Your Reach with Career Portal and Search Engine OptimizationExtending Your Reach with Career Portal and Search Engine Optimization
Extending Your Reach with Career Portal and Search Engine Optimization
 
AWS re:Invent 2016: Case Study: How Spokeo Improved Web Application Response ...
AWS re:Invent 2016: Case Study: How Spokeo Improved Web Application Response ...AWS re:Invent 2016: Case Study: How Spokeo Improved Web Application Response ...
AWS re:Invent 2016: Case Study: How Spokeo Improved Web Application Response ...
 
Local Storage for Web Applications
Local Storage for Web ApplicationsLocal Storage for Web Applications
Local Storage for Web Applications
 
Django Deployment with Fabric
Django Deployment with FabricDjango Deployment with Fabric
Django Deployment with Fabric
 
JavaScript Build System Battle Royale | PrDC 2017
JavaScript Build System Battle Royale | PrDC 2017JavaScript Build System Battle Royale | PrDC 2017
JavaScript Build System Battle Royale | PrDC 2017
 
Web Test Automation Framework - IndicThreads Conference
Web Test Automation Framework  - IndicThreads ConferenceWeb Test Automation Framework  - IndicThreads Conference
Web Test Automation Framework - IndicThreads Conference
 
Exploring MongoDB & Elasticsearch: Better Together
Exploring MongoDB & Elasticsearch: Better TogetherExploring MongoDB & Elasticsearch: Better Together
Exploring MongoDB & Elasticsearch: Better Together
 
Modern JavaScript Talk
Modern JavaScript TalkModern JavaScript Talk
Modern JavaScript Talk
 

Justia and Amazon CloudSearch

  • 1. Justia’s Cloud Farmer Software Developer @NickMoline http://www.nick.pro/
  • 2.
  • 3.
  • 4. Pros Cons Really simple to set up No discrete field searching (for web pages or documents) (other than things like title:) Can run inside your firewall Physical branded box to install and support Great highlighting and snippet generation Very limited control over look / feel of results Creates “Cached Version” even of PDFs No Geospatial No JSON version Discontinued
  • 5. Pros Cons Really simple to set up No discrete field searching (for web pages or documents) (other than things like title:) If your site is indexed, no wait time to get Minimal control over new content getting started indexed on your terms Great highlighting and snippet generation Very limited control over look / feel of results With JSON/XML version can only return 4 or 8 results at a time No Geospatial
  • 6. Pros Cons Very fast for searching Very slow for indexing Full control of when content is indexed Requires reindexing ALL content, every time Good Geospatial Search built in Newer versions can be connected to with Doesn’t return any of the textual content, MySQL libraries and queried like a DB so requires a separate database query ALWAYS Filters & Faceting Only on Numeric fields Field Boosting!
  • 7. Pros Cons Very extendable and configurable Very difficult to optimize performance Full control of when content is indexed Adding Content Geospatial with “LocalSOLR” plugin Returns content More content you return, slower it gets Does highlighting Highlighting is not good performance Tons of Faceting options Sharding and Cores Again, hard to optimize Field Boosting Document Boosting!
  • 8. Pros Cons Extremely fast Automatically Scales (no thinking) No control of the scaling Automatically Shards when adding content Easy Re-indexing of content Returns content for creating snippets Easy JSON implementation No geo (yet) No highlighting/snippet gen (yet) No field boosting (yet)
  • 9.
  • 10. • Duplicate Word Mark field as both text and literal • Do 4 Searches: • Exact Word Mark Match • bq=(and type:'trademark_case' literal_word_mark:'amazon') • Prefix Word Mark Match • bq=(and type:'trademark_case' (and (not literal_word_mark:'amazon') literal_word_mark:'amazon*')) • Anywhere Word Mark Match • bq=(and type:'trademark_case' (and (not literal_word_mark:'amazon*') word_mark:'amazon')) • Full Text Search • bq=(and type:'trademark_case' (and (not literal_word_mark:'amazon*') (not literal_word_mark:'amazon') (not word_mark:'amazon'))) • Pass counts with pagination links
  • 11.
  • 12. Justia’s Cloud Farmer Software Developer @NickMoline http://www.nick.pro/