Friday, August 10, 12
Friday, August 10, 12
Friday, August 10, 12
Friday, August 10, 12
Friday, August 10, 12
Flexible schema                        Easily to scale, increased redundancy                        Fast enough for web re...
Friday, August 10, 12
Friday, August 10, 12
FUD                        No Ad-hoc queries                        No Indexes                        No range queries    ...
Friday, August 10, 12
Friday, August 10, 12
REST                        CQL                        ThriftFriday, August 10, 12
SOLR Schema                    <?xml version="1.0" encoding="UTF-8" ?>                 <schema name="my_column_family" ver...
Basic Queries      http://localhost:8983/solr/my_keyspace.my_column_family/select?q=name:foo               SELECT * FROM m...
Wide Rows                <?xml version="1.0" encoding="UTF-8" ?>             <schema name="my_column_family" version="1.0"...
Fuzzy Search     <schema name="my_column_family" version="1.0">    <types>      <fieldType name="string" class="solr.StrFi...
• Full-text indexing                        • Trigrams                        • Rich data formats (PDF, Word, HTML)       ...
Friday, August 10, 12
StormFriday, August 10, 12
StormFriday, August 10, 12
Increased performance                   by 700% while growing                        data by 500%Friday, August 10, 12
Reduced operational                           costs by 40%Friday, August 10, 12
Deleted 15,000 lines of codeFriday, August 10, 12
Friday, August 10, 12
Upcoming SlideShare
Loading in...5
×

Cassandra summit

1,132

Published on

Published in: Technology
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
1,132
On Slideshare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
8
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Cassandra summit

  1. 1. Friday, August 10, 12
  2. 2. Friday, August 10, 12
  3. 3. Friday, August 10, 12
  4. 4. Friday, August 10, 12
  5. 5. Friday, August 10, 12
  6. 6. Flexible schema Easily to scale, increased redundancy Fast enough for web requests Consolidate existing services Hadoop supportFriday, August 10, 12
  7. 7. Friday, August 10, 12
  8. 8. Friday, August 10, 12
  9. 9. FUD No Ad-hoc queries No Indexes No range queries Limited tooling Code complexityFriday, August 10, 12
  10. 10. Friday, August 10, 12
  11. 11. Friday, August 10, 12
  12. 12. REST CQL ThriftFriday, August 10, 12
  13. 13. SOLR Schema <?xml version="1.0" encoding="UTF-8" ?> <schema name="my_column_family" version="1.0"> <types> <fieldType name="string" class="solr.StrField"/> <fieldType name="date" class="solr.DateField"/> </types> <fields> <field name="id" type="string" indexed="true" stored="true"/> <field name="name" type="string" indexed="true" stored="true"/> <field name="released_at" type="date" indexed="true" stored="true"/> </fields> <uniqueKey>id</uniqueKey> <defaultSearchField>name</defaultSearchField> </schema>Friday, August 10, 12
  14. 14. Basic Queries http://localhost:8983/solr/my_keyspace.my_column_family/select?q=name:foo SELECT * FROM my_column_family WHERE solr_query=name:foo;Friday, August 10, 12
  15. 15. Wide Rows <?xml version="1.0" encoding="UTF-8" ?> <schema name="my_column_family" version="1.0"> <types> <fieldType name="string" class="solr.StrField"/> <fieldType name="date" class="solr.DateField"/> </types> <fields> <field name="id" type="string" indexed="true" stored="true"/> <field name="name" type="string" indexed="true" stored="true"/> <field name="released_at" type="date" indexed="true" stored="true"/> <dynamicField name="wide_*" type="string" indexed="true" stored="true"/> </fields> <uniqueKey>id</uniqueKey> <defaultSearchField>name</defaultSearchField> </schema>Friday, August 10, 12
  16. 16. Fuzzy Search <schema name="my_column_family" version="1.0"> <types> <fieldType name="string" class="solr.StrField"/> <fieldType name="ngram" class="solr.TextField" positionIncrementGap="100"> <analyzer type="index"> <tokenizer class="solr.KeywordTokenizerFactory"/> <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" generateNumberParts="1" catenateWords="1" catenateNumbers="1" catenateAll="1" preserveOriginal="1"/> <filter class="solr.NGramFilterFactory" minGramSize="2" maxGramSize="15"/> </analyzer> <analyzer type="query"> <tokenizer class="solr.WhitespaceTokenizerFactory"/> </analyzer> </fieldType> </types> <fields> <field name="id" type="string" indexed="true" stored="true" /> <field name="name" type="string" indexed="true" stored="true" /> <field name="name_fuzzy" type="ngram" indexed="true" stored="true" /> </fields> <copyField source="name" dest="name_fuzzy"/> <uniqueKey>id</uniqueKey> <defaultSearchField>name</defaultSearchField> </schema>Friday, August 10, 12
  17. 17. • Full-text indexing • Trigrams • Rich data formats (PDF, Word, HTML) • Easy interop (REST,CSV, XML, JSON) • Geo-spatial search • Highlighting • Auto-suggest • Faceted search and filteringFriday, August 10, 12
  18. 18. Friday, August 10, 12
  19. 19. StormFriday, August 10, 12
  20. 20. StormFriday, August 10, 12
  21. 21. Increased performance by 700% while growing data by 500%Friday, August 10, 12
  22. 22. Reduced operational costs by 40%Friday, August 10, 12
  23. 23. Deleted 15,000 lines of codeFriday, August 10, 12
  24. 24. Friday, August 10, 12
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×