• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Cql – cassandra query language
 

Cql – cassandra query language

on

  • 24,916 views

Introduction to CQL v1.0.0

Introduction to CQL v1.0.0

Statistics

Views

Total Views
24,916
Views on SlideShare
19,928
Embed Views
4,988

Actions

Likes
5
Downloads
187
Comments
2

16 Embeds 4,988

http://nosql.mypopescu.com 2494
http://www.publickey1.jp 2404
http://crlog.info 28
url_unknown 25
http://webcache.googleusercontent.com 10
http://www.slideshare.net 9
http://lab.mydns.jp 3
http://twitter.com 3
http://translate.googleusercontent.com 3
http://news.google.com 2
http://feeds.feedburner.com 2
https://twitter.com 1
http://www.hanrss.com 1
http://55.23.254.25:15871 1
http://cache.yahoofs.jp 1
http://paper.li 1
More...

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel

12 of 2 previous next

  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • http://wiki.apache.org/cassandra/API#StructuresWritesLevelBehaviorANYEnsure that the write has been written to at least 1 node, including HintedHandoff recipients.ONEEnsure that the write has been written to at least 1 replica's commit log and memory table before responding to the client.QUORUMEnsure that the write has been written to N / 2 + 1 replicas before responding to the client.LOCAL_QUORUMEnsure that the write has been written to  / 2 + 1 nodes, within the local datacenter (requires NetworkTopologyStrategy)EACH_QUORUMEnsure that the write has been written to  / 2 + 1 nodes in each datacenter (requires NetworkTopologyStrategy)ALLEnsure that the write is written to all N replicas before responding to the client. Any unresponsive replicas will fail the operation.ReadsLevelBehaviorANYNot supported. You probably want ONE instead.ONEWill return the record returned by the first replica to respond. A consistency check is always done in a background thread to fix any consistency issues whenConsistencyLevel.ONE is used. This means subsequent calls will have correct data even if the initial read gets an older value. (This is called ReadRepair)QUORUMWill query all replicas and return the record with the most recent timestamp once it has at least a majority of replicas (N / 2 + 1) reported. Again, the remaining replicas will be checked in the background.LOCAL_QUORUMReturns the record with the most recent timestamp once a majority of replicas within the local datacenter have replied.EACH_QUORUMReturns the record with the most recent timestamp once a majority of replicas within each datacenter have replied.ALLWill query all replicas and return the record with the most recent timestamp once all replicas have replied. Any unresponsive replicas will fail the operation.
  • The CREATE KEYSPACE statement creates a new top-level namespace (aka “keyspace”). Valid names are any string constructed of alphanumeric characters and underscores, but must begin with a letter. Properties such as replication strategy and count are specified during creation using the following accepted keyword arguments:Keywordrequireddescriptionreplication_factor yes Numeric argument that specifies the number of replicas for this keyspace.strategy_class yes Class name to use for managing replica placement. Any of the shipped strategies can be used by specifying the class name relative to org.apache.cassandra.locator, others will need to be fully-qualified and located on the classpath.strategy_options no Some strategies require additional arguments which can be supplied by appending the option name to the strategy_options keyword, separated by a colon (:). For example, a strategy option of “DC1″ with a value of “1″ would be specified as strategy_options:DC1 = 1.Synopsis:CREATE COLUMNFAMILY [(name1 type, name2 type, ...)] [WITH keyword1 = arg1 [AND keyword2 = arg2 [AND ...]]]; CREATE COLUMNFAMILY statements create new column family namespaces under the current keyspace. Valid column family names are strings of alphanumeric characters and underscores, which begin with a letter.Specifying Column Type (optional)CREATE COLUMNFAMILY (name1 type, name2 type) ...; It is possible to assign columns a type during column family creation. Columns configured with a type are validated accordingly when a write occurs. Column types are specified as a parenthesized, comma-separated list of column term and type pairs. The list of recognized types are:Type descriptionBytes Arbitrary bytes (no validation)AsciiASCII character stringUtf8 UTF8 encoded stringTimeuuid Type 1 UUIDUuid Type 4 UUIDInt 4-byte integerLong 8-byte longNote: In addition to the recognized types listed above, it is also possible to supply a string containing the name of a class (a sub-class of AbstractType), either fully qualified, or relative to theorg.apache.cassandra.db.marshalpackage.Column Family Options (optional)CREATE COLUMNFAMILY ... WITH keyword1 = arg1 AND keyword2 = arg2; A number of optional keyword arguments can be supplied to control the configuration of a new column family.Keyword default descriptionComparator utf8 Determines sorting and validation of column names. Valid values are identical to the types listed in Specifying Column Type above.Comment none A free-form, human-readable comment.row_cache_size 0 Number of rows whose entire contents to cache in memory.key_cache_size 200000 Number of keys per SSTable whose locations are kept in memory in “mostly LRU” order.read_repair_chance 1.0 The probability with which read repairs should be invoked on non-quorum reads.gc_grace_seconds 864000 Time to wait before garbage collecting tombstones (deletion markers).default_validation utf8 Determines validation of column values. Valid values are identical to the types listed in Specifying Column Type above.min_compaction_threshold 4 Minimum number of SSTables needed to start a minor compaction.max_compaction_threshold 32 Maximum number of SSTables allowed before a minor compaction is forced.row_cache_save_period_in_seconds 0 Number of seconds between saving row caches.key_cache_save_period_in_seconds 14400 Number of seconds between saving key caches.memtable_flush_after_mins 60 Maximum time to leave a dirty table unflushed.memtable_throughput_in_mb dynamic Maximum size of the memtable before it is flushed.memtable_operations_in_millions dynamic Number of operations in millions before the memtable is flushed.replicate_on_write false
  • Synopsis:USE ; A USE statement consists of the USE keyword, followed by a valid keyspace name. Its purpose is to assign the per-connection, current working keyspace. All subsequent keyspace-specific actions will be performed in the context of the supplied value.Synopsis:DROP namespace; DROP statements result in the immediate, irreversible removal of keyspace and column family namespaces.
  • Synopsis:UPDATE [USING CONSISTENCY ] SET name1 = value1, name2 = value2 WHERE KEY = keyname; An UPDATE is used to write one or more columns to a record in a Cassandra column family. No results are returned.Column FamilyUPDATE ... Statements begin with the UPDATE keyword followed by a Cassandra column family name.Consistency LevelUPDATE ... [USING ] ... Following the column family identifier is an optional consistency level specification.Specifying Columns and RowUPDATE ... SET name1 = value1, name2 = value2 WHERE KEY = keyname; Rows are created or updated by supplying column names and values in term assignment format. Multiple columns can be set by separating the name/value pairs using commas. Each update statement requires exactly one key to be specified using a WHERE clause and the KEY keyword.Additionally, it is also possible to send multiple UPDATES to a node at once using a batch syntax:BEGIN BATCH [USING ] UPDATE CF1 SET name1 = value1, name2 = value2 WHERE KEY = keyname1; UPDATE CF1 SET name3 = value3 WHERE KEY = keyname2; UPDATE CF2 SET name4 = value4, name5 = value5 WHERE KEY = keyname3; APPLY BATCH When batching UPDATEs, a single consistency level is used for the entire batch, it appears after the BEGIN BATCHstatement, and uses the standard consistency level specification. Batch UPDATEs default toCONSISTENCY.ONEwhen left unspecified.NOTE: While there are no isolation guarantees, UPDATE queries are atomic within a give record.
  • Synopsis:DELETE [COLUMNS] FROM [USING ] WHERE KEY = keyname1 DELETE [COLUMNS] FROM [USING ] WHERE KEY IN (keyname1, keyname2); A DELETE is used to perform the removal of one or more columns from one or more rows.Specifying ColumnsDELETE [COLUMNS] ... Following the DELETE keyword is an optional comma-delimited list of column name terms. When no column names are specified, the remove applies to the entire row(s) matched by the WHERE clauseColumn FamilyDELETE ... FROM ... The column family name follows the list of column names.Consistency LevelUPDATE ... [USING ] ... Following the column family identifier is an optional consistency level specification.Specifying RowsUPDATE ... WHERE KEY = keyname1 UPDATE ... WHERE KEY IN (keyname1, keyname2) The WHERE clause is used to determine which row(s) a DELETE applies to. The first form allows the specification of a single keyname using the KEY keyword and the = operator. The second form allows a list of keyname terms to be specified using the IN notation and a parenthesized list of comma-delimited keyname terms.
  • Synopsis:SELECT [FIRST N] [REVERSED] FROM [USING ] [WHERE ] [LIMIT N]; A SELECT is used to read one or more records from a Cassandra column family. It returns a result-set of rows, where each row consists of a key and a collection of columns corresponding to the query.Specifying ColumnsSELECT [FIRST N] [REVERSED] name1, name2, name3 FROM ... SELECT [FIRST N] [REVERSED] name1..nameN FROM ... The SELECT expression determines which columns will appear in the results and takes the form of either a comma separated list of names, or a range. The range notation consists of a start and end column name separated by two periods (..). The set of columns returned for a range is start and end inclusive.The FIRST option accepts an integer argument and can be used to apply a limit to the number of columns returned per row. When this limit is left unset it defaults to 10,000 columns.The REVERSED option causes the sort order of the results to be reversed.It is worth noting that unlike the projection in a SQL SELECT, there is no guarantee that the results will contain all of the columns specified. This is because Cassandra is schema-less and there are no guarantees that a given column exists.Column FamilySELECT ... FROM ... The FROM clause is used to specify the Cassandra column family applicable to a SELECT query.Consistency LevelSELECT ... [USING ] ... Following the column family clause is an optional consistency level specification.Filtering rowsSELECT ... WHERE KEY = keyname AND name1 = value1 SELECT ... WHERE KEY >= startkey and KEY =< endkey AND name1 = value1 The WHERE clause provides for filtering the rows that appear in results. The clause can filter on a key name, or range of keys, and in the case of indexed columns, on column values. Key filters are specified using the KEYkeyword, a relational operator, (one of =, >, >=,  and =and  [LIMIT N] ... Limiting the number of rows returned can be achieved by adding the LIMIT option to a SELECT expression. LIMITdefaults to 10,000 when left unset.

Cql – cassandra query language Cql – cassandra query language Presentation Transcript

  • CQL – Cassandra Query Language
    Courtney Robinson – crlog.info@ Eric Evans (Python tests and CQL Spec)
  • Thrift or Avro?
    External dependency
    Community activity dissipates
    Too generic
    Many user reported problems caused by Thrift or the misuse thereof...
    Its thrift! Its Avro!
    Its CQL....
  • What is it?
    Effectively a structured query language
    Replacement for clients? Not really...
    Attempting to push as much server-side as possible
    Familiar syntax
    User friendly API for Cassandra new comers
  • CQL V1.0.0 - Keywords
    USE
    SELECT
    UPDATE
    DELETE
    TRUNCATE
    DROP
    BATCH
    CREATE KEYSPACE
    CREATE COLUMNFAMILY
    CREATE INDEX
    That’s right!No INSERT...?
    Special statements
  • Specifying Consistency
    ... USING <CONSISTENCY> ...
    Made up of the keyword USING, followed by a consistency level identifier.
    Valid consistency levels are:
    CONSISTENCY ZERO
    CONSISTENCY ONE (default)
    CONSISTENCY QUORUM
    CONSISTENCY ALL
    CONSISTENCY DCQUORUM
    CONSISTENCY DCQUORUMSYNC
    E.gBEGIN BATCH USING CONSISTENCY ONE
  • Create keyspce
    CREATE KEYSPACE WITH replication_factor = AND strategy_class = [AND strategy_options. = [AND strategy_options. = ]];
    E.gCREATE KEYSPACE TestKeyspaceWITH strategy_options:DC1 = '1‘ AND strategy_class = 'NetworkTopologyStrategy‘
    CREATE COLUMNFAMILY [(name1 type, name2 type, ...)] [WITH keyword1 = arg1 [AND keyword2 = arg2 [AND ...]]];
    Set column type:
    CREATE COLUMNFAMILY (name1 type, name2 type) ...;
    CREATE INDEX [index_name] ON <column_family> (column_name);
    Used to create a new, automatic secondary index for the named column.
    Create Column Family
    Create INDEX
  • USE
    USE <KEYSPACE>;
    Use keyword followed by a valid Keyspace name
    Set working Keyspace per-connection
    DROP <KEYSPACE|COLUMNFAMILY>;
    DROP KEYSPACE KSName;
    DROP COLUMNFAMILYCFName; Immediate, irreversible removal of keyspace and column family namespaces.
    TRUNCATE <COLUMN FAMILY>
    Accepts a single argument (CF) name
    permanently removes all data from said column family.
    DROP
    TRUNCATE
  • UPDATE (values)
    UPDATE [USING CONSISTENCY ] SET name1 = value1, name2 = value2 WHERE KEY = keyname;
    UPDATE is used to write one or more columns to a record in a Cassandra column family.
    No results are returned.
    Creates or updates rows
    UPDATE CFName SET ‘name' = 10 WHERE KEY = ‘1234’
    UPDATE <COLUMN FAMILY> ...
    Begin with the UPDATE keyword followed by CF name.
    Followed by an optional consistency level specification.
  • BATCH UPDATES
    Where’s my batch mutate gone?
    Similar to code blocks/structs
    Has begining (Begin Batch) and end (Apply Batch)
    BEGIN BATCH [USING ]
    UPDATE CF1 SET name1 = value1, name2 = value2 WHERE KEY = keyname1;
    UPDATE CF1 SET name3 = value3 WHERE KEY = keyname2;
    UPDATE CF2 SET name4 = value4, name5 = value5 WHERE KEY = keyname3;
    APPLY BATCH
    BEGIN BATCH USING CONSISTENCY QUORUM
    UPDATE CFnameSET 1='1', 2='2', 3='3', 4='4' WHERE KEY='aa‘
    UPDATE CFname2 SET 5='5', 6='6', 7='8', 9='9' WHERE KEY='ab'
    UPDATE CFnameSET 9='9', 8='8', 7='7', 6='6' WHERE KEY='ac'
    UPDATE CFname3 SET 5='5', 4='4', 3='3', 2='2' WHERE KEY='ad'
    UPDATE CFnameSET 1='1', 2='2', 3='3', 4='4' WHERE KEY='ae'
    APPLY BATCH
  • DELETE
    DELETE [COLUMNS] FROM [USING ] WHERE KEY = keyname1
    DELETE [COLUMNS] FROM [USING ] WHERE KEY IN (keyname1, keyname2);
    • Remove one or more columns from one or more rows.
    • Optional comma-delimited list of column names following DELETE
    • Removes entire row(s) if no columns are specified
    • DELETE ‘col1', 'col2' FROM CF WHERE KEY = 'key‘
    • DELETE ‘col1', 'col2' FROM CF WHERE KEY = ('key‘, ‘key2’, ‘key3’)
  • SELECT
    SELECT [FIRST N] [REVERSED] <SELECT EXPR> FROM <COLUMN FAMILY> [USING <CONSISTENCY>] [WHERE <CLAUSE>] [LIMIT N];
    FIRST = number of columns, N
    REVERSED = Reverses sort order
    Return a result set, key + columns per row
    Specify columns:
    SELECT [FIRST N] [REVERSED] name1, name2, name3 FROM ...
    Request a range of columns:
    SELECT [FIRST N] [REVERSED] name1..nameN FROM ...
    “..” Notation specifies range
    Inclusive of start and end columns
    E.gSELECT FIRST 2 REVERSED 3..1 FROM CF WHERE KEY = 'aa'
  • Courtney Robinson @zcourts
    Thank you for listening.
    Questions?Links:
    http://crlog.info/2011/03/29/cassandra-query-language-aka-cql-syntax/
    https://svn.apache.org/viewvc/cassandra/trunk/doc/cql/CQL.html?view=co
    https://issues.apache.org/jira/browse/CASSANDRA-1703