14. Cassandra 使用案例 [2] user site application others evaluated Jonathan Ellis-3 Rackspace stats collection (testing, almost production),Mail & Apps division (early testing) HBase, Hypertable, dynomite,and Voldemort Ryan King Twitter storage for all tweets a custom mysql impl, voldemort, hbase, mongodb, memcachdb, hypertable, and others Edmond Lau Ooyala store and serve our near real-time video analytics data HBase, Cassandra, Voldemort, and some others Joe Stump SimpleGeo real-time location infrastructure scott w Onespot a subset of our data store Tokyo, Voldemort and Riak and Cassandra Vitaly Kushner Astrails a project for one of our clients(early development stage) Dan Di Spaltro Cloudkick store monitoring statistics and running analytics over the data Eric Lubow ShermansTravel mailing system,social network usage Cassandra and Tokyo Cabinet/Tokyo Tyrant Richard grossman bee.tv smart TV and movies recommendations , index each day all the TV shows in every states + all the new VOD sources matthew hawthorne Comcast tons of data that we are migrating into non-relational storage cassandra, riak, voldemort, and hdfs Santal Li Cisco Webex store User Feed & User Activity data Voldemort, MemcacheDB, Dynomite
27. Data Model KEY ColumnFamily1 Name : MailList Type : Simple Sort : Name Name : tid1 Value : <Binary> TimeStamp : t1 Name : tid2 Value : <Binary> TimeStamp : t2 Name : tid3 Value : <Binary> TimeStamp : t3 Name : tid4 Value : <Binary> TimeStamp : t4 ColumnFamily2 Name : WordList Type : Super Sort : Time Name : aloha ColumnFamily3 Name : System Type : Super Sort : Name Name : hint1 <Column List> Name : hint2 <Column List> Name : hint3 <Column List> Name : hint4 <Column List> Name : dude C2 V2 T2 C6 V6 T6 Column Families are declared upfront Columns are added and modified dynamically SuperColumns are added and modified dynamically Columns are added and modified dynamically C1 V1 T1 C2 V2 T2 C3 V3 T3 C4 V4 T4
28.
29. Storage Model-Compactions K1 < Serialized data > K2 < Serialized data > K3 < Serialized data > -- -- -- Sorted K2 < Serialized data > K10 < Serialized data > K30 < Serialized data > -- -- -- Sorted K4 < Serialized data > K5 < Serialized data > K10 < Serialized data > -- -- -- Sorted MERGE SORT K1 < Serialized data > K2 < Serialized data > K3 < Serialized data > K4 < Serialized data > K5 < Serialized data > K10 < Serialized data > K30 < Serialized data > Sorted K1 Offset K5 Offset K30 Offset Bloom Filter Loaded in memory Index File Data File D E L E T E D
对 CAP 公理的详细描述 . http://www.julianbrowne.com/article/viewer/brewers-cap-theorem Brewer 的最初描述 http://www.cs.berkeley.edu/%7Ebrewer/cs262b-2004/PODC-keynote.pdf 两位教授对 CAP 公理的证明 . http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.20.1495&rep=rep1&type=pdf Daniel Abadi 对 CAP Theorem 的质疑 ,C 并不是始终为 A 做牺牲 , 因为 P 是一个很少发生的事实 , 大部分情况下 ,C 可能更多的是为 L(Latency) 做牺牲 . http://dbmsmusings.blogspot.com/2010/04/problems-with-cap-and-yahoos-little.html
About ACID http://en.wikipedia.org/wiki/ACID About BASE http://queue.acm.org/detail.cfm?id=1394128 Eventual Consistency http://www.allthingsdistributed.com/2008/12/eventually_consistent.html