NoSQL服務器 Cassandra 1.0 穩定版發布
Apache軟件基金會(ASF)發布了 NoSQL 數據庫 Cassandra 1.0 正式版。
主要新特性包括:
1. Windows 下的服務安裝,只需運行:bin\cassandra.bat install
 2. Hinted handoff
 3. 多線程壓縮
 4. 節點替換
 5. 全新的 CQL 客戶端
詳細的改進內容請看
 
Besides compression, memory and disk space management, leveled compaction, and performance improvements, Cassandra 1.0 also includes a number of smaller features and enhancements.
Windows service
You can still run Cassandra 1.0 from the bat file if you want — this is the most convenient way to develop and run quick tests — but for production, installing it as a Windows service is a much better option. To do this, just run bin\cassandra.bat install. (Similarly, uninstall will remove it.)
Hinted handoff
Hinted handoff is one way Cassandra replays missed updates to nodes that were temporarily down or unreachable. Cassandra 1.0 makes two improvements:
- Hint data is stored more efficiently, reducing the load imposed by hint replay on nodes storing them
- Coordinator nodes no longer need to wait for the failure detector to recognize a problem with a replication target, to begin generating hints for it
This means that full repair, while still recommended periodically, will only have to reconcile data missing because of a coordinator failure at the same time, or because of losing a node entirely. Both of these are relatively rare occurrences.
Similarly, read repair is less necessary, so the default read repair probability has been reduced to 10% from 100%. This will result in significantly improved throughput in clusters with more than one replica and lower ConsistencyLevel reads than ALL.
Multithreaded compaction
Cassandra 0.8 introduced concurrent compaction: running multiple compaction sets at a time, each in its own thread. Cassandra 1.0 adds multithreaded compaction, which allows multiple threads to be used per compaction set.
The primary use case for multithreaded compaction is when you either have a small number of column families or you are using leveled compaction, and you want to keep your data as compacted as possible--usually because you have a read-heavy workload.
Multithreaded compaction can be enabled by setting multithreaded_compaction to true in cassandra.yaml.
Node replacement
Replacing a node was a two-step process in earlier versions of Cassandra: one to add the new node, and a second to remove the old. Cassandra 1.0 simplifies this common operation into simply starting the new node with the option -Dcassandra.replace_token=[token].
New CQL clients
After some discussion, we decided that CQL drivers would be better developed as separate projects rather than in the Cassandra tree. The Java and Python drivers have moved to Apache Extras on code.google.com, and new Ruby and PHP drivers have joined them, with node.js coming soon.
Previously
- What's new in Cassandra 1.0, part 4: Performance
- What's new in Cassandra 1.0, part 3: Leveled compaction
- What's new in Cassandra 1.0, part 2: Improved memory and disk space management
- What's new in Cassandra 1.0, part 1: Compression
- What's new in Cassandra 0.8
- What's new in Cassandra 0.7
- What's new in Cassandra 0.6
 
 Apache Cassandra是一套開源分布式Key-Value存儲系統。它最初由非死book開發,用于儲存特別大的數據。非死book目前在使用此系統。
主要特性:
- 分布式
- 基于column的結構化
- 高伸展性
Cassandra的主要特點就是它不是一個數據庫,而是由一堆數據庫節點共同構成的一個分布式網絡服務,對Cassandra 的一個寫操作,會被復制到其他節點上去,對Cassandra的讀操作,也會被路由到某個節點上面去讀取。對于一個Cassandra群集來說,擴展性能 是比較簡單的事情,只管在群集里面添加節點就可以了。
Cassandra是一個混合型的非關系的數據庫,類似于Google的BigTable。其主要功能比 Dynomite(分布式的Key-Value存 儲系統)更豐富,但支持度卻不如文檔存儲MongoDB(介于關系數據庫和非關系數據庫之間的開源產品,是非關系數據庫當中功能最豐富,最像關系數據庫 的。支持的數據結構非常松散,是類似json的bjson格式,因此可以存儲比較復雜的數據類型。)Cassandra最初由非死book開發,后轉變成了開源項目。它是一個網絡社交云計算方面理想的數據庫。以Amazon專有的完全分布式的Dynamo為基礎,結合了Google BigTable基于列族(Column Family)的數據模型。P2P去中心化的存儲。很多方面都可以稱之為Dynamo 2.0。
和其他數據庫比較,有幾個突出特點:
模式靈活 :使用Cassandra,像文檔存儲,你不必提前解決記錄中的字段。你可以在系統運行時隨意的添加或移除字段。這是一個驚人的效率提升,特別是在大型部 署上。 
 真正的可擴展性 :Cassandra是純粹意義上的水平擴展。為給集群添加更多容量,可以指向另一臺電腦。你不必重啟任何進程,改變應用查詢,或手動遷移任何數據。 
 多數據中心識別 :你可以調整你的節點布局來避免某一個數據中心起火,一個備用的數據中心將至少有每條記錄的完全復制。
一些使Cassandra提高競爭力的其他功能:
范圍查詢 :如果你不喜歡全部的鍵值查詢,則可以設置鍵的范圍來查詢。 
 列表數據結構 :在混合模式可以將超級列添加到5維。對于每個用戶的索引,這是非常方便的。 
 分布式寫操作 :有可以在任何地方任何時間集中讀或寫任何數據。并且不會有任何單點失敗。
 
 項目地址:http://cassandra.apache.org/