Cassandra redesigns indexing, storage management for 5.0 release

The Apache Software Foundation Cassandra project has released the 5.0 iteration of the wide-column store database boasting new features to improve vector search, a Java update and enhanced performance.

Cassandra is designed to support highly distributed systems where writes exceed reads, and so-called ACID compliance is not important. Netflix, for example, has been using Cassandra since 2013, replacing Oracle databases and using the NoSQL system to support global accounts and customer data worldwide.

New features for the 5.0 release, made generally available on Thursday last week, include Storage Attached Indexes (SAI) which promises to boost query flexibility and performance, especially for large datasets. The approach to move indexing closer to the data, improves query performance, and replaces the original Secondary Index feature.

The database upgrade also includes Vector Search, which supports a vector data type and indexing for Approximate Nearest Neighbor (ANN).

"Cassandra 5.0 lays the groundwork for advanced AI and machine learning applications. Developers building Generative AI applications can combine Cassandra's scale and distribution with the latest search technology," the project said in a statement.

Sarma Pydipally, a Cassandra contributor and freelance database engineer with experience of large-scale systems in the telecoms sector, said the two updates would work together to improve the database's performance in support of applications based on GenAI.

He explained that in earlier approaches to indexing, each node would only contain the index information for its own data, slowing the performance of distributed queries. "The SAI [Storage Attached Indexes] model is little bit different as it is attached to the data itself."

"Storage attached indexes are going to change the way we create indexes in Cassandra. It seems to solve that indexing problem for Cassandra, and the vector search is totally dependent on it," he said.

Patrick McFadin, developer relations veep at Cassandra vendor DataStax, said the net effect of changes to the way data is stored and the way storage is managed would mean organizations "can run less Cassandra."

"You don't have to use as many nodes to get the same amount of effect... that shows up as node density, you get much higher node density," he told us.

The Cassandra 5.0 release also upgrades Java support to JDK 17, bringing performance improvements of up to 20 percent in some cases, according to the community announcement.

With the emission of 5.0, the Cassandra project announced the end-of-life (EOL) for the 3.x series, including versions 3.0 and 3.11. It said it would evaluate security patches contributed to unmaintained branches on a case-by-case basis while the application of CVE fixes is not guaranteed.

As of last year, a DataStax survey showed around a third of Cassandra users were on the 3.x releases, McFadin said. ®

Search
About Us
Website HardCracked provides softwares, patches, cracks and keygens. If you have software or keygens to share, feel free to submit it to us here. Also you may contact us if you have software that needs to be removed from our website. Thanks for use our service!
IT News
Sep 17
Sainsbury's bags a ticket to RISE with SAP, hoping not to trip like Asda

UK's second largest retailer set to move ERP to the cloud with AWS, Accenture, and the German software gaint

Sep 17
Oracle urged again to give up JavaScript trademark

If there's one thing we know about Big Red, it's being entirely reasonable

Sep 17
Desktop hypervisors are like buses: none for ages, then four at once

VirtualBox, Parallels, and VMware have all upgraded

Sep 16
The empire of C++ strikes back with Safe C++ blueprint

You pipsqueaks want memory safety? We'll show you memory safety! We'll borrow that borrow checker

Sep 16
Ellison declares Oracle 'all in' on AI mass surveillance

Eyes on everyone: From cops to the public

Sep 16
Microsoft's Copilot 'Wave 2' is a tsunami of unanswered questions

What the next few months hold for the productivity chatbot

Sep 16
Snowflake's Unistore still on ice years after announcement

Cloud unicorn struggles to make database that can do everything 'margin positive'