Couchbase 8.0: Unified Information Platform for Hyperscale AI Purposes

December 4, 2025

83

Couchbase 8.0: Unified Information Platform for Hyperscale AI Purposes — blog couchbase 8 hyperscale ai 1024x536

AI is reshaping how we construct and run trendy functions. From real-time suggestions to agentic assistants, groups want knowledge platforms that may sustain with new efficiency and adaptability calls for. That’s what Couchbase 8.0 is constructed for—a unified platform that brings operational, analytical, and vector-based workloads collectively so builders can construct sooner, smarter, and less expensive AI-powered functions.

Common availability of Couchbase Server 8.0

At present we introduce Couchbase Server 8.0, our newest launch for self-managed and fully-managed Capella deployments. With over 400 options and modifications, Couchbase 8.0 delivers breakthrough improvements in vector indexing, vector search utilization and efficiency, and cluster safety, scalability and reliability. These new options assist remodel Couchbase into the AI knowledge spine wanted for tomorrow’s era of AI-powered functions and agentic techniques.

Agentic techniques are operational functions

We have now lengthy argued that agentic techniques are finest seen as operational functions as a result of they require the responsiveness, availability, distributed scale, and efficiency of platforms like Couchbase and Capella. And we’ve got traditionally contended that assembling an operational utility that’s powered by a group of purpose-built databases is a foul concept. Doing that with AI could possibly be a catastrophe.

Take a look at the very best multipurpose vector database

At present, we’re including one other class of database performance that needs to be built-into a multipurpose platform, not dwell alongside one. Couchbase 8.0 turns into the very best, most versatile vector database, together with being a implausible JSON, KV-caching, search, eventing, cellular, and analytic multipurpose database platform.

Vector search that scales to a billion—and past

AI-driven functions rely upon discovering the suitable context immediately. Which means quick, correct, vector retrieval at huge scale. With the brand new Hyperscale Vector Index (HVI) in Couchbase 8.0, that’s now potential—with out tradeoffs between velocity, accuracy, or value.

In unbiased billion-scale testing, HVI delivered as much as 19,000 queries per second with 28 millisecond latency when adjusted for cheap recall accuracy of 66%. In comparison with a number one cloud database, Couchbase ran over 3,000 instances sooner. And once we turned up the adjustment for top recall accuracy (93% on modest {hardware}), Couchbase dealt with 350 instances extra queries per second.

Our new Hyperscale Vector Index, has already been examined to simply scale past one billion vectors with distinctive question throughput, recall accuracy, and millisecond latency. This not solely helps clients enhance accuracy and belief inside their AI functions, it additionally helps make using GenAI extra inexpensive. It should drive down the whole value of possession of RAG and agentic use instances, particularly when it’s troublesome to anticipate what customers could ask of Massive Language Fashions (LLMs).

As an alternative of utilizing HNSW, IVF, or DiskANN, Hyperscale Vector Index is powered by a novel hybrid algorithm that mixes the strengths of graph and cluster-based algorithms, primarily based on Microsoft’s Vamana paper mixed with IVF. The benefit to this design is that it makes use of each distributed in-memory and partitioned-on-disk processing, ensuing within the best-in-class efficiency when it comes to capability, throughputs, latency, and recall. It’s the popular index to make use of when a big corpus of knowledge should be vectorized whereas builders don’t totally management the content material equipped in prompts, akin to chatbots. This implementation has many benefits, which we are going to discover sooner or later. However right now, we simply need to present it off.

Hyperscale vector index benchmark competitors

In a recent head-to-head vector efficiency benchmark between Couchbase and MongoDB Atlas, Couchbase’s new Hyperscale Vector Index achieved distinctive vector retrieval efficiency, measured in queries per second (QpS), in opposition to a typical medium-sized vector set, after which additionally a billion vector dataset with 128 dimensions. The assessments used the VDBBench methodology and toolkit, and measured queries per second (QpS), response latency in milliseconds, and recall accuracy proportion.

By various the breadth of centroid clusters scanned (from 10 to 100), the assessments are capable of regulate retrieval efficiency and latency in opposition to their vector recall accuracy. Centroids are clusters of comparable vectors. When scanning fewer centroids, queries per second (QpS) improve, however vector accuracy could also be decrease. Scanning extra centroids improves accuracy, however can also improve latency.

The benchmark outcomes exhibit that Couchbase’s Hyperscale Vector Index can ship simply over 19,000 queries per second with a latency of solely 28 milliseconds, when adjusted for decrease accuracy (66%). That is 3,100 instances sooner than the identical check and settings for MongoDB Atlas, which may solely execute 6 queries per second at 57% recall accuracy.

When configured to favor recall accuracy, MongoDB’s efficiency dropped to 2 queries per second, and its latency responsiveness jumped to over 40 seconds. Couchbase carried out over 700 QpS, with sub-second latency of 369 milliseconds. Atlas’ recall accuracy was 89% to Couchbase’s 93%. When working at billion-vector scale, Couchbase’s Hyperscale Vector Index works tougher, sooner, smarter, and prices much less.

The Hyperscale Vector Index is an extension of the unique Index Service in Couchbase, and it inherits its present deployment choices, scale, distributed partitioning, and efficiency traits.

Composite Vector Index (CVI)

Whereas we had been at it, we added the Composite Vector Index for conditions when builders need to outline a prefiltered, narrowed, vector consequence set, additionally at millisecond velocity.

Composite vector index is a part of present secondary index capabilities (GSI) by way of which you’ll be able to construct an index combining vector and different supported knowledge sorts like strings, numbers and booleans. It helps slim the main target of a vector request and is helpful when builders management the contents of prompts inside an LLM engagement. Thus it will possibly apply filtering standards earlier than requesting particular vectors from Couchbase and reduce LLM token consumption with out compromising accuracy.

Couchbase deploys vector search on premises, in Capella, and on cellular. Who else does that?

These new, massively scalable, vector indexing choices are added to our present, hybrid-vector search capabilities powered by our Search Service. Couchbase is now the one database platform to supply three versatile, and extremely scalable vector search choices for self-managed techniques on premises, in Kubernetes, and totally managed Capella deployments. Add to that our cellular vector search, and you may see how we are able to grow to be the spine in your AI functions that serve finish customers wherever they’re positioned in our AI world.

What else is in Couchbase 8.0?

Each service in Couchbase enjoys main enhancements. Let’s take a look at the modifications for every Couchbase service:

Information Service

- - Native encryption at relaxation with KMS integration for customer-managed keys. Information service is the primary of every Couchbase service to be encrypted. Others like Question, Index and Search will observe in a subsequent launch.
  - Helps Key Administration Interoperability Protocol (KIMP)
  - Consists of centralized coverage management with auto key rotation
  - 90% Decrease reminiscence quota for Magma (100MB)
  - Smaller cluster map choice of 128 vbuckets as a substitute of 1024
  - Sooner node activation as cache warms with new bucket warmup choices (Background, Blocking, None)
  - Memcached bucket sort is eliminated, deprecated since model 6.5

Question Service

- - Pure Language enter for queries from Couchbase Server by way of command line shell, SQL++ and Question Workbench utilizing the Capella iQ entry credentials. Ask questions, with “USING AI” command or REST API instructions beginning with “natural_“.
  - Question workload repository and reporting maintains snapshots and reviews to ease troubleshooting queries. A user-defined assortment collects elapsed time, CPU and reminiscence utilization, KV fetch, executions and extra.
  - Auto replace optimization statistics for excellent question plan era as question traits evolve
  - New SQL++ key phrases and clauses for vector index creation together with, CREATE VECTOR INDEX with optionally available INCLUDE, PARTITION BY, and WHERE clauses, plus extensions to the WITH clause for vector-specific parameters akin to, Dimension, Description, Similarity metric, Train_list, and Num_replicas.
  - New SQL++ capabilities for vector choice, APPROX_VECTOR_DISTANCE
  - Vector Indexes can be found by way of Question Workbench GUI, Capella UI, REST API to the Question Service, SDKs and through mannequin frameworks like LlamaIndex and LangChain

Index Service

New characteristic settings for vector index creation

- - Algorithms: IVF for GSI Composite, and IVF + Vamana (Hybrid) for hyperscale
  - SQL++: CREATE/ALTER/DROP INDEX by way of SQL++, REST API, and SDK
  - Quantization: Tuning index with alternative of PQ, SQ variants for lowered reminiscence utilization
  - Similarity distance: Cosine, Dot Product, L2 and Euclidean for various utility wants
  - Partitioned indexes: For scalability into multi-billion vectors and granular indexing necessities

New characteristic choices for vector search

- - Easy search question: Fundamental ANN scans with vector fields in ORDER BY
  - Pre-filtering in Composite Index and Inline filtering in Hyperscale index with INCLUDE columns for decreasing search area
  - Pushdowns to Indexer: For filtering and limiting paperwork to enhance performances
  - Projections: Assist for projections like vector distance
  - Reranking outcomes: For bettering recall with a efficiency tradeoff

Search Service

- - Person-defined synonyms out there to reference in search queries
  - Filter which paperwork to be listed by search service
  - Finest match (BM25) scoring for higher hybrid search outcomes
  - Learn-replica partitions added to Search Service for sooner question throughput
  - Search Vector Index efficiency has doubled by way of higher SIMD (Single Instruction, A number of Information) assist utilizing avx2 instruction set

Eventing Service

- - Eventing Service has been re-architected for scale, velocity, and safety with dramatic outcomes
  - Set eventing choices on the scope or bucket stage of execution
  - Configure eventing service nodes by scope
  - TLS node to node encryption for inside communication

Cluster Supervisor

- - Auto-failover of ephemeral buckets and non-responsive disks
  - Modify non-KV Multidimensional Scaling (MDS) companies with out introducing new goal nodes
  - Mixture SDK shopper metrics on cluster for simpler monitoring and troubleshooting
  - Lock/unlock person accounts and monitor exercise
  - Improve path requires model 7.2 or greater, earlier variations should improve to 7.2.3 first

Cross Datacenter Replication (XDCR)

- - New bucket property, “EnableCrossClusterVersioning” designed to allow:
    - Bi-directional replication with cellular buckets in Sync Gateway or Capella App Companies
    - Goal cluster consciousness of inbound replications for simpler administration
    - Battle logging for paperwork modified on each ends throughout battle timeframe window
  - XDCR Diagnostic Utility to examine knowledge consistency between clusters

Backups

- - Level-in-time restoration preview earlier than 8.1 GA
    - Cut back knowledge loss window to user-defined timing from hours, to some minutes, and even sub-seconds
  - Backup Retention Interval and expiration settings to set expiration dates for backups
  - Auto-resolve naming conflicts with cbbackupmgr

Constructed for builders, trusted by enterprises

Couchbase 8.0 combines velocity, scale, and adaptability in a single platform that runs anyplace—on-prem, in Capella DBaaS, or on the edge. It’s designed for the builders shaping tomorrow’s AI-powered experiences and for the enterprises that depend on them to run essential functions.

“Our clients can discover related content material primarily based on which means and context, not simply precise key phrases. As a Capella buyer, we’re excited for Couchbase 8.0 and the scalability and TCO advantages that make it the best resolution for our AI-powered video platform,” stated Ian Merrington, CTO at Seenit.

Couchbase 8.0 is now typically out there. Discover what’s new and see how groups are utilizing it to construct next-generation AI and agentic techniques right now.

Take a look at these associated assets: