Auto select CAGRA build algorithm for hnsw::build by tfeher · Pull Request #1719 · NVIDIA/cuvs

tfeher · 2026-01-21T16:20:30Z

Configuring HNSW graph build using CAGRA is complicated, because CAGRA offers multiple build algorithms. This PR implements an automatic algorithm selection. The goal is to have a simplified API, where the user needs to set only two parameters that control graph size and quality (M and ef_construction respectively). This shall be familiar for HNSW users, and allows easier adaption of cuvs accelerated HNSW graph building.

  hnsw::index_params params;
  params.M               = 24;
  params.ef_construction = 200;
  params.hierarchy       = cuvs::neighbors::hnsw::HnswHierarchy::GPU;

  auto hnsw_index = hnsw::build(res, params, dataset_host_view);
  cuvs::neighbors::hnsw::serialize(res, "hnsw_index.bin", *hnsw_index);

If we have enough memory (host and GPU) to do both the KNN graph building and optimization in memory, then we choose in memory build, and let cagra::index_params::from_hnsw_params derive the additional configuration parameters.

If the build would require more memory then available, then we choose ACE method and let the number of partitions derived using #1603.

For host we query the os for available memory, for GPU it is assumed that the whole device memory is available.

copy-pr-bot · 2026-01-21T16:20:34Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

mfoerste4

I did not go over all memory estimates in detail but suggest to align predictions with real data.

Is autotuning of ACE params part of a different PR? Besides the open question on the file location we might want to at least set the number of partitions dynamically.

mfoerste4 · 2026-01-22T20:31:45Z

-      raft::make_host_matrix_view<const T, int64_t>(dataset, nrow, this->dim_));
-  }
-
+  auto dataset_view = raft::make_host_matrix_view<const T, int64_t>(dataset, nrow, this->dim_);


Is the data expected to always reside in host memory?

ACE only supports host memory right now. The main reasons is that we expect the data size to be large and memory-mapped. Further, we do the partitioning and reordering on the host since there is no benefit of moving it to the GPU only to write it to disk afterwards.

Anyways, I think we can support device datasets easily since these should not end up using ACE with this heuristic. @tfeher What do you think?

mfoerste4 · 2026-01-22T21:09:08Z

+  // ACE build and search example.
+  cagra_build_search_ace(res);


maybe we want to rename this to something generic now that the selection is hidden from the user.

julianmi

I did not get a chance to fully review the memory heuristics yet. I wonder how we can test it though. Should max_host_memory_gb and max_gpu_memory_gb be optional HNSW parameters that we could use to test that the expected algorithm is used based on memory limits set?

Co-authored-by: Julian Miller <mail@julian-miller.de>

julianmi

Thanks @mfoerste4. These are great improvements!

julianmi · 2026-06-02T14:06:44Z

I added 2GB static memory consumption for both GPU & Host. This is probably too conservative given that we already reduce the available usage to 80% of the actual value. Especially on small host memory machines this increases the number of partitions drastically. We might want to get rid of one of the limits. On the GPU I would prefer the constant 2GB (e.g. for workspace memory), but on the host side the percentage seems more natural. What do you think?

I agree that the host might stick to the percentage. We have much less control over the host which has many other processes running. Also, high memory pressure reduces the performance significantly. We might want to test with increasing the limit to 90% given the added 2 GB static memory.

achirkin

Looks good overall. A few small suggestions below.

tfeher

Fixed few smaller issues.

KyleFromNVIDIA

Just one small request, otherwise looks good

KyleFromNVIDIA

Approving as I will be out for the first few days of next week and don't want to hold this up. Please address my above comment before merging.

achirkin

Thanks for the updates! lgtm

tfeher · 2026-06-22T06:35:41Z

/merge

tfeher requested review from a team as code owners January 21, 2026 16:20

github-project-automation Bot moved this to Todo in Unstructured Data Processing Jan 21, 2026

github-project-automation Bot added this to Unstructured Data Processing Jan 21, 2026

tfeher force-pushed the auto_selec_cagra_build branch from bb78635 to 23a0b16 Compare January 21, 2026 17:43

tfeher removed request for a team January 21, 2026 17:46

tfeher added breaking Introduces a breaking change improvement Improves an existing functionality labels Jan 21, 2026

tfeher commented Jan 21, 2026

View reviewed changes

Comment thread cpp/src/neighbors/detail/cagra/cagra_helpers.cpp Outdated

tfeher requested a review from mfoerste4 January 21, 2026 17:53

tfeher commented Jan 21, 2026

View reviewed changes

Comment thread examples/cpp/src/hnsw_openai_example.cu Outdated

tfeher commented Jan 21, 2026

View reviewed changes

Comment thread examples/cpp/src/hnsw_openai_example.cu

mfoerste4 reviewed Jan 22, 2026

View reviewed changes

julianmi reviewed Jan 23, 2026

View reviewed changes

cjnolet assigned tfeher Jan 29, 2026

achirkin reviewed Mar 3, 2026

View reviewed changes

Comment thread cpp/src/neighbors/detail/cagra/cagra_helpers.cpp Outdated

cjnolet moved this from Todo to In Progress in Unstructured Data Processing Mar 24, 2026

achirkin requested a review from a team as a code owner March 30, 2026 14:40

achirkin requested a review from msarahan March 30, 2026 14:40

tfeher changed the title ~~Auto select CAGRA build algorithom for hnsw::build~~ Auto select CAGRA build algorithm for hnsw::build Mar 31, 2026

tfeher and others added 3 commits May 21, 2026 10:11

Auto select cagra build algo during HNSW build

5297ce0

Update cpp/include/cuvs/neighbors/ivf_pq.hpp

1b5eb73

Co-authored-by: Julian Miller <mail@julian-miller.de>

Update cpp/src/neighbors/detail/cagra/cagra_helpers.cpp

c9ad00a

Co-authored-by: Julian Miller <mail@julian-miller.de>

huuanhhuyn reviewed Jun 2, 2026

View reviewed changes

Comment thread examples/cpp/CMakeLists.txt Outdated

Merge branch 'main' into auto_selec_cagra_build

9b56a19

huuanhhuyn reviewed Jun 2, 2026

View reviewed changes

Comment thread cpp/src/neighbors/detail/cagra/cagra_build.cuh

huuanhhuyn reviewed Jun 2, 2026

View reviewed changes

Comment thread python/cuvs/cuvs/tests/test_hnsw_ace.py Outdated

huuanhhuyn reviewed Jun 2, 2026

View reviewed changes

Comment thread cpp/src/neighbors/detail/cagra/cagra_build.cuh Outdated

julianmi reviewed Jun 2, 2026

View reviewed changes

mfoerste4 added 3 commits June 8, 2026 16:57

rename example

c918a3f

review suggestions + heuristic refinement

aa6c26f

Host memory limited via percentage, GPU only via static

2b72f2e

julianmi reviewed Jun 12, 2026

View reviewed changes

Comment thread cpp/src/neighbors/detail/hnsw.hpp Outdated

mfoerste4 and others added 2 commits June 12, 2026 13:59

Fixed WS pool in example, minor adaptions

871ebb9

Merge branch 'main' into auto_selec_cagra_build

db2a1cf

achirkin requested changes Jun 16, 2026

View reviewed changes

Fix comments an codebook size estimate

5d52dc4

tfeher commented Jun 16, 2026

View reviewed changes

Comment thread cpp/src/neighbors/detail/hnsw.hpp

Comment thread cpp/src/neighbors/ivf_pq_index.cu

Comment thread cpp/src/neighbors/ivf_pq_index.cu

Comment thread examples/cpp/CMakeLists.txt Outdated

Comment thread cpp/src/neighbors/detail/cagra/cagra_helpers.cpp Outdated

KyleFromNVIDIA requested changes Jun 16, 2026

View reviewed changes

Comment thread examples/cpp/CMakeLists.txt

more review suggestions

95daa58

KyleFromNVIDIA approved these changes Jun 16, 2026

View reviewed changes

KyleFromNVIDIA requested changes Jun 16, 2026

View reviewed changes

Comment thread examples/cpp/CMakeLists.txt

KyleFromNVIDIA approved these changes Jun 16, 2026

View reviewed changes

mfoerste4 and others added 3 commits June 16, 2026 21:12

move batchsize from public header to internal

6f232ab

more fixes

1ce8715

Merge branch 'main' into auto_selec_cagra_build

f6d8c35

achirkin approved these changes Jun 17, 2026

View reviewed changes

dantegd approved these changes Jun 22, 2026

View reviewed changes

rapids-bot Bot merged commit 0b090ba into NVIDIA:main Jun 22, 2026
272 of 283 checks passed

github-project-automation Bot moved this from In Progress to Done in Unstructured Data Processing Jun 22, 2026

coderabbitai Bot mentioned this pull request Jun 24, 2026

Add HNSW Layered Index Support #2148

Open

		// ACE build and search example.
		cagra_build_search_ace(res);

Uh oh!

Conversation

tfeher commented Jan 21, 2026

Uh oh!

copy-pr-bot Bot commented Jan 21, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mfoerste4 left a comment

Choose a reason for hiding this comment

Uh oh!

mfoerste4 Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

julianmi Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mfoerste4 Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

julianmi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

julianmi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

julianmi commented Jun 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

achirkin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tfeher left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

KyleFromNVIDIA left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

KyleFromNVIDIA left a comment

Choose a reason for hiding this comment

julianmi commented Jun 2, 2026 •

edited

Loading