Enable caching of all filters in `knn` queries #134458

jimczi · 2025-09-10T15:03:29Z

This change makes all filters in the knn query eligible for query caching. By default, Lucene considers some simple filters (e.g., term queries) too cheap to cache. In the context of vector search, these filters are eagerly materialized as bitsets, which makes them significantly more expensive to evaluate on every request. Forcing them to be cacheable avoids repeated recomputation.

This is a stop-gap change to support simple use cases such as a single term query used as a filter in knn. The long-term solution is to move this decision logic into the Lucene knn codec itself, but that will require more time.

Benchmark Results

Dataset: 20M 128D vectors, term filter matching ~80% of documents.

With this change:

Precision  QPS      P50 (ms)   P95 (ms)
0.91       632.8    5.763      9.900

Without this change:

Precision  QPS      P50 (ms)   P95 (ms)
0.91        68.2    82.52      193.92

This change makes all filters in the `knn` query eligible for query caching. By default, Lucene considers some simple filters (e.g., term queries) too cheap to cache. In the context of vector search, these filters are eagerly materialized as bitsets, which makes them significantly more expensive to evaluate on every request. Forcing them to be cacheable avoids repeated recomputation. This is a stop-gap change to support simple use cases such as a single term query used as a filter in `knn`. The long-term solution is to move this decision logic into the Lucene `knn` codec itself, but that will require more time. ### Benchmark Results Dataset: **20M 128D vectors**, term filter matching \~80% of documents. **With this change:** ``` Precision QPS P50 (ms) P95 (ms) 0.91 632.8 5.763 9.900 ``` **Without this change:** ``` Precision QPS P50 (ms) P95 (ms) 0.91 68.2 82.52 193.92 ```

elasticsearchmachine · 2025-09-10T15:03:54Z

Pinging @elastic/es-search-relevance (Team:Search Relevance)

elasticsearchmachine · 2025-09-10T15:03:55Z

Hi @jimczi, I've created a changelog YAML for you.

…_query_filter_cache

...n/java/org/elasticsearch/index/cache/query/ElasticsearchUsageTrackingQueryCachingPolicy.java

…sary

jimczi added >enhancement :Search Relevance/Vectors Vector search v9.2.0 labels Sep 10, 2025

elasticsearchmachine added the Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch label Sep 10, 2025

Update docs/changelog/134458.yaml

381f2aa

jimczi added 2 commits September 10, 2025 16:26

fix NPE

515132f

Merge remote-tracking branch 'origin/knn_query_filter_cache' into knn…

5fb66c5

…_query_filter_cache

benwtrent self-requested a review September 10, 2025 15:48

plug the es query caching policy

ea443c4

benwtrent reviewed Sep 10, 2025

View reviewed changes

...n/java/org/elasticsearch/index/cache/query/ElasticsearchUsageTrackingQueryCachingPolicy.java Outdated Show resolved Hide resolved

remove the custom es query caching policy as it is not strictly neces…

43a5394

…sary

benwtrent approved these changes Sep 10, 2025

View reviewed changes

jimczi added 4 commits September 10, 2025 17:40

fix spotless

216767f

fix ut assertion

ab1dbd9

Merge branch 'main' into knn_query_filter_cache

33afc99

Merge branch 'main' into knn_query_filter_cache

896f7c8

jimczi merged commit ad5df9d into elastic:main Sep 11, 2025
34 checks passed

jimczi deleted the knn_query_filter_cache branch September 11, 2025 08:12

john-wagster mentioned this pull request Sep 20, 2025

[CI] CachingEnableFilterQueryTests testBooleanQuery failing #135124

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable caching of all filters in `knn` queries #134458

Enable caching of all filters in `knn` queries #134458

Uh oh!

jimczi commented Sep 10, 2025

Uh oh!

elasticsearchmachine commented Sep 10, 2025

Uh oh!

elasticsearchmachine commented Sep 10, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Enable caching of all filters in knn queries #134458

Enable caching of all filters in knn queries #134458

Uh oh!

Conversation

jimczi commented Sep 10, 2025

Benchmark Results

Uh oh!

elasticsearchmachine commented Sep 10, 2025

Uh oh!

elasticsearchmachine commented Sep 10, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Enable caching of all filters in `knn` queries #134458

Enable caching of all filters in `knn` queries #134458