Add AnytimeRankingSearcher for SLA-Aware Early Termination with Bin-Based Score Boosting #14525

atris · 2025-04-18T18:58:23Z

Add AnytimeRankingSearcher for SLA-aware early termination with bin-based score boosting

This patch adds AnytimeRankingSearcher, a new low-latency search implementation that supports early termination under SLA constraints, combined with bin-aware score boosting.

Architecture

Index-time binning uses a configurable post-indexing pass to assign each document to one of bin.count bins. This pass is activated via field attributes (doBinning=true, bin.count=N, etc.) and is triggered after all standard postings are written. Binning uses a segment-local sparse similarity graph where each node is a document and edges represent cosine similarity between term frequency vectors.

The bin distribution is computed via recursive graph bisection. The graph is recursively split into halves using a seeded heuristic that assigns each document to the closer of two seed nodes based on edge weights. This ensures intra-bin similarity and minimizes cross-bin connectivity. A fixed number of bins is produced, and the assignment is saved to a .binmap file.

In approximate mode (graph.builder=approx), we avoid building explicit term vectors. Instead, token co-occurrence is tracked using per-term BitSets, and documents are grouped using lightweight overlap heuristics. This trades off precision for speed and scales better on large segments.

At search time, BinMapReader loads the bin assignments, and BinScoreReader makes them accessible to search collectors. BinBoostCalculator assigns a boost score to each bin based on estimated bin quality (e.g. average term frequency or rank share in a warmup run). This boost is applied additively during ranking, allowing the collector to prioritize high-quality bins earlier and exit faster under SLA pressure.

Binning Modes (Index Time)

This patch supports two modes of document binning during indexing:
• Absolute mode: computes exact bin assignments using full document similarity graphs.
• Approximate mode: enabled when document count exceeds a threshold; skips graph construction and uses faster heuristics to assign bins.

Bin assignment is handled by DocBinningGraphBuilder and switches to ApproximateDocGraphBuilder automatically when needed.

To enable binning, field attributes must be set:

fieldType.putAttribute("postingsFormat", "Lucene103");
fieldType.putAttribute("doBinning", "true");
fieldType.putAttribute("bin.count", "4"); // total number of bins
fieldType.putAttribute("bin.builder", "exact" | "approx" | "auto"); // binning strategy

Search-Time Integration

At search time, bin boosts are loaded using BinScoreReader. To enable anytime ranking:

AnytimeRankingSearcher searcher = new AnytimeRankingSearcher(reader, topK, slaMs, fieldName);
TopDocs results = searcher.search(query);

Internally:
• Bin scores are applied per segment at query time.
• The collector monitors elapsed time and stops scoring once SLA is exhausted.

Test Coverage

Includes a full test (TestAnytimeRankingSearchQuality) that:

• Indexes 10k docs with periodic relevant content
• Runs baseline and anytime search
• Computes NDCG, precision, recall
• Asserts average and max position delta across result sets
• Verifies minimal degradation under SLA constraints

Performance

• AnytimeRankingSearcher provides ~2–3x speedup at low SLA targets
• Recall, precision, and NDCG remain within 95%+ of baseline
• Position delta of relevant docs remains bounded

Notes

• Readers are wrapped using BinScoreUtil.wrap(reader) to enable bin-aware scoring
• Compound readers are tracked and closed explicitly
• BinFilter skipping is not implemented yet — will be added in a follow-up patch
• Fallback to approximate binning ensures indexing remains scalable for large segments

Benchmarks

…riterion for graph building

atris · 2025-04-20T20:17:25Z

@jpountz This PR is now ready for review. I will post luceneutil benchmarks tomorrow. Please let me know if anything else is needed from me.

jpountz · 2025-04-22T20:28:43Z

Can you link the paper that you implemented? I'll need some time to digest these 5k lines. :)

atris · 2025-04-23T16:25:17Z

@jpountz Thanks! Here is the paper: https://arxiv.org/abs/2104.08976

Note that the core inspiration of this PR's approach comes from the paper, but the implementation diverges in certain ways:

The paper talks about using bins mainly for hard cutoffs and filtering. The PR, instead, uses bins to compute adaptive score boosts, and wire that directly using the new Collector.

The PR also adds:
• index-time graph-based binning (exact + approximate). This adds minimal indexing latency but gives significant improvements in search time.
• bin-level boosting at segment level

So while the high-level idea overlaps, the implementation is more ambitious and also opens doors for implementations like bin skipping and multiple fields support and then use graph intersection or fusion to identify documents that are strongly connected across multiple semantic dimensions.

jpountz · 2025-04-24T19:59:24Z

the implementation is more ambitious

I like ambition, but it also makes this change harder to review/integrate, especially with the high LOC count. I would suggest splitting this PR into multiple PRs, for instance:

First PR just works with indexes created with existing recursive graph bisection and uses basic heuristics to determine which ranges of doc IDs to score first (e.g. using impacts) to hopefully increase the top-k score quickly. No extra data stored in the index. All code under lucene/misc rather than core.
Another PR can introduce the SLA-based termination logic.
Another PR can introduce topical clustering mechanism of Kulkarni and Callan, that the paper suggests combining with recursive graph bisection.
Another PR can discuss augmenting index formats to enhance the range selection logic.

atris · 2025-04-24T21:11:04Z

@jpountz thanks for looking!

Just for my understanding, the first PR should contain the index time binning logic that is currently in this PR, just with a simpler model of rank on bin boosting?

github-actions · 2025-05-09T00:26:08Z

This PR has not had activity in the past 2 weeks, labeling it as stale. If the PR is waiting for review, notify the [email protected] list. Thank you for your contribution!

Atri Sharma and others added 20 commits April 18, 2025 01:02

Initial commit without triggering changes

8e6ff8f

More stuff without wiring

47237db

Update tests and wire in

498c6d2

Tidy output

3149145

Remove redundant variable

8d02a91

Make AnytimeRankingSearcher clean its own input and tidy output

19d6245

Debug statements

4c2d426

Intermittent commit

4f99c7a

Closing resources

20a9240

Add JMH benchmarks and tidy output

cb0b2c3

Codec fixes and tidy output

d9dae66

Updates for system out and tidy output

fe53e1f

output and tidy output

b57ff7b

Fix closing in tests

f4086a9

Add disconnected nodes and tidy output

9c90e24

Update closeable in tests

7e925c8

Add baseline test

0b5e979

Forbidden api

05a5ca5

Update closing in tests

7a03945

Extensive relevance test and tidy output

721ca99

github-project-automation bot added this to OpenSearch Lucene & Core Performance Tracking Apr 18, 2025

github-project-automation bot moved this to Open in OpenSearch Lucene & Core Performance Tracking Apr 18, 2025

atris requested a review from jpountz April 18, 2025 18:58

github-actions bot added module:core/index module:core/search module:core/codecs labels Apr 18, 2025

atris self-assigned this Apr 18, 2025

atris added 3 commits April 19, 2025 00:49

Add tests for ApproximateDocGraphBuilder

6af1a75

Add benchmark, fix tests

ee5d389

fixes to benchmark

fe301d3

atris added 10 commits April 19, 2025 11:27

Add tests and a new benchmark

594149f

Optimize doc approximate doc graph builder and add more tests

6b22b13

Remove redundant variable

8b59010

Add missing licenses

23b06f0

Update tests and allow tuning of high frequency terms being used as c…

ffcfeb6

…riterion for graph building

Update setting maxDocFreq through config and update test cutoff

eac3ca9

Tidy output

d3359dc

Lift limit on default changeover for approximate binning

170f63c

More optimizations and benchmarks

aaae4ae

Update really ignore

0dc01d3

atris mentioned this pull request Apr 20, 2025

#14410 - Add Anytime Ranking Searching - SLA-constrained ranking With Range Boosting and Dynamic SLA #14409

Closed

atris added 12 commits April 20, 2025 22:28

Remove instances of Arrays.copyOf

2747719

More additions

fe20c05

Remove redundant token

5c619c1

Tidy output

d1b6118

Update and tidy output

e189ddb

Make graph structure more strict

fe91aeb

Update forbidden API and tidy output

4ddbac7

More fixes and get self loop to not be added and add more tests for that

e751179

add missing licenses

a844a02

Javadocs and tidy

0e0133e

Add heavy document indexing benchmark

4b81375

Tidy output

371932c

github-actions bot added the Stale label May 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add AnytimeRankingSearcher for SLA-Aware Early Termination with Bin-Based Score Boosting #14525

Add AnytimeRankingSearcher for SLA-Aware Early Termination with Bin-Based Score Boosting #14525

Uh oh!

atris commented Apr 18, 2025 •

edited

Loading

Uh oh!

atris commented Apr 20, 2025

Uh oh!

jpountz commented Apr 22, 2025

Uh oh!

atris commented Apr 23, 2025

Uh oh!

jpountz commented Apr 24, 2025

Uh oh!

atris commented Apr 24, 2025

Uh oh!

github-actions bot commented May 9, 2025

Uh oh!

Uh oh!

Add AnytimeRankingSearcher for SLA-Aware Early Termination with Bin-Based Score Boosting #14525

Are you sure you want to change the base?

Add AnytimeRankingSearcher for SLA-Aware Early Termination with Bin-Based Score Boosting #14525

Uh oh!

Conversation

atris commented Apr 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Architecture

Binning Modes (Index Time)

Search-Time Integration

Test Coverage

Performance

Notes

Benchmarks

Uh oh!

atris commented Apr 20, 2025

Uh oh!

jpountz commented Apr 22, 2025

Uh oh!

atris commented Apr 23, 2025

Uh oh!

jpountz commented Apr 24, 2025

Uh oh!

atris commented Apr 24, 2025

Uh oh!

github-actions bot commented May 9, 2025

Uh oh!

Uh oh!

atris commented Apr 18, 2025 •

edited

Loading