CNDB-15381: CNDB-15485: Fix ResultRetriever key comparison to prevent… #2053

michaelsembwever · 2025-10-09T21:36:22Z

https://github.com/riptano/cndb/issues/15578

Port into main-5.0 commit 7e7230c

CNDB-15381: Port CASSANDRA-20888 index hints improvements (https://github.com/datastax/cassandra/pull/2004)
Port some of the improvements for index hints done by
[CASSANDRA-20888](https://issues.apache.org/jira/browse/CASSANDRA-20888),
especially the ones in messaging. Also clean up unused methods in index hints.

github-actions · 2025-10-09T21:36:41Z

… dupes in result set (#2024) (cherry picked from commit ada025c) Copy of #2023, but targeting `main` riptano/cndb#15485 This PR fixes a bug introduced to this branch via #1884. The bug only impacts SAI file format `aa` when the index file was produced via compaction, which is why the modified test simply adds coverage to compact the table and hit the bug. The bug happens when an iterator produces the same partition across two different batch fetches from storage. These keys were not collapsed in the `key.equals(lastKey)` logic because compacted indexes use a row id per row instead of per partition, and the logic in `PrimaryKeyWithSource` considers rows with different row ids to be distinct. However, when we went to materialize a batch from storage, we hit this code: ```java ClusteringIndexFilter clusteringIndexFilter = command.clusteringIndexFilter(firstKey.partitionKey()); if (cfs.metadata().comparator.size() == 0 || firstKey.hasEmptyClustering()) { return clusteringIndexFilter; } else { nextClusterings.clear(); for (PrimaryKey key : keys) nextClusterings.add(key.clustering()); return new ClusteringIndexNamesFilter(nextClusterings, clusteringIndexFilter.isReversed()); } ``` which returned `clusteringIndexFilter` for `aa` because those indexes do not have the clustering information. Therefore, each batch fetched the whole partition (which was subsequently filtered to the proper results), and produced a multiplier effect where we saw `batch` many duplicates. This fix works by comparing partition keys and clustering keys directly, which is a return to the old comparison logic from before #1884. There was actually a discussion about this in the PR to `main`, but unfortunately, we missed this case #1883 (comment). A more proper long term fix might be to remove the logic of creating a `PrimaryKeyWithSource` for AA indexes. However, I preferred this approach because it is essentially a `revert` instead of fixing forward solution.

sonarqubecloud · 2025-10-10T22:48:21Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
89.5% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

cassci-bot · 2025-10-10T22:52:29Z

❌ Build ds-cassandra-pr-gate/PR-2053 rejected by Butler

1 regressions found
See build details here

Found 1 new test failures

Test	Explanation	Runs	Upstream
o.a.c.cql3.validation.operations.AggregationQueriesTest.testAggregationQueryShouldNotTimeoutWhenItExceedesReadTimeout (compression)	REGRESSION	🔴🔴	2 / 10

Found 6 known test failures

michaelsembwever force-pushed the mck-cndb-15578-main-5.0 branch from 4c7f75e to 024e52f Compare October 10, 2025 20:29

djatnieks approved these changes Oct 13, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CNDB-15381: CNDB-15485: Fix ResultRetriever key comparison to prevent… #2053

CNDB-15381: CNDB-15485: Fix ResultRetriever key comparison to prevent… #2053

michaelsembwever commented Oct 9, 2025

Uh oh!

github-actions bot commented Oct 9, 2025

Uh oh!

sonarqubecloud bot commented Oct 10, 2025

Uh oh!

cassci-bot commented Oct 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

CNDB-15381: CNDB-15485: Fix ResultRetriever key comparison to prevent… #2053

Are you sure you want to change the base?

CNDB-15381: CNDB-15485: Fix ResultRetriever key comparison to prevent… #2053

Conversation

michaelsembwever commented Oct 9, 2025

Uh oh!

github-actions bot commented Oct 9, 2025

Checklist before you submit for review

Uh oh!

sonarqubecloud bot commented Oct 10, 2025

Quality Gate passed

Uh oh!

cassci-bot commented Oct 10, 2025

❌ Build ds-cassandra-pr-gate/PR-2053 rejected by Butler

Found 1 new test failures

Found 6 known test failures

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants