Skip to content

roachtest: follower-reads/survival=region/locality=regional/reads=bounded-staleness/insufficient-quorum failed #147322

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
cockroach-teamcity opened this issue May 27, 2025 · 1 comment
Labels
B-runtime-assertions-enabled branch-release-25.2 C-test-failure Broken test (automatically or manually discovered). O-roachtest O-robot Originated from a bot. T-kv KV Team

Comments

@cockroach-teamcity
Copy link
Member

cockroach-teamcity commented May 27, 2025

Note: This build has runtime assertions enabled. If the same failure was hit in a run without assertions enabled, there should be a similar failure without this message. If there isn't one, then this failure is likely due to an assertion violation or (assertion) timeout.

roachtest.follower-reads/survival=region/locality=regional/reads=bounded-staleness/insufficient-quorum failed with artifacts on release-25.2 @ a7ec8f927bf1bb46443080b7d3655869e7507416:

(test_runner.go:2323).func1: task ``task-139`` returned error: error reading data: pq: internal error while retrieving user account memberships: operation "get-user-session" timed out after 10s (given timeout 10s): internal error while retrieving user account: get auth info error: interrupted during singleflight load-value:authinfo-roachprod-2-2: context deadline exceeded
(follower_reads.go:823).verifyHighFollowerReadRatios: roachtestutil.addCookies: unable to extract sessionID: roachtestutil.GetSessionID: failed to authenticate: context canceled
test artifacts and logs in: /artifacts/follower-reads/survival=region/locality=regional/reads=bounded-staleness/insufficient-quorum/run_1

Parameters:

  • arch=amd64
  • cloud=gce
  • coverageBuild=false
  • cpu=4
  • encrypted=false
  • fs=ext4
  • localSSD=true
  • metamorphicBufferedSender=false
  • metamorphicLeases=epoch
  • runtimeAssertionsBuild=true
  • ssd=0
Help

See: roachtest README

See: How To Investigate (internal)

See: Grafana

/cc @cockroachdb/kv-triage

This test on roachdash | Improve this report!

Jira issue: CRDB-50998

@cockroach-teamcity cockroach-teamcity added C-test-failure Broken test (automatically or manually discovered). O-robot Originated from a bot. O-roachtest release-blocker Indicates a release-blocker. Use with branch-release-2x.x label to denote which branch is blocked. T-kv KV Team B-runtime-assertions-enabled branch-release-25.2 labels May 27, 2025
@stevendanna
Copy link
Collaborator

From what I can see, node 6 was never that happy in this test. It couldn't update its liveness record from nearly the beginning of its logs forward.

Looking at the implementation of the test, one thing I noted is that while we wait for the main database to be fully replicated, we don't really wait for any of the system tables to be fully replicated.

@stevendanna stevendanna removed the release-blocker Indicates a release-blocker. Use with branch-release-2x.x label to denote which branch is blocked. label May 28, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
B-runtime-assertions-enabled branch-release-25.2 C-test-failure Broken test (automatically or manually discovered). O-roachtest O-robot Originated from a bot. T-kv KV Team
Projects
None yet
Development

No branches or pull requests

2 participants