You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Jan 22, 2025. It is now read-only.
Validator crashed due to #33740. On attempt to restart, the validator went into the same bad state that I've seen before after this crash, where a ton of logs of the form:
The solution I have used in this case is to delete the rocksdb from my ledger, download a fresh snapshot, and restart.
After doing that, the validator went into a mode where it would never get to the "voting" phase. Instead it logged like this periodically:
[2023-11-14T18:06:14.733317350Z INFO solana_core::cluster_slot_state_verifier] check_slot_agrees_with_cluster() slot: 230006728, root: 230006574, slot_state_update: DuplicateConfirmed(DuplicateConfirmedState { duplicate_confirmed_hash: 7dZqqGgf1TCaasmxX2eAiZiPxUdNYGPewXBC1xTjH6w9, bank_status: Frozen(7dZqqGgf1TCaasmxX2eAiZiPxUdNYGPewXBC1xTjH6w9) })
[2023-11-14T18:06:14.733363116Z INFO solana_core::replay_stage] Waiting to switch vote to 230006729,
resetting to slot None for now,
switch proof stake: 0,
threshold stake: 153264047423939550,
total stake: 403326440589314600
[2023-11-14T18:06:14.733418090Z INFO solana_core::replay_stage] Haven't landed a vote, so skipping my leader slot
That's potential issue 2.
The full logs for almost 1 day leading up to these issues:
[2023-11-14T01:48:44.265809110Z INFO solana_validator] Starting validator with: ArgsOs {
Line 1910826, validator starting up and going into the POTENTIAL ISSUE 1 mode:
[2023-11-14T01:57:47.176418508Z INFO solana_validator] Starting validator with: ArgsOs {
Line 2043435, validator starting up after being restarted because it was in the POTENTIAL ISSUE 1 mode. A possibly faulty attempt had been made to delete the ledger before start-up. But after this the validator was still in heavy "dropped vote VoteStateUpdate" mode:
[2023-11-14T02:03:48.684757437Z INFO solana_validator] Starting validator with: ArgsOs {
Line 2410319, restarted again because of the continued "VoteStateUpdate" logs, but this time, DEFINITELY deleted rocksdb before the restart:
[2023-11-14T02:14:51.383772737Z INFO solana_validator] Starting validator with: ArgsOs {
Line 10533669, TODAY'S crash (issue #33740), after which the restarted validator was back in the bad "VoteStateUpdate" logging mode (POTENTIAL ISSUE 1):
[2023-11-14T17:47:07.603584201Z INFO solana_validator] Starting validator with: ArgsOs {
Line 10637901: DOWNLOADED FRESH SNAPSHOTS then restarted validator, it went into the POTENTIAL ISSUE 2 mode, just sat there never making it to "voting". I am not sure if I deleted rocksdb right before this start-up.
[2023-11-14T17:54:58.232062722Z INFO solana_validator] Starting validator with: ArgsOs {
Line 10917972: Gave up on the "never getting to voting" start-up and restarted again. Possibly deleted rocksdb first:
[2023-11-14T18:06:47.467202892Z INFO solana_validator] Starting validator with: ArgsOs {
Line10918441: Final restart, this time I definitely cleared out rocksdb and downloaded fresh snapshots before starting up. This restart finally worked.
[2023-11-14T18:10:07.151966666Z INFO solana_validator] Starting validator with: ArgsOs {
Problem
Validator crashed due to #33740. On attempt to restart, the validator went into the same bad state that I've seen before after this crash, where a ton of logs of the form:
are logged. That's potential issue 1.
The solution I have used in this case is to delete the rocksdb from my ledger, download a fresh snapshot, and restart.
After doing that, the validator went into a mode where it would never get to the "voting" phase. Instead it logged like this periodically:
That's potential issue 2.
The full logs for almost 1 day leading up to these issues:
https://s3.us-west-1.amazonaws.com/shinobi-systems.com/BLADE.2014.11.14.log.gz
(not sure why I named the file 2014.11.14, it's from 2023.11.14, some serious fumble fingering there)
Proposed Solution
The text was updated successfully, but these errors were encountered: