Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consensus stopped due to syncing issue #4517

Open
junwei0117 opened this issue Dec 17, 2024 · 2 comments
Open

Consensus stopped due to syncing issue #4517

junwei0117 opened this issue Dec 17, 2024 · 2 comments
Assignees
Labels
consensus Issues related to the Core Consensus team

Comments

@junwei0117
Copy link
Collaborator

Bug description

We noticed that the consensus in our testnet occasionally stops for several minutes, causing no new checkpoints to be created. From the metrics, it’s clear that the number of missing blocks is higher than normal during these periods when consensus stops.

Image

The following warning appears in the logs during this time::

WARN consensus_core::synchronizer: No results returned while requesting missing blocks

Explore-logs-2024-12-17 18_21_17.txt

@alexsporn alexsporn added the consensus Issues related to the Core Consensus team label Dec 17, 2024
@junwei0117
Copy link
Collaborator Author

junwei0117 commented Dec 20, 2024

We just encountered another temporary consensus stop again

Image
2024-12-20T09:23:36.466720Z  WARN consensus_core::authority_service: Errored while trying to fetch missing ancestors via synchronizer: Synchronizer for fetching blocks directly from I is saturated
2024-12-20T09:23:36.468730Z  WARN consensus_core::authority_service: Errored while trying to fetch missing ancestors via synchronizer: Synchronizer for fetching blocks directly from P is saturated
2024-12-20T09:23:36.728751Z  WARN consensus_core::authority_service: Errored while trying to fetch missing ancestors via synchronizer: Synchronizer for fetching blocks directly from F is saturated
2024-12-20T09:23:37.241722Z  WARN consensus_core::authority_service: Errored while trying to fetch missing ancestors via synchronizer: Synchronizer for fetching blocks directly from D is saturated
2024-12-20T09:23:40.128437Z  WARN consensus_core::synchronizer: No results returned while requesting missing blocks
2024-12-20T09:23:46.420191Z  WARN consensus_core::synchronizer: Max retries 6 reached while trying to fetch blocks from peer F.

Explore-logs-2024-12-20 20_46_02.txt

You can check more metrics on Grafana within this time range with CET

{"from":"2024-12-20 10:21:20","to":"2024-12-20 10:24:54"}

@junwei0117
Copy link
Collaborator Author

And this one, although the consensus didn't stop, but the consensus latency is still getting very high, you can notice that this happened after the node came back online from being offline.

Image

WARN consensus_core::synchronizer: No results returned while requesting missing blocks

Explore-logs-2024-12-20 20_45_11.txt

You can check more metrics on Grafana within this time range with CET

{"from":"2024-12-20 10:21:20","to":"2024-12-20 10:24:54"}

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
consensus Issues related to the Core Consensus team
Projects
None yet
Development

No branches or pull requests

4 participants