You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
When connectivity to a postgres database has been lost, an operator step-down will cause vault to hang and becomes unresponsive for an extended period (> 5min). This is an HA issue and is blocking our DR test.
To Reproduce
Steps to reproduce the behavior:
Confirm credentials can be retrieved vault read database/creds/my-role
Confirm which node is leader vault status
Disable traffic to postgres on leader iptables -A OUTPUT -p tcp -d <db host> --dport 9142 -j DROP
Issue vault status, Error checking seal status: context deadline exceeded
Expected behavior
Step-down succeeds and vault is responsive.
Environment:
Vault Server Version (retrieve with vault status):
Key Value
--- -----
Seal Type shamir
Initialized true
Sealed false
Total Shares 1
Threshold 1
Version 1.6.1
Cluster Name vault-cluster-edfc2327
Cluster ID
HA Enabled true
HA Cluster
HA Mode active
Raft Committed Index 388093
Raft Applied Index 388093
Vault CLI Version (retrieve with vault version):
1.5.5
Server Operating System/Architecture:
Linux flatcar Initial Website Import #1 SMP Fri Oct 23 16:42:52 -00 2020 x86_64 Intel(R) Xeon(R) Platinum 8175M CPU @ 2.50GHz GenuineIntel GNU/Linux
Additional context
Seems to be related that the postgres-database-plugin, doesn't handle connection failures resiliently #6792 and a step down forces the plugin to be gracefully shutdown
Find attached goroutine traces of vault process of leader when stepping down. log.json.zip
The text was updated successfully, but these errors were encountered:
Describe the bug
When connectivity to a postgres database has been lost, an
operator step-down
will cause vault to hang and becomes unresponsive for an extended period (> 5min). This is an HA issue and is blocking our DR test.To Reproduce
Steps to reproduce the behavior:
vault read database/creds/my-role
vault status
iptables -A OUTPUT -p tcp -d <db host> --dport 9142 -j DROP
vault read database/creds/my-role
, Error reading database/creds/my-role: context deadline exceededvault operator step-down
, Success! Stepped down: https://vault.xxxxvault status
, Error checking seal status: context deadline exceededExpected behavior
Step-down succeeds and vault is responsive.
Environment:
vault status
):Vault CLI Version (retrieve with
vault version
):1.5.5
Server Operating System/Architecture:
Linux flatcar Initial Website Import #1 SMP Fri Oct 23 16:42:52 -00 2020 x86_64 Intel(R) Xeon(R) Platinum 8175M CPU @ 2.50GHz GenuineIntel GNU/Linux
Additional context
Seems to be related that the postgres-database-plugin, doesn't handle connection failures resiliently #6792 and a step down forces the plugin to be gracefully shutdown
Find attached goroutine traces of vault process of leader when stepping down.
log.json.zip
The text was updated successfully, but these errors were encountered: