I have same problem as well.
Tested on all 4.x versions (4.0.0, 4.0.1, 4.0.2 and 4.0.3)
Unable to add node to the cluster.
Partial logs bellow.
On server side:
WARN [PROXY:1:C] Re-init the cache on error: all SubConns are in TransientFailure, latest connection error: . logrus/entry.go:188
DEBU [PROXY:1:C] Reloading Linear(attempt=38, duration=10s). cache/cache.go:346
WARN [PROXY:1] Re-init the watcher on error: all SubConns are in TransientFailure, latest connection error: . services/proxywatcher.go:180
DEBU [PROXY:1] Reloading Linear(attempt=38, duration=10s). services/proxywatcher.go:184
WARN [REVERSE:D] Re-init the cache on error: all SubConns are in TransientFailure, latest connection error: . logrus/entry.go:188
DEBU [REVERSE:D] Reloading Linear(attempt=38, duration=10s). cache/cache.go:346
WARN [PROXY:1:C] Re-init the cache on error: all SubConns are in TransientFailure, latest connection error: . logrus/entry.go:188
DEBU [PROXY:1:C] Reloading Linear(attempt=39, duration=10s). cache/cache.go:346
WARN [PROXY:1] Re-init the watcher on error: all SubConns are in TransientFailure, latest connection error: . services/proxywatcher.go:180
DEBU [PROXY:1] Reloading Linear(attempt=39, duration=10s). services/proxywatcher.go:184
WARN [REVERSE:D] Re-init the cache on error: all SubConns are in TransientFailure, latest connection error: . logrus/entry.go:188
DEBU [REVERSE:D] Reloading Linear(attempt=39, duration=10s). cache/cache.go:346
WARN [PROXY:1:C] Re-init the cache on error: all SubConns are in TransientFailure, latest connection error: . logrus/entry.go:188
DEBU [PROXY:1:C] Reloading Linear(attempt=40, duration=10s). cache/cache.go:346
WARN [PROXY:1] Re-init the watcher on error: all SubConns are in TransientFailure, latest connection error: . services/proxywatcher.go:180
DEBU [PROXY:1] Reloading Linear(attempt=40, duration=10s). services/proxywatcher.go:184
WARN [REVERSE:D] Re-init the cache on error: all SubConns are in TransientFailure, latest connection error: . logrus/entry.go:188
DEBU [REVERSE:D] Reloading Linear(attempt=40, duration=10s). cache/cache.go:346
On the client side:
WARN [NODE:BEAT] Heartbeat failed transport is closing. logrus/entry.go:188
WARN [PROC:1] Sync rotation state cycle failed: watcher has disconnected, going to retry after 10s. logrus/entry.go:188
WARN [NODE:BEAT] Heartbeat failed all SubConns are in TransientFailure, latest connection error: . logrus/entry.go:188
WARN [NODE:BEAT] Heartbeat failed transport is closing. logrus/entry.go:188
WARN [PROC:1] Sync rotation state cycle failed: watcher has disconnected, going to retry after 10s. logrus/entry.go:188
Looking forward for any suggestion.
Regards,
P.