Unable to install gravity on AWS node, it seems to be failing at planet health check but not sure what is going wrong, is there any way we can debug planet stuff, I am getting the following error
at Sep 21 00:49:29 UTC Executing “/health” locally
Sat Sep 21 00:49:29 UTC Waiting for the planet to start
Sat Sep 21 00:49:29 UTC Wait for cluster to pass health checks
Sat Sep 21 00:49:39 UTC Still waiting for the planet to start (10 seconds elapsed)
Sat Sep 21 00:50:29 UTC Still waiting for the planet to start (1 minute elapsed)
Sat Sep 21 00:51:29 UTC Still waiting for the planet to start (2 minutes elapsed)
Sat Sep 21 00:55:49 UTC Still waiting for the planet to start (6 minutes elapsed)
Sat Sep 21 00:57:49 UTC Still waiting for the planet to start (8 minutes elapsed)
Sat Sep 21 00:57:51 UTC Executing operation finished in 12 minutes
Sat Sep 21 00:57:51 UTC Saving debug report to /home/ubuntu/crashreport.tgz
[ERROR]: not all planets have come up yet: &{degraded []}, failed to execute phase “/health”
Gravity Plan
Phase Description State Node Requires Updated
----- ----------- ----- ---- -------- -------
✓ checks Execute preflight checks Completed - - Fri Sep 20 17:01 UTC
✓ configure Configure packages for all nodes Completed - - Fri Sep 20 17:01 UTC
✓ bootstrap Bootstrap all nodes Completed - - Fri Sep 20 17:01 UTC
✓ ip-10-151-20-200 Bootstrap master node ip-10-151-20-200 Completed 10.151.20.200 - Fri Sep 20 17:01 UTC
✓ pull Pull configured packages Completed - /configure,/bootstrap Fri Sep 20 17:02 UTC
✓ ip-10-151-20-200 Pull packages on master node ip-10-151-20-200 Completed 10.151.20.200 /configure,/bootstrap Fri Sep 20 17:02 UTC
✓ masters Install system software on master nodes Completed - /pull Fri Sep 20 17:02 UTC
✓ ip-10-151-20-200 Install system software on master node ip-10-151-20-200 Completed - /pull/ip-10-151-20-200 Fri Sep 20 17:02 UTC
✓ teleport Install system package teleport:3.2.7 on master node ip-10-151-20-200 Completed 10.151.20.200 /pull/ip-10-151-20-200 Fri Sep 20 17:02 UTC
✓ planet Install system package planet:6.0.6-11402 on master node ip-10-151-20-200 Completed 10.151.20.200 /pull/ip-10-151-20-200 Fri Sep 20 17:02 UTC
✓ wait Wait for Kubernetes to become available Completed - /masters Fri Sep 20 17:03 UTC
✓ rbac Bootstrap Kubernetes roles and PSPs Completed - /wait Fri Sep 20 17:03 UTC
✓ coredns Configure CoreDNS Completed - /wait Fri Sep 20 17:03 UTC
✓ resources Create user-supplied Kubernetes resources Completed - /rbac Fri Sep 20 17:03 UTC
✓ export Export applications layers to Docker registries Completed - /wait Fri Sep 20 17:04 UTC
✓ ip-10-151-20-200 Populate Docker registry on master node ip-10-151-20-200 Completed 10.151.20.200 /wait Fri Sep 20 17:04 UTC
× health Wait for cluster to pass health checks Failed - /export Fri Sep 20 17:13 UTC
* runtime Install system applications Unstarted - /rbac -
* dns-app Install system application dns-app:0.3.0 Unstarted - /rbac -
* logging-app Install system application logging-app:6.0.2 Unstarted - /rbac -
* monitoring-app Install system application monitoring-app:6.0.4 Unstarted - /rbac -
* tiller-app Install system application tiller-app:6.0.0 Unstarted - /rbac -
* site Install system application site:6.0.1 Unstarted - /rbac -
* kubernetes Install system application kubernetes:6.0.1 Unstarted - /rbac -
* app Install user application Unstarted - /runtime -
* test-appliance Install application test-appliance:0.0.1 Unstarted - /runtime -
* connect-installer Connect to installer Unstarted - /runtime -
* election Enable cluster leader elections Unstarted - /app -
The /health phase ("Wait for cluster to pass health checks") has failed
not all planets have come up yet: &{unknown [{ 10.151.19.194 master offline []} { 10.151.20.200 master degraded []}]}
Planet status
{"nodes":[{"name":"10_151_19_194.awesomeleakey7861","member_status":{"name":"10_151_19_194.awesomeleakey7861","addr":"10.151.19.194:7496","status":"alive","tags":{"publicip":"10.151.19.194","role":"master"}}},{"name":"10_151_20_200.youthfulshannon8034","member_status":{"name":"10_151_20_200.youthfulshannon8034","addr":"10.151.20.200:7496","status":"alive","tags":{"publicip":"10.151.20.200","role":"master"}},"status":"degraded","probes":[{"checker":"br-netfilter","status":"running"},{"checker":"docker","status":"running"},{"checker":"ip-forward","status":"running"},{"checker":"disk-space","detail":"disk utilization on /var/lib/gravity is below 80 percent (55 GB is available out of 83 GB)","status":"running","checker_data":"eyJoaWdoX3dhdGVybWFyayI6ODAsInBhdGgiOiIvdmFyL2xpYi9ncmF2aXR5IiwidG90YWxfYnl0ZXMiOjgzMjA0MTQxMDU2LCJhdmFpbGFibGVfYnl0ZXMiOjU0Nzg3MzEzNjY0fQ=="},{"checker":"etcd-healthz","status":"running"},{"checker":"dns","status":"running"},{"checker":"ping-checker","status":"running"},{"checker":"kube-apiserver","status":"running"},{"checker":"nodestatus","status":"running"},{"checker":"docker-registry","status":"running"},{"checker":"system-version","detail":"Linux ip-10-151-20-200 4.4.0-1094-aws #105-Ubuntu SMP Mon Sep 16 13:08:01 UTC 2019 x86_64 GNU/Linux\n","status":"running"},{"checker":"systemd-version","detail":"systemd 241 (241)\n+PAM +AUDIT +SELINUX +IMA +APPARMOR +SMACK +SYSVINIT +UTMP +LIBCRYPTSETUP +GCRYPT +GNUTLS +ACL +XZ +LZ4 +SECCOMP +BLKID +ELFUTILS +KMOD -IDN2 +IDN -PCRE2 default-hierarchy=hybrid\n","status":"running"},{"checker":"docker-version","detail":"Containers: 0\n Running: 0\n Paused: 0\n Stopped: 0\nImages: 2\nServer Version: 18.09.5\nStorage Driver: overlay2\n Backing Filesystem: extfs\n Supports d_type: true\n Native Overlay Diff: true\nLogging Driver: json-file\nCgroup Driver: cgroupfs\nPlugins:\n Volume: local\n Network: bridge host macvlan null overlay\n Log: awslogs fluentd gcplogs gelf journald json-file local logentries splunk syslog\nSwarm: inactive\nRuntimes: runc\nDefault Runtime: runc\nInit Binary: docker-init\ncontainerd version: bb71b10fd8f58240ca47fbb579b9d1028eea7c84\nrunc version: 2b18fe1d885ee5083ef9f0838fee39b62d653e30\ninit version: fec3683\nSecurity Options:\n seccomp\n Profile: default\nKernel Version: 4.4.0-1094-aws\nOperating System: Debian GNU/Linux 9 (stretch)\nOSType: linux\nArchitecture: x86_64\nCPUs: 4\nTotal Memory: 15.67GiB\nName: ip-10-151-20-200\nID: CHHW:54CN:XNYI:A6XL:UP62:DVJW:VDC3:WUIV:FHS3:RYXT:4ZYM:OXKG\nDocker Root Dir: /ext/docker\nDebug Mode (client): false\nDebug Mode (server): false\nNo Proxy: 0.0.0.0/0,.local\nRegistry: https://index.docker.io/v1/\nLabels:\nExperimental: false\nInsecure Registries:\n 127.0.0.0/8\nLive Restore Enabled: false\nProduct License: Community Engine\n\nWARNING: No swap limit support\n","status":"running"},{"checker":"etcd-version","detail":"etcd Version: 3.3.12\nGit SHA: d57e8b8\nGo Version: go1.10.8\nGo OS/Arch: linux/amd64\n","status":"running"},{"checker":"kubelet-version","detail":"Kubernetes v1.14.2\n","status":"running"},{"checker":"coredns-version","detail":"CoreDNS-1.3.1\nlinux/amd64, go1.11.4, 6b56a9c\n","status":"running"},{"checker":"dbus-version","detail":"D-Bus Message Bus Daemon 1.10.28\nCopyright (C) 2002, 2003 Red Hat, Inc., CodeFactory AB, and others\nThis is free software; see the source for copying conditions.\nThere is NO warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.\n","status":"running"},{"checker":"serf-version","detail":"Serf v0.8.0\nAgent Protocol: 4 (Understands back to: 2)\n","status":"running"},{"checker":"flanneld-version","detail":"0.5.3+git\n","status":"running"},{"checker":"registry-version","detail":"/usr/bin/registry planet/docker/distribution v2.7.1-gravitational\n","status":"running"}]}],"timestamp":"2019-09-20T17:12:32.703532082Z"}[ERROR]: status degraded
In Gravity system logs
2019-09-20T17:12:58Z DEBU Unsuccessful attempt 99/100: not all planets have come up yet: &{degraded []}, retry in 5s. utils/logginghook.go:56 2019-09-20T17:13:03Z DEBU Unsuccessful attempt 100/100: not all planets have come up yet: &{unknown [{ 10.151.19.194 master offline []} { 10.151.20.200 master degraded []}]}, retry in 5s. utils/logginghook.go:56 2019-09-20T17:13:04Z DEBU [KEYGEN] generated user key for [root] with expiry on (1569035584) 2019-09-21 03:13:04.859572377 +0000 UTC m=+36724.079232598 utils/logginghook.go:56 2019-09-20T17:13:04Z INFO [CA] Generating TLS certificate {0x6065a68 0xc0001b8260 CN=opscenter@gravitational.io,O=@teleadmin+O=default-implicit-role,L=root 2019-09-21 03:13:04.863799398 +0000 UTC []}. common_name:opscenter@gravitational.io dns_names:[] locality:[root] not_after:2019-09-21 03:13:04.863799398 +0000 UTC org:[@teleadmin default-implicit-role] org_unit:[] utils/logginghook.go:56 2019-09-20T17:13:04Z DEBU [TELEPROXY] Renewed certificate for opscenter@gravitational.io. utils/logginghook.go:56 2019-09-20T17:13:08Z WARN All attempts failed. error:[ ERROR REPORT: Original Error: *trace.BadParameterError not all planets have come up yet: &{unknown [{ 10.151.19.194 master offline []} { 10.151.20.200 master degraded []}]} Stack Trace: /gopath/src/github.com/gravitational/gravity/lib/install/phases/postsystem.go:177 github.com/gravitational/gravity/lib/install/phases.(*healthExecutor).Execute.func1 /gopath/src/github.com/gravitational/gravity/lib/utils/retry.go:88 github.com/gravitational/gravity/lib/utils.Retry /gopath/src/github.com/gravitational/gravity/lib/install/phases/postsystem.go:168 github.com/gravitational/gravity/lib/install/phases.(*healthExecutor).Execute /gopath/src/github.com/gravitational/gravity/lib/fsm/fsm.go:453 github.com/gravitational/gravity/lib/fsm.(*FSM).executeOnePhase /gopath/src/github.com/gravitational/gravity/lib/fsm/fsm.go:385 github.com/gravitational/gravity/lib/fsm.(*FSM).executePhaseLocally /gopath/src/github.com/gravitational/gravity/lib/fsm/fsm.go:345 github.com/gravitational/gravity/lib/fsm.(*FSM).executePhase /gopath/src/github.com/gravitational/gravity/lib/fsm/fsm.go:206 github.com/gravitational/gravity/lib/fsm.(*FSM).ExecutePhase /gopath/src/github.com/gravitational/gravity/lib/fsm/fsm.go:163 github.com/gravitational/gravity/lib/fsm.(*FSM).ExecutePlan /gopath/src/github.com/gravitational/gravity/lib/install/operation.go:81 github.com/gravitational/gravity/lib/install.(*Installer).ExecuteOperation /gopath/src/github.com/gravitational/gravity/lib/install/engine/cli/cli.go:111 github.com/gravitational/gravity/lib/install/engine/cli.(*Engine).execute /gopath/src/github.com/gravitational/gravity/lib/install/engine/cli/cli.go:80 github.com/gravitational/gravity/lib/install/engine/cli.(*Engine).Execute /gopath/src/github.com/gravitational/gravity/lib/install/install.go:263 github.com/gravitational/gravity/lib/install.(*Installer).execute /gopath/src/github.com/gravitational/gravity/lib/install/install.go:204 github.com/gravitational/gravity/lib/install.(*Installer).startExecuteLoop.func1 /go/src/runtime/asm_amd64.s:1333 runtime.goexit User Message: not all planets have come up yet: &{unknown [{ 10.151.19.194 master offline []} { 10.151.20.200 master degraded []}]} ] utils/logginghook.go:56 2019-09-20T17:13:08Z ERRO Phase execution failed: not all planets have come up yet: &{unknown [{ 10.151.19.194 master offline []} { 10.151.20.200 master degraded []}]}. phase:/health utils/logginghook.go:56 2019-09-20T17:13:08Z DEBU [FSM:INSTA] Applied StateChange(Phase=/health, State=failed, Error=not all planets have come up yet: &{unknown [{ 10.151.19.194 master offline []} { 10.151.20.200 master degraded []}]}). opid:77aede92-9d19-4c66-822c-1ff6869f32c7 utils/logginghook.go:56 2019-09-20T17:13:08Z WARN [INSTALLER] Failed to execute operation plan. error:[ ERROR REPORT: Original Error: *trace.BadParameterError not all planets have come up yet: &{unknown [{ 10.151.19.194 master offline []} { 10.151.20.200 master degraded []}]} Stack Trace: /gopath/src/github.com/gravitational/gravity/lib/install/phases/postsystem.go:177 github.com/gravitational/gravity/lib/install/phases.(*healthExecutor).Execute.func1 /gopath/src/github.com/gravitational/gravity/lib/utils/retry.go:88 github.com/gravitational/gravity/lib/utils.Retry /gopath/src/github.com/gravitational/gravity/lib/install/phases/postsystem.go:168 github.com/gravitational/gravity/lib/install/phases.(*healthExecutor).Execute /gopath/src/github.com/gravitational/gravity/lib/fsm/fsm.go:453 github.com/gravitational/gravity/lib/fsm.(*FSM).executeOnePhase /gopath/src/github.com/gravitational/gravity/lib/fsm/fsm.go:385 github.com/gravitational/gravity/lib/fsm.(*FSM).executePhaseLocally /gopath/src/github.com/gravitational/gravity/lib/fsm/fsm.go:345 github.com/gravitational/gravity/lib/fsm.(*FSM).executePhase /gopath/src/github.com/gravitational/gravity/lib/fsm/fsm.go:206 github.com/gravitational/gravity/lib/fsm.(*FSM).ExecutePhase /gopath/src/github.com/gravitational/gravity/lib/fsm/fsm.go:163 github.com/gravitational/gravity/lib/fsm.(*FSM).ExecutePlan /gopath/src/github.com/gravitational/gravity/lib/install/operation.go:81 github.com/gravitational/gravity/lib/install.(*Installer).ExecuteOperation /gopath/src/github.com/gravitational/gravity/lib/install/engine/cli/cli.go:111 github.com/gravitational/gravity/lib/install/engine/cli.(*Engine).execute /gopath/src/github.com/gravitational/gravity/lib/install/engine/cli/cli.go:80 github.com/gravitational/gravity/lib/install/engine/cli.(*Engine).Execute /gopath/src/github.com/gravitational/gravity/lib/install/install.go:263 github.com/gravitational/gravity/lib/install.(*Installer).execute /gopath/src/github.com/gravitational/gravity/lib/install/install.go:204 github.com/gravitational/gravity/lib/install.(*Installer).startExecuteLoop.func1 /go/src/runtime/asm_amd64.s:1333 runtime.goexit User Message: not all planets have come up yet: &{unknown [{ 10.151.19.194 master offline []} { 10.151.20.200 master degraded []}]}, failed to execute phase "/health" ] utils/logginghook.go:56