Hmm weird. Can you deploy anything else on your EKS cluster? Are there other workloads running on said EKS cluster?
Usually something that is pending for so long is usually a resource issue e.g Kubernetes can’t place the work. By default the Harness Delegate needs 8gb of memory. Not sure of your machine types that are running at EKS Worker Nodes if they have enough space.
Are you using this for an example? If so I would suggest deleting the deployment/pod and giving that another whirl.
Can check out these Minikube steps for pruning the memory required [not recommended for production workloads]: “Harness K8s Steps”
I am using t3.medium - 4 GB. So I will have to upgrade to t3.xlarge - 16GB
I dont have anything else running on the cluster. It is just example/testing
If you can’t deploy that, your cluster most likely needs to be re-created.
In training class I have students all share an EKS cluster so multiple delegates running. You can re-download the delegate YAML and give another name and re-deploy since your EKS cluster is for learning,
try running kubectl get pods -n harness-delegate instead. If you aren’t changing the context before you run kubectl get pods then you will be running the command under the default namespace.
K8s version 1.14 (EKS default), used the dashboard to create the cluster (not CLI).
In AWS/EKS First you create the cluster (VPC/Subnets). Next step create and attach the nodes/node group to the Cluster. You size the node group - number of instances and size/type of instance.
How did you create your EKS Cluster? EKS for me is a funny one, if you don’t get it right the first go I never had any luck re-sizing infra, etc. The closest success I got is with EKSTL.
I downloaded new delegate with new name: harness-delegate1 - changed to 4Gi
If I try apply still get same - namespace/harness-delegate unchanged
get pods - returns nothing
kubectl delete pods harness-sample-k8s-delegate-skjrdc-0 --grace-period=0 --force
(using the old pod name)
warning: Immediate deletion does not wait for confirmation that the running resource has been terminated. The resource may continue to run on the cluster indefinitely.
Error from server (NotFound): pods “harness-sample-k8s-delegate-skjrdc-0” not found
So I have done everything except recreating the cluster itself (I did re-create the node group)
kubectl get pods -n harness-delegate
NAME READY STATUS RESTARTS AGE
harness-delegate1-skjrdc-0 0/1 Pending 0 24m
So this returned something, but still pending. What does this mean - which one is running and which one is not?
kubectl -n harness-delegate delete pods harness-sample-k8s-delegate-skjrdc-0 --grace-period=0 --force
warning: Immediate deletion does not wait for confirmation that the running resource has been terminated. The resource may continue to run on the cluster indefinitely.
pod “harness-sample-k8s-delegate-skjrdc-0” force deleted
kubectl get pods -n harness-delegate
NAME READY STATUS RESTARTS AGE
harness-delegate1-skjrdc-0 0/1 Pending 0 30m
harness-sample-k8s-delegate-skjrdc-0 1/1 Running 0 70s
So 1 still pending after 30m ??
kubectl -n harness-delegate get deployments
No resources found in harness-delegate namespace.
So maybe I will start form scratch - destroy and re-create the cluster itself.
Note: There is some limit on this site - you can only reply so many times a day, so i can only edit a post
Update
I recreated the cluster and the nodes with 16GB. It finally worked
kubectl -n harness-delegate get pods
NAME READY STATUS RESTARTS AGE
harness-delegate1-skjrdc-0 1/1 Running 0 95s
I feel like there is something up with your EKS Cluster. Depending how you created your EKS cluster, Kubernetes land is one of many interpretations. Even the AWS provided Cloud Formation Stack when kicking off an EKS cluster or modifying an EKS from the AWS Web Console
UI has a ton of nuances. Even EKS limits you to K8s 1.14, not even a 1.14.x release for bug fixes as Kubernetes 1.16.x is out.
Have you tried installing on another right-sized EKS cluster? How did you go about re-sizing your worker nodes EC2 instance sizes inside your EKS cluster?
I created a Sam-B-Test Cluster using Weave’s/Amazon’s CLI, EKSTL [https://eksctl.io/]