Everything You Need To Know About Kubernetes Scheduler

Written By

September 23, 2020

What is Kubernetes Scheduling?

When creating a Kubernetes cluster, scheduling the pod to an available node is an important component of the process. This component works under specific rules and technicalities that I’d like to explore in this article. Understanding this can help you increase your cluster deployment rate and quality.

What Happens When You Create a Pod On a Kubernetes Cluster?

The Pod will be executed in a matter of seconds but a lot of things are going on in the background as this happens.

While scanning the API server (which it is continuously doing), the Kubernetes Scheduler detects that there is a new Pod without a nodeName parameter. The nodeName is what shows which node should be owning this Pod.
The Scheduler selects a suitable node for this Pod and updates the Pod definition with the node name (through the nodeName parameter).
The kubelet on the chosen node is notified that there is a pod that is pending execution.
The kubelet executes the Pod, and the latter starts running on the node.

Selecting The Right node?

This part is the hardest work as there are several algorithms that the Scheduler must utilize to make this decision. Some of those algorithms depend on user-supplied options, while Kubernetes itself calculates others.

Determining If You Have The Resources for a Pod

A node may be overloaded with so many busy pods consuming most of its CPU and memory. So, when the scheduler has a Pod to deploy, it determines whether or not the node has the necessary resources. If a Pod was deployed to a node that does not have enough memory (for example) that the Pod is requesting, the hosted application might behave unexpectedly or even crash.

Which Node Is Capable of Running The Pod?

In addition to true/false decisions a.k.a predicates, the scheduler executes some calculations (or functions) to determine which node is more suited to be hosting the pod in question.

How K8 Scheduler Makes The Decision?

How does the scheduler make the decision to run the pod after the previous steps?

Undergoes status checks
Filters nodes based on predicate checks
Priority tests are done to sort the nodes
Nodes with the highest scores are moved to the final list

Alternative Decision-making Options

In busy Kubernetes Clusters, the time between the Scheduler choosing the right node and the kubelet on that node executing the pod may be sufficient for changes to occur on the nodes. Even if that time is no more than a few milliseconds, a pod may get terminated on one of the nodes that were filtered out due to insufficient memory. That node could’ve had a higher score on the priority test only if it wasn’t overloaded back then. But now, perhaps a less-suitable node was selected for the pod.

User-Defined Decision Making

You can directly run a pod on a determined node using the nodeselector parameters. Specifically, the .spec.nodeSelector parameter which should be mentioned in the pod definition. The nodeSelector chooses nodes with one or more labels. However, sometimes, user requirements get more complicated. A nodeSelector, for example, selects nodes that have all the labels defined in the parameter. What if you want to make a more flexible selection?

Node Affinity

Node Affinity lets you attach pods to nodes based on specifications you have determined. You can choose the disk state, number of cores, and other specs to find nodes of that specific nature.

Node Anti-affinity

Some scenarios require that you don’t use one or more nodes except for particular pods. Think of the nodes that host your monitoring application. Those nodes shouldn’t have many resources due to the nature of their role. Thus, if pods other than those which have the monitoring app is scheduled to those nodes, they hurt monitoring and also degrade the application they are hosting. In such a case, you need to use node anti-affinity to keep pods away from a set of nodes.

Conclusion

Hopefully, you’ve come to understand the nature of Kubernetes Scheduler and how you can take advantage of it for your cluster deployments. Kubernetes documentation guides can give you more examples and commands to use.

Tagged:

Kubernetes

Kubernetes Cluster

Cloud

Cloud Computing

How To Migrate To Azure Faster?

Migrating from on premise data centers to a cloud provider is always considered a difficult endeavor. From the cost, to the planning and resource allocation, plenty of preliminary work is gone to setting up a cloud infrastructure. Which is why, Microsoft Azure’s new program stands to benefit many organizations still on the fence about migrating to the cloud.

July 21, 2021

Hrishikesh

Cloud Computing

20 Cloud Influencers You Should Be Following in 2020

It’s important to follow the right individuals so that you remain on the loop and always find yourself learning things that you were unaware of. These thought leaders and influencers can only be the avenues by which you meet other interesting technologists.

September 23, 2020

Hrishikesh

Cloud Automation

New In: No-code cloud management workflows for Azure, VMware & Private Cloud (in addition to AWS)

At TotalCloud, we’ve been enabling workflow-based cloud management for AWS to make it intuitive, accelerated, and no-code. Instead of programming cloud management use cases or depending on siloed solutions, we built out a platform that gives you building blocks to assemble any cloud management solution.

September 4, 2020

Hrishikesh

Cloud Computing

List of Essential Kubernetes Tools

Kubernetes is a Container-as-a-Service with tons of unique tools to choose from. External tools play a role in integrating with different systems or maintaining control over the clusters you deploy. Manual health checks and troubleshooting is not ideal to keep a system in full health.This list of tools will provide ample support to your containers and have enough configuration to leave management flexible...

August 12, 2020

Hrishikesh

AWS Use Case Files

TotalCloud Inventory Actions: Giving a new meaning to Cloud Inventory

Learn how the TotalCloud Inventory Dashboard can become equivalent to your cloud provider’s SDK. Carry out any action on any discovered resource with Inventory Actions.

July 30, 2020

Sayonee

AWS Tips & Tricks

AWS Tutorial: Create an AWS Instance Scheduler with Terraform

Terraform is a popular IaaS tool used by many to create, update, and maintain their AWS architecture. If you use Terraform to provision your AWS architecture, you won’t be disappointed with our new AWS tutorial video.We provide you with the means to set up your own instance scheduler from Terraform...

July 20, 2020

Hrishikesh

Everything You Need To Know About Kubernetes Scheduler

What is Kubernetes Scheduling?

What Happens When You Create a Pod On a Kubernetes Cluster?

Selecting The Right node?

Determining If You Have The Resources for a Pod

Which Node Is Capable of Running The Pod?

How K8 Scheduler Makes The Decision?

Alternative Decision-making Options