02-Installation - Anvesh Jhuboo

## Installation Tools Installing and configuring Kubernetes can be approached in several ways. Depending on your goals, you might choose a cloud service, a local development environment, or a more manual installation method. ### Cloud Services 1. **Google Kubernetes Engine (GKE)**: A managed service from Google Cloud Platform that lets you create and manage Kubernetes clusters easily. GKE ensures you have the latest stable version of Kubernetes, and is an excellent way to get started quickly. 2. **Amazon Elastic Kubernetes Service (EKS)**: AWS’s managed Kubernetes service, providing greater control over control plane nodes. ### Local Development 1. **Minikube**: A local Kubernetes solution that deploys a single-node cluster on your laptop using VirtualBox. Minikube is great for learning, testing, and development. 2. **MicroK8s**: Developed by Canonical, MicroK8s offers a lightweight, single-command installation on Ubuntu. To interact with your Kubernetes cluster, you will need the Kubernetes command line tool, **kubectl**. This powerful CLI allows you to create, manage, and delete Kubernetes resources. ### Using kubeadm In this course, we will focus on **kubeadm**, a tool that simplifies Kubernetes installation. The process involves two main commands: - `kubeadm init`: Run on the control plane node to initialize the cluster. - `kubeadm join`: Run on worker nodes to join them to the cluster. Other tools like **kubespray** and **kops** can also create Kubernetes clusters on various cloud providers, offering more customization options. ## Installing kubectl To manage your cluster, you will primarily use **kubectl**. Available in most Linux distribution repositories, **kubectl** can also be downloaded and compiled from source if needed. It uses `$HOME/.kube/config` as its configuration file, containing cluster definitions, credentials, and contexts. You can switch contexts with: bash Copy code `$ kubectl config use-context my-context` ## Using Google Kubernetes Engine (GKE) Google rigorously tests each Kubernetes release and makes it available via GKE. To use GKE, you need a Google Cloud account, a payment method, and the **gcloud** command-line tool. Follow the [GKE quickstart guide](https://cloud.google.com/kubernetes-engine/docs/quickstart) to create your first Kubernetes cluster: ```bash $ gcloud container clusters create my-cluster $ gcloud container clusters list $ kubectl get nodes ``` Remember to delete your cluster when done to avoid ongoing charges: ```bash $ gcloud container clusters delete my-cluster ``` ## Using Minikube Minikube is an excellent tool for local Kubernetes development. Install Minikube with: ```bash $ curl -Lo minikube https://storage.googleapis.com/minikube/releases/latest/minikube-darwin-amd64 $ chmod +x minikube $ sudo mv minikube /usr/local/bin` ``` Start a local Kubernetes cluster with: ```bash $ minikube start $ kubectl get nodes ``` ## Installing with kubeadm Once familiar with Kubernetes using Minikube, you can start building a real cluster with **kubeadm**. Follow these steps: 1. Run `kubeadm init` on the control plane node. 2. Apply a network plugin for Pod communication. 3. Run `kubeadm join` on worker nodes. For example, to use the Weave network plugin: ```bash $ kubectl create -f https://git.io/weave-kube ``` ## Upgrading with kubeadm If you built your cluster with **kubeadm**, you can upgrade it using `kubeadm upgrade`. The process involves: 1. Planning the upgrade: `$ kubeadm upgrade plan` 2. Applying the upgrade: `$ kubeadm upgrade apply v1.21.1 3. Updating worker nodes: `$ kubectl drain <node-name> $ kubectl uncordon <node-name>` ## Installing a Pod Network Before initializing the cluster, you need to select a network plugin to avoid IP conflicts. Popular options include: 1. **Calico**: A Layer 3 network providing policy management. 2. **Flannel**: A Layer 3 IPv4 network focused on inter-node communication. 3. **Kube-Router**: An all-in-one networking solution. 4. **Cilium**: A powerful network plugin using eBPF. Install a network plugin with `kubectl create -f <plugin-url>`. ## Additional Installation Tools Beyond **kubeadm**, other tools can automate Kubernetes installation: 1. **kubespray**: An Ansible playbook for setting up Kubernetes clusters. 2. **kops**: A tool for creating Kubernetes clusters on AWS. 3. **kube-aws**: Uses AWS CloudFormation to provision Kubernetes. 4. **kind**: Runs Kubernetes locally using Docker. For a step-by-step manual setup, check out [Kelsey Hightower’s Kubernetes the Hard Way](https://github.com/kelseyhightower/kubernetes-the-hard-way). ## Installation Considerations When deploying a Kubernetes cluster, consider the following: - **Provider**: Public or private cloud, physical or virtual servers. - **Operating System**: Choose an OS compatible with Kubernetes (e.g., Ubuntu, CentOS). - **Networking**: Select a suitable networking solution. - **etcd**: Determine where to run your etcd cluster. - **High Availability**: Plan for HA configurations for control plane nodes. ## Main Deployment Configurations You have four main deployment configurations: 1. **Single-node**: All components run on one server. Ideal for learning and development. 2. **Single control plane, multiple workers**: Suitable for small production environments. 3. **HA control plane, multiple workers**: Adds durability with multiple control plane nodes. 4. **HA etcd, HA control plane, multiple workers**: The most resilient setup with HA etcd and control plane nodes. Choose a configuration based on your goals and experience level. ## Compiling from Source Kubernetes can be compiled from source using Golang. Install Golang, clone the repository, and use `make` to build the binaries: `$ cd $GOPATH $ git clone https://github.com/kubernetes/kubernetes $ cd kubernetes $ make` This will generate binaries in the `output/bin` directory. ### Bridging to Machine Learning Kubernetes is particularly powerful for machine learning workloads, especially for scaling GPU resources. Here’s how Kubernetes supports ML: 1. **GPU Support**: Kubernetes natively supports scheduling and managing GPU workloads using NVIDIA’s device plugin. 2. **Custom Resource Definitions (CRDs)**: Define custom resources to manage ML-specific requirements. 3. **Kubeflow**: An ML toolkit for Kubernetes, providing components for training, hyperparameter tuning, and serving models. 4. **Scalability**: Leverages Kubernetes' scaling capabilities to handle dynamic ML workloads efficiently. By integrating Kubernetes with your machine learning pipeline, you can achieve efficient resource utilization, scalability, and seamless deployment of ML models. ### Conclusion Kubernetes is a robust and flexible platform for managing containerized applications. Understanding its installation methods, architecture, and deployment configurations is crucial for leveraging its full potential. Moreover, using Kubernetes for machine learning workloads demonstrates its versatility and power in handling diverse and demanding use cases. Continue: [[03-K8s Architecture]]