How We Architected and Run Kubernetes on OpenStack at Scale at Yahoo! JAPAN

Monday, October 24, 2016

How We Architected and Run Kubernetes on OpenStack at Scale at Yahoo! JAPAN

_Editor’s note: today’s post is by the Infrastructure Engineering team at Yahoo! JAPAN, talking about how they run OpenStack on Kubernetes. This post has been translated and edited for context with permission – originally published on the Yahoo! JAPAN engineering blog. _

Intro
This post outlines how Yahoo! JAPAN, with help from Google and Solinea, built an automation tool chain for “one-click” code deployment to Kubernetes running on OpenStack.

We’ll also cover the basic security, networking, storage, and performance needs to ensure production readiness.

Finally, we will discuss the ecosystem tools used to build the CI/CD pipeline, Kubernetes as a deployment platform on VMs/bare metal, and an overview of Kubernetes architecture to help you architect and deploy your own clusters.

Preface
Since our company started using OpenStack in 2012, our internal environment has changed quickly. Our initial goal of virtualizing hardware was achieved with OpenStack. However, due to the progress of cloud and container technology, we needed the capability to launch services on various platforms. This post will provide our example of taking applications running on OpenStack and porting them to Kubernetes.

Coding Lifecycle
The goal of this project is to create images for all required platforms from one application code, and deploy those images onto each platform. For example, when code is changed at the code registry, bare metal images, Docker containers and VM images are created by CI (continuous integration) tools, pushed into our image registry, then deployed to each infrastructure platform.

We use following products in our CICD pipeline:

Function	Product
Code registry	GitHub Enterprise
CI tools	Jenkins
Image registry	Artifactory
Bug tracking system	JIRA
deploying Bare metal platform	OpenStack Ironic
deploying VM platform	OpenStack
deploying container platform	Kubernetes

Image Creation. Each image creation workflow is shown in the next diagram.

VM Image Creation :

1.push code to GitHub
2.hook to Jenkins master
3.Launch job at Jenkins slave
4.checkout Packer repository
5.Run Service Job
6.Execute Packer by build script
7.Packer start VM for OpenStack Glance
8.Configure VM and install required applications
9.create snapshot and register to glance 10.10.Download the new created image from Glance 11.11.Upload the image to Artifactory

Bare Metal Image Creation:

1.push code to GitHub
2.hook to Jenkins master
3.Launch job at Jenkins slave
4.checkout Packer repository
5.Run Service Job
6.Download base bare metal image by build script
7.build script execute diskimage-builder with Packer to create bare metal image
8.Upload new created image to Glance
9.Upload the image to Artifactory

Container Image Creation:

1.push code to GitHub
2.hook to Jenkins master
3.Launch job at Jenkins slave
4.checkout Dockerfile repository
5.Run Service Job
6.Download base docker image from Artifactory
7.If no docker image found at Artifactory, download from Docker Hub
8.Execute docker build and create image
9.Upload the image to Artifactory

Platform Architecture.

Let’s focus on the container workflow to walk through how we use Kubernetes as a deployment platform. This platform architecture is as below.

Function	Product
Infrastructure Services	OpenStack
Container Host	CentOS
Container Cluster Manager	Kubernetes
Container Networking	Project Calico
Container Engine	Docker
Container Registry	Artifactory
Service Registry	etcd
Source Code Management	GitHub Enterprise
CI tool	Jenkins
Infrastructure Provisioning	Terraform
Logging	Fluentd, Elasticsearch, Kibana
Metrics	Heapster, Influxdb, Grafana
Service Monitoring	Prometheus

We use CentOS for Container Host (OpenStack instances) and install Docker, Kubernetes, Calico, etcd and so on. Of course, it is possible to run various container applications on Kubernetes. In fact, we run OpenStack as one of those applications. That’s right, OpenStack on Kubernetes on OpenStack. We currently have more than 30 OpenStack clusters, that quickly become hard to manage and operate. As such, we wanted to create a simple, base OpenStack cluster to provide the basic functionality needed for Kubernetes and make our OpenStack environment easier to manage.

Kubernetes Architecture

Let me explain Kubernetes architecture in some more detail. The architecture diagram is below.

Tenant Isolation To enable multi-tenant usage like OpenStack, we utilize OpenStack Keystone for authentication and authorization.

Authentication With a Kubernetes plugin, OpenStack Keystone can be used for Authentication. By Adding authURL of Keystone at startup Kubernetes API server, we can use OpenStack OS_USERNAME and OS_PASSWORD for Authentication. Authorization We currently use the ABAC (Attribute-Based Access Control) mode of Kubernetes Authorization. We worked with a consulting company, Solinea, who helped create a utility to convert OpenStack Keystone user and tenant information to Kubernetes JSON policy file that maps Kubernetes ABAC user and namespace information to OpenStack tenants. We then specify that policy file when launching Kubernetes API Server. This utility also creates namespaces from tenant information. These configurations enable Kubernetes to authenticate with OpenStack Keystone and operate in authorized namespaces. Volumes and Data Persistence Kubernetes provides “Persistent Volumes” subsystem which works as persistent storage for Pods. “Persistent Volumes” is capable to support cloud-provider storage, it is possible to utilize OpenStack cinder-volume by using OpenStack as cloud provider. Networking Flannel and various networking exists as networking model for Kubernetes, we used Project Calico for this project. Yahoo! JAPAN recommends to build data center with pure L3 networking like redistribute ARP validation or IP CLOS networking, Project Calico matches this direction. When we apply overlay model like Flannel, we cannot access to Pod IP from outside of Kubernetes clusters. But Project Calico makes it possible. We also use Project Calico for Load Balancing we describe later.

In Project Calico, broadcast production IP by BGP working on BIRD containers (OSS routing software) launched on each nodes of Kubernetes. By default, it broadcast in cluster only. By setting peering routers outside of clusters, it makes it possible to access a Pod from outside of the clusters. External Service Load Balancing

There are multiple choices of external service load balancers (access to services from outside of clusters) for Kubernetes such as NodePort, LoadBalancer and Ingress. We could not find solution which exactly matches our requirements. However, we found a solution that almost matches our requirements by broadcasting Cluster IP used for Internal Service Load Balancing (access to services from inside of clusters) with Project Calico BGP which enable External Load Balancing at Layer 4 from outside of clusters.

Service Discovery

Service Discovery is possible at Kubernetes by using SkyDNS addon. This is provided as cluster internal service, it is accessible in cluster like ClusterIP. By broadcasting ClusterIP by BGP, name resolution works from outside of clusters. By combination of Image creation workflow and Kubernetes, we built the following tool chain which makes it easy from code push to deployment.

Summary

In summary, by combining Image creation workflows and Kubernetes, Yahoo! JAPAN, with help from Google and Solinea, successfully built an automated tool chain which makes it easy to go from code push to deployment, while taking multi-tenancy, authn/authz, storage, networking, service discovery and other necessary factors for production deployment. We hope you found the discussion of ecosystem tools used to build the CI/CD pipeline, Kubernetes as a deployment platform on VMs/bare-metal, and the overview of Kubernetes architecture to help you architect and deploy your own clusters. Thank you to all of the people who helped with this project. –Norifumi Matsuya, Hirotaka Ichikawa, Masaharu Miyamoto and Yuta Kinoshita. _This post has been translated and edited for context with permission – originally published on the Yahoo! JAPAN engineer blog where this was one in a series of posts focused on Kubernetes._

Kubernetes 1.11: In-Cluster Load Balancing and CoreDNS Plugin Graduate to General Availability Jun 27
Dynamic Ingress in Kubernetes Jun 7
4 Years of K8s Jun 6
Say Hello to Discuss Kubernetes May 30
Introducing kustomize; Template-free Configuration Customization for Kubernetes May 29
Kubernetes Containerd Integration Goes GA May 24
Getting to Know Kubevirt May 22
Gardener - The Kubernetes Botanist May 17
Docs are Migrating from Jekyll to Hugo May 5
Announcing Kubeflow 0.1 May 4
Current State of Policy in Kubernetes May 2
Developing on Kubernetes May 1
Zero-downtime Deployment in Kubernetes with Jenkins Apr 30
Kubernetes Community - Top of the Open Source Charts in 2017 Apr 25
Kubernetes Application Survey 2018 Results Apr 24
Local Persistent Volumes for Kubernetes Goes Beta Apr 13
Migrating the Kubernetes Blog Apr 11
Container Storage Interface (CSI) for Kubernetes Goes Beta Apr 10
Fixing the Subpath Volume Vulnerability in Kubernetes Apr 4
Kubernetes 1.10: Stabilizing Storage, Security, and Networking Mar 26
Principles of Container-based Application Design Mar 15
Expanding User Support with Office Hours Mar 14
How to Integrate RollingUpdate Strategy for TPR in Kubernetes Mar 13
Apache Spark 2.3 with Native Kubernetes Support Mar 6
Kubernetes: First Beta Version of Kubernetes 1.10 is Here Mar 2
Reporting Errors from Control Plane to Applications Using Kubernetes Events Jan 25
Core Workloads API GA Jan 15
Introducing client-go version 6 Jan 12
Extensible Admission is Beta Jan 11
Introducing Container Storage Interface (CSI) Alpha for Kubernetes Jan 10
Kubernetes v1.9 releases beta support for Windows Server Containers Jan 9
Five Days of Kubernetes 1.9 Jan 8

Creating a Raspberry Pi cluster running Kubernetes, the installation (Part 2) Dec 22
Managing Kubernetes Pods, Services and Replication Controllers with Puppet Dec 17
How Weave built a multi-deployment solution for Scope using Kubernetes Dec 12
Creating a Raspberry Pi cluster running Kubernetes, the shopping list (Part 1) Nov 25
Monitoring Kubernetes with Sysdig Nov 19
One million requests per second: Dependable and dynamic distributed systems at scale Nov 11
Kubernetes 1.1 Performance upgrades, improved tooling and a growing community Nov 9
Kubernetes as Foundation for Cloud Native PaaS Nov 3
Some things you didn’t know about kubectl Oct 28
Kubernetes Performance Measurements and Roadmap Sep 10
Using Kubernetes Namespaces to Manage Environments Aug 28
Weekly Kubernetes Community Hangout Notes - July 31 2015 Aug 4
The Growing Kubernetes Ecosystem Jul 24
Weekly Kubernetes Community Hangout Notes - July 17 2015 Jul 23
Strong, Simple SSL for Kubernetes Services Jul 14
Weekly Kubernetes Community Hangout Notes - July 10 2015 Jul 13
Announcing the First Kubernetes Enterprise Training Course Jul 8
Kubernetes 1.0 Launch Event at OSCON Jul 2
How did the Quake demo from DockerCon Work? Jul 2
The Distributed System ToolKit: Patterns for Composite Containers Jun 29
Slides: Cluster Management with Kubernetes, talk given at the University of Edinburgh Jun 26
Cluster Level Logging with Kubernetes Jun 11
Weekly Kubernetes Community Hangout Notes - May 22 2015 Jun 2
Kubernetes on OpenStack May 19
Weekly Kubernetes Community Hangout Notes - May 15 2015 May 18
Docker and Kubernetes and AppC May 18
Kubernetes Release: 0.17.0 May 15
Resource Usage Monitoring in Kubernetes May 12
Weekly Kubernetes Community Hangout Notes - May 1 2015 May 11
Kubernetes Release: 0.16.0 May 11
AppC Support for Kubernetes through RKT May 4
Weekly Kubernetes Community Hangout Notes - April 24 2015 Apr 30
Borg: The Predecessor to Kubernetes Apr 23
Kubernetes and the Mesosphere DCOS Apr 22
Weekly Kubernetes Community Hangout Notes - April 17 2015 Apr 17
Kubernetes Release: 0.15.0 Apr 16
Introducing Kubernetes API Version v1beta3 Apr 16
Weekly Kubernetes Community Hangout Notes - April 10 2015 Apr 11
Faster than a speeding Latte Apr 6
Weekly Kubernetes Community Hangout Notes - April 3 2015 Apr 4
Paricipate in a Kubernetes User Experience Study Mar 31
Weekly Kubernetes Community Hangout Notes - March 27 2015 Mar 28
Kubernetes Gathering Videos Mar 23
Welcome to the Kubernetes Blog! Mar 20

Kubernetes Blog

Monday, October 24, 2016

How We Architected and Run Kubernetes on OpenStack at Scale at Yahoo! JAPAN

« Prev

Next >>

2018

2017

2016

2015