My KubeCon China 2023 Summary

First of all, I would like to thank the karmada community for providing the tickets to KubeCon, and I met with core contributors and maintainers of karmada such as zhen chang, hongcai Ren, and Wei jiang.

In previous years, attending the technology conference left me with no deep impression and no harvest. This time I forced myself to record it, deepen my impression and summarize my gains.

I am interested in colocation, so the sharing I listened to is basically related to this.

updated: all of the kubeCon china 2023 videos is released. youtube video list, WeChat subscription account

slides: https://kccncosschn2023.sched.com/?iframe=no

istioCon china 2023 slides: https://istioconchina2023.sched.com/ https://github.com/cloudnativeto/academy/tree/master/istiocon-china-2023

使用KubeRay和Kueue在Kubernetes中托管Sailing Ray工作负载 | Sailing Ray Workloads with KubeRay and Kueue in Kubernetes - Jason Hu, Volcano Engine & Kante Yin, DaoCloud

video: YouTube WeChat subscription account

Previously, I only knew that kueue was a project of the Kubernetes Sig Group, which manages job queues. This time, I learned about its working principle based on MutatingWebhook, which intercepts various job resources for queue management and adds NodeAffinity.

image-20230929175613267

image-20230929180013728

image-20230929180716881

image-20230929180755335

controller-runtime每天的新功能–使您的Operator更高效 | What’s New on Controller-Runtime of the Day – Make Your Operator More Efficient - Siyu Wang, Alibaba Cloud

video: YouTube WeChat subscription

It mainly discusses new features and some pitfalls between 0.14 and 0.16. Controller-runtime is a framework for developing various controllers and operators, but it lacks comprehensive documentation. To understand the updates in each version, one needs to dig into the code themselves.

I discussed with Jiuzhu on-site that only the Name and Namespace fields can be saved in the workQueue, which is not very friendly for certain scenarios. For example, in the case of associations with deployments or pods related to VPA (Vertical Pod Autoscaler), writing code using this framework would become quite complex. It was mentioned that the community is unlikely to change this design pattern.

Pitfalls in Version Compatibility

image-20230929183208775

Support for Multiple Clusters

Abstract the concept of clusters; one manager can handle multiple clusters, and one controller corresponds to multiple clusters.

image-20230929183403165

Disabling DeepCopy

Address excessive memory usage. In version 1.28, Reflector has a similar optimization as well. GitHub Pull Request

image-20230929183825750

Architecture Diagrams for Various Components

image-20230929184923201

Controller Mesh

Enables controller gray releases and monitoring and circuit breakers.

image-20230929185255761

image-20230929185353505

image-20230929185424986

image-20230929185454928

项目更新和深入探讨:containerd | Project Update and Deep Dive: Containerd - Wei Fu, Microsoft & Iceber Gu, DaoCloud

video: YouTube Wechat subscription

Introduction to the new features of containerd version 1.7.

1.6 is the First LTS Version

image-20230929204638473

Changes in 1.7

image-20230929204831543

Release Plan for 2.0

image-20230929210005252

WebAssembly (wasm)

image-20230929210300022

使用Volcano更轻松高效地在Kubernetes上运行您的AI工作负载和微服务 | Run Your AI Workloads and Microservices on Kubernetes More Easily and Efficiently with Volcano - William Wang, Huawei Cloud

video: YouTube Wechat subscription

It introduced the functions of Volcano, its use cases, and future plans. It addresses various issues related to running big data on Kubernetes, which is very helpful. Further research will be conducted.

Regarding the difference between Volcano and Kueue, inquiries were made to Kueue’s maintainer, Kante Yin. Kueue is designed to solve queue-related problems but is not implemented on the scheduler; it is an independent component. Volcano, on the other hand, solves queue-related issues within the scheduler itself. They share some similar functionalities. Koordinator also has similar features to Volcano, particularly in offline hybrid deployments.

no slide

基于生产案例详细解析和重现Istio访问日志的各种应答标记 | Detailed Parse and Reproduce Response Flags of Istio Access Log Based on Production Use Case - Chaomeng Zhang, Huawei

video: YouTube Wechat subscription

It introduced various meanings of the “Response Flags” field in the Istio proxy logs and their corresponding scenarios. The PowerPoint (PPT) contains a detailed summary of various intricacies, making it an excellent troubleshooting manual. Chaomeng Zhang must be quite experienced, having encountered a multitude of Envoy-related issues. It reminds me of the time when I was well-versed in various errors with Nginx.

Unfortunately, there is no slide available at the moment, but I am looking forward to updates.

构建一个主动-主动的高可用Kubernetes控制平面集群 | Building an Active-Active HA Kubernetes Control Plane Cluster - MinJie Huang & WenJie Song, DaoCloud; Jiashun Dai, SAIC General Motors

video: YouTube Wechat subscription

It introduced an active-active Kubernetes cluster high-availability solution. To be honest, I didn’t fully understand it, so I’ll have to watch the recording later and study it carefully.

image-20230929214645538

image-20230929214744302

云原生技术与文化背景:跨境最大化业务价值 | Cloud Native Technology and Cultural Context: Maximizing Business Value Across Borders - Katerina Arzhayev, SUSE

video: YouTube Wechat subscription

It introduced the differences between Eastern and Western approaches to enterprise management, primarily focusing on the Western working style. To test my English listening skills, I intentionally chose an English lecture. I found that I could understand most of it, perhaps because the speaker had very clear pronunciation. However, I had difficulty understanding the questions from someone with an Indian accent.

在Kubernetes上构建一个精细化和智能化的资源管理系统 | Building a Fine-Grained and Intelligent Resource Management System on Kubernetes - He Cao & Wei Shao, ByteDance

video: YouTube Wechat subscription

It introduced various features of Katalyst, an open-source project by ByteDance, and ByteDance’s practices in resource management. Multiple enhancements and secondary development were made to kubelet in this context, as well as kernel enhancements.

image-20230929221934438

image-20230929222014210

ByteDance also conducted secondary development on kubelet:

image-20230929222118267

Machine Learning-Based Service Profiling

image-20230929222337225

Kernel Enhancements by ByteDance

image-20230929222455243

image-20230929222609027

GPU Sharing

image-20230929222716687

Topology-Aware Scheduling

image-20230929222759689

Resource Efficiency Suite

image-20230929222848523

Kubernetes上的干扰检测和资源隔离增强的最佳实践 | Best Practice for Interference Detection and Resource Isolation Enhancement on Kubernetes - Haogang Wang, Kuaishou

video: YouTube Wechat subscription

It introduced the work done by Kuaishou in resource isolation and topology awareness. Similar work was done in projects like Koordinator, Crane, and ByteDance’s Katalyst.

image-20230929223809251

image-20230929224008953

image-20230929224228870

image-20230929225029806

只是噪音还是真正的字节?云原生中的eBPF | Just Buzz or Real Byte? eBPF in Cloud Native - Bill Mulligan, Isovalent

video: YouTube WeChat Subscription

Because I am a contributor to Cilium, I attended this presentation, which was relatively straightforward. It mainly introduced eBPF and its applications, as well as related projects. Currently, eBPF is still in its early stages and has not been widely adopted on a large scale.

After the presentation, I had a discussion with Bill Mulligan and people from VMware. Although my spoken English is not very fluent, the conversation went naturally, and I didn’t feel nervous. I plan to practice more in the future.

The attendance at this KubeCon was much lower than in 2019, and some cloud vendors no longer sponsored the event. The presentation topics didn’t seem as groundbreaking (lacking novelty), but there were still many high-quality presentations.

Related Content