Category: Tech

Troubleshooting GKE Pods in CrashLoopBackOff: How to Start with Strace

Mar 29, 2025

—

by

haopengzhan

in Tech

In most real-world Kubernetes cases, infrastructure engineers don’t need to understand how a container itself works. This is reasonable because container image developers are typically responsible for it. However, a problem often arises when a container is deployed on Kubernetes, which is such a complex deployment and orchestration system. Sometimes, unexpected components can influence a…
How do we know if Kubelet leaks Inotify watchers

Apr 18, 2024

—

by

haopengzhan

in Tech

Kubelet as the node agent of Kubernetes OSS, always needs to monitor paths. Using Inotify to do so, Kubelet exposes to possibility of leaking of Inotify watchers. In recent, I observed a case where the Kubelet was hung for enormous Inotify usage. This Post briefly discussed how I debug the process and locate the problem.…
Practical debugging methods for Kubelet

Apr 10, 2024

—

by

haopengzhan

in Tech

Kubelet, a vital component in Kubernetes, runs on each node in your cluster. It acts as the field manager, receiving instructions from the Kubernetes API server and ensuring containerized applications run smoothly. Kubelet is responsible for downloading container images, pulling secrets, and launching pods – the basic units containing your application containers. It also monitors…
Selected Labs for CS350 courses in Binghamton University

Feb 22, 2022

—

by

haopengzhan

in Tech

TL; DR. This will be a series regarding labs I gave during the spring 2022 semester. The reason why I am writing this down is that it has been a week and no students have asked for the solution for the last Lab. I realize that the learning gap between students is huge, especially when…
EDDL: How do we train neural networks on limited edge devices – PART 2

Oct 31, 2021

—

by

haopengzhan

in Tech

In the last post, part1, our idea of distributed learning on edge environment was generally addressed.I introduced the reason why edge distributed learning is needed and what improvements it can achieve.In this post, I will talk about our motivation study and how our framework works.