KAO YI CHEN (YC Kao)
Senior Site Reliability Engineer
with 7+ years in backend development, Kubernetes and cloud platforms.
Experience
Machine Learning Operation Engineer
Mar. 2025 - Jul. 2025PicCollage
· The easiest photo and video editing app to add magic to your treasured memories
- Experimented and built Ray clusters using KubeRay, accelerating data-preprocessing speed by 30x.
- Redesigned multiple part of the machine-learning service lifecycle workflow — including cloning and removing features.
- Implemented an automated recovery mechanism that detects unrecoverable service states and triggers immediate restarts.
- Wrote multiple optimization plans and proposals that enhanced service manageability, efficiency, and stability.
RayKnativeKubernetesGCPCogPythonN8N
Sr. Site Reliability Engineer
Nov. 2022 - Mar. 2025Dcard
· Taiwan's leading social media platform for youth engagement
- Managed multiple kubernetes clusters with in-depth expertise in cloud platforms.
- Maintained in-house kubernetes operator and open-source kubernetes operator for effortless deployment and management of workloads.
- Refactored the user behavior event pipeline to handle 13k RPS and 120k msg/s traffic efficiently, reducing costs by integrating multiple messaging systems and implemented high-performance batching logic.
- Researched and introduced solutions for observability, resiliency, and performance improvements, including tools like Linkerd, Gateway API, and ArgoCD.
- Administered and optimized the observability stack, including Grafana, VictoriaMetrics, and Tempo.
- Acted as the first responder for incident management, collaborating with cross-functional teams to resolve critical issues.
KubernetesKubebuilderGo kitLinkerdGCPPulumiGrafanaVictoria MetricsTempo
Sr. Site Reliability Engineer
Feb. 2022 - Nov. 2022Autopass
· Innovative automotive payment solutions for seamless mobility
- Mastered the company's systems within one month, successfully completing the annual PCI-DSS external audit.
KubernetesCrossplanePulumiDrone CIGCPRuby On Rails
Backend Engineer
GoCircleCIGCPKubernetesPulumi
Dcard
· Taiwan's leading social media platform for youth engagement
Backend Engineer
Full-RemoteGCPKubernetesDroneCI
Shenlearn
· Cloud-based education management platform for learning centers
Technical Consultant
Full-RemoteAWSKubernetesGitLabCI
Leukocyte-Lab
· Premier cybersecurity guardian specializing in breach simulation
Web Development Intern
GoNode.jsGCPKubernetesReact
Dcard
· Taiwan's leading social media platform for youth engagement
Projects
gta
GTA is a command-line tool for managing temporary IAM roles across cloud providers
- Automatic role revocation on program exit or interrupt
- Configurable time-to-live (TTL) for permissions
- Unique identifiers for each temporary binding for easy cleanup
go-dataloader
A clean, safe, user-friendly implementation of GraphQL's Dataloader, written with generics in go.
- Written in generics with strong type
- Use shared memory via a communicating pattern
- Provide almost the same interface as the original Node.js Dataloader
- Implement a promise like thunk design, simply call `val, err := loader.Load(ctx, id).Get(ctx)`
go-batcher
This is a batcher inspired by GraphQL's Dataloader and yckao's dataloader implementation.
Skills
Languages
Chinese · Native
Japanese · Intermediate (JLPT N1)
English · Intermediate (TOEIC 825)
Programming Languages
JavaScript · 9+ years
TypeScript · 7+ years
Go · 7+ years
Python · ~1 years
Infrastructure
Kubernetes · 7+ years
GCP · 7+ years
Docker · 8+ years
AWS · 1 years
Observability
Grafana · 3+ years
VictoriaMetrics · 2+ years
Tempo · 2+ years
Loki · 1 years
CI/CD
CircleCI · 2+ years
DroneCI · 2+ years
GithubActions · 1 years
GitLabCI · 1 years
Community Activities
GopherDay Taiwan
2024Speaker: "Reflect 用太多,都不好意思跟別人說 Go 靜態了"
Conference for Open Source Coders, Users and Promoters
2023Speaker: "深入淺出 Go AST,動手做自己的 Linter"
Dcard Tech Sharing
2023Speaker: "千萬用戶的大流量下 Dcard 後端團隊怎麼面對?"
Golang Taiwan Gathering
2022Speaker: "擺脫 Lock 實作 GraphQL Dataloader"