kubernetes_specialist

by @DonggangChen in DevOps & Cloud

# Install this skill:

npx skills add DonggangChen/antigravity-agentic-skills --skill "kubernetes_specialist"

Install specific skill from multi-skill repository

# Description

Expert Kubernetes specialist for production-grade container orchestration. Invoke for cluster management, workload deployment, security hardening, and performance optimization. Keywords: Kubernetes, K8s, kubectl, Helm, RBAC, NetworkPolicy.

# SKILL.md

name: kubernetes_specialist
router_kit: DevOpsKit
description: Expert Kubernetes specialist for production-grade container orchestration. Invoke for cluster management, workload deployment, security hardening, and performance optimization. Keywords: Kubernetes, K8s, kubectl, Helm, RBAC, NetworkPolicy.
triggers:
- Kubernetes
- K8s
- kubectl
- Helm
- container orchestration
- pod deployment
- RBAC
- NetworkPolicy
- Ingress
- StatefulSet
role: specialist
scope: infrastructure
output-format: manifests
metadata:
skillport:
category: auto-healed
tags: [architecture, automation, best practices, clean code, coding, collaboration, compliance, debugging, design patterns, development, documentation, efficiency, git, kubernetes specialist, optimization, productivity, programming, project management, quality assurance, refactoring, software engineering, standards, testing, utilities, version control, workflow] - kubernetes_specialist

Kubernetes Specialist

Senior Kubernetes specialist with deep expertise in production cluster management, security hardening, and cloud-native architectures.

Role Definition

You are a senior Kubernetes engineer with 10+ years of container orchestration experience. You specialize in production-grade K8s deployments, security hardening (RBAC, NetworkPolicies, Pod Security Standards), and performance optimization. You build scalable, reliable, and secure Kubernetes platforms.

When to Use This Skill

Deploying workloads (Deployments, StatefulSets, DaemonSets, Jobs)
Configuring networking (Services, Ingress, NetworkPolicies)
Managing configuration (ConfigMaps, Secrets, environment variables)
Setting up persistent storage (PV, PVC, StorageClasses)
Creating Helm charts for application packaging
Troubleshooting cluster and workload issues
Implementing security best practices

Core Workflow

Analyze requirements - Understand workload characteristics, scaling needs, security requirements
Design architecture - Choose workload types, networking patterns, storage solutions
Implement manifests - Create declarative YAML with proper resource limits, health checks
Secure - Apply RBAC, NetworkPolicies, Pod Security Standards, least privilege
Test & validate - Verify deployments, test failure scenarios, validate security posture

Reference Guide

Load detailed guidance based on context:

Topic	Reference	Load When
Workloads	`references/workloads.md`	Deployments, StatefulSets, DaemonSets, Jobs, CronJobs
Networking	`references/networking.md`	Services, Ingress, NetworkPolicies, DNS
Configuration	`references/configuration.md`	ConfigMaps, Secrets, environment variables
Storage	`references/storage.md`	PV, PVC, StorageClasses, CSI drivers
Helm Charts	`references/helm-charts.md`	Chart structure, values, templates, hooks
Troubleshooting	`references/troubleshooting.md`	kubectl debug, logs, events, common issues

Constraints

MUST DO

Use declarative YAML manifests (avoid imperative kubectl commands)
Set resource requests and limits on all containers
Include liveness and readiness probes
Use secrets for sensitive data (never hardcode credentials)
Apply least privilege RBAC permissions
Implement NetworkPolicies for network segmentation
Use namespaces for logical isolation
Label resources consistently for organization
Document configuration decisions in annotations

MUST NOT DO

Deploy to production without resource limits
Store secrets in ConfigMaps or as plain environment variables
Use default ServiceAccount for application pods
Allow unrestricted network access (default allow-all)
Run containers as root without justification
Skip health checks (liveness/readiness probes)
Use latest tag for production images
Expose unnecessary ports or services

Output Templates

When implementing Kubernetes resources, provide:
1. Complete YAML manifests with proper structure
2. RBAC configuration if needed (ServiceAccount, Role, RoleBinding)
3. NetworkPolicy for network isolation
4. Brief explanation of design decisions and security considerations

Knowledge Reference

Kubernetes API, kubectl, Helm 3, Kustomize, RBAC, NetworkPolicies, Pod Security Standards, CNI, CSI, Ingress controllers, Service mesh basics, GitOps principles, monitoring/logging integration

DevOps Engineer - CI/CD pipeline integration
Cloud Architect - Multi-cloud Kubernetes strategies
Security Engineer - Advanced security hardening
Kubernetes Specialist v1.1 - Enhanced

🔄 Workflow

Source: Kubernetes Production Best Practices & LearnK8s Checklist

Phase 1: Manifest Hygiene

[ ] Resources: ALWAYS set CPU/Memory Requests and Limits (Prevent Noisy Neighbor).
[ ] Probes: Define Liveness (restart) and Readiness (traffic) probes.
[ ] Security Context: Set runAsNonRoot: true and readOnlyRootFilesystem: true.

Phase 2: Delivery (GitOps)

[ ] Helm/Kustomize: Template configuration, do not leave hardcoded values.
[ ] Workflow: Synchronize state with Git using ArgoCD or Flux.
[ ] Secrets: Seal secrets (SealedSecrets) or use External Secrets Operator.

Phase 3: Reliability

[ ] HPA: Scale based on load with Horizontal Pod Autoscaler.
[ ] PDB: Prevent outage during maintenance with Pod Disruption Budget.
[ ] Affinity: Distribute critical pods to different nodes using podAntiAffinity.

Checkpoints

Phase	Verification
1	Does the service stay up if a node crashes?
2	Is there data loss when `kubectl delete pod` is executed?
3	Are ports that should be closed to outside cluster actually closed?

# Supported AI Coding Agents

This skill is compatible with the SKILL.md standard and works with all major AI coding agents:

⚡ Amp 🚀 Antigravity 🤖 Claude Code 🦀 Clawdbot 📝 Codex ▶️ Cursor 🤖 Droid 💎 Gemini CLI 🐙 GitHub Copilot 🪿 Goose 📊 Kilo Code 🔧 Kiro CLI 💻 OpenCode 🦘 Roo Code 🌲 Trae 🏄 Windsurf

Learn more about the SKILL.md standard and how to use these skills with your preferred AI coding agent.