Description:
Design, deploy, and manage Kubernetes clusters at scale across multiple production sites using VMware Tanzu Kubernetes Grid (TKG) and VMware Telco Cloud Automation (TCA).
- Operate and maintain VMware-based infrastructure including vSphere, VCF, NSX-T, TCA, and TKG/VKS.
- Manage cluster lifecycle activities including upgrades, patching, capacity planning, and security hardening.
- Contribute to platform automation, monitoring, observability, and disaster recovery practices.
- Troubleshoot complex production issues spanning Kubernetes, networking, storage, and underlying infrastructure.
- Configure and maintain ingress, load balancing (AVI/AKO), and service mesh solutions.
- Implement and maintain GitOps-based deployment pipelines and Infrastructure as Code
- Work closely with application teams to onboard workloads and improve developer experience.
- Document architectures, runbooks, and operational procedures for knowledge sharing across teams.
- Collaborate with cross-functional teams across networking, security, and application engineering.
Required Skills & Experience
- 8+ years of experience in enterprise infrastructure, with at least 4+ years focused on Kubernetes/TKG.
- Strong hands-on expertise with Kubernetes administration (CKA certification preferred).
- Hands-on experience with VMware Tanzu Kubernetes Grid (TKG) and VMware Telco Cloud Automation (TCA), including cluster provisioning, lifecycle management, and CNF/VNF onboarding.
- Solid VMware background including vSphere, VCF, NSX-T, AVI and Tanzu/TKG/VKS.
- Proficiency with Infrastructure as Code tools (Terraform, Ansible) and GitOps workflows (ArgoCD)
- Experience operating Kubernetes platforms in production at scale across multiple sites.
- Working knowledge of container registries (Harbor), Helm, and OCI standards.
- Familiarity with monitoring and observability tooling.
- Strong understanding of Linux systems, networking, and storage fundamentals.
- Excellent troubleshooting and problem-solving skills in complex distributed systems.
Nice to Have
- VMware certifications (VCP-DCV, VCAP-DCV, VMware Telco Cloud).
- Experience in telecom or large-scale enterprise environments.
- Scripting skills in Bash and at least one of Python/Go
#INDCAN