What You'll Be Doing:
Cluster Management and automation:
- Monitor and maintain the health of Kubernetes clusters to ensure optimal performance and availability.
- Assist in routine maintenance tasks, including cluster upgrades, control plane upgrades, and security-focused updates for operational and client node groups.
- Participate in triaging and resolving issues related to Kubernetes infrastructure, networking, storage, and security.
- Perform routine AWS module upgrades, manage SSL certificate rotations, secrets management, and Terraform state refresh/updates.
- Learn and apply Kubernetes best practices and automation techniques to improve operational efficiency.
- Security and Vulnerability Management:
- Conduct image vulnerability remediation, ensuring secure deployments by rolling out newer Docker images after remediation (75+ images).
- Address Kyverno policy violations on customer workloads and remediate security concerns.
- Perform ArgoCD version upgrades to maintain secure GitOps workflows and Helm chart upgrades for operational security.
- Collaboration & Support:
- Work closely with development teams to understand and support containerized applications, ensuring seamless integration and deployment.
- Assist in implementing infrastructure as code (IaC) using tools like Terraform for managing Kubernetes clusters and automating operational tasks.
- Collaborate on the rollout of new Helm chart updates across multiple environments, ensuring smooth transitions.
- Perform routine AWS module upgrades, manage SSL certificate rotations, secrets management, and Terraform state refresh/updates.
- Documentation
- Contribute to internal documentation and knowledge-sharing initiatives to enhance team capabilities and collaboration.
Qualifications:
- Bachelor's degree in Computer Science, Engineering
- Basic understanding of Kubernetes concepts and containerization technologies (Docker, containers)
- Familiarity with atleast one of the container orchestration platforms like Kubernetes (EKS, AKS, GKE)
- Knowledge of scripting languages (e.g., Bash, Python)
- Exposure to infrastructure as code tools (e.g., Terraform, Ansible) is a plus
- Fundamental understanding of cloud platforms (AWS, Azure, GCP)
- Strong problem-solving and troubleshooting mindset
- Excellent communication and documentation skills
- Ability to work in a collaborative team environment
- Eagerness to learn and grow in the field of Kubernetes and cloud-native technologies.