About the Role
We are looking for a Platform Engineer to join our engineering team. In this role, you will build the paved road for our software developerscreating the internal developer platform (IDP), automating infrastructure, and ensuring our systems are scalable, secure, and resilient.
You will bridge the gap between software engineering and operations, treating infrastructure as a product and our developers as your customers. If you are passionate about Kubernetes, Cloud Native technologies, and reducing toil through automation, we want to meet you.
Key Responsibilities
- Build & Maintain Infrastructure: Design and manage scalable cloud infrastructure on [AWS / GCP] using Infrastructure as Code (Terraform/Ansible).
- Enable Developer Velocity: Create self-service tools and internal developer platforms (IDP) that allow product teams to spin up environments, deploy code, and monitor services autonomously.
- Kubernetes Management: Administer production-grade Kubernetes clusters (EKS/AKS/GKE), including upgrades, scaling, and security hardening.
- CI/CD Optimization: Design, maintain, and optimize CI/CD pipelines (GitLab CI, GitHub Actions, Jenkins) to ensure fast and reliable software delivery.
- Observability: Implement and manage monitoring, logging, and tracing stacks (Prometheus, Grafana, ELK, Datadog) to ensure system health and rapid incident response.
- Reliability Engineering: Participate in on-call rotations (if applicable), lead incident post-mortems, and implement SRE practices like SLIs/SLOs.
- Security (DevSecOps): Integrate security best practices into the infrastructure and pipelines (IAM, secret management, vulnerability scanning).
Qualifications
Must-Have:
- Experience: 5+ years of experience in DevOps, Site Reliability Engineering (SRE), and Platform Engineering.
- Cloud Proficiency: Strong hands-on experience with at least one major cloud provider (Preferably AWS).
- Containerization: Deep understanding of Docker and container orchestration with Kubernetes (administering clusters, not just deploying to them).
- Infrastructure as Code: Proficiency with Terraform (preferred), Ansible, or Pulumi.
- Scripting: Solid programming skills in Python, Go (Golang), or Bash for automation and tooling.
- OS Knowledge: Strong background in Linux administration and networking fundamentals (DNS, Load Balancing, VPCs).
Nice-to-Have:
- Experience with Gitlab tools (DAST, SAST)
- Experience building Internal Developer Portals (e.g., Backstage.io).
- Certifications: CKA (Certified Kubernetes Administrator), AWS Certified Solutions Architect.
- Background in software development (backend engineering) before moving to ops.
Our Tech Stack (Experienced)
- Cloud: [e.g., AWS]
- IaC:Terraform
- Orchestration:Kubernetes (EKS)
- CI/CD:GitLab CI
- Observability:Datadog / Prometheus / Grafana
- Languages: Python, Go, TypeScript