Bachelor's or master's degree in computer science, Engineering, or related field.
Proven experience in SRE, DevOps, or software engineering roles.
Strong scripting skills (Python, Bash, Ruby).
Expertise in Linux systems, networking (TCP/IP, DNS), and cloud platforms (AWS).
Familiarity with CI/CD tools (GitLab, Github Actions, Jenkins) and container orchestration (Kubernetes, Docker).
Experience with automation and configuration management tools.
Excellent problem-solving and communication skills.
Preferably with background in Application Development (Web Applications)
Ability to quickly learn and apply enterprise AI tools and technologies to support technical workflows and business objectives.
Accountabilities
Infrastructure & Cloud Management: Design and maintain scalable, secure, and highly available infrastructure ( AWS).
Implement Infrastructure as Code (IaC) using tools like Terraform, CloudFormation, and Ansible. Manage container orchestration platforms (ECS/ECR).
Automation & CI/CD: Build and maintain CI/CD pipelines for automated testing, deployment, and rollback. Automate routine operational tasks to reduce manual effort and improve reliability. Integrate security and compliance checks into pipelines (DevSecOps).
Monitoring & Observability: Set up and maintain monitoring, logging, and alerting systems (e.g., Prometheus, Grafana, ELK). Define and track SLIs, SLOs, and SLAs. Implement distributed tracing and performance profiling.
Incident Management & Reliability Engineering: Participate in on-call rotations and lead incident response efforts. Conduct root cause analysis and write postmortems. Design self-healing systems and automated recovery mechanisms. Apply chaos engineering principles to test system resilience.
Security & Compliance: Manage secrets, access controls, and identity policies (IAM, Vault). Ensure infrastructure and deployments meet compliance standards (e.g., SOC 2, ISO 27001). Remediate vulnerability issues.
Collaboration & Mentorship: Work closely with software engineers, QA, and product teams to ensure smooth releases. Mentor junior engineers and contribute to team knowledge sharing. Participate in architectural reviews and technical planning.