
Search by job, company or skills
We are seeking an experienced Cloud Engineer (Operations - Day 2 Support) to manage, support, and optimize AWS cloud environments supporting business-critical applications and infrastructure. This role focuses on cloud operations, incident management, system reliability, performance monitoring, security compliance, and continuous improvement of cloud platforms.
The ideal candidate will possess strong hands-on AWS experience, excellent troubleshooting capabilities, and a proactive approach to maintaining highly available and secure cloud environments.
Provide Day 2 operational support for AWS cloud infrastructure and hosted applications.
Monitor cloud environments to ensure high availability, reliability, and optimal performance.
Investigate, troubleshoot, and resolve production incidents, service disruptions, and infrastructure issues.
Perform root cause analysis (RCA) and implement preventive measures to reduce recurring incidents.
Manage and maintain AWS services including EC2, VPC, RDS, S3, IAM, Route 53, ELB/ALB, CloudWatch, Lambda, Auto Scaling, SNS, and SQS.
Support patch management, system maintenance, backup, restoration, and recovery activities.
Configure and maintain monitoring, alerting, logging, and observability solutions.
Ensure cloud infrastructure complies with security, governance, and operational best practices.
Collaborate with application, infrastructure, security, and DevOps teams to support production workloads.
Participate in change management, deployment support, and release activities.
Automate operational tasks using scripting and Infrastructure as Code (IaC) methodologies.
Develop and maintain operational runbooks, standard operating procedures (SOPs), and technical documentation.
Support disaster recovery testing and business continuity initiatives.
Participate in on-call support rotations and incident response activities when required.
Degree in Computer Science, Information Technology, Engineering, or a related discipline.
Minimum 3 years of hands-on experience supporting AWS cloud environments in a production setting.
Strong knowledge of AWS services including EC2, VPC, S3, RDS, IAM, Route 53, CloudWatch, ELB/ALB, Auto Scaling, Lambda, SNS, and SQS.
Experience administering Linux and/or Windows Server environments.
Strong troubleshooting, incident management, and problem-resolution skills.
Hands-on experience with Infrastructure as Code tools such as Terraform or AWS CloudFormation.
Proficiency in scripting using Bash, Python, or PowerShell.
Experience with monitoring and logging platforms such as CloudWatch, Splunk, ELK, Datadog, or Prometheus.
Good understanding of backup, recovery, high availability, and disaster recovery concepts.
Knowledge of networking fundamentals including TCP/IP, DNS, VPNs, load balancing, and firewalls.
Familiarity with ITIL-based service management processes including Incident, Problem, Change, and Service Request Management.
Strong communication, stakeholder management, and documentation skills.
Ability to work effectively in a fast-paced operational support environment.
AWS Certified Solutions Architect - Associate or Professional.
AWS Certified SysOps Administrator.
Experience with container technologies such as Docker and Kubernetes (EKS).
Exposure to CI/CD tools including Jenkins, GitLab CI/CD, Azure DevOps, or GitHub Actions.
Experience supporting large-scale enterprise cloud environments.
Knowledge of cloud security best practices and compliance frameworks.
Interested applicants may send their CV directly to [Confidential Information] for consideration.
Job ID: 149257187
Skills:
Aws Cloud, Cloud Infrastructure, Kubernetes, Cloud Migration, Jira, Itsm, Cicd, Git
We don’t charge any money for job offers