
Search by job, company or skills
Design, implement, and maintain highly available, scalable, and reliable production systems.
Develop monitoring, alerting, and observability solutions to improve system reliability.
Automate operational tasks and reduce manual intervention through scripting and tooling.
Troubleshoot production incidents, perform root cause analysis, and drive long-term improvements.
Collaborate with development teams to improve system performance, deployment processes, and operational efficiency.
Participate in capacity planning, disaster recovery planning, and incident response activities.
Create and maintain operational documentation and runbooks.
Bachelor's degree in Computer Science, Computer Engineering, or a related field.
3+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering.
Strong experience with Linux systems, Kubernetes, containers, and distributed systems.
Familiarity with monitoring and observability tools such as Prometheus, Grafana, ELK, or OpenTelemetry.
Experience with CI/CD pipelines, automation, and scripting languages such as Bash, Python, or Go.
Knowledge of cloud platforms and Infrastructure as Code tools is preferred.
Excellent problem-solving abilities and strong collaboration skills.
If you arepassionate about this role and meet the above requirements, please don'thesitate to apply. Please note that only shortlisted candidates will becontacted. Appreciate your understanding. Data provided is for recruitmentpurposes only.
About Us
Dada Consultants wasestablished in 2017, with the commitment of providing the best recruitmentservices in Singapore. We are comprised of a dynamic head-hunting teamdedicated to sourcing for highly competent professionals in IT industry. Weprovide enterprises with customized talent solutions, and bring talents tocareer advancement.
Dada Consultants Pte Ltd
Website: www.dadaconsultants.com
EA License No.: 18S9037 | EA Registration No. R25128548
Business Registration Number: 201735941W
Job ID: 149232821
Skills:
Gcp, Datadog, Prometheus, Azure, Terraform, Grafana, Jenkins, Ansible, GitHub Actions, AI-Ops, GCP Operations Suite, Azure Monitor
We don’t charge any money for job offers