Description
We are seeking a Site Reliability Engineer to join our team in Cebu, SEA. The ideal candidate will have a strong background in system reliability and automation, with a focus on maintaining the uptime and performance of our critical services.
Responsibilities
- Monitor and maintain system reliability and uptime of critical services.
- Implement automation tools and frameworks to improve operational efficiency.
- Collaborate with development teams to design and implement scalable systems and infrastructure.
- Troubleshoot and resolve production issues in a timely manner.
- Participate in on-call rotations and incident management processes.
- Develop and maintain documentation for system architecture and operational procedures.
Skills and Qualifications
- 5-10 years of experience in Site Reliability Engineering or related fields.
- Strong understanding of cloud computing platforms (AWS, Azure, GCP).
- Proficiency in scripting languages (Python, Bash, etc.) and configuration management tools (Ansible, Puppet, etc.).
- Experience with containerization technologies (Docker, Kubernetes).
- Solid grasp of networking concepts and protocols.
- Ability to work in a fast-paced, collaborative environment.