Job Description:
- Strong understanding of observability tools and platforms (e.g., Dynatrace, Prometheus, Grafana, Splunk, New Relic, Datadog).
- Proven experience establishing and scaling observability and monitoring frameworks for large scale enterprise systems.
- Expertise in incident management, monitoring, alerting, and performance optimization within
regulated industries
- Familiarity with AI/ML applications in predictive monitoring, anomaly detection, and incident
analysis.
- Good to have advanced certifications in observability or monitoring platforms, SRE practices, or cloud infrastructure.
Qualifications
- Degree in Computer Science, Engineering, Information Systems, or a related field.
- 8+ years of experience in IT operations, observability, monitoring, or a related field, with a minimum of 3 years in a leadership or COE role.
- Excellent analytical skills and experience with data analytics, anomaly detection,
and automation in observability.
- Strong project management skills, with experience leading cross-functional
initiatives within Agile or DevOps environments.
- Understanding of Banking or Financial Services
- Ability to balance competing priorities, workloads and tight timelines in a fast-paced, dynamic work environment.