We are seeking an Expert-Level Integration Engineer with primary expertise in Apache Kafka administration. This role is critical for ensuring the stability, security, and scalability of our integration ecosystem. While the main focus is Kafka administration, you will also engage in engineering/build activities in close partnership with vendors and manage other integration platforms. The ideal candidate is a subject matter expert (SME) who thrives in complex environments, drives operational excellence, and champions continuous improvement.
Key Responsibilities
- Lead the administration and management of Apache Kafka, ensuring high availability, performance, and security.
- Perform Kafka engineering/build activities, including cluster setup, scaling, and upgrades in collaboration with vendors.
- Maintain integration environment stability through patching, vulnerability remediation, and cost optimization (FinOps).
- Manage and optimize other integration platforms: IBM MQ, IBM ACE, Microsoft Identity Manager (MIM), MeshIQ, and Stonebranch File Transfer.
- Deliver end-to-end implementation, covering engineering (build) and support (run) activities.
- Develop automation scripts and workflows to improve operational efficiency (development experience is a plus).
- Act as a subject matter expert, providing guidance and mentorship to team members.
- Manage and resolve critical incidents, ensuring timely communication and root cause analysis.
- Participate in on-call rotations, including weekend support as needed.
Qualifications & Requirements
- Expert-level knowledge and hands-on experience in Apache Kafka administration (priority skill).
- Experience in Kafka engineering/build activities, including cluster setup, scaling, and upgrades.
- Strong experience managing integration platforms: IBM MQ, IBM ACE, MIM, MeshIQ, Stonebranch File Transfer.
- Proven ability to maintain environment stability, including patching, vulnerability remediation, and cost optimization.
- Experience in incident management, troubleshooting, and performance tuning.
- Familiarity with automation tools and scripting (Python, Shell, Ansible); development experience is a plus.
- Excellent communication and collaboration skills.
- Willingness to participate in on-call support, including weekends.
Preferred Skills
- Certifications in Kafka or IBM Integration products.
- Experience with cloud platforms (Azure, GCP).
- Knowledge of container orchestration (Kubernetes, Docker).
- Familiarity with monitoring tools and observability frameworks.
Soft Skills & Leadership Qualities
- Strong analytical and problem-solving skills.
- Leadership and mentorship capabilities.
- Adaptability and resilience in dynamic environments.
- Proactive mindset with ownership and accountability.