Role Purpose
Ensure availability, performance, monitoring, operating & maintaining of existing and new functionalities\systems within defined SLA and SLO KPIs. provide technical guidance to team members.
Key Accountabilities & Decision Ownership
• Ensure environment stability, availability, security and performance through SLO’s, SLA’s and automation enforcement
• Develop and maintain CI/CD across a range of languages and environments.
• Working closely with software engineers, product owners and systems engineers to understand requirements, participate in implementation and operate.
• Participating in On-Call rotations, fixing support escalation Issues, helping to improve incident response and conducting post-Incident reviews.
• Responsible for the end-to-end monitoring activities (application, system, setup)
• Vendor management and follow up vendor escalated issues and planning for systems upgrades.
• Technical Guidance to build team knowledge and skills.
Core Competencies, Knowledge, and Experience
• Advanced trouble shooting, analytical and problem-solving skills.
• Ability to work in a team and under stress in 24/7 work environment.
• Agile and DevOps methodologies and tools knowledge
• Cloud computing awareness
• Excellent communication skills, both written and verbal.
• Excellent time management with strong sense of urgency.
Must have technical / professional qualifications:
• Computer Science/Engineering graduate or equivalent related degree.
• 3+ years of working experience in supporting and administrating scaled systems.
• Hands-on experience in following technologies: RedHat OpenShift, Kubernetes, Database, Unix/Linux, Git, Jenkins, Ansible, Docker, Jira, and Shell scripting.