- Implement and improve monitoring and alerting.
- Build and maintain highly available systems on Kubernetes.
- Implement and manage CI/CD pipelines.
- Implement an auto-scaling system for our Kubernetes nodes.
- Ability to work with agile development teams.
- Demonstrated proficiency with automated configuration management tools (Puppet, Ansible, Chef)
- Ensure security through access controls, backups and firewalls.
- Upgrade systems with new releases and models.
- Responsible for application administration activities in support of development, user acceptance test and production systems supporting web-deployed applications
- Monitoring internal and production hosts using Nagios, Cacti, and other application performance monitoring systems.
- Working with Developers and other Engineers in troubleshooting internal production issues.
- Performing network and operational tasks in the internal and production systems.
- Scripting operational tasks for faster and less error-prone execution.
- Working with the Operations and Product Development teams in continually improving the company’s automated software deployment process.
- Maintaining and evolving the security infrastructure on both internal and production environments.
- Working with the Operations and Product Development teams in continually improving the company’s automated software deployment process.
- Maintaining and evolving the security infrastructure on both internal and production environments.
Bachelor's Degree in Computer Science or any related fieldMinimum 3 years of experienceFluency in Arabic & English (Reading, Writing & Oral). (Preferred)Special Certificates: preferably Red Hat Certified.Operations or systems administration experience, particularly on Linux.At least 3 years of experience with Kubernetes, Docker, and/or cloud deployment technologies.Experience with container networking on Docker.Experience with application deployment by using CI/CD.Experience with monitoring tools like Prometheus, Grafana, Datadog, etc.Experience with alerting tools like OpsGenie, PagerDuty, etc.Manage servers ( install – maintenance - rescue - idrac )