- Design, build, and maintain scalable, secure, and highly available infrastructure across cloud environments (e.g., AWS, Azure, GCP).
- Implement and manage CI/CD pipelines to enable fast, reliable deployments.
- Define and enforce infrastructure best practices including Infrastructure as Code (IaC), configuration management, and monitoring.
- Collaborate with software engineers, QA, and security teams to improve deployment and release processes.
- Build and maintain system monitoring, alerting, and logging stacks (e.g., Prometheus, Grafana, ELK, Datadog).
- Drive automation efforts to reduce manual work and improve system resilience.
- Troubleshoot and resolve issues in dev, staging, and production environments.
- Participate in incident management and post-incident analysis.
- Lead infrastructure security initiatives and ensure compliance with company policies.
- Config the server, install the app or software as well as other features the project needs.
- Setting up tools and required infrastructure
- Stay up-to-date on the latest DevOps tools and technologies.
3+ years of experience in DevOps, SRE, or infrastructure engineering roles.Strong expertise in at least one major cloud provider (AWS preferred).Solid experience with IaC tools (e.g., Terraform, Pulumi, CloudFormation).Deep understanding of CI/CD tools (e.g., GitHub Actions, GitLab CI, Jenkins, ArgoCD).Strong skills in scripting and automation (Bash, Python, Go, etc.).Experience working on Linux-based infrastructureExperience with containerization and orchestration (Docker, Kubernetes, Helm).Configuration and managing databases such as MSQL Server, MySQL, Mongo…etc.Solid knowledge of monitoring/logging tools and practices.Familiarity with secure cloud architecture and DevSecOps practices.Proven ability to work independently and mentor junior engineers.Problem solver and troubleshooterExcellent communication and problem-solving skills.