Design, deploy, and automate infrastructure (on-prem and cloud: AWS/GCP/VNG Cloud) with a focus on availability, scalability, cost efficiency, and security.Build and maintain CI/CD pipelines and Infrastructure as Code (IaC) to streamline deployment workflows and accelerate software delivery.Manage and optimize Kubernetes and Docker environments for mission-critical services.Implement monitoring, logging, tracing, and alerting solutions to deliver observability, reliability, and SLA/SLO compliance.Apply cloud security and compliance best practices aligned with industry standards (e.g., ISO27001, SOX404, ITGC).Lead reliability engineering initiatives, including incident response, root cause analysis, and performance optimization.Continuously improve DevOps workflows, scalability, and reliability, automation, and development productivity.