Home / Jobs / Paya Lebar Air Base Jobs / Operations / Cloud Operations engineer

Cloud Operations engineer

YEPEESOFT PTE. LTD.

Full Time Paya Lebar Air Base, East Region Mid Level Competitive
Apply Now

Description

Key Responsibilities


  • Design and implement scalable, secure, and highly available cloud infrastructure across multi-region environments

  • Manage and optimize Kubernetes clusters, ensuring reliability, performance, and scalability

  • Develop and maintain CI/CD pipelines and automated deployment frameworks to improve release efficiency and reduce errors

  • Lead cloud migration initiatives, ensuring zero downtime and data integrity

  • Build and enhance observability frameworks (monitoring, logging, alerting) using tools such as Prometheus, Grafana, and ELK

  • Drive system reliability improvements, including incident response, root cause analysis, and performance tuning

  • Optimize infrastructure costs and resource utilization without compromising system stability

  • Implement traffic management and capacity planning strategies for high-concurrency systems

  • Develop tools and platforms to improve automation, operational efficiency, and developer productivity

  • Collaborate with cross-functional teams to ensure high-quality system design and delivery


Requirements

  • Minimum 5–8 years of experience in SRE / DevOps / Cloud Engineering roles

  • Strong hands-on experience with:Kubernetes, Docker, and container orchestrationCloud platforms (AWS, Alibaba Cloud, or similar)CI/CD tools (Jenkins, GitHub Actions, etc.)

  • Proficiency in infrastructure as code (e.g., Terraform, Ansible)

  • Experience with monitoring and logging tools (Prometheus, Grafana, ELK stack)

  • Strong knowledge of Linux systems, networking (VPC, DNS, CDN), and security best practices

  • Programming/scripting experience in Go, Python, or Shell

  • Experience with high-scale distributed systems and microservices architecture


Preferred Qualifications

  • Experience in large-scale internet or e-commerce platforms

  • Proven track record in cloud migration and cost optimization initiatives

  • Exposure to multi-cluster Kubernetes management and automation platforms

  • Experience in AI/ML platform infrastructure or data-intensive systems

  • Leadership experience or ability to mentor junior engineers


Key Competencies

  • Strong problem-solving and analytical skills

  • Ability to work in fast-paced, high-availability environments

  • Excellent communication and stakeholder management skills

  • Proactive mindset with a focus on automation and continuous improvement

About YEPEESOFT PTE. LTD.

Description pending