Job Description
About the Role
We are looking for an experienced Engineering Manager โ Cloud Operations to lead and scale our Cloud Operations and DevOps initiatives. You will manage a high-performing team, drive operational excellence across multiple cloud platforms, and ensure our infrastructure is secure, scalable, and reliable as we grow.
If you thrive in fast-paced environments, have strong technical depth, and are passionate about
building resilient platforms with a start-up mindset, weโd love to talk to you.
Responsibilities
โ Lead and manage the Cloud Operations / DevOps team, fostering a culture of ownership, reliability, and continuous improvement.
โ Drive operational excellence across cloud infrastructure (AWS / GCP / Azure), ensuring high availability, performance, and cost efficiency.
โ Define and own SLOs, SLIs, and SLAs, and ensure effective monitoring, incident management, and on-call processes.
โ Build and maintain scalable, secure, and cost-optimized environments across multi-region and multi-cloud deployments.
โ Manage and improve CI/CD pipelines and adopt GitOps practices.
โ Establish strong observability frameworks (Prometheus, Grafana, Datadog, etc.) and reduce MTTR through automation.
โ Drive cloud security and compliance best practices (IAM, secrets management, auditing, SOC2, ISO27001, GDPR, HIPAA as relevant).
โ Ensure disaster recovery, backup strategies, and business continuity planning are in place and regularly tested.
โ Collaborate with engineering, product, and data teams to balance speed and reliability in deployments.
โ Identify opportunities for automation, performance tuning, and infrastructure cost optimization.
โ Mentor and grow the team; partner with leadership on hiring, career development, and retention of top talent.
โ Align cloud strategy with business goals and contribute to long-term architectural decisions.
Requirements
โ 8โ12 years experience in Cloud Operations, DevOps, or SRE roles.
โ 2โ3 years of experience managing small to medium-sized technical teams.
โ Strong experience with at least one major cloud platform (AWS, GCP, Azure).
โ Proficiency with Infrastructure as Code (Terraform, Pulumi, CloudFormation).
โ Hands-on experience with Kubernetes, Docker, and microservices at scale.
โ Strong knowledge of CI/CD pipelines, observability tools, and incident response frameworks (PagerDuty, Opsgenie, Squadcast).
โ Experience managing cloud costs and driving optimization initiatives.
โ Strong communication, stakeholder management, and cross-functional collaboration skills.
โ Bachelorโs or Masterโs degree in Computer Science or equivalent.
Good to Have
โ Familiarity with multi-cloud or hybrid cloud environments.
โ Exposure to data platforms and ML infrastructure (MLOps tools like SageMaker, Vertex AI, Kubeflow, MLflow).
โ Cloud certifications (AWS/GCP/Azure).
Why Join Us?
โ Build and scale modern cloud platforms from the ground up.
โ Define the culture, processes, and growth path of the Cloud Ops function.
โ Work with cutting-edge technologies and best practices.
โ Impact a rapidly growing organization by shaping infrastructure at scale.
โ Thrive in a dynamic, fast-paced environment with autonomy and ownership.