Job Description
About the Role:
We are seeking an accomplished and visionary
DevOps Leader
to spearhead our entire DevOps function. In this pivotal role, you will be the strategic architect and technical authority, responsible for guiding the evolution and optimization of our infrastructure, operations, and deployment practices. You will lead the DevOps team, ensuring our systems are highly available, scalable, secure, fault-tolerant, and cost-efficient. This position demands a blend of deep technical expertise, exceptional leadership, and a commitment to fostering a culture of operational excellence across the engineering organization.
Key Responsibilities:
Strategic DevOps Leadership & Architecture:
Lead the entire DevOps organization , taking full ownership of the
architecture and technical leadership of the entire DevOps infrastructure and team .
Define, communicate, and execute the long-term DevOps strategy, roadmap, and vision, aligning it directly with broader business and engineering objectives.
Drive the adoption of cutting-edge practices in infrastructure as code, continuous integration/delivery, and site reliability engineering.
Platform Operations & Reliability Engineering:
Deploy, manage, and operate scalable, highly available, fault-tolerant, and cost-optimized systems
in a dynamic production environment.
Setup and champion the Application Monitoring Framework , establishing robust logging, alerting, and performance monitoring best practices and processes
across all engineering teams .
Oversee incident response, root cause analysis, and proactive measures to ensure maximum uptime and system health.
Platform Security & Compliance Management:
Manage Platform Security and Compliance , ensuring the entire platform consistently
meets the latest security and compliance requirements .
Proactively
identify vulnerabilities and critical business risks
within our infrastructure and applications.
Collaborate strategically with engineering teams to
plan and drive the timely closure
of all identified security and compliance gaps.
Implement and enforce security-first principles throughout the operational lifecycle.
Team Leadership & Development:
Recruit, mentor, coach, and develop a high-performing team of DevOps/SRE engineers, cultivating a culture of innovation, continuous learning, and shared ownership.
Provide clear direction, set performance expectations, and foster career growth for team members.
Technical Vendor Management & Negotiation:
Manage critical vendor relationships (tech) , including cloud providers, SaaS tools, and specialized services.
Lead engagement with vendors during critical issues,
driving accountability across defined Service Level Agreements (SLAs)
to ensure optimal performance and support.
Qualifications:
Bachelor's or Master's degree in Computer Science, Engineering.
12+ years of progressive experience
in DevOps or Infrastructure roles, with
at least 5+ years in a leadership position
managing and mentoring DevOps teams.
Proven track record of architecting, deploying, and managing highly scalable, secure, and resilient cloud-native infrastructure
(specifically AWS).
Expert-level proficiency in CI/CD methodologies and tools (e.g., Jenkins, GitLab CI, ArgoCD, Spinnaker).
Deep expertise with Infrastructure as Code (IaC) tools such as Terraform, CloudFormation, or Ansible.
Extensive experience with containerization (Docker) and container orchestration platforms (Kubernetes).
Strong background in setting up and leveraging comprehensive observability stacks (monitoring, logging, tracing – e.g., Prometheus, Grafana, ELK Stack, Datadog).
Demonstrated ability to implement and enforce robust security practices and manage compliance frameworks (e.g., ISO 27001, SOC 2).
Strong experience in
vendor management, including contract negotiations and driving SLA adherence .
Exceptional leadership, strategic thinking, and problem-solving abilities.
Excellent communication, interpersonal, and stakeholder management skills, with the ability to influence technical and non-technical audiences.
Preferred Qualifications:
Experience in a high-growth, fast-paced SaaS or logistics technology environment.
Relevant industry certifications (e.g., AWS Certified DevOps Engineer - Professional, Certified Kubernetes Administrator).
Experience with advanced networking concepts, distributed systems, and microservices architectures.
Proficiency in programming/scripting languages such as Python, Go, or Java for automation and tooling development