Job Description
Overview
We are seeking a skilled and passionate
Site Reliability Engineer (SRE)
to ensure the reliability, scalability, and performance of our hybrid and cloud-native infrastructure. You will play a critical role in automating operations, improving system resilience, and supporting mission-critical services running across Kubernetes and cloud environments. This role is ideal for engineers who enjoy solving complex infrastructure challenges, building automation, and improving platform reliability at scale.
Responsibilities
Maintain high availability, scalability, and performance of production systems
Define and monitor SLIs, SLOs, and error budgets to ensure service reliability.
Perform root cause analysis, incident response, and postmortem reviews.
Implement reliability improvements and proactive failure prevention.
Support multi-cluster and hybrid infrastructure environments.
Implement autoscaling and high availability architecture
CI/CD, GitOps & Release Engineering
Design and maintain CI/CD pipelines using
GitLab CI/CD .
Provision and manage infrastructure using
Terraform / OpenTofu .
Develop and maintain
Helm charts
for Kubernetes deployments.
Automate operational tasks using
Python scripting
to reduce manual toil.
Observability, Monitoring & Distributed Tracing
Implement centralized logging using
Grafana Loki
and
ELK Stack .
Build dashboards and alerts using
Grafana
and
Datadog .
Implement distributed tracing using
OpenTelemetry
to improve system visibility.
Improve monitoring coverage and alert accuracy.
Conduct load and stress testing using tools such as
k6, Locust, or JMeter .
Analyze performance bottlenecks and implement tuning strategies.
Support capacity planning and performance optimization.
Support Change Data Capture (CDC) and real-time data streaming pipelines.
Work with
Confluent Platform / Apache Kafka
to ensure reliable event-driven data flow.
Security & Secret Management
Manage secrets securely using
Google Cloud Secret Manager
and Kubernetes secrets, Vault Hashicorp.
Implement secure CI/CD and platform access practices.
Education
Bachelor’s degree in Computer Science, Informatics, Information Systems, Electrical Engineering, Mathematics/Statistics, or related field.
Experience
0–4 years
of experience in SRE, DevOps, Cloud Engineering, or Platform Engineering.
Hands-on experience supporting production systems and cloud infrastructure.
Technical Skills
Strong Linux system administration and networking fundamentals.
Hands-on experience with Kubernetes and containerized environments.
Experience designing and maintaining CI/CD pipelines.
Infrastructure as Code experience (Terraform), Ansible.
Monitoring, logging, and observability best practices.
Programming/scripting skills in
Bash ,
Python
(Go is a plus).
Familiarity with
Google Cloud Platform (GCP) .
Contact
Email Group Legal & Corporate : corporate.secretary@ioh.co.id
Email Investor Communication:
investor@ioh.co.id
#J-18808-Ljbffr
Ready to Apply?
Don't miss this opportunity! Apply now and join our team.
Job Details
Posted Date:
February 24, 2026
Job Type:
Construction
Location:
Indonesia
Company:
Indosat
Ready to Apply?
Don't miss this opportunity! Apply now and join our team.